Can you share the hyper-parameters you used in training? I want to do a simple replay. #6

GuangtaoLyu · 2024-04-11T03:40:06Z

Hello,
Can you share the hyper-parameters you used in training? I want to do a simple replay.
thank you.

exitudio · 2024-04-12T03:14:26Z

Hi,
The default parameters are the parameters we used for training. Except the batch size, we use 512 since we use 4 gpus for training therefore we decrease total-iter, lr-scheduler, and eval-iter by 4 proportional to the increasing batch size. Learning rate is still the same. Here is the batch script we use.

name='HML3D_45_crsAtt1lyr_20breset' 
vq_name='2023-07-19-04-17-17_12_VQVAE_20batchResetNRandom_8192_32'
export CUDA_VISIBLE_DEVICES=0,1,2,3
MULTI_BATCH=4

python3 train_t2m_trans.py  \
    --exp-name ${name} \
    --batch-size $((128*MULTI_BATCH)) \
    --vq-name ${vq_name} \
    --out-dir output/${dataset_name} \
    --total-iter $((300000/MULTI_BATCH)) \
    --lr-scheduler $((150000/MULTI_BATCH)) \
    --dataname t2m \
    --eval-iter $((20000/MULTI_BATCH))

CDLCHOI mentioned this issue Apr 19, 2024

Can you share the training log of t2m_trans #7

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can you share the hyper-parameters you used in training? I want to do a simple replay. #6

Can you share the hyper-parameters you used in training? I want to do a simple replay. #6

GuangtaoLyu commented Apr 11, 2024

exitudio commented Apr 12, 2024

Can you share the hyper-parameters you used in training? I want to do a simple replay. #6

Can you share the hyper-parameters you used in training? I want to do a simple replay. #6

Comments

GuangtaoLyu commented Apr 11, 2024

exitudio commented Apr 12, 2024