File tree 1 file changed +5
-4
lines changed
1 file changed +5
-4
lines changed Original file line number Diff line number Diff line change @@ -70,7 +70,9 @@ python -m torch.distributed.launch --nproc_per_node=2 imagenet_fast.py -a sge_re
70
70
```
71
71
#### WS-ResNet with e-shifted L2 regularizer, e = 1e-3
72
72
```
73
- python -m torch.distributed.launch --nproc_per_node=8 imagenet_fast.py -a ws_resnet50 --data /share1/public/public/imagenet1k/ --epochs 100 --schedule 30 60 90 --wd 1e-4 --gamma 0.1 -c checkpoints/imagenet/es1e-3_ws_resnet50 --train-batch 32 --opt-level O0 --label-smoothing 0. --warmup 0 --nowd-conv --mineps 1e-3 --el2
73
+ python -m torch.distributed.launch --nproc_per_node=8 imagenet_fast.py -a ws_resnet50 --data /share1/public/public/imagenet1k/ \
74
+ --epochs 100 --schedule 30 60 90 --wd 1e-4 --gamma 0.1 -c checkpoints/imagenet/es1e-3_ws_resnet50 --train-batch 32 \
75
+ --opt-level O0 --label-smoothing 0. --warmup 0 --nowd-conv --mineps 1e-3 --el2
74
76
```
75
77
76
78
--------------------------------------------------------
@@ -131,13 +133,12 @@ Note that the following models are with bias wd = 0.
131
133
| WS-ShuffleNetV2(e = 1e-8) | 69.70 | [ GoogleDrive] ( https://drive.google.com/file/d/1Oc04IvP9JTFM8yDnlbmB5wnugr_3Cd0I/view?usp=sharing ) |
132
134
| WS-MobileNetV1(e = 1e-6) | 73.60 | [ GoogleDrive] ( https://drive.google.com/file/d/17oAS8W2Mr83qhgI-gTRG1H6WJGMQdFMB/view?usp=sharing ) |
133
135
134
- ### Detection
135
-
136
136
--------------------------------------------------------
137
137
## Results of "Generalization Bound Regularizer: A Unified Framework for Understanding Weight Decay"
138
138
139
+ ### To appear
139
140
140
-
141
+ --------------------------------------------------------
141
142
## Citation
142
143
143
144
If you find our related works useful in your research, please consider citing the paper:
You can’t perform that action at this time.
0 commit comments