-
Notifications
You must be signed in to change notification settings - Fork 309
Open
Labels
questionFurther information is requestedFurther information is requested
Description
The GMACs are the same regardless of any frozen layers.
All layers are trainable:
Computational complexity: 7.63 GMac
Number of parameters: 128.92 M
Only classifier is trainable:
Computational complexity: 7.63 GMac
Number of parameters: 155.69 k
In my understanding, if "param.requires_grad' is set to 'False' in some of the layers, those layers would not be computed however they would remain part of the graph.
So, in the calculation of GMACs, such layers should be excluded as they would not be computed during the training hence reducing the number of operations?
Please correct me if my understanding is wrong.
Metadata
Metadata
Assignees
Labels
questionFurther information is requestedFurther information is requested