Overwrite weights (initializers) with fixed data or random data #27

zhenhuaw-me · 2022-09-14T01:49:41Z

Bert series ONNX models are very large (x GB) thus not easy to share the real file. We can improve this process by overwriting the weights (initializers)

It can be fixed data (e.g. all 0.1 or other value specified), thus the model can be compressed.
After sharing, we can recover with numpy style random numbers.

This can only be used as a sharing method, the generated model are not useful when evaluate accuracy.

For better usage:

Annotation will be added when writing fixed data, thus when re-random we can detect automatically.
The tensors can be specified with names or size.
Only works for FP32/FP16.
0 removed.

The text was updated successfully, but these errors were encountered:

zhenhuaw-me added the enhancement New feature or request label Sep 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overwrite weights (initializers) with fixed data or random data #27

Overwrite weights (initializers) with fixed data or random data #27

zhenhuaw-me commented Sep 14, 2022

Overwrite weights (initializers) with fixed data or random data #27

Overwrite weights (initializers) with fixed data or random data #27

Comments

zhenhuaw-me commented Sep 14, 2022