This is the release note of v0.4.0.
Highlights
- Proper ONNX export, and optimizing peak memory usage and execution speed on some ONNX inference engine
- Split c6 computation to reduce peak memory usage #17
- Fix tracing batch size #18
- Reduce Gather op when ONNX-exported #20
Others