Added memory and time optimization for onnx transforms #640

abhishek-singh591 · 2025-11-26T06:34:55Z

Memory Optimization

Added periodic memory cleanup to FP16ClipTransform and SplitTensorsTransform to reduce memory usage during large tensor processing. Also avoids redundant external data loading when already present.

Time Optimized ONNX Transform via Class Merging and Thread Pooling

It merges the FP16 and Split ONNX transform classes into a single implementation to eliminate redundant tensor loading and iteration. Additionally, the transform logic has been refactored to use a thread pool, replacing the previous sequential loop to parallelize tensor operations.

Performance Benchmarks:-

Model	Original Duration (s)	Optimized Duration (s)
LLaMA 3.1 8B	88.35	58.55
LLaMA 3.1 70B	1029.82	727.37

Note: Thread count is set to os.cpu_count() * 4 to better handle I/O-bound workloads. Performance may vary depending on system hardware and threading capabilities.

Signed-off-by: abhishek-singh591 <[email protected]>

Added all the changes required for export time and memory optimization

9d6baa9

Signed-off-by: abhishek-singh591 <[email protected]>

abhishek-singh591 requested review from ochougul, quic-amitraj, quic-hemagnih and quic-rishinr as code owners November 26, 2025 06:34

abhishek-singh591 assigned quic-rishinr Nov 26, 2025

abhishek-singh591 added 5 commits November 27, 2025 11:03

Merge branch 'quic:main' into export_optim

73e870e

Addressed comments from the earlier PR

0bb780e

Signed-off-by: abhishek-singh591 <[email protected]>

Made Minnor Fixes

62cf130

Signed-off-by: abhishek-singh591 <[email protected]>

fixed missing data in onnx file issue

5e1f484

Signed-off-by: abhishek-singh591 <[email protected]>

fixed missing data in onnx file

9066b0c

Signed-off-by: abhishek-singh591 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added memory and time optimization for onnx transforms #640

Added memory and time optimization for onnx transforms #640

abhishek-singh591 commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Added memory and time optimization for onnx transforms #640

Are you sure you want to change the base?

Added memory and time optimization for onnx transforms #640

Conversation

abhishek-singh591 commented Nov 26, 2025

Memory Optimization

Time Optimized ONNX Transform via Class Merging and Thread Pooling

Performance Benchmarks:-

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants