v0.8.0
New Changes:
- NuGet package for the .NET platform. This package offers comprehensive platform support, including Windows, Linux, MacOS, Android, and iOS. Both x64 and arm64 architectures are supported, where applicable.
- Support for pre-processing and post-processing of the Whisper model, inclusive of Audio and Tokenizer decoding operators.
- Extends support for pre-processing and post-processing of object-detection models, including a new DrawBoundingBoxes operator. Pre/post processing tools can add non-max-suppression to the model to select the best bounding boxes, and scale those to the original image. See the end-to-end example in
tutorials/yolo_e2e.py
. - Introduces the Audio Domain, complemented with AudioCodec and optimized STFT Operators, enhancing audio processing capabilities.
- Enabled optional input/output support for some operators such as GPT2Tokenizer, ClipTokenizer, and RobertaTokenizer.
- Refined the implementation of offset mapping for BBPE-style tokenizers for more operators and efficiency improvement.
- Other bug and security fixes.
Contributions
Contributors to ONNX Runtime Extensions include members across teams at Microsoft, along with our community members: @edgchen1 @kunal-vaishnavi @sayanshaw24 @skottmckay @snnn @VishalX @wenbingl @wejoncy
Full Changelog: v0.7.0...v0.8.0