Skip to content

Releases: intel/neural-compressor

Intel® Low Precision Optimization Tool v1.0 Beta Release

31 Aug 10:16
Compare
Choose a tag to compare

Intel® Low Precision Optimization Tool v1.0 beta release is featured by:

  • Built-in dataloaders and evaluators
  • Add random and exhaustive tuning strategies
  • Mix precision tuning support on TensorFlow (INT8/BF16/FP32)
  • Quantization-aware training POC support on Pytorch
  • TensorFlow mainstream version support, including 1.15.2, 1.15UP1 and 2.1.0
  • 50+ models validated

Supported Models:

TensorFlow Model Category
ResNet50 V1 Image Recognition
ResNet50 V1.5 Image Recognition
ResNet101 Image Recognition
Inception V1 Image Recognition
Inception V2 Image Recognition
Inception V3 Image Recognition
Inception V4 Image Recognition
ResNetV2_50 Image Recognition
ResNetV2_101 Image Recognition
ResNetV2_152 Image Recognition
Inception ResNet V2 Image Recognition
SSD ResNet50 V1 Object Detection
Wide & Deep Recommendation
VGG16 Image Recognition
VGG19 Image Recognition
Style_transfer Style Transfer
PyTorch Model Category
BERT-Large RTE Language Translation
BERT-Large QNLI Language Translation
BERT-Large CoLA Language Translation
BERT-Base SST-2 Language Translation
BERT-Base RTE Language Translation
BERT-Base STS-B Language Translation
BERT-Base CoLA Language Translation
BERT-Base MRPC Language Translation
DLRM Recommendation
BERT-Large MRPC Language Translation
ResNext101_32x8d Image Recognition
BERT-Large SQUAD Language Translation
ResNet50 V1.5 Image Recognition
ResNet18 Image Recognition
Inception V3 Image Recognition
YOLO V3 Object Detection
Peleenet Image Recognition
ResNest50 Image Recognition
SE_ResNext50_32x4d Image Recognition
ResNet50 V1.5 QAT Image Recognition
ResNet18 QAT Image Recognition
MxNet Model Category
ResNet50 V1 Image Recognition
MobileNet V1 Image Recognition
MobileNet V2 Image Recognition
SSD-ResNet50 Object Detection
SqueezeNet V1 Image Recognition
ResNet18 Image Recognition
Inception V3 Image Recognition

Known Issues:

  • TensorFlow ResNet50 v1.5 int8 model will crash on TensorFlow 1.15 UP1 branch

Validated Configurations:

  • Python 3.6 & 3.7
  • Centos 7
  • Intel TensorFlow 1.15.2, 2.1.0 and 1.15UP1
  • PyTorch 1.5
  • MxNet 1.6

Distribution:

  Channel Links Install Command
Source Github https://github.com/intel/lp-opt-tool.git $ git clone https://github.com/intel/lp-opt-tool.git
Binary Pip https://pypi.org/project/ilit $ pip install ilit
Binary Conda https://anaconda.org/intel/ilit $ conda config --add channels intel $ conda install ilit

Contact:

Please feel free to contact [email protected], if you get any questions.

Intel® Low Precision Optimization Toolkit (iLiT) v1.0 Alpha Release

11 Aug 12:14
Compare
Choose a tag to compare

Intel® Low Precision Optimization Tool (iLiT) is an open-sourced python library which is intended to deliver a unified low-precision inference solution cross multiple Intel optimized DL frameworks on both CPU and GPU. It supports automatic accuracy-driven tuning strategies, along with additional objectives like performance, model size, or memory footprint. It also provides the easy extension capability for new backends, tuning strategies, metrics and objectives.

Feature List:

  • Unified low precision quantization interface cross multiple Intel optimized frameworks (TensorFlow, PyTorch, and MXNet)
  • Built-in tuning strategies, including Basic, Bayesian, and MSE
  • Built-in evaluation metrics, including TopK (image classification), F1 (NLP), and CocoMAP (object detection)
  • Built-in tuning objectives, including Performance, ModelSize, and Footprint
  • Extensible API design to add new strategy, framework backend, metric, and objective
  • KL-divergence calibration for TensorFlow and MXNet
  • Tuning process resume from certain checkpoint

Supported Models:

Model Framework Model Framework Model Framework
ResNet50 V1 MXNet BERT-Large RTE PyTorch ResNet18 PyTorch
MobileNet V1 MXNet BERT-Large QNLI PyTorch ResNet50 V1 TensorFlow
MobileNet V2 MXNet BERT-Large CoLA PyTorch ResNet50 V1.5 TensorFlow
SSD-ResNet50 MXNet BERT-Base SST-2 PyTorch ResNet101 TensorFlow
SqueezeNet V1 MXNet BERT-Base RTE PyTorch Inception V1 TensorFlow
ResNet18 MXNet BERT-Base STS-B PyTorch Inception V2 TensorFlow
Inception V3 MXNet BERT-Base CoLA PyTorch Inception V3 TensorFlow
DLRM PyTorch BERT-Base MRPC PyTorch Inception V4 TensorFlow
BERT-Large MRPC PyTorch ResNet101 PyTorch Inception ResNet V2 TensorFlow
BERT-Large SQUAD PyTorch ResNet50 V1.5 PyTorch SSD ResNet50 V1 TensorFlow

Known Issues:

  • Statistics collection for KL algorithm is slow in TensorFlow due to lack of tensor inspector APIs
  • MSE tuning strategy is not supported in PyTorch

Validated Configurations:

  • Python 3.6 & 3.7
  • Centos 7
  • TensorFlow 1.15, 2.0 and 2.1
  • PyTorch 1.5
  • MxNet 1.6

Distribution:

  Channel Links Install Command
Source Github https://github.com/intel/lp-opt-tool.git $ git clone https://github.com/intel/lp-opt-tool.git
Binary Pip https://pypi.org/project/ilit $ pip install ilit
Binary Conda https://anaconda.org/intel/ilit $ conda config --add channels intel $ conda install ilit

Contact:

Please feel free to contact [email protected], if you get any questions.