neural-bits / production-hub Public

Hands-on hub to learn techniques to optimize and serve AI models to production the most optimal way.

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
001-inference_engines		001-inference_engines
002-triton-server-cnn-deployment		002-triton-server-cnn-deployment
.gitignore		.gitignore
README.md		README.md

Repository files navigation

Neural Bits Production Hub

This repository consists of code and articles on the Neural Bits Newsletter that showcase:

ID	📝 Article	💻 Code	Details	Complexity	Tech Stack
001	Inference Engines Profilling	Here	Profile a CNN model across PyTorch, ONNX, TensorRT, and TorchCompile	🟩🟩⬜	Python, Jupyter

ID	📝 Article	💻 Code	Details	Complexity	Tech Stack
002	Deploying DL models with NVIDIA Triton Inference Server	Here	Full tutorial on how to set-up and deploy ML models with Triton Inference Server	🟩🟩🟩	Python, Docker, Bash