[English|中文版]
FlagDNN is part of FlagOS. FlagDNN is a deep neural network computing library oriented towards multiple chip backends. It provides high-performance implementations of common deep learning operators, supporting efficient computation in fields such as deep learning, computer vision, natural language processing, and artificial intelligence.
FlagDNN is a high-performance deep learning operator library implemented using the Triton programming language launched by OpenAI.
- Operators have undergone deep performance tuning
- Triton kernel call optimization
- Flexible multi-backend support mechanism
- Support for common deep learning operators (ReLU, etc.)
pip install -U scikit-build-core>=0.11 pybind11 ninja cmakegit clone https://github.com/flagos-ai/FlagDNN.git
cd FlagDNN
pip install .import torch
import flag_dnn
# Create a tensor
x = torch.randn(1024, device='cuda')
# Apply ReLU activation
y = flag_dnn.ops.relu(x)This project is licensed under the Apache (version 2.0) License.