Change the repository type filter
All
Repositories list
1.3k repositories
- SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
ai-reference-models
PublicIntel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Intel® Data Center GPUsmlir-extensions
Public