Tag: QAT
2 articles
AI Research 42 - Multimodal Large Model Quantization: Fro...
Survey outline for multimodal large model quantization schemes: from FP32 to INT4. Core goal is model capability retention, compression efficiency 50-75%, inference speedup 2-4x. Analyze comparison...
AI Research 36 - Comprehensive Analysis of Multimodal Lar...
This comprehensive overview systematically introduces mainstream quantization techniques in multimodal models, including principles and practices of...