Model Quantization and Hardware Acceleration for Vision Transformers: A Comprehensive Survey Paper • 2405.00314 • Published May 1, 2024
Low-bit Model Quantization for Deep Neural Networks: A Survey Paper • 2505.05530 • Published May 8, 2025 • 1