Running 5 HLE Leaderboard for Agents with Tools 🥇 5 Humanity's Last Exam Leaderboard for LLM Agents with Tools
Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACs Paper • 2503.06342 • Published Mar 8, 2025 • 1
Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACs Paper • 2503.06342 • Published Mar 8, 2025 • 1
SeerAttention/SeerAttention-Llama-3.1-8B-AttnGates Text Generation • Updated Mar 3, 2025 • 945 • 4
EN-T: Optimizing Tensor Computing Engines Performance via Encoder-Based Methodology Paper • 2404.11887 • Published Apr 18, 2024
LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration Paper • 2408.06003 • Published Aug 12, 2024
SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs Paper • 2410.13276 • Published Oct 17, 2024 • 29
Daniel-Duda/bitdistiller-llama3-8b-instruct-int2g128 Text Generation • 8B • Updated Jul 23, 2024 • 1 • 1
SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs Paper • 2410.13276 • Published Oct 17, 2024 • 29
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge Paper • 2407.00088 • Published Jun 25, 2024 • 12