CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark Paper • 2401.11944 • Published Jan 22, 2024 • 27
SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval Paper • 2401.13478 • Published Jan 24, 2024 • 3
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models Paper • 2402.13109 • Published Feb 20, 2024
ChatMusician: Understanding and Generating Music Intrinsically with LLM Paper • 2402.16153 • Published Feb 25, 2024 • 58
m2mKD: Module-to-Module Knowledge Distillation for Modular Transformers Paper • 2402.16918 • Published Feb 26, 2024
DEEP-ICL: Definition-Enriched Experts for Language Model In-Context Learning Paper • 2403.04233 • Published Mar 7, 2024 • 1
COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning Paper • 2403.18058 • Published Mar 26, 2024 • 4
MuPT: A Generative Symbolic Music Pretrained Transformer Paper • 2404.06393 • Published Apr 9, 2024 • 16
MMRA: A Benchmark for Multi-granularity Multi-image Relational Association Paper • 2407.17379 • Published Jul 24, 2024 • 3
EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models Paper • 2502.06663 • Published Feb 10, 2025 • 2
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published Nov 23, 2025 • 283
Encyclo-K: Evaluating LLMs with Dynamically Composed Knowledge Statements Paper • 2512.24867 • Published 9 days ago • 1
Encyclo-K: Evaluating LLMs with Dynamically Composed Knowledge Statements Paper • 2512.24867 • Published 9 days ago • 1
WideSearch: Benchmarking Agentic Broad Info-Seeking Paper • 2508.07999 • Published Aug 11, 2025 • 110