daily-papers
updated
Fast Matrix Multiplications for Lookup Table-Quantized LLMs
Paper
• 2407.10960
• Published
• 13
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG
Capabilities
Paper
• 2407.14482
• Published
• 26
EVLM: An Efficient Vision-Language Model for Visual Understanding
Paper
• 2407.14177
• Published
• 45
Knowledge Mechanisms in Large Language Models: A Survey and Perspective
Paper
• 2407.15017
• Published
• 34
Compact Language Models via Pruning and Knowledge Distillation
Paper
• 2407.14679
• Published
• 39
DDK: Distilling Domain Knowledge for Efficient Large Language Models
Paper
• 2407.16154
• Published
• 22
PERSONA: A Reproducible Testbed for Pluralistic Alignment
Paper
• 2407.17387
• Published
• 20
LAMBDA: A Large Model Based Data Agent
Paper
• 2407.17535
• Published
• 37
Wolf: Captioning Everything with a World Summarization Framework
Paper
• 2407.18908
• Published
• 32
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains
Paper
• 2407.18961
• Published
• 40
Tails Tell Tales: Chapter-Wide Manga Transcriptions with Character Names
Paper
• 2408.00298
• Published
• 11
Finch: Prompt-guided Key-Value Cache Compression
Paper
• 2408.00167
• Published
• 17
Improving Text Embeddings for Smaller Language Models Using Contrastive
Fine-tuning
Paper
• 2408.00690
• Published
• 25
Gemma 2: Improving Open Language Models at a Practical Size
Paper
• 2408.00118
• Published
• 78
SAM 2: Segment Anything in Images and Videos
Paper
• 2408.00714
• Published
• 120
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented
Generation
Paper
• 2408.02545
• Published
• 40
MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Paper
• 2408.01800
• Published
• 92
Synthesizing Text-to-SQL Data from Weak and Strong LLMs
Paper
• 2408.03256
• Published
• 10