Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction Paper • 2512.04987 • Published Dec 4, 2025 • 76
Revisiting Long-context Modeling from Context Denoising Perspective Paper • 2510.05862 • Published Oct 7, 2025 • 20
EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning Paper • 2509.22576 • Published Sep 26, 2025 • 134
On Path to Multimodal Generalist: General-Level and General-Bench Paper • 2505.04620 • Published May 7, 2025 • 82
Taming the Titans: A Survey of Efficient LLM Inference Serving Paper • 2504.19720 • Published Apr 28, 2025 • 12 • 2
Taming the Titans: A Survey of Efficient LLM Inference Serving Paper • 2504.19720 • Published Apr 28, 2025 • 12
Taming the Titans: A Survey of Efficient LLM Inference Serving Paper • 2504.19720 • Published Apr 28, 2025 • 12
OTC: Optimal Tool Calls via Reinforcement Learning Paper • 2504.14870 • Published Apr 21, 2025 • 35
Test-time Computing: from System-1 Thinking to System-2 Thinking Paper • 2501.02497 • Published Jan 5, 2025 • 45
Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch Paper • 2410.18693 • Published Oct 24, 2024 • 42
LOGO -- Long cOntext aliGnment via efficient preference Optimization Paper • 2410.18533 • Published Oct 24, 2024 • 43
L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding? Paper • 2410.02115 • Published Oct 3, 2024 • 10
RicardoLee/Llama2-base-7B-Chinese-50W-pre_release Text Generation • Updated Jul 23, 2023 • 17 • 11
RicardoLee/Llama2-base-7B-Chinese-50W-Full2LoRA Text Generation • Updated Jul 24, 2023 • 13 • 9