Nikita Arsenin

lumirey

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 months ago

InteractComp: Evaluating Search Agents With Ambiguous Queries

upvoted a paper 3 months ago

SIM-CoT: Supervised Implicit Chain-of-Thought

upvoted a paper 4 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

View all activity

Organizations

upvoted a paper 2 months ago

InteractComp: Evaluating Search Agents With Ambiguous Queries

Paper • 2510.24668 • Published Oct 28, 2025 • 97

upvoted a paper 3 months ago

SIM-CoT: Supervised Implicit Chain-of-Thought

Paper • 2509.20317 • Published Sep 24, 2025 • 41

upvoted a paper 4 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18, 2025 • 114

upvoted a paper 5 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 267

upvoted 2 papers 7 months ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30, 2025 • 277

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

liked a model 10 months ago

Qwen/QwQ-32B

Text Generation • 33B • Updated Mar 11, 2025 • 123k • • 2.88k

upvoted a paper about 1 year ago

Facilitating large language model Russian adaptation with Learned Embedding Propagation

Paper • 2412.21140 • Published Dec 30, 2024 • 18

liked a model about 1 year ago

nvidia/Hymba-1.5B-Instruct

Text Generation • 2B • Updated Jan 2, 2025 • 204 • 242

upvoted a paper about 1 year ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 45

upvoted 2 papers over 1 year ago

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31, 2024 • 68

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18, 2024 • 55

upvoted 6 papers almost 2 years ago

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11, 2024 • 91

Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU

Paper • 2403.06504 • Published Mar 11, 2024 • 56

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 189

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Paper • 2402.19427 • Published Feb 29, 2024 • 56

Beyond Language Models: Byte Models are Digital World Simulators

Paper • 2402.19155 • Published Feb 29, 2024 • 53

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 152

liked a model almost 2 years ago

sander-wood/bgpt

Updated Mar 17, 2024 • 34

liked a Space almost 2 years ago

BigCode - Editor

💻

Run a web application server

Nikita Arsenin

AI & ML interests

Recent Activity

Organizations

lumirey's activity

BigCode - Editor