zuijiang's picture

1 13 4

zuijiang

zuijiang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

The Reasoning-Creativity Trade-off: Toward Creativity-Driven Problem Solving

upvoted a paper 6 days ago

Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling

upvoted a paper 20 days ago

Coupled Variational Reinforcement Learning for Language Model General Reasoning

View all activity

Organizations

upvoted a paper 3 days ago

The Reasoning-Creativity Trade-off: Toward Creativity-Driven Problem Solving

Paper • 2601.00747 • Published 6 days ago • 17

upvoted a paper 6 days ago

Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling

Paper • 2512.23959 • Published 10 days ago • 94

upvoted a paper 20 days ago

Coupled Variational Reinforcement Learning for Language Model General Reasoning

Paper • 2512.12576 • Published 25 days ago • 2

upvoted a paper about 1 month ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 245

upvoted a paper 4 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2, 2025 • 83

upvoted 2 papers 7 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks

Paper • 2504.12764 • Published Apr 17, 2025 • 41

upvoted a paper 9 months ago

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published Apr 14, 2025 • 85

upvoted a paper 11 months ago

DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

Paper • 2502.01142 • Published Feb 3, 2025 • 24

upvoted 2 papers about 1 year ago

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

Paper • 2501.01830 • Published Jan 3, 2025 • 17

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18, 2024 • 24

upvoted 2 papers over 2 years ago

RAIN: Your Language Models Can Align Themselves without Finetuning

Paper • 2309.07124 • Published Sep 13, 2023 • 3

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Paper • 2309.00267 • Published Sep 1, 2023 • 52