Jiahao Xu's picture

3 6 3

Jiahao Xu

Jiahao004

·

Jiahao004

AI & ML interests

Sentence Emebddings; Neural Machine Translation

Organizations

upvoted a paper 2 months ago

The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published Oct 30, 2025 • 116

upvoted 2 papers 7 months ago

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

Paper • 2505.23754 • Published May 29, 2025 • 15

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Paper • 2504.11456 • Published Apr 15, 2025 • 12

upvoted a collection 7 months ago

DeepTheorem

A dataset and RL-zero pipeline for advanced mathematical reasoning of informal theorem proving. • 6 items • Updated Jun 11, 2025 • 2

upvoted a paper 9 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88

upvoted a paper about 1 year ago

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 62