Critique to Verify: Accurate and Honest Test-Time Scaling with RL-Trained Verifiers (https://arxiv.org/abs/2509.23152)
Zhicheng YANG
yangzhch6
AI & ML interests
reasoning with LLMs
Recent Activity
updated
a dataset 13 days ago
yangzhch6/Accordion-Thinking-Synthetic-Data published
a dataset 13 days ago
yangzhch6/Accordion-Thinking-Synthetic-Data Organizations
None yet