Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published Dec 2, 2025 • 54
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information Paper • 2510.03632 • Published Oct 4, 2025 • 42
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information Paper • 2510.03632 • Published Oct 4, 2025 • 42
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information Paper • 2510.03632 • Published Oct 4, 2025 • 42 • 3
Self-Rewarding Vision-Language Model via Reasoning Decomposition Paper • 2508.19652 • Published Aug 27, 2025 • 84
MobileGUI-RL: Advancing Mobile GUI Agent through Reinforcement Learning in Online Environment Paper • 2507.05720 • Published Jul 8, 2025 • 2
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning Paper • 2505.16421 • Published May 22, 2025 • 19
MedEdit: Model Editing for Medical Question Answering with External Knowledge Bases Paper • 2309.16035 • Published Sep 27, 2023 • 1
ECHOPulse: ECG controlled echocardio-grams video generation Paper • 2410.03143 • Published Oct 4, 2024