Jonathan Yinhan He
jonathanhe123
AI & ML interests
None yet
Recent Activity
upvoted a paper 9 days ago
IAPO: Information-Aware Policy Optimization for Token-Efficient Reasoning updated a model 16 days ago
jonathanhe123/iapo authored a paper 21 days ago
Rank-GRPO: Training LLM-based Conversational Recommender Systems with
Reinforcement LearningOrganizations
None yet