Hongli Zhou's picture

1 4 3

Hongli Zhou

Joe-Hall-Lee

https://Joe-Hall-Lee.github.io

AI & ML interests

Large Language Models

Recent Activity

upvoted a paper about 2 months ago

Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory

liked a Space 4 months ago

allenai/reward-bench

authored a paper 5 months ago

Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory

View all activity

Organizations

None yet

authored 3 papers 5 months ago

Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory

Paper • 2505.15055 • Published May 21 • 1

Mitigating the Bias of Large Language Model Evaluation

Paper • 2409.16788 • Published Sep 25, 2024

Think-J: Learning to Think for Generative LLM-as-a-Judge

Paper • 2505.14268 • Published May 20