Yupeng Cao's picture

7

Yupeng Cao PRO

YupengCao

·

https://cyp0630.github.io/

CYP0630

AI & ML interests

NLP, Multimodal, Audio, Truthworthy

Recent Activity

updated a model about 1 month ago

YupengCao/t4-Qwen2-7B-Instruct-GRPO

updated a Space about 1 month ago

YupengCao/t4-Qwen2-7B-Instruct-GRPO

published a Space about 1 month ago

YupengCao/t4-Qwen2-7B-Instruct-GRPO

View all activity

Organizations

updated a model about 1 month ago

YupengCao/t4-Qwen2-7B-Instruct-GRPO

updated a Space about 1 month ago

T4 Qwen2 7B Instruct GRPO

Show a live tracking dashboard

published a Space about 1 month ago

T4 Qwen2 7B Instruct GRPO

Show a live tracking dashboard

published a model about 1 month ago

YupengCao/t4-Qwen2-7B-Instruct-GRPO

updated a Space about 1 month ago

Trackio

Display a visual summary of your program’s I/O activity

published a Space about 1 month ago

Trackio

Display a visual summary of your program’s I/O activity

published a model about 1 month ago

YupengCao/Qwen3-VL-4B-Instruct-trl-grpo

updated a dataset about 1 month ago

Financial-Misinformation-Detection/MultilingualFMD

Viewer • Updated Feb 27 • 42 • 43

updated a dataset 2 months ago

Financial-Misinformation-Detection/PersonaReasoning-v2

Viewer • Updated Feb 1 • 7 • 54

upvoted a paper 3 months ago

Same Claim, Different Judgment: Benchmarking Scenario-Induced Bias in Multilingual Financial Misinformation Detection

Paper • 2601.05403 • Published Jan 8 • 11

published a dataset 4 months ago

YupengCao/FinMCP

Viewer • Updated Dec 9, 2025 • 8.11k • 8

updated a dataset 4 months ago

YupengCao/FinMCP

Viewer • Updated Dec 9, 2025 • 8.11k • 8

updated a model 5 months ago

YupengCao/qwen2-7b-instruct-amazon-description

Updated Nov 12, 2025

published a model 5 months ago

YupengCao/qwen2-7b-instruct-amazon-description

Updated Nov 12, 2025

updated a dataset 5 months ago

YupengCao/OCR-evaluation

Preview • Updated Nov 12, 2025 • 4

published a dataset 5 months ago

YupengCao/OCR-evaluation

Preview • Updated Nov 12, 2025 • 4

upvoted a paper 6 months ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26, 2025 • 137

upvoted a paper 9 months ago

Truth Neurons

Paper • 2505.12182 • Published May 18, 2025 • 8

upvoted a paper 10 months ago

MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation

Paper • 2506.14028 • Published Jun 16, 2025 • 94

updated a model 11 months ago

YupengCao/halsci_lora_8bit

Updated May 23, 2025