39 36 30

Shizhe Diao

shizhediao2

https://shizhediao.github.io/

AI & ML interests

LLM pre-training and reasoning

Recent Activity

upvoted a paper 9 days ago

PhyCritic: Multimodal Critic Models for Physical AI

upvoted a paper 10 days ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

upvoted a paper about 1 month ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

View all activity

Organizations

New activity in nvidia/ToolScale 2 months ago

Add metadata and refactor to ToolScale Dataset Card

#3 opened 3 months ago by

nielsr

New activity in nvidia/Nemotron-Orchestrator-8B 3 months ago

Adding `transformers` as the library name

#18 opened 3 months ago by

ariG23498

Upload merges.txt with huggingface_hub

#1 opened 3 months ago by

bestluck123

Upload config.json with huggingface_hub

#2 opened 3 months ago by

bestluck123

Upload model-00006-of-00007.safetensors with huggingface_hub

#3 opened 3 months ago by

bestluck123

Upload model-00003-of-00007.safetensors with huggingface_hub

#4 opened 3 months ago by

bestluck123

Upload model-00001-of-00007.safetensors with huggingface_hub

#5 opened 3 months ago by

bestluck123

Upload special_tokens_map.json with huggingface_hub

#6 opened 3 months ago by

bestluck123

Upload vocab.json with huggingface_hub

#7 opened 3 months ago by

bestluck123

Upload model-00005-of-00007.safetensors with huggingface_hub

#8 opened 3 months ago by

bestluck123

Upload tokenizer_config.json with huggingface_hub

#9 opened 3 months ago by

bestluck123

Upload added_tokens.json with huggingface_hub

#10 opened 3 months ago by

bestluck123

Upload tokenizer.json with huggingface_hub

#12 opened 3 months ago by

bestluck123

Upload generation_config.json with huggingface_hub

#13 opened 3 months ago by

bestluck123

Upload model.safetensors.index.json with huggingface_hub

#14 opened 3 months ago by

bestluck123

Upload model-00002-of-00007.safetensors with huggingface_hub

#15 opened 3 months ago by

bestluck123

Upload model-00004-of-00007.safetensors with huggingface_hub

#16 opened 3 months ago by

bestluck123

Upload model-00007-of-00007.safetensors with huggingface_hub

#17 opened 3 months ago by

bestluck123

New activity in nvidia/ToolScale 3 months ago

Upload dataset

#1 opened 3 months ago by

bestluck123

New activity in nvidia/Nemotron-Research-Reasoning-Qwen-1.5B 3 months ago

Update README.md

#9 opened 3 months ago by

jianh-nvidia

Shizhe Diao

AI & ML interests

Recent Activity

Organizations

shizhediao2's activity

Add metadata and refactor to ToolScale Dataset Card

Adding `transformers` as the library name

Upload merges.txt with huggingface_hub

Upload config.json with huggingface_hub

Upload model-00006-of-00007.safetensors with huggingface_hub

Upload model-00003-of-00007.safetensors with huggingface_hub

Upload model-00001-of-00007.safetensors with huggingface_hub

Upload special_tokens_map.json with huggingface_hub

Upload vocab.json with huggingface_hub

Upload model-00005-of-00007.safetensors with huggingface_hub

Upload tokenizer_config.json with huggingface_hub

Upload added_tokens.json with huggingface_hub

Upload tokenizer.json with huggingface_hub

Upload generation_config.json with huggingface_hub

Upload model.safetensors.index.json with huggingface_hub

Upload model-00002-of-00007.safetensors with huggingface_hub

Upload model-00004-of-00007.safetensors with huggingface_hub

Upload model-00007-of-00007.safetensors with huggingface_hub

Upload dataset

Update README.md