AI & ML interests
None yet
Organizations
None yet
models
19
MisDrifter/min_judge_seed555135
Text Generation
•
3B
•
Updated
•
5
MisDrifter/min_judge_model
Text Generation
•
3B
•
Updated
•
6
MisDrifter/reward1e5-whole
Text Generation
•
3B
•
Updated
•
7
MisDrifter/qwen2.5_3B_Instruct_rebel_reward_1e5
Text Generation
•
3B
•
Updated
•
7
MisDrifter/qwen2.5_3B_Instruct_rebel_reward_1e4
Text Generation
•
3B
•
Updated
•
5
MisDrifter/qwen2.5_3B_Instruct_rebel_1e5
Text Generation
•
3B
•
Updated
•
4
MisDrifter/qwen2.5_3B_Instruct_rebel_1e4
Text Generation
•
3B
•
Updated
•
4
MisDrifter/verl_rebel_actor
Text Generation
•
1B
•
Updated
•
4
MisDrifter/1e6_rebel_rerun
Text Generation
•
3B
•
Updated
•
6
Text Generation
•
3B
•
Updated
•
6
MisDrifter/iter2_scores_base_0
Viewer
•
Updated
•
10
•
3
MisDrifter/1029_test_soft
Viewer
•
Updated
•
23
•
16
Viewer
•
Updated
•
23
•
2
Viewer
•
Updated
•
23
•
8
MisDrifter/game_stage3_base
Viewer
•
Updated
•
500
•
9
MisDrifter/1019_Qwen__Qwen2.5-1.5B-Instruct
Viewer
•
Updated
•
10
•
16
MisDrifter/1019_Qwen__Qwen2.5-3B-Instruct
Viewer
•
Updated
•
10
•
3
Viewer
•
Updated
•
10
•
4
MisDrifter/1013_chk_mean_maxlenp_1024_beta_1.0_nocheck_tokenized
Viewer
•
Updated
•
21
•
14
MisDrifter/1013_7b_mean_maxlenp_1024_beta_1.0_nocheck_tokenized
Viewer
•
Updated
•
21
•
5