MultiRL

non-profit

AI & ML interests

None defined yet.

Recent Activity

KimSHine updated a model about 8 hours ago

MultiRL/qwen3_1.7b_sudoku_multi_action_easy_21_30

KimSHine published a model about 8 hours ago

MultiRL/qwen3_1.7b_sudoku_multi_action_easy_21_30

KimSHine updated a model about 8 hours ago

MultiRL/qwen3_1.7b_sudoku_multi_action_easy_21_30_epoch3

View all activity

MultiRL 's models 137

MultiRL/qwen3_4b_base_sft_final

4B • Updated Dec 17, 2025 • 75

MultiRL/qwen3_4b_easy_rl_new

4B • Updated Dec 16, 2025 • 74

MultiRL/qwen3_1.7b_easy_rl_gspo

2B • Updated Dec 16, 2025 • 4

MultiRL/qwen3_4b_sft_new

4B • Updated Dec 15, 2025 • 52

MultiRL/qwen3_1.7b_easy_rl_final_step120

2B • Updated Dec 15, 2025 • 237

MultiRL/qwen3_4b_medium_rl_final

4B • Updated Dec 15, 2025 • 167

MultiRL/qwen3_4b_sft_one_act

4B • Updated Dec 14, 2025 • 54

MultiRL/qwen3_1.7b_easy_rl_reinforce_ori

2B • Updated Dec 14, 2025 • 89

MultiRL/qwen3_1.7b_easy_rl_reinforce_alpha_0.5

2B • Updated Dec 14, 2025 • 4

MultiRL/qwen3_1.7b_easy_rl_reinforce_alpha_1

2B • Updated Dec 14, 2025 • 4

MultiRL/qwen3_1.7b_easy_rl_reinforce_alpha_0

2B • Updated Dec 14, 2025 • 3

MultiRL/qwen3_1.7b_sft_one_act

2B • Updated Dec 14, 2025 • 99

MultiRL/qwen3_1.7b_easy_rl_final

2B • Updated Dec 13, 2025 • 866

MultiRL/qwen3_4b_easy_rl_final

4B • Updated Dec 13, 2025 • 56

MultiRL/qwen3_1.7b_sft_final

2B • Updated Dec 11, 2025 • 2.91k

MultiRL/qwen3_4b_sft_final

4B • Updated Dec 11, 2025 • 77

MultiRL/qwen3_1.7b_easy_rl_new

2B • Updated Dec 6, 2025 • 1

MultiRL/qwen3_4b_standard_medium_rl

4B • Updated Dec 6, 2025 • 43

MultiRL/qwen3_4b_standard_easy_rl

4B • Updated Dec 5, 2025 • 46

MultiRL/qwen3_4b_medium_rl_progress_C

4B • Updated Dec 5, 2025

MultiRL/qwen3_4b_medium_rl

4B • Updated Dec 4, 2025 • 42

MultiRL/qwen3_4b_instruct_sft

4B • Updated Dec 1, 2025 • 57

MultiRL/qwen3_1.7b_easy_rl_test_task_group

2B • Updated Dec 1, 2025

MultiRL/qwen3_1.7b_easy_rl_test

2B • Updated Nov 30, 2025 • 36

MultiRL/qwen3_1.7b_sudoku_sft

2B • Updated Nov 28, 2025 • 99

MultiRL/qwen3_1.7b_easy_reinforce_batch_32_by_pass

2B • Updated Nov 26, 2025 • 10

MultiRL/qwen3_1.7b_easy_reinforce_batch_64_by_pass

2B • Updated Nov 25, 2025

MultiRL/qwen3_1.7b_easy_reinforce_test

2B • Updated Nov 23, 2025

MultiRL/qwen3_1.7b_C_easy_gspo_test

2B • Updated Nov 22, 2025 • 1

MultiRL/qwen3_1.7b_base_C_normal_short_sft_lr_1e_5_C_easy_grpo_step70

2B • Updated Nov 17, 2025 • 1