sleeepeer/sleeepeer-OPI-SEP-warmup-pisanitizer-dolly_OPI-SEP-2_alpacafarm-42-202601102247 Updated 28 days ago
sleeepeer/sleeepeer-OPI-SEP-warmup-pisanitizer-dolly_OPI-SEP-2_alpacafarm-42-202601102246 Updated 28 days ago
sleeepeer/sleeepeer-OPI-SEP-warmup-pisanitizer-dolly_OPI-SEP-2_alpacafarm-42-202601102244 Updated 28 days ago
sleeepeer/sleeepeer-OPI-SEP-warmup-pisanitizer-dolly_OPI-SEP-2_alpacafarm-42-202601102242 Updated 28 days ago
sleeepeer/sleeepeer-OPI-SEP-warmup-pisanitizer-dolly_OPI-SEP-2_alpacafarm-42-202601102239 Updated 28 days ago
sleeepeer/sleeepeer-OPI-SEP-warmup-pisanitizer-dolly_OPI-SEP-2_alpacafarm-42-202601102236 Updated 28 days ago
sleeepeer/meta-llama-Llama-3.1-8B-Instruct-pisanitizer-dolly_OPI-SEP-2_alpacafarm-42-202601101739 Updated 28 days ago
sleeepeer/Llama-3.1-8B-Instruct-pisanitizer-MIX-0110-42 Text Generation • 8B • Updated 28 days ago • 74
sleeepeer/meta-llama-Llama-3.1-8B-Instruct-pisanitizer-squad_v2-sanitization-42-202601082138 Text Generation • 8B • Updated 30 days ago • 92
sleeepeer/meta-llama-Llama-3.1-8B-Instruct-pisanitizer-squad_v2-llm-judge-42-20260108-1706 Text Generation • 8B • Updated 30 days ago • 111
sleeepeer/meta-llama-Llama-3.1-8B-Instruct-pisanitizer-squad_v2-llm-judge-42 Text Generation • 8B • Updated 30 days ago • 11
sleeepeer/meta-llama-Meta-Llama-3-8B-Instruct-DPO-dpo_anchor_3epoch_llama3_2000-42 Updated Oct 4, 2025
sleeepeer/meta-llama-Llama-3.1-8B-Instruct-DPO-dpo_anchor_3epoch_no_instruction-42 Updated Oct 3, 2025
sleeepeer/Llama-3.1-8B-Instruct-GRPO-alpaca_mix_combine_naive-llm-judge-42 Text Generation • 8B • Updated Jul 16, 2025 • 2
sleeepeer/Llama-3.1-8B-Instruct-GRPO-alpaca_mix_combine_naive_least_similar-llm-judge-42 Text Generation • 8B • Updated Jul 16, 2025 • 3