ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-F16 Reinforcement Learning • 8B • Updated Mar 25, 2025 • 439 • 91
NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos Reinforcement Learning • 8B • Updated Apr 29, 2025 • 14 • 8
AdityaaXD/Multi-Agent_Reinforcement_Learning_Trading_System_Models Reinforcement Learning • Updated Feb 1 • 132 • 5
ValueFX9507/Tifa-Deepsex-14b-CoT-Q8 Reinforcement Learning • 15B • Updated Feb 13, 2025 • 7.79k • 185