arxiv:2502.04313
Ilze Amanda Auzina
iaa01
·
AI & ML interests
RL Post-Training | Reasoning and Exploration | Open-ended
Recent Activity
updated
a model
about 23 hours ago
iaa01/qwen3-4b-elicit-pos
published
a model
about 23 hours ago
iaa01/qwen3-4b-elicit-pos
updated
a model
about 1 month ago
iaa01/llama-8b-merge-alpha1-freq10