Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
12
20
Jonatan Borkowski
PRO
j14i
Follow
abeldsouza's profile picture
ryanaustin1's profile picture
ronantakizawa's profile picture
6 followers
Β·
23 following
jborkowski
AI & ML interests
None yet
Recent Activity
reacted
to
sergiopaniego
's
post
with β€οΈ
about 3 hours ago
This super detailed tutorial by @Paulescu is pure gold πͺ "Fine-tuning a Small Language Model for browser control with GRPO and OpenEnv" LFM2-350M (@LiquidAI) + BrowserGym (OpenEnv) + GRPO (TRL) for learning browser control π€ https://paulabartabajo.substack.com/p/fine-tuning-lfm2-350m-for-browser
liked
a Space
1 day ago
hysts/daily-papers
reacted
to
sergiopaniego
's
post
with π
2 days ago
Google DeepMind releases FunctionGemma, a 240M model specialized in π§ tool calling, built for fine-tuning TRL has day-0 support. To celebrate, weβre sharing 2 new resources: > Colab guide to fine-tune it for π browser control with BrowserGym OpenEnv > Standalone training script > Colab notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_functiongemma_browsergym_openenv.ipynb > Training script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/browsergym_llm.py (command to run it inside the script) > More notebooks in TRL: https://huggingface.co/docs/trl/example_overview#notebooks
View all activity
Organizations
j14i
's models
1
Sort:Β Recently updated
j14i/fa_agents
Updated
4 days ago