Reinforcement Learning
Transformers
Safetensors
qwen2
text-generation
ufb
fsdp
text-generation-inference
Instructions to use ZihanWang314/test_global_step_10_ragen with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use ZihanWang314/test_global_step_10_ragen with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("ZihanWang314/test_global_step_10_ragen") model = AutoModelForCausalLM.from_pretrained("ZihanWang314/test_global_step_10_ragen") - Notebooks
- Google Colab
- Kaggle
test_global_step_10_ragen
This model was exported from a UFO training checkpoint.
- Base model:
Qwen/Qwen2.5-3B-Instruct - Source checkpoint:
/workspace/ufb/outputs/checkpoints/exp1_MetamathQA/top_k/global_step_10/actor - Exported step:
global_step_10
Usage
from transformers import AutoModelForCausalLM, AutoTokenizer
model_name = "/workspace/ufb/outputs/hf/test_global_step_10_ragen"
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(model_name, trust_remote_code=True)
- Downloads last month
- 1