kangdawei commited on
Commit
c2892b6
·
verified ·
1 Parent(s): 36b6839

Training in progress, step 200

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a781e3aef45fa1e809cf060aa100e2170c8f018aed60b767c3ebe4fc56a5d81e
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6bd5edfcc9cd8360d40be777e38400fbb09142ebfeebe305d37dfac3dd5ef29f
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e2ae14ffe97dec71eb500030c0fa8d5f736c14eaf52ef4f6ff4e495b34e5a734
3
- size 128037689
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:201db9aeafea34bffbbed038550d9ee9c0601060d195574adcaff84f32fc9e38
3
+ size 162266208
reward_plots/advantage_plot_step_150.png ADDED
reward_plots/advantage_plot_step_160.png ADDED
reward_plots/advantage_plot_step_170.png ADDED
reward_plots/advantage_plot_step_180.png ADDED
reward_plots/advantage_plot_step_190.png ADDED
reward_plots/reward_comparison_step_150.png ADDED
reward_plots/reward_comparison_step_160.png ADDED
reward_plots/reward_comparison_step_170.png ADDED
reward_plots/reward_comparison_step_180.png ADDED
reward_plots/reward_comparison_step_190.png ADDED
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bcd3ddd48f2fd39f8e5dfb1f680f3d32c60c9c8e6913a69ab0306059bc98d643
3
  size 8504
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c35427105f08dd66e4f9765e50c93467db2ad339acfa5baa425376f928e85763
3
  size 8504