Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
24
2
8
Tim Lawson
tim-lawson
Follow
https://timlawson.dev
tslwn
tim-lawson
timothyspencerlawson
AI & ML interests
language models, interpretability
Organizations
None yet
tim-lawson
's models
441
Sort: Recently updated
tim-lawson/pretrain_llama_tinystories_v8192_d64_l12_h8
1.84M
•
Updated
Oct 6, 2025
•
7
tim-lawson/pretrain_llama_tinystories_v4096_d768_l8_h8
81.8M
•
Updated
Oct 6, 2025
•
4
tim-lawson/pretrain_llama_tinystories_v4096_d512_l12_h8
54.5M
•
Updated
Oct 6, 2025
•
4
tim-lawson/pretrain_llama_tinystories_v4096_d768_l4_h8
44M
•
Updated
Oct 6, 2025
•
4
tim-lawson/pretrain_llama_tinystories_v4096_d256_l12_h8
14.7M
•
Updated
Oct 6, 2025
•
7
tim-lawson/pretrain_llama_tinystories_v4096_d768_l2_h8
25.2M
•
Updated
Oct 6, 2025
•
4
tim-lawson/pretrain_llama_tinystories_v4096_d128_l12_h8
4.2M
•
Updated
Oct 6, 2025
•
4
tim-lawson/pretrain_llama_tinystories_v4096_d64_l12_h8
1.31M
•
Updated
Oct 6, 2025
•
5
tim-lawson/pretrain_llama_tinystories_v4096_d768_l1_h8
15.7M
•
Updated
Oct 6, 2025
•
4
tim-lawson/pretrain_llama_tinystories_v4096_d1024_l12_h8
Updated
Oct 6, 2025
tim-lawson/pretrain_llama_tinystories_v4096_d1024_l12_h12
Updated
Oct 6, 2025
tim-lawson/pretrain_llama_tinystories_v4096_d128_l12_h12
Updated
Oct 6, 2025
tim-lawson/pretrain_llama_tinystories_v4096_d64_l12_h12
Updated
Oct 6, 2025
tim-lawson/pretrain_llama_tinystories_v4096_d256_l12_h12
Updated
Oct 6, 2025
tim-lawson/pretrain_llama_tinystories_v4096_d768_l2_h12
25.2M
•
Updated
Oct 6, 2025
•
4
tim-lawson/pretrain_llama_tinystories_v4096_d768_l1_h12
15.7M
•
Updated
Oct 6, 2025
•
5
tim-lawson/pretrain_llama_tinystories_v4096_d512_l12_h12
Updated
Oct 6, 2025
tim-lawson/pretrain_llama_tinystories_v4096_d768_l6_h12
Updated
Oct 6, 2025
tim-lawson/pretrain_llama_tinystories_v4096_d768_l4_h12
Updated
Oct 6, 2025
tim-lawson/pretrain_llama_tinystories_v4096_d768_l10_h12
Updated
Oct 6, 2025
tim-lawson/pretrain_llama_tinystories_v4096_d768_l8_h12
Updated
Oct 6, 2025
tim-lawson/pretrain_gemma3_c4_kl_270m_1b-pt_weights
0.3B
•
Updated
Sep 29, 2025
•
6
tim-lawson/pretrain_gemma3_slimpajama_v2
0.3B
•
Updated
Sep 29, 2025
•
10
tim-lawson/pretrain_gemma3_c4_kl_270m_1b-pt_topk_0.75
0.3B
•
Updated
Sep 29, 2025
•
6
tim-lawson/pretrain_gemma3_fineweb_v2
0.3B
•
Updated
Sep 29, 2025
•
11
tim-lawson/pretrain_gemma3_openwebtext_v2
0.3B
•
Updated
Sep 29, 2025
•
12
tim-lawson/pretrain_gemma3_c4_kl_270m_1b-pt_topk_0.5
0.3B
•
Updated
Sep 29, 2025
•
6
tim-lawson/pretrain_gemma3_c4_kl_270m_1b-pt_topk_0.25
0.3B
•
Updated
Sep 29, 2025
•
6
tim-lawson/pretrain_gemma3_c4_v2
0.3B
•
Updated
Sep 28, 2025
•
11
tim-lawson/pretrain_gemma3_pile_v2
0.3B
•
Updated
Sep 28, 2025
•
12
Previous
1
2
3
4
...
15
Next