Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
36.4
TFLOPS
2
Giles Thomas
gpjt
Follow
alishabhale's profile picture
Arp25's profile picture
stochastically's profile picture
11 followers
·
23 following
https://www.gilesthomas.com/
gpjt
gpjt
gilesthomas
gilesthomas.com
AI & ML interests
Doing my best to speedrun 20 years of AI research. YMMV
Recent Activity
updated
a collection
2 days ago
LLM from scratch
updated
a model
2 days ago
gpjt/1xrtx3090-stacked-interventions
published
a model
2 days ago
gpjt/1xrtx3090-stacked-interventions
View all activity
Organizations
None yet
gpjt
's models
33
Sort:Â Recently updated
gpjt/1xrtx3090-stacked-interventions
Text Generation
•
0.2B
•
Updated
2 days ago
•
300
gpjt/1xrtx3090-baseline
Text Generation
•
0.2B
•
Updated
3 days ago
•
204
gpjt/8xa100m40-stacked-interventions-3
Text Generation
•
0.2B
•
Updated
8 days ago
•
195
gpjt/8xa100m40-stacked-interventions-2
Text Generation
•
0.2B
•
Updated
9 days ago
•
253
gpjt/8xa100m40-stacked-interventions-1
Text Generation
•
0.2B
•
Updated
9 days ago
•
242
gpjt/8xa100m40-baseline-8
Text Generation
•
0.2B
•
Updated
9 days ago
•
512
gpjt/8xa100m40-baseline-7
Text Generation
•
0.2B
•
Updated
9 days ago
•
511
gpjt/8xa100m40-baseline-6
Text Generation
•
0.2B
•
Updated
9 days ago
•
501
gpjt/8xa100m40-baseline-4
Text Generation
•
0.2B
•
Updated
9 days ago
•
313
gpjt/8xa100m40-baseline-5
Text Generation
•
0.2B
•
Updated
9 days ago
•
496
gpjt/8xa100m40-baseline-3
Text Generation
•
0.2B
•
Updated
9 days ago
•
563
gpjt/8xa100m40-baseline-2
Text Generation
•
0.2B
•
Updated
9 days ago
•
521
gpjt/8xa100m80-no-amp
Text Generation
•
0.2B
•
Updated
14 days ago
•
252
gpjt/8xa100m40-weight-decay-cerebras
Text Generation
•
0.2B
•
Updated
24 days ago
•
354
gpjt/8xa100m40-weight-decay-gpt2
Text Generation
•
0.2B
•
Updated
24 days ago
•
340
gpjt/8xa100m40-qkv-bias
Text Generation
•
0.2B
•
Updated
24 days ago
•
86
gpjt/8xa100m40-schedule-learning-rate
Text Generation
•
0.2B
•
Updated
24 days ago
•
85
gpjt/8xa100m40-remove-dropout
Text Generation
•
0.2B
•
Updated
24 days ago
•
90
gpjt/8xa100m40-baseline
Text Generation
•
0.2B
•
Updated
24 days ago
•
85
gpjt/1xrtx3090m24-fineweb-edu-2x
Text Generation
•
0.2B
•
Updated
24 days ago
•
79
gpjt/1xrtx3090m24-fineweb-edu
Text Generation
•
0.2B
•
Updated
24 days ago
•
81
gpjt/1xrtx3090m24-fineweb
Text Generation
•
0.2B
•
Updated
24 days ago
•
87
gpjt/8xh100m80-latest
Text Generation
•
0.2B
•
Updated
24 days ago
•
79
gpjt/8xh100m80-best
Text Generation
•
0.2B
•
Updated
24 days ago
•
83
gpjt/8xb200m160
Text Generation
•
0.2B
•
Updated
24 days ago
•
89
gpjt/8xa100m80
Text Generation
•
0.2B
•
Updated
24 days ago
•
86
gpjt/8xa100m40
Text Generation
•
0.2B
•
Updated
24 days ago
•
121
gpjt/8xa100m40-gradient-clipping
Text Generation
•
0.2B
•
Updated
24 days ago
•
81
gpjt/test5
Text Generation
•
0.2B
•
Updated
Jan 28
•
7
gpjt/test4
Text Generation
•
0.2B
•
Updated
Jan 28
•
3
Previous
1
2
Next