Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Mechanist Interpretability for Alignment Algorithms
community
Activity Feed
Follow
5
AI & ML interests
AI Safety, Mechanist Interpretability
Recent Activity
ArthT
updated
a model
8 days ago
MInAlA/Qwen3-4B-ORPO-merged
ArthT
published
a model
9 days ago
MInAlA/Qwen3-4B-ORPO-merged
ArthT
updated
a model
9 days ago
MInAlA/Llama-3.2-3B-ORPO-merged
View all activity
Team members
5
MInAlA
's models
15
Sort: Recently updated
MInAlA/Llama-3.2-3B-SimPO-merged
Text Generation
•
3B
•
Updated
1 day ago
•
263
MInAlA/Qwen3-4B-Instruct-2507-SimPO-merged
Text Generation
•
4B
•
Updated
1 day ago
•
21
MInAlA/SmolLM3-3B-SimPO-merged
Text Generation
•
3B
•
Updated
1 day ago
•
10
MInAlA/Llama-3.2-3B-Instruct-GRPO-merged
Text Generation
•
3B
•
Updated
3 days ago
•
27
MInAlA/Qwen3-4B-Instruct-2507-GRPO-merged
Text Generation
•
4B
•
Updated
4 days ago
•
190
MInAlA/SmolLM3-3B-GRPO-merged
Text Generation
•
3B
•
Updated
7 days ago
•
12
MInAlA/Llama-3.2-3B-Instruct-KTO-merged
Text Generation
•
3B
•
Updated
7 days ago
•
226
MInAlA/Qwen3-4B-Instruct-2507-KTO-merged
Text Generation
•
4B
•
Updated
8 days ago
•
20
MInAlA/Qwen3-4B-ORPO-merged
4B
•
Updated
8 days ago
•
34
MInAlA/Llama-3.2-3B-ORPO-merged
Text Generation
•
Updated
9 days ago
•
387
MInAlA/SmolLM3-3B-KTO-merged
Text Generation
•
3B
•
Updated
9 days ago
•
312
MInAlA/SmolLM3-3B-ORPO-merged
Text Generation
•
3B
•
Updated
13 days ago
•
212
MInAlA/Llama-3.2-3B-DPO-merged
Text Generation
•
3B
•
Updated
14 days ago
•
300
MInAlA/Qwen3-4B-Instruct-2507-DPO-merged
Text Generation
•
4B
•
Updated
14 days ago
•
360
MInAlA/SmolLM3-3B-DPO-merged
Text Generation
•
3B
•
Updated
14 days ago
•
623