Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
33
1
John Schaefer
johnschaefer
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories
upvoted
a
paper
1 day ago
VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding
upvoted
a
paper
about 1 month ago
Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems
View all activity
Organizations
None yet
models
10
Sort: Recently updated
johnschaefer/GRPO-olmo3_7b_creativity_grpo_purerl
Updated
Apr 11
johnschaefer/GRPO-olmo3_7b_physics_grpo_purerl
Updated
Apr 11
johnschaefer/GRPO-qwen3_8b_creativity_grpo_purerl
Updated
Apr 11
johnschaefer/GRPO-qwen3_8b_creativity_grpo_weighted_mul
Updated
Apr 11
johnschaefer/GRPO-qwen3_8b_physics_grpo_purerl
Updated
Apr 11
johnschaefer/DAPO-RLVR-with-full-tokens-Qwen3-8B
Updated
Apr 11
johnschaefer/GRPO-olmo3_7b_physics_grpo_weighted_mul
Updated
Apr 11
johnschaefer/GRPO-qwen3_8b_physics_grpo_weighted_mul
Updated
Apr 11
johnschaefer/DAPO-RLVR-with-only-high-entropy-tokens-Qwen3-8B
Updated
Apr 11
johnschaefer/GRPO-olmo3_7b_creativity_grpo_weighted_mul
Updated
Apr 11
datasets
0
None public yet