Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
2
Xintong Li
Kaylee0501
Follow
vintropl's profile picture
1 follower
·
1 following
https://kaylee0501.github.io/
XintongLi0501
Kaylee0501
xintong-li-970ab31b5
AI & ML interests
NLP, Multimodal, LLM Reasoning
Recent Activity
upvoted
a
paper
7 days ago
Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning
updated
a model
9 days ago
Kaylee0501/qwen2_7b_grpo_150
published
a model
9 days ago
Kaylee0501/qwen2_7b_grpo_150
View all activity
Organizations
Kaylee0501
's models
25
Sort: Recently updated
Kaylee0501/qwen2_7b_grpo_150
8B
•
Updated
9 days ago
•
43
Kaylee0501/qwen2_vl_7b_COT_grpo_LLM-judge_nat_460
8B
•
Updated
10 days ago
•
19
Kaylee0501/qwen2_vl_7b_COT_grpo_LLM-judge_930
8B
•
Updated
10 days ago
•
19
Kaylee0501/qwen2_vl_7b_COT_grpo_800
8B
•
Updated
10 days ago
•
176
Kaylee0501/qwen2_vl_7b_COT_grpo_LLM-judge_nat_690
8B
•
Updated
10 days ago
•
19
Kaylee0501/qwen3_vl_8b_COT_grpo_LLM-judge_400
9B
•
Updated
10 days ago
•
33
Kaylee0501/qwen3_vl_8b_wo-COT_grpo_800
9B
•
Updated
10 days ago
•
21
Kaylee0501/qwen3_vl_8b_COT_grpo_LLM-judge_nat_680
9B
•
Updated
10 days ago
•
14
Kaylee0501/qwen3_vl_8b_wo-COT_grpo_90
9B
•
Updated
11 days ago
•
21
Kaylee0501/qwen3_vl_8b_COT_grpo_reward0.3_90
9B
•
Updated
11 days ago
•
21
Kaylee0501/qwen3_vl_8b_COT_grpo_180
9B
•
Updated
11 days ago
•
24
Kaylee0501/qwen2_vl_7b_COT_grpo_reward0.3_110
8B
•
Updated
11 days ago
•
43
Kaylee0501/qwen2_vl_7b_COT_grpo_490
8B
•
Updated
11 days ago
•
54
Kaylee0501/Qwen2.5-Coder-32B-Instruct-SFT-planning3
33B
•
Updated
Mar 18
•
1
Kaylee0501/Qwen2.5-Coder-32B-Instruct-SFT-planning2
33B
•
Updated
Mar 17
•
2
Kaylee0501/Qwen2.5-Coder-32B-Instruct-SFT-planning1
33B
•
Updated
Mar 17
•
1
Kaylee0501/Qwen2.5-Coder-32B-Instruct-onpolicy
Updated
Feb 26
Kaylee0501/Qwen2.5-Coder-32B-Instruct-llm
Updated
Feb 26
Kaylee0501/Qwen2.5-Coder-32B-Instruct-edit
Updated
Feb 26
•
1
Kaylee0501/Qwen3-Coder-30B-A3B-Instruct-DPO-llm
Updated
Feb 19
Kaylee0501/Qwen3-Coder-30B-A3B-Instruct-DPO-Edit
Updated
Feb 19
•
2
Kaylee0501/Qwen3-Coder-30B-A3B-Instruct-SFT2
Updated
Feb 19
•
1
Kaylee0501/Qwen3-Coder-30B-A3B-Instruct-SFT1
Updated
Feb 19
Kaylee0501/trained_models_llava8b_from_aot
8B
•
Updated
Sep 25, 2025
•
1
Kaylee0501/trained_models_qwen3_shorten
4B
•
Updated
Sep 24, 2025
•
2