7 22 8

Zhengyang Tang

tangzhy

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

Safe, or Simply Incapable? Rethinking Safety Evaluation for Phone-Use Agents

submitted a paper 2 days ago

Safe, or Simply Incapable? Rethinking Safety Evaluation for Phone-Use Agents

authored a paper 10 days ago

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

View all activity

Organizations

authored a paper 1 day ago

Safe, or Simply Incapable? Rethinking Safety Evaluation for Phone-Use Agents

Paper • 2605.07630 • Published 6 days ago

submitted a paper to Daily Papers 2 days ago

Safe, or Simply Incapable? Rethinking Safety Evaluation for Phone-Use Agents

Paper • 2605.07630 • Published 6 days ago

authored a paper 10 days ago

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Paper • 2604.28139 • Published 14 days ago • 42

upvoted a paper 13 days ago

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Paper • 2604.28139 • Published 14 days ago • 42

authored a paper 24 days ago

Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning

Paper • 2604.16029 • Published 27 days ago • 23

upvoted a paper 24 days ago

Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning

Paper • 2604.16029 • Published 27 days ago • 23

upvoted a paper 29 days ago

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

Paper • 2604.10866 • Published Apr 13 • 65

authored a paper about 1 month ago

Do Phone-Use Agents Respect Your Privacy?

Paper • 2604.00986 • Published Apr 1 • 9

upvoted a paper about 1 month ago

Do Phone-Use Agents Respect Your Privacy?

Paper • 2604.00986 • Published Apr 1 • 9

submitted a paper to Daily Papers about 1 month ago

Do Phone-Use Agents Respect Your Privacy?

Paper • 2604.00986 • Published Apr 1 • 9

upvoted a paper 3 months ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published Feb 5 • 353

authored 2 papers 3 months ago

Teaching Language Models to Reason with Tools

Paper • 2510.20342 • Published Oct 23, 2025

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published Feb 2 • 270

upvoted a paper 3 months ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published Feb 2 • 270

published a model 4 months ago

tangzhy/STORM-Qwen3-4B

4B • Updated Oct 7, 2025 • 6

upvoted a paper 5 months ago

DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry

Paper • 2512.11558 • Published Dec 12, 2025 • 45

upvoted a paper 7 months ago

CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling

Paper • 2510.04204 • Published Oct 5, 2025 • 21

commented a paper 7 months ago

CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling

Paper • 2510.04204 • Published Oct 5, 2025 • 21 •

authored a paper 7 months ago

CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling

Paper • 2510.04204 • Published Oct 5, 2025 • 21

updated a model 7 months ago

tangzhy/STORM-Qwen3-4B

4B • Updated Oct 7, 2025 • 6

Zhengyang Tang

AI & ML interests

Recent Activity

Organizations

tangzhy's activity