Jiaming Wang

Jessamine

6 28

AI & ML interests

None yet

Recent Activity

commentedon a paper 24 days ago

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

authored a paper 26 days ago

CoVEBench: Can Video Editing Models Handle Complex Instructions?

upvoted a paper 26 days ago

OmniCap-IF: Benchmarking and Improving Instruction Following Abilities for Omni-Video Captioning

View all activity

Organizations

commented a paper 24 days ago

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

Paper • 2606.02060 • Published Jun 1 • 57 •

authored a paper 26 days ago

CoVEBench: Can Video Editing Models Handle Complex Instructions?

Paper • 2606.08415 • Published 28 days ago • 51

upvoted 2 papers 26 days ago

OmniCap-IF: Benchmarking and Improving Instruction Following Abilities for Omni-Video Captioning

Paper • 2606.08572 • Published 28 days ago • 14

On the Geometry of On-Policy Distillation

Paper • 2606.07082 • Published about 1 month ago • 75

commented a paper 26 days ago

CoVEBench: Can Video Editing Models Handle Complex Instructions?

Paper • 2606.08415 • Published 28 days ago • 51 •

updated a collection 26 days ago

Video Edit

Collection

Video Edit Paper • 2 items • Updated 26 days ago

upvoted a paper 26 days ago

CoVEBench: Can Video Editing Models Handle Complex Instructions?

Paper • 2606.08415 • Published 28 days ago • 51

submitted a paper to Daily Papers 26 days ago

CoVEBench: Can Video Editing Models Handle Complex Instructions?

Paper • 2606.08415 • Published 28 days ago • 51

New activity in NJU-LINK/CoVEBench 26 days ago

UPDATE CITATION

#3 opened 26 days ago by

Jessamine

Update README.md

#2 opened 26 days ago by

Jessamine

upvoted 2 papers 27 days ago

Socratic-SWE: Self-Evolving Coding Agents via Trace-Derived Agent Skills

Paper • 2606.07412 • Published about 1 month ago • 12

UniSHARP: Universal Sharp Monocular View Synthesis

Paper • 2606.07514 • Published about 1 month ago • 14

commented a paper about 1 month ago

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

Paper • 2606.02060 • Published Jun 1 • 57 •

updated a dataset about 1 month ago

NJU-LINK/TELBench

Updated Jun 4 • 10.5k • 2

updated a collection about 1 month ago

Agent Papers

Collection

4 items • Updated Jun 4 • 1

upvoted a collection about 1 month ago

Agent Papers

Collection

4 items • Updated Jun 4 • 1

Jiaming Wang

AI & ML interests

Recent Activity

Organizations

Jessamine's activity

UPDATE CITATION

Update README.md