arxiv:2508.19205
zhiliang
zzliang
AI & ML interests
multimodal
Recent Activity
upvoted a paper 2 days ago
Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models upvoted a paper about 2 months ago
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding upvoted a paper 2 months ago
Online Experiential Learning for Language Models