-
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Paper • 2402.10986 • Published • 82 -
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Paper • 2408.02657 • Published • 35 -
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale
Paper • 2508.10711 • Published • 146 -
Qwen3-Omni Technical Report
Paper • 2509.17765 • Published • 153
Charles Cai
charlescai2016
AI & ML interests
None yet
Recent Activity
liked a dataset 2 days ago
allenai/olmOCR-bench liked a model 2 days ago
datalab-to/chandra-ocr-2 upvoted an article 2 days ago
How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs