3 45 15

Beichen Zhang

BeichenZhang

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

liked a dataset 9 days ago

internlm/DL3DV-2k

liked a dataset 9 days ago

internlm/ETCHR-GRPO-10K

View all activity

Organizations

None yet

upvoted a paper 1 day ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

Paper • 2606.14777 • Published 7 days ago • 168

liked 3 datasets 9 days ago

liked a model 9 days ago

internlm/ETCHR-FLUX.2-klein-9B

Image-to-Image • Updated 23 days ago • 144 • 8

upvoted a paper 23 days ago

ETCHR: Editing To Clarify and Harness Reasoning

Paper • 2605.23897 • Published 26 days ago • 13

updated a dataset 23 days ago

BeichenZhang/ETCHR-SFT-400K

Viewer • Updated 23 days ago • 405k • 513 • 2

published a dataset 24 days ago

BeichenZhang/ETCHR-SFT-400K

Viewer • Updated 23 days ago • 405k • 513 • 2

liked a model 28 days ago

internlm/CapRL-Qwen3VL-4B

Image-Text-to-Text • 4B • Updated Apr 16 • 396 • 12

liked a dataset 3 months ago

internlm/WildClawBench

Benchmark • Updated May 15 • 11.7k • 61

upvoted 2 papers 3 months ago

Visual-ERM: Reward Modeling for Visual Equivalence

Paper • 2603.13224 • Published Mar 13 • 21

EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models

Paper • 2603.12252 • Published Mar 12 • 12

upvoted 2 papers 4 months ago

DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing

Paper • 2602.12205 • Published Feb 13 • 83

Unified Personalized Reward Model for Vision Generation

Paper • 2602.02380 • Published Feb 2 • 20

upvoted a paper 6 months ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 50

upvoted 2 papers 7 months ago

ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

Paper • 2512.03036 • Published Dec 2, 2025 • 22

Think Visually, Reason Textually: Vision-Language Synergy in ARC

Paper • 2511.15703 • Published Nov 19, 2025 • 9

commented a paper 7 months ago

Think Visually, Reason Textually: Vision-Language Synergy in ARC

Paper • 2511.15703 • Published Nov 19, 2025 • 9 •

upvoted 2 papers 8 months ago

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Paper • 2510.27606 • Published Oct 31, 2025 • 31

STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence

Paper • 2510.24693 • Published Oct 28, 2025 • 19

Beichen Zhang

AI & ML interests

Recent Activity

Organizations

BeichenZhang's activity