Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

H-EmbodVis

university
https://github.com/H-EmbodVis
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

LMD0311  authored a paper about 8 hours ago
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models
dkliang  authored a paper about 14 hours ago
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models
dkliang  submitted a paper about 19 hours ago
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models
View all activity

Papers

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

View all Papers

Dingkang Liang's profile pictureXin Zhou's profile pictureCheng's profile pictureCheng Zhang's profile pictureXianjin-Wu's profile pictureHENG FANG's profile pictureEllery Kant's profile picture
H-EmbodVis 's Papers 5
Submitted by
Dingkang Liang
104

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

H-EmbodVis H-EmbodVis
23 2
Submitted by
Dingkang Liang
153

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

H-EmbodVis H-EmbodVis
220 4
Submitted by
Dingkang Liang
95

Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding

H-EmbodVis H-EmbodVis
329 5
Submitted by
Dingkang Liang
3

Towards Generalizable Robotic Manipulation in Dynamic Environments

H-EmbodVis H-EmbodVis
136 2
Submitted by
Dingkang Liang
7

Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution

H-EmbodVis H-EmbodVis
359 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs