Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Hengyi Wang's picture
2 5

Hengyi Wang

aaronwhy
https://www.linkedin.com/in/hengyi-wang-86605b175/
  • AaronWhy

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 7 months ago

R-WoM: Retrieval-augmented World Model For Computer-use Agents

Paper • 2510.11892 • Published Oct 13, 2025 • 23
upvoted a paper about 1 year ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6, 2025 • 97
upvoted a paper over 1 year ago

Unifying Specialized Visual Encoders for Video Language Models

Paper • 2501.01426 • Published Jan 2, 2025 • 20
upvoted 2 papers almost 2 years ago

Probabilistic Conceptual Explainers: Trustworthy Conceptual Explanations for Vision Foundation Models

Paper • 2406.12649 • Published Jun 18, 2024 • 16

Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models

Paper • 2406.11230 • Published Jun 17, 2024 • 33
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs