AI & ML interests
None defined yet.
Recent Activity
Papers
FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling
MoRight: Motion Control Done Right
Articles
Music Flamingo
Analyze music and answer questions from audio or YouTube links
VoMP
Volumetric physics materials for interactive worlds
LLM RTL Coding Errors Explainer
NVR - How LLMs Fail and Generalize in RTL Coding
Kimodo
Generate high-quality motions from text prompts
KVPress Leaderboard
KVPress leaderboard: benchmark KV Cache compression methods
Audio Flamingo 3 Demo
Audio Flamingo 3 Demo
Judge's Verdict Leaderboard
Judge's Verdict: Benchmarking LLM as a Judge
Llm Robustness Leaderboard
LLM Robustness leaderboard
Nemotron OCR v2
Extract text and bounding boxes from images
ProfBench
Human-annotated rubrics in Professional Tasks
Audio Flamingo Next
Generate detailed answers about any audio or YouTube video
RE USE
A universal speech enhancement model for diverse degradation
Magpietts Demo
Generate natural speech from text in multiple languages
MMOU Eval
Evaluate prediction files against MMOU benchmark data
Cosmos Embed1
Cosmos-Embed1 demo app
Parakeet-TDT-0.6b-V2
Transcribe audio files with timestamps and download transcripts
Aic Demo
Configure and estimate AI model performance for deployment
Earth2 Inference Demo
Visualize weather forecasts for any date and time range
Nemotron Speech Streaming
Real-time speech recognition with NVIDIA Triton
Difix3D
Interface to interact with NVIDIA's Difix3D+ model
Parakeet-tdt_ctc-1.1b
Transcribe audio with timestamps
DoMINO with Ahmed Body Dataset - Multi-Scale Neural Operator for CFD
Access JupyterLab for interactive coding
Voice Agent WebRTC + LangGraph
Voice agent with LangGraph, WebRTC, ASR & TTS
NV-Reason-CXR-3B Demo
Analyze chest X-rays and answer your medical questions