AI & ML interests
None defined yet.
Recent Activity
Papers
Light Interaction: Training-Free Inference Acceleration for Interactive Video World Models
Why Far Looks Up: Probing Spatial Representation in Vision-Language Models
Articles
NV-Generate Synthetic Medical Imaging
Synthetic 3D CT and MR generation with NVIDIA NV-Generate.
Music Flamingo
Analyze music and answer questions from audio or YouTube links
VoMP
Volumetric physics materials for interactive worlds
LLM RTL Coding Errors Explainer
NVR - How LLMs Fail and Generalize in RTL Coding
Kimodo
Generate high-quality motions from text prompts
KVPress Leaderboard
KVPress leaderboard: benchmark KV Cache compression methods
Audio Flamingo 3 Demo
Audio Flamingo 3 Demo
Judge's Verdict Leaderboard
Judge's Verdict: Benchmarking LLM as a Judge
Llm Robustness Leaderboard
LLM Robustness leaderboard
Cosmos3 Action Viewer
Open and explore the Viser web client
LocateAnything
Detect and label objects in images and videos
Simready Validator
Validate a HuggingFace dataset against a SimReady profile
ProfBench
Human-annotated rubrics in Professional Tasks
GeoTransolver DrivAerML Demo
Predict pressure, shear stress, drag & lift on car models
Parakeet-TDT-0.6b-V2
Transcribe audio files with timestamps and downloadable subtitles
Parakeet TDT 0.6b V3
Transcribe Speech with Multilingual parakeet-tdt-0.6b-v3
RE USE
A universal speech enhancement model for diverse degradation
NV-Reason-CXR-3B Demo
Analyze chest X‑ray images and get detailed medical findings
Magpietts Demo
Generate multilingual speech from text
NVIDIA Hugging Face Organization
Asset Harvester
Image-to-3D for autonomous-vehicle simulation assets
Audio Flamingo Next
Answer questions about uploaded audio or YouTube videos
Audio Flamingo Next Captioner
Generate detailed captions and summaries for audio or YouTube videos
Audio Flamingo Next Think
Generate timestamped answers from audio or YouTube videos