AI & ML interests

None defined yet.

Recent Activity

Chris-Alexiuk  updated a dataset about 16 hours ago
nvidia/Nemotron-RL-Agentic-SWE-Pivot-v1
zhiyucheng  updated a model about 17 hours ago
nvidia/GLM-5.2-NVFP4
donglaix  updated a Space about 20 hours ago
nvidia/PhysicalAI-Robotics-VoMP-Demo
View all activity

Articles

View all articles

nvidia 's collections 118

NVIDIA Nemotron v3
Open, Production-ready Enterprise Models
Inference Optimized Checkpoints (with Model Optimizer)
A collection of generative models quantized and optimized for inference with Model Optimizer.
Nemotron Chat & Instruction Following
Datasets for building helpful, multi-turn, instruction-following conversational models across single and multi-turn settings.
Nemotron Speech
Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S
Cosmos2
⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3
Nemotron-Post-Training-v3
Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3.
Nemotron RAG
Set of tools to build retrieval-augmented generation (RAG) systems, improve search and ranking accuracy, and extract structured data from complex docs
Nemotron-Personas
A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions.
OpenReasoning-Nemotron
Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science.
OpenMathReasoning
Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset"
NemoGuard
Essential datasets and models for content safety, topic-following, and security guardrails
Canary ASR/AST
A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤
PS3: Scaling Vision Pre-Training to 4K Resolution
Enabling 4k resolution for VLMs, CVPR 2025, https://nvlabs.github.io/PS3/
Nemotron-Labs-Diffusion
A Tri-Mode Language Model Family Unifying Autoregressive, Diffusion, and Self-Speculation Decoding
Cosmos1
⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3
Nemotron Safety & Content Moderation
Datasets for building safe models with refusals, content moderation, PII detection, agentic safety, and audio safety capabilities.
Nemotron-Cascade 2
Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
MedTech Open Models
Open models for physical AI and medical imaging — robot control, surgical simulation, segmentation, reconstruction, generation, and reasoning.
Clara Medical
NVIDIA Clara Open Models for medical imaging AI: segment, generate, and reason across CT, MRI, and X-ray. Built on MONAI by NVIDIA.
Reward Models 06-2025
Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge
OpenCodeReasoning
Reasoning data for supervised finetuning of LLMs to advance data distillation for competitive coding
Llama Nemotron Feedback-Edit Inference-Time Scaling
Novel ITS approach for open-ended tasks - No. 1 on Arena Hard on 18 Mar 2025
AceMath
We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark.
Eagle
Eagle is a family of frontier vision-language models with data-centric strategies. The model supports both HD image and long-context video input.
Parakeet ASR
NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants.
MambaVision
MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models.
Minitron
A family of compressed models obtained via pruning and knowledge distillation
NVIDIA Nemotron v3
Open, Production-ready Enterprise Models
Nemotron-Labs-Diffusion
A Tri-Mode Language Model Family Unifying Autoregressive, Diffusion, and Self-Speculation Decoding
Inference Optimized Checkpoints (with Model Optimizer)
A collection of generative models quantized and optimized for inference with Model Optimizer.
Cosmos1
⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3
Nemotron Safety & Content Moderation
Datasets for building safe models with refusals, content moderation, PII detection, agentic safety, and audio safety capabilities.
Nemotron Chat & Instruction Following
Datasets for building helpful, multi-turn, instruction-following conversational models across single and multi-turn settings.
Nemotron-Cascade 2
Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
MedTech Open Models
Open models for physical AI and medical imaging — robot control, surgical simulation, segmentation, reconstruction, generation, and reasoning.
Nemotron Speech
Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S
Cosmos2
⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3
Nemotron-Post-Training-v3
Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3.
Nemotron RAG
Set of tools to build retrieval-augmented generation (RAG) systems, improve search and ranking accuracy, and extract structured data from complex docs
Clara Medical
NVIDIA Clara Open Models for medical imaging AI: segment, generate, and reason across CT, MRI, and X-ray. Built on MONAI by NVIDIA.
Nemotron-Personas
A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions.
OpenReasoning-Nemotron
Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science.
Reward Models 06-2025
Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge
OpenMathReasoning
Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset"
OpenCodeReasoning
Reasoning data for supervised finetuning of LLMs to advance data distillation for competitive coding
Llama Nemotron Feedback-Edit Inference-Time Scaling
Novel ITS approach for open-ended tasks - No. 1 on Arena Hard on 18 Mar 2025
AceMath
We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark.
NemoGuard
Essential datasets and models for content safety, topic-following, and security guardrails
Eagle
Eagle is a family of frontier vision-language models with data-centric strategies. The model supports both HD image and long-context video input.
Parakeet ASR
NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants.
Canary ASR/AST
A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤
MambaVision
MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models.
PS3: Scaling Vision Pre-Training to 4K Resolution
Enabling 4k resolution for VLMs, CVPR 2025, https://nvlabs.github.io/PS3/
Minitron
A family of compressed models obtained via pruning and knowledge distillation