< 1B – 30B MoE

NVIDIA

The Nemotron 3 family — efficient open models for reasoning, multimodal, speech and document AI.

NVIDIA’s Nemotron 3 family spans tiny edge models to MoE reasoning models — all efficient, open, and built to run on hardware you can actually get. Mix and match across reasoning, multimodal, speech and document extraction.

The kit, sized small

Nemotron 3 Nano

30B total · 3B active MoE — efficient reasoning for long-running agents.

  • MoE
  • reasoning
30B parameters32B cap
Nemotron 3 Nano 4B

Edge-optimised 4B model for constrained hardware.

  • edge
  • text
4B parameters32B cap
Nemotron 3 Nano Omni

Multimodal nano model across modalities.

  • omni
  • multimodal
Nemotron 3 ASR

Speech recognition built for real-time use.

  • speech
  • ASR
Nemotron Parse

Sub-1B parameter document extraction.

  • documents
  • extraction
1B parameters32B cap
Nemotron Embed VL

Vision-language embeddings for retrieval & search.

  • embeddings
  • vision

If you want to build…

A long-running agent / reasoning app
Nemotron 3 Nano3B active MoE
To run on the edge
Nemotron 3 Nano 4BEdge-ready
A multimodal app
Nemotron 3 Nano OmniOmni-modal
Speech recognition
Nemotron 3 ASRReal-time ASR
Document extraction
Nemotron Parse< 1B params
STARTER SPACE
Nemotron-3 Nano Omni demo

Fork this Gradio Server Space to start from a working Nemotron-3 Nano Omni app.

Fork it
NVIDIA · SPONSOR PRIZE

Nemotron Hardware Prize

To qualify ·Build with Nemotron models. Awarded for best Space (judged) and community engagement.

2× RTX 5080
All prizes