
Tiny, capable models for text, vision, audio and omni — small enough to live on your own hardware.
The MiniCPM family proves you don’t need a giant to get real work done. Each model is tuned to punch far above its parameter count and runs happily on a laptop — or a phone. OpenBMB provide free hosted API access for the jam, and every model also runs locally via llama.cpp or transformers. Pick the modality you need and go.
Full-duplex omni model — voice, vision and language in, speech out. Real-time capable.
Fork this Gradio Server Space to start from a working MiniCPM-V-4.6 app.
To qualify ·Build with MiniCPM models. The pool is split $5k per track (1st $2,500 · 2nd $1,500 · 3rd $1,000).