0
Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents
https://huggingface.co/blog/nvidia/nemotron-3-nano-omni-multimodal-intelligence(huggingface.co)NVIDIA Nemotron 3 Nano Omni is a new omni-modal understanding model built for real-world applications. It is designed for document analysis, reasoning across multiple images, and automatic speech recognition. The model also supports long audio-video understanding, making it suitable for creating advanced AI agents.
0 points•by hdt•3 hours ago