Speech Generation
Aug 18, 2025
Identify Speakers in Meetings, Calls, and Voice Apps in Real-Time with NVIDIA Streaming Sortformer
In every meeting, call, crowded room, or voice-enabled app, technology has a core question: who is speaking, and when? For decades, answering that question in...
5 MIN READ
Jul 17, 2025
NVIDIA Canary‑Qwen‑2.5B: Open‑Source ASR/LLM for Superior Transcription and Summarization
Top‑ranked on the HuggingFace Open‑ASR leaderboard, the model is production‑ready.
1 MIN READ
Apr 29, 2025
Spotlight: Personal AI Brings AI Receptionists to Small Business Owners with NVIDIA Riva
It's 10 p.m. on a Tuesday when the phone rings at the Sapochnick Law Firm, a specialized law practice in San Diego, California. The caller, a client of the...
6 MIN READ
Jul 01, 2024
Google's New Gemma 2 Model Now Optimized and Available on NVIDIA API Catalog
Gemma 2, the next generation of Google Gemma models, is now optimized with TensorRT-LLM and packaged as NVIDIA NIM inference microservice.
1 MIN READ
Jun 26, 2024
Generate High-Quality, Context-Aware Responses for Chatbots and Search Engines with Llama 3-ChatQA
Experience and test Llama3-ChatQA models at scale with performance optimized NVIDIA NIM inference microservice using the NVIDIA API catalog.
1 MIN READ
Jun 03, 2024
NVIDIA Collaborates with Hugging Face to Simplify Generative AI Model Deployments
As generative AI experiences rapid growth, the community has stepped up to foster this expansion in two significant ways: swiftly publishing state-of-the-art...
4 MIN READ
May 20, 2024
Supercharge Generative AI Development with Firebase Genkit, Optimized by NVIDIA RTX GPUs
At Google I/O 2024, Google announced Firebase Genkit, a new open-source framework for developers to add generative AI to web and mobile applications using...
4 MIN READ
Apr 18, 2024
New Standard for Speech Recognition and Translation from the NVIDIA NeMo Canary Model
NVIDIA NeMo is an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises. The NeMo team...
4 MIN READ
Apr 18, 2024
Turbocharge ASR Accuracy and Speed with NVIDIA NeMo Parakeet-TDT
NVIDIA NeMo, an end-to-end platform for developing multimodal generative AI models at scale anywhere—on any cloud and on-premises—recently released...
6 MIN READ
Feb 07, 2024
Featured Large Language Models Sessions at NVIDIA GTC 2024
Speakers from NVIDIA, Meta, Microsoft, OpenAI, and ServiceNow will be talking about the latest tools, optimizations, trends and best practices for large...
1 MIN READ
Feb 06, 2024
Top Retrieval-Augmented Generation (RAG) Sessions at NVIDIA GTC 2024 Sessions
Join us in-person or virtually and learn about the power of RAG with insights and best practices from experts at NVIDIA, visionary CEOs, data scientists, and...
1 MIN READ