Voice Technology RSS Feeds

Directory0 voice technology feeds · 17 articlesShow all languages →

Latest

Voice Technology articles

KugelAudio launches real-time TTS with voice cloning, sub-60ms latency, and on-premise deployment

KugelAudio launches a real-time text-to-speech model with voice cloning capabilities on Product Hunt. The model can clone a voice from just 30-60 seconds of audio, achieves sub-60ms latency (excluding network), supports input/output streaming, and offers both API and on-premise d

Producttechnologybusinessai & machine learning

Product Hunt — The best new products, every day·4d ago·1 min read

ElevenLabs launches pre-built voice and chat agent templates for customer support and sales

Producttechnologybusinessai agents

Product Hunt1mo ago

VoiceOS: Voice Control Software for Computer Workflows on Mac and Windows

Producttechnologyproductivitysoftware

Product Hunt2mo ago

FnKey: Open-Source macOS Dictation App with Real-Time Deepgram Transcription

Producttechnologyprogrammingmacos apps

Product Hunt2mo ago

ElevenAgents Launches Expressive Mode: AI Voice Agents That Adapt Tone, Timing, and Emotion by Context

Producttechnologyartificial intelligenceproduct launch

Product Hunt3mo ago

Wispr Flow: Voice-to-Text Tool for Faster Writing Across All Apps

Producttechnologyproductivitysoftware tools

Product Hunt3mo ago

Emra: Voice-to-Text Tool with Always-On Transcription for Productivity

Producttechnologyproductivitysoftware

Product Hunt4mo ago

Sparrow-1: AI Model for Human-Like Conversational Timing in Real-Time Voice Systems

Sparrow-1 is a specialized multilingual audio model designed to achieve human-level conversational timing in real-time voice interactions. Unlike traditional voice systems that wait for silence before responding, Sparrow-1 continuously models conversational flow and floor transfe

Insighttechnologyartificial intelligencehuman-computer interaction

tavus.io·Hacker News: Front Page·4mo ago·10 min read

Building Ultra-Low-Latency Voice Agents with NVIDIA Open Models

This technical guide demonstrates how to build ultra-low-latency voice agents using NVIDIA's open models, including the newly launched Nemotron Speech ASR for sub-25ms transcription, Nemotron 3 Nano LLM for natural language processing, and Magpie TTS for text-to-speech. The artic

technologyartificial intelligenceprogramming

daily.co·Hacker News: Front Page·4mo ago·18 min read

Nexorify: Voice-Activated Calendar and Reminder Creation Tool

Nexorify is a voice-powered productivity tool that converts spoken commands into calendar events and reminders, eliminating the need for typing. The app is designed for users who can speak faster than they type, allowing them to create reminders and schedule events through natura

Producttechnologyproductivitysoftware tools

Product Hunt — The best new products, every day·6mo ago·1 min read

OpenAI Launches GPT-Realtime Model and Voice API for Advanced Voice Agent Development

Producttechnologyprogrammingai development

Product Hunt9mo ago

OpenAI Launches GPT-Realtime Model for Advanced Voice Agent Capabilities

Producttechnologyprogrammingai development

Product Hunt9mo ago

Vogent Voicelab: Platform for Optimized Open-Source Voice Model Inference

Producttechnologyartificial intelligencevoice technology

Product Hunt10mo ago

AI Voice Note Taker: Browser-Based Real-Time Voice Transcription Tool

AI Voice Note Taker is a browser-based tool that provides real-time voice transcription with features including auto-punctuation, support for 30+ languages, notes history, file uploads, and export capabilities for hands-free writing.

Producttechnologyproductivitysoftware tools

Product Hunt — The best new products, every day·11mo ago·1 min read

KugelAudio launches real-time TTS with voice cloning, sub-60ms latency, and on-premise deployment

ElevenLabs launches pre-built voice and chat agent templates for customer support and sales

VoiceOS: Voice Control Software for Computer Workflows on Mac and Windows

FnKey: Open-Source macOS Dictation App with Real-Time Deepgram Transcription

ElevenAgents Launches Expressive Mode: AI Voice Agents That Adapt Tone, Timing, and Emotion by Context

Wispr Flow: Voice-to-Text Tool for Faster Writing Across All Apps

Emra: Voice-to-Text Tool with Always-On Transcription for Productivity

Sparrow-1: AI Model for Human-Like Conversational Timing in Real-Time Voice Systems

Building Ultra-Low-Latency Voice Agents with NVIDIA Open Models

Nexorify: Voice-Activated Calendar and Reminder Creation Tool

Ito: Open Source Voice Assistant for Natural Computer Interaction

Voice Gecko: Desktop Voice Dictation App for Coding and Writing

Hume AI Launches Octave 2: Next-Generation Multilingual Text-to-Speech Model

OpenAI Launches GPT-Realtime Model and Voice API for Advanced Voice Agent Development

OpenAI Launches GPT-Realtime Model for Advanced Voice Agent Capabilities

Vogent Voicelab: Platform for Optimized Open-Source Voice Model Inference

AI Voice Note Taker: Browser-Based Real-Time Voice Transcription Tool