Text to Speech Models

A new, open source text-to-speech model called Dia has arrived to challenge ElevenLabs, OpenAI and more

A two-person startup by the name of Nari Labs has introduced Dia, a 1.6 billion parameter text-to-speech (TTS) model designed to produce naturalistic dialogue directly from text prompts — and one of ...

Geeky Gadgets

OpenAI AI Audio : TTS Speech-to-Text Audio Integrated Agents

OpenAI has introduced a series of AI audio models, fundamentally redefining how voice-based AI can be integrated into modern applications wit&h ChatGPT. These advancements include state-of-the-art ...

TechRepublic

OpenAI Gives Its Agents a Voice – Now a ‘Medieval Knight’ Can Read Your Work Emails

The text-to-speech and speech-to-text tools are all based on GPT-4o. OpenAI hinted it may take a similar path with video. OpenAI is expanding its controversial stable of AI voices to include agentic ...

TechCrunch

Largest text-to-speech AI model yet shows ’emergent abilities’

Researchers at Amazon have trained the largest ever text-to-speech model yet, which they claim exhibits “emergent” qualities improving its ability to speak even complex sentences naturally. The ...

SiliconANGLE

Amazon researchers develop cutting-edge Base TTS text-to-speech model

Amazon.com Inc. researchers have developed a new text-to-speech model, Base TTS, that can pronounce words more naturally than earlier neural networks. TechCrunch reported the project late Wednesday.

ZDNet

Text-to-speech with feeling - this new AI model does everything but shed a tear

I wore the world's first HDR10 smart glasses TCL's new E Ink tablet beats the Remarkable and Kindle Anker's new charger is one of the most unique I've ever seen Best laptop cooling pads Best flip ...

Kotoba Technologies Raises $10 Million in Seed Funding to Expand Real-Time Voice AI Platform Across East Asia

Kotoba Technologies, a developer of real-time speech models optimized for East Asian languages, today announced an additional ...

Neowin

ElevenLabs unveils text-to-speech Turbo 2.5 model with 32 languages0 0

The AI company ElevenLabs has launched a new text-to-speech model called Turbo 2.5. It introduces support for three new languages: Vietnamese, Hungarian, and Norwegian. The API is available too. The ...

Geeky Gadgets

ChatTTS a new open source AI voice text-to-speech AI model

ChatTTS is an open-source AI voice text-to-speech (TTS) model that has gained significant popularity on GitHub due to its impressive features and user-friendly design. This model is specifically ...

TechCrunch

OpenAI launches DALL-E 3 API, new text-to-speech models

OpenAI launched a slew of new APIs during its first-ever developer day. The DALL-E 3 API offers different format and quality options and resolutions ranging from 1024×1024 to 1792×1024, with prices ...

VentureBeat

Meta Introduces Spirit LM open source model that combines text and speech inputs/outputs

Just in time for Halloween 2024, Meta has unveiled Meta Spirit LM, the company’s first open-source multimodal language model capable of seamlessly integrating text and speech inputs and outputs.

Forbes

How Large Scale Speech Models Will Impact Voice AI

Gautam Jha is the Co-Founder & CTO of Kalpa Labs, an SF-based YC backed startup building large scale Foundational speech models. Voice is quickly becoming a primary interface for enterprise software, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results