Speech to Text Open Source

Meta Introduces Spirit LM open source model that combines text and speech inputs/outputs

Just in time for Halloween 2024, Meta has unveiled Meta Spirit LM, the company’s first open-source multimodal language model capable of seamlessly integrating text and speech inputs and outputs.

Geeky Gadgets

5 Best Free Speech-to-Text APIs in 2025 Compared & Tested

What if you could transform hours of audio into precise, actionable text with just a few lines of code? In 2025, this is no longer a futuristic dream but a reality powered by innovative speech-to-text ...

Geeky Gadgets

OpenAI AI Audio : TTS Speech-to-Text Audio Integrated Agents

OpenAI has introduced a series of AI audio models, fundamentally redefining how voice-based AI can be integrated into modern applications wit&h ChatGPT. These advancements include state-of-the-art ...

Nature

Striving for open-source and equitable speech-to-speech translation

• Billions of people around the world regularly communicate online in languages other than their own. • This has created huge demand for artificial intelligence (AI) models that can translate both ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results