
ElevenLabs Unveils New Text-to-Sound-Effects API for Developers
In a significant innovation, ElevenLabs has introduced its latest offering: the Text to Sound Effects API. This cutting-edge technology is designed to empower developers to create dynamic audio experiences, revolutionizing the way we interact with text and sound in various applications.
A Showcase of Innovation: The First Video to Sounds Effects App
To demonstrate the capabilities of this innovative API, ElevenLabs has developed the first-ever Video to Sounds Effects app. This app is available for free and is fully open-source, allowing developers to explore its functionality and potential uses. You can watch a showcase of this app on X here: https://link-to-showcase
The Text to Speech API: A Comprehensive Solution
ElevenLabs’ text-to-speech API is an advanced tool that converts text into audio across 29 languages, offering thousands of realistic voices. Developers can seamlessly integrate these voices into their React apps or use the Python library to get started. The Voice Lab provides a platform for exploring pre-made voices and cloning your own, while the Voices Library showcases user-generated voices.
API Features
High-Quality Voices
- Thousands of voices in 29 languages, delivered at 128kbps
- Realistic voices that can be integrated into various applications
Ultra-Low Latency
- Achieve ~400ms audio generation times with the Turbo model
- Fast and efficient processing for dynamic audio experiences
Contextual Awareness
- The API understands text nuances for appropriate intonation and resonance
- Enhanced accuracy in voice synthesis, ensuring a more natural sound
API Models
The Text to Sound Effects API features various models, each designed to cater to different needs:
- Turbo v2: Supports up to 30k characters (~30 minutes of audio) per request
- Other models: Support up to 10k characters (~10 minutes of audio) per request
Subscription Tiers and Concurrency Limits
ElevenLabs offers various subscription tiers, each with its own concurrency limits:
| Tier | Concurrent Requests |
| — | — |
| Free | 2 concurrent requests |
| Starter | 3 concurrent requests |
| Creator | 5 concurrent requests |
| Pro | 10 concurrent requests |
| Scale | 15 concurrent requests |
For higher limits, users can contact the Enterprise team to discuss custom plans.
Supported Languages
The multilingual TTS API currently supports the following languages:
- Chinese
- Korean
- Dutch
- Turkish
- Swedish
- Indonesian
- Filipino
- Japanese
- Ukrainian
- Greek
- Czech
- Finnish
- Romanian
- Russian
- Danish
- Bulgarian
- Malay
- Slovak
- Croatian
- Classic Arabic
- Tamil
- English
- Polish
- German
- Spanish
- French
- Italian
- Hindi
- Portuguese
To use these languages, simply provide the input text in the desired language.
Explore More
Learn more about the API features and how to use the Text to Sound Effects API through the following links:
- API Features: A comprehensive overview of the API’s capabilities and features.
- How to Use Text to Sound Effects API: A step-by-step guide for developers to integrate the API into their applications.
A New Era in Audio Experiences
By launching the Text to Sound Effects API, ElevenLabs continues to support developers in creating dynamic audio experiences, enhancing the way we interact with text and sound in various applications. This innovative technology is poised to revolutionize the industry, enabling developers to push the boundaries of what is possible with audio synthesis.
Get Started Today
Join the growing community of developers who are leveraging the Text to Sound Effects API to create innovative audio experiences. Sign up for a subscription tier and start exploring the possibilities today!