AI Multilingual Audio Content Agent: TTS, Transcription & Translation
Integrates with:
Overview
Unlock Global Reach with this AI Agent
This AI Agent takes French text as input, converts it to high-quality French audio using ElevenLabs, transcribes this audio back to text with OpenAI's Whisper for verification or repurposing, translates the original text to English using an OpenAI LLM, and finally generates English audio with ElevenLabs. It’s a powerful tool for solopreneurs, founders, and CTOs looking to create engaging multilingual audio content effortlessly and expand their market reach.
Key Features & Benefits
- AI-Driven Text-to-Speech (TTS): Generates natural-sounding audio from text in both French and English using ElevenLabs, allowing for voice customization to maintain brand consistency.
- Accurate AI Translation: Leverages OpenAI's large language models (via LangChain) for precise text translation between French and English, breaking language barriers.
- Advanced Speech-to-Text (STT): Utilizes OpenAI's Whisper model to accurately transcribe spoken French audio back into text, perfect for content verification or creating written accompaniments.
- End-to-End Automation: Streamlines the entire workflow from text input to multilingual audio output and transcription, saving significant time and resources.
- Voice Customization: Enables the use of specific ElevenLabs voice IDs, ensuring a consistent and professional audio brand across different languages.
Use Cases
- Generate multilingual audio for B2C e-commerce product descriptions to cater to international customers.
- Create localized audio versions of B2B SaaS product demos and tutorial videos.
- Automate voiceover production for marketing campaigns targeting diverse linguistic audiences.
- Develop audio content for language learning platforms or educational materials.
- Repurpose blog posts and articles into podcasts or audio summaries in multiple languages to increase accessibility.
Prerequisites
- An n8n instance (Cloud or self-hosted).
- OpenAI API Key with access to a suitable model (e.g., gpt-3.5-turbo or gpt-4 for translation, and Whisper API access for transcription).
- ElevenLabs API Key and at least one Voice ID from your ElevenLabs Voice Lab.
Setup Instructions
- Download the n8n workflow JSON file.
- Import the workflow into your n8n instance.
- Configure ElevenLabs Voice ID & Text: Open the 'Set ElevenLabs voice ID and text' node. Update the
voice_id
field with your desired ElevenLabs Voice ID (find this in your ElevenLabs Voice Lab). You can also change the initial Frenchtext
here. - Configure ElevenLabs API Credentials:
- In the 'Generate French Audio' node, under 'Credentials', select or create new 'HTTP Header Auth' credentials. Set 'Name' to
xi-api-key
and 'Value' to your ElevenLabs API key. - Repeat this for the 'Translate English text to speech' node.
- In the 'Generate French Audio' node, under 'Credentials', select or create new 'HTTP Header Auth' credentials. Set 'Name' to
- Configure OpenAI API Credentials:
- For the 'Transcribe Audio' node (HTTP Request): Under 'Authentication', select 'Predefined Credential Type'. For 'Credential Type', choose 'OpenAI API'. Select or create your OpenAI API credentials.
- For the 'OpenAI Chat Model' node (used by 'Translate Text to English'): Select or create your OpenAI API credentials in this node.
- Ensure all credential nodes are correctly configured and showing as active.
- Activate the workflow. You can then test it by clicking 'Execute Workflow' on the 'When clicking "Execute Workflow"' node.
Want your own unique AI agent?
Talk to us - we know how to build custom AI agents for your specific needs.
Schedule a Consultation