Skip to content

πŸŽ™οΈ Clone Your Voice - AI-powered voice cloning with browser recording. Record 10 seconds, generate speech in your voice. Simple Docker deployment.

License

Notifications You must be signed in to change notification settings

aldervall/clone-your-voice

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

41 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸŽ™οΈ Clone Your Voice 2.0

AI-powered voice cloning made simple - Optimized Edition

Record your voice for 10 seconds, then generate speech in your cloned voice with any text. Browser-based recording, no microphone setup required.

Docker Python License Optimized

✨ Features

  • 🎀 Browser Recording - No microphone setup, record directly in your browser
  • πŸ€– AI Voice Cloning - Powered by NeuTTS-Air (Qwen 0.5B backbone)
  • πŸ“ Auto-Generated Prompts - Read a random sentence, we handle the rest
  • 🎡 Text-to-Speech - Generate speech from any text in your voice
  • πŸ“± Mobile Friendly - Works on phones, tablets, and desktops
  • 🐳 Docker Ready - One-command deployment
  • πŸ’Ύ Persistent Storage - Your recordings and outputs are saved

πŸš€ Quick Start

Docker (Recommended)

# Clone the repository
git clone https://github.com/aldervall/clone-your-voice.git
cd clone-your-voice

# Start with Docker Compose
docker-compose up -d

# Open in browser
open http://localhost:5000

That's it! The interface is ready to use.

For more detailed setup options, including local development, see the Quick Start Guide.

🎯 How It Works

3 Simple Steps

Step 1: Record Your Voice (10 seconds)

  • Click the microphone button
  • Read the displayed prompt aloud
  • Preview your recording

Step 2: Generate Speech

  • Type any text you want to hear
  • Click "Generate Speech"
  • Wait for AI processing (~10-30 seconds)

Step 3: Download

  • Listen to your generated audio
  • Download the file
  • Generate more!

πŸ“– Documentation & AI Context

πŸ› οΈ Technology Stack

  • AI Model: NeuTTS-Air - Qwen 0.5B backbone
  • Audio Codec: NeuCodec (50Hz neural codec)
  • Backend: Python 3.11 + Flask
  • Frontend: Vanilla JavaScript + CSS
  • Deployment: Docker + Docker Compose

🀝 Contributing

Contributions welcome! Fork, make changes, submit PR.

πŸ“ License

MIT License - see LICENSE

πŸ™ Acknowledgments

πŸ“§ Contact


Clone Your Voice - AI-powered voice cloning made simple πŸŽ™οΈ

About

πŸŽ™οΈ Clone Your Voice - AI-powered voice cloning with browser recording. Record 10 seconds, generate speech in your voice. Simple Docker deployment.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 9