Table of Contents
Documentation
Learn how to use every feature of ClonyVoice.
Getting Started
Installation
Download the installer from the link provided after purchase. Run the installer and follow the on-screen instructions. The installer will automatically set up all required dependencies including the AI models.
- Download the installer from your account or the email you received after purchase
- Run the .exe installer and follow the setup wizard
- Wait for the installation to complete (this may take a few minutes as AI models are set up)
- Launch ClonyVoice from your desktop or Start menu
License Activation
On first launch, you'll be asked to enter your license key. You can find it in your account dashboard or in the confirmation email. Enter the key and click Activate. Your license supports up to 2 machines simultaneously.
Interface Overview
The main interface is divided into several areas: the voice selector on the left, the text input area in the center, and the controls panel on the right. The top bar provides access to settings, voice management, and the Voice Store.
Voice Cloning
How Voice Cloning Works
Voice cloning creates a digital model of a real voice from audio samples. Once cloned, this voice model can speak any text in any of the 10 supported languages.
Quick Mode
Quick mode creates an instant voice clone from your audio sample. Perfect for testing and previewing voices.
- Click "Clone Voice" in the main interface
- Select "Quick Mode"
- Upload or record an audio sample (3-60 seconds)
- Give your voice a name and click "Clone"
- Your cloned voice appears in the voice selector, ready to use
Precise Mode
Precise mode uses transcription to align the audio sample with its text content, producing a higher-quality voice clone. Recommended for production use.
- Click "Clone Voice" and select "Precise Mode"
- Upload or record an audio sample (10-60 seconds recommended)
- The transcription is generated automatically — you can edit it for accuracy
- Click "Clone" and wait for processing (30-60 seconds)
- Your high-fidelity voice model is ready
Multi-Sample Cloning
For the best voice fidelity, combine up to 5 different audio samples of the same voice. The AI will learn from all samples to create a more accurate voice model.
Tips for Best Results
docs_vc_tips_text
- Use a quiet environment with minimal background noise
- Record at a consistent volume and distance from the microphone
- Include varied intonation — don't read in a monotone
- Longer samples (10-30 seconds) give better results than very short ones
- Avoid samples with music, other speakers, or sound effects
Voice Design
Creating Voices from Text
Voice Design lets you create entirely new voices by describing them in natural language. No audio sample is needed.
- Click "Design Voice" in the main interface
- Enter a description of the voice you want (e.g., "A warm, deep male voice with a calm tone")
- Click "Generate" to create the voice
- Preview the generated voice and regenerate if needed
- Save the voice to your library when satisfied
Description Tips
docs_vd_tips_text
- Describe age, gender, pitch, and tone
- Mention accents or speaking styles if desired
- Be specific: "energetic young woman" works better than "nice voice"
- Generate multiple variations and pick your favorite
Text-to-Speech
Generating Speech
Once you have a voice (cloned, designed, or from the Voice Store), you can generate speech from any text.
- Select a voice from the voice selector
- Type or paste your text in the input area
- Choose the output language
- Optionally select an emotion preset
- Click "Generate" and listen to the result
- Save the audio file (WAV format) to your computer
Emotion Presets
Apply emotional presets to make the generated speech more expressive. Available emotions: Happy, Sad, Angry, Fearful, Disgusted, Surprised, Whisper. Each preset adjusts the pitch, speed, and intonation of the voice.
Multi-language Output
Any voice can speak in any of the 10 supported languages. Simply select the target language before generating. The voice characteristics are preserved while adapting pronunciation to the target language.
Voice Store
Browsing the Voice Store
The Voice Store is an online marketplace of voice models shared by the community. Browse, preview, and download voices for your projects.
- Click "Voice Store" in the top navigation bar
- Browse voices by category or use the search function
- Preview any voice by clicking the play button
- Click "Download" to add it to your local voice library
Import & Export
Exporting Voice Models
Export your voice models for backup or sharing.
- Right-click a voice in the voice selector
- Select "Export Voice"
- Choose a save location and click "Export"
Importing Voice Models
Import voice models from files or other users.
- Click "Import Voice" in the voice management menu
- Select the voice model file
- The voice appears in your voice selector, ready to use
Settings
Application Settings
Access settings from the gear icon in the top bar. Available settings include:
- GPU/CPU mode: Choose between NVIDIA GPU acceleration or CPU-only processing
- Output format: Configure the default audio output settings
- Interface language: Change the application language
- Model management: Download or update AI models
Troubleshooting
Slow generation
If generation is slow, make sure you're using GPU mode (requires NVIDIA GPU with CUDA). Close other GPU-intensive applications. On CPU-only mode, generation is naturally slower.
Poor voice quality
Use Precise mode with clean audio samples (10-30 seconds). Minimize background noise. Multiple samples improve fidelity. Avoid samples with music or multiple speakers.
Application crashes or won't start
Verify system requirements (Windows 10/11, 16 GB RAM). Try running as administrator. Ensure your antivirus isn't blocking the application. If the issue persists, reinstall or contact support.
License activation issues
Make sure you're entering the correct license key from your account dashboard. Internet connection is required for activation only. If you've reached the machine limit, deactivate a machine from your dashboard.