The voice & video studio that runs on your machine

Clone your voice.
Generate without counting.

ClonyVoice runs voice synthesis on your own GPU: unlimited speech, at no extra cost, and your voice never leaves your computer. Then paste a URL — the Studio turns it into a branded video with your cloned voiceover, in about three minutes.

Signups open soon — leave your email and be notified the day it does.

Windows · NVIDIA GPU (CUDA) required · macOS on the roadmap

Local generation Unlimited voice Your voice stays with you 10 languages Consent-first cloning French company

Why creators switch

Generate without counting

The voice runs on your GPU: unlimited on every plan. Elsewhere, every sentence has a price.

Your voice stays with you

A voice is biometric data. Voice generation is 100% local — nothing to upload, nothing to leak.

From your website to a video

Paste a URL: AI script, your images, your logo, your voice. MP4 rendered on your machine.

4 tools, 1 subscription

Cloning + TTS + videos + dubbing in 10 languages. The equivalent stack costs $50–120/month elsewhere.

How it works

Clone your voice

A few minutes of audio are enough — recorded or imported, with the speaker's consent.

Generate

Text to speech, or URL to Studio video. Listen sentence by sentence, regenerate what you want.

Export MP4 or WAV

Rendered on your machine, with subtitles and sound design for videos.

Steps 1 and 3 are 100% local. The Studio and translation use our online AI, covered by your AI budget.

From your website to a finished video

No editing skills, no stock-footage collage. The Studio reads your site and builds a video that looks like your brand — because it is your brand.

Paste a URL

ClonyVoice analyzes the page: your logo, your images, your colors, your message.

The AI writes and directs

Script, scenes, rhythm and design are composed on our servers — this is what your AI budget pays for.

Your machine renders

The voiceover is generated locally with your cloned voice, and the MP4 is rendered on your GPU. Revise the script until it's right.

About 3 minutes from URL to MP4

Why we can offer unlimited voice — and cloud tools can't

Cloud voice tools pay for GPU time on every sentence you generate. So they sell credits, meter every character, and bill overages. That's not greed — it's their architecture.

ClonyVoice generates speech on your GPU. Once the app runs, a sentence costs us nothing — so we don't count it. Voice generation is unlimited on every plan, including Free.

The only thing we meter is the only thing that costs us: the server-side AI that writes scripts, directs videos and translates dubbing. That's what your AI budget is — a transparent meter on our costs, never on your voice.

Hours of audio per day? Same price.
No overage bills. Ever.
Paid plans are never blocked — even with zero AI budget left.

Your voice never leaves your machine

A voice is biometric data. With ClonyVoice, your recordings, your voice models and every audio file you generate live on your computer — nothing is uploaded to clone or to speak. Publishing a voice to the VoiceStore is a separate, explicit choice.

What does reach our servers — in plain words

Studio and dubbing use AI on our servers: the text of your scripts, the analysis of the URL you submit, and translation requests. These requests count against your AI budget and are logged for billing and abuse prevention. Your audio is not part of them.

Working offline? Voice generation keeps running without a connection for up to 48 hours between license checks.

One video. Ten languages. Still your voice.

ClonyVoice translates your script (on your AI budget), then your cloned voice speaks each language locally, synchronized to the timing of the original audio. English, French, German, Spanish, Italian, Portuguese, Russian, Japanese, Korean, Chinese.

To be precise: we synchronize the audio to the original timing. We do not alter the image or the lips.

A store of voices — and a stage for yours

Need a voice you don't have? Browse the VoiceStore and download ready-to-use voices, included from the Creator plan.

Voice artist? Publish your voice on your terms, with consent built into the process, and earn when it is downloaded. Your voice becomes an asset — one you control.

Explore the VoiceStore

Voice AI를 위한 모든 것

복제에서 생성까지, 하나의 플랫폼으로.

음성 복제

모든 목소리를 즉시 복제

단 3초의 오디오로 어떤 목소리의 본질도 포착합니다. 1~5개 샘플로 더 높은 충실도를 달성하세요. 패스트 모드로 즉시 결과 또는 프리사이스 모드로 스튜디오 품질 복제.

3초부터 — 최대 5개 샘플로 최고 품질
자동 다국어 지원
톤, 억양, 음성 정체성 보존

보이스 디자인

텍스트로 목소리 생성

원하는 목소리를 설명하면 AI가 구현합니다. 독특한 캐릭터, 브랜드 보이스, 또는 이전에 존재하지 않았던 가상의 페르소나 제작에 완벽합니다.

자연어 설명
나이, 성별, 억양 조정
무제한 변형 생성

보이스 라이브러리

표현력 있는 스튜디오 보이스

모든 내장 언어로 말할 수 있는 고품질 음성 라이브러리를 이용하세요. 따뜻한 내레이터부터 에너지 있는 발표자까지, 어떤 프로젝트에도 맞는 음성을 찾을 수 있습니다.

9개 프리미엄 스튜디오 보이스 내장
모든 음성이 10개 언어를 지원
전문가 수준 품질 출력

모델 가져오기

나만의 모델 가져오기

이미 음성 모델이 있으신가요? 바로 가져올 수 있습니다. ClonyVoice는 XTTS, Coqui 등 프레임워크의 주요 형식을 지원합니다.

XTTS & Coqui 호환
.pth, .onnx 형식 지원
간편한 드래그 앤 드롭 가져오기

멀티보이스 스튜디오

멀티보이스 대화 & 비디오

Assign different voices to each sentence for realistic dialogues. Import scripts from .txt, .srt or .vtt files. Export as audio or as subtitled MP4 video.

문장별 다른 음성
스크립트 가져오기 (.txt, .srt, .vtt)
Video export with subtitles (MP4)

스마트 편집

실시간 생성 & 편집

생성되는 동안 실시간으로 각 문장을 들을 수 있습니다. 전체를 다시 하지 않고 개별 문장만 재생성. 멀티트랙 타임라인이 있는 비디오 편집기 내장.

생성 중 문장별 실시간 청취
개별 문장 재생성
멀티트랙 타임라인 비디오 편집기

오디오 소스

녹음, 업로드 또는 다운로드

실시간 VU 미터로 마이크에서 직접 녹음하세요. 모든 형식의 오디오 파일을 업로드하세요. 또는 YouTube URL을 붙여넣어 자동으로 음성을 추출하세요.

VU 미터 포함 내장 마이크 녹음
YouTube URL 오디오 추출
자동 노이즈 제거 및 Whisper 전사

내보내기 & 공유

음성 모델 내보내기

생성한 음성을 암호화된 .clonyvoice 패키지로 저장하세요. 기기 간 안전한 가져오기/내보내기. 테이크 히스토리가 있는 프로젝트 관리.

AES 암호화 음성 패키지
테이크 히스토리가 있는 프로젝트 관리
협업자와 음성 공유

로컬 API

완전한 REST API 내장

포괄적인 로컬 API로 ClonyVoice를 워크플로우에 통합하세요. 음성 생성, 보이스 관리, 모든 것을 프로그래밍으로 제어 — 클라우드 의존 없음.

localhost의 RESTful API
WebSocket 실시간 이벤트
속도 제한이 있는 스코프 API 키

What you should know before starting

Windows 10/11 + NVIDIA GPU (CUDA) required — macOS is on the roadmap.
Free plan: 720p export with a "Created with ClonyVoice" watermark and outro; paid plans: clean 1080p.
Voice: unlimited, local. Studio & translation: online, covered by your monthly AI budget.
Offline: the app keeps working for 48 hours without a connection.

The questions everyone asks

What do I need to run ClonyVoice?

A Windows PC with an NVIDIA GPU (CUDA). As a rule of thumb: if your machine runs a recent game, it runs ClonyVoice. macOS is on the roadmap.

Is voice generation really unlimited?

Yes. It runs on your hardware, so it costs us nothing — there is no meter to run out. Generate as much as you want, on every plan.

Then what is the AI budget for?

Only for what runs on our servers: Studio script writing and video direction, and dubbing translation. Each plan includes a monthly AI budget shown as a 0-100% gauge; your voice never touches it.

Is my voice sent to the cloud?

No. Recordings, voice models and generated audio stay on your machine. Our servers only receive the text and URL-analysis requests used by Studio and dubbing — and those are logged for billing. Publishing to the VoiceStore is always an explicit opt-in.

Can I use it commercially?

Yes, from the Creator plan. The Free plan is for personal use, and its videos carry a discreet "Created with ClonyVoice" watermark and outro.

Can I clone any voice?

You may clone your own voice, or a voice whose owner has given you explicit permission. Consent is a condition of use — and it's what the upcoming EU AI Act expects of everyone.

모든 질문 보기

Compare like for like

We compare our Creator plan with same-tier monthly plans, no commitment.

	ClonyVoiceCreator — $12/월	ElevenLabsCreator — $22/월	FlikiStandard — $28/월
Voice generation	Unlimited — runs on your GPU	121,000 credits/month, overages billed	Credit-based (cloud rendering)
Voice cloning	Unlimited cloned voices	Included	Premium plan only ($88/mo)
Videos with YOUR images and logo (site analysis)	URL → MP4, ≈ 30 videos/month	—	Generic stock media
Dubbing / translation	10 languages, audio sync	Separate Dubbing product	Not offered
Where your voice goes	Never leaves your machine	Sent to and processed in the cloud	Sent to and processed in the cloud
Commercial use	✓	✓	✓
Runs in the browser	— (Windows app, NVIDIA GPU required)	✓	✓
¹ Prices and allowances as listed on elevenlabs.io/pricing and fliki.ai/pricing on 6 July 2026 (monthly plans, USD, before tax). Offers change: check the vendors' sites. ² "Unlimited": voice synthesis runs locally on your GPU; Studio video and translation usage is covered by your monthly AI budget (≈ 30 videos/month on Creator at current rates). ³ ClonyVoice requires Windows 10/11 and a CUDA-compatible NVIDIA GPU. macOS is on the roadmap.

Four subscriptions. Replaced by one.

Creators typically stack $50–120 a month of separate tools to do what ClonyVoice does from $12/month.

Voice cloning & text-to-speech	$6–99 per month, metered by credits
Faceless & marketing videos	$25–88 per month, minutes capped
Dubbing & translation	$25–60 per month
Open voice marketplace	No mainstream equivalent
ClonyVoice: all four, from $12/month — with unlimited local voice generation.

Published prices of leading tools in each category, observed July 2026.

Start free. Upgrade when it pays off.

Free 무료 Hear your voice cloned. Decide for yourself. Creator $12/월 For creators who publish every week. Pro $24/월 For professionals producing at pace. Studio $49/월 For teams, agencies and heavy publishers.

See plans →

Ready for the EU AI Act — ahead of the deadline

From August 2, 2026, the AI Act (art. 50) requires synthetic audio and video to be disclosed. ClonyVoice was built for that world: consent-first cloning, content labeling built in, by a French company. Create with a tool designed for the rules — before they apply.

Your GPU is sitting idle. Put it to work.

Clone your voice today, free. Two voices, about 3 Studio videos a month, and voice generation you'll never have to count.

One email at launch. Nothing else, no sharing.

더 많은 AI 음성 사용 사례 탐색

ClonyVoice가 다양한 산업과 애플리케이션에서 음성 제작을 혁신하는 방법을 알아보세요.

모든 사용 사례 보기 →

Clone your voice.Generate without counting.