Skip to main content

Choose a voice

Search the user-created library, open a recent voice, or train your own.

The library

A voice library built like an instrument

Every voice on VoiceDub is a trained model you can play any song through — searchable, filterable, and ready to render.

10,000+

Voices

Creator-trained clones.

7

Languages

From English and Spanish to Korean and Thai.

7

Genres

Pop to jazz, with rap, rock, and R&B between.

< 60s

To a cover

Upload to finished render, start to finish.

Signal path

From any song to any voice in three moves

01

Pick a voice

Search 10,000+ user-created voices, filter by genre, language, or style — or train a clone of your own voice and use that instead.

02

Drop in a track

Upload an audio file, paste a song link, or type text. Vocal separation isolates the performance automatically.

03

Render and download

The vocal is converted, re-mixed over the instrumental, and ready to download — with the isolated vocal stem included when the voice is your own clone.

Index

Browse the library your way

Every entry below is a live filter on the full library — jump straight to the corner of it you need.

Engine

Under the hood: a full vocal chain

Most tools stop at conversion. VoiceDub runs the whole chain a producer would.

SEPARATION

Vocals isolated from any mix.

Source separation pulls the performance out of a finished song, so you can convert tracks — not just dry vocal takes.

CONVERSION

A model per voice, not a filter.

Each library voice is an individually trained, creator-made model. Conversion reproduces tone and delivery instead of layering an effect.

PITCH / FORMANT

Melody preserved, timbre natural.

Covers keep the original melody note for note. Shift pitch in semitones for range mismatches and formants are handled for you.

MIXDOWN

Re-blended, not bolted on.

The converted vocal is mixed back over the original instrumental at render time, so the result sounds like a record, not a demo.

STEMS

Take the parts with you.

Download the finished mix — and on dubs made with your own cloned voice, grab the isolated converted vocal and keep full control of your own mixdown.

In the wild

What creators render here

Song covers

The classic. Any song, re-sung in the voice of your favorite character — melody, phrasing, and instrumental untouched. Most covers go from upload to download in under a minute, which is fast enough to try a song in three different voices and keep the one that lands.

Fan edits & shorts

Voice swaps for edits, trailers, and short-form video — rendered clean enough to sit in a final cut.

Parodies & mashups

Put the wrong singer on the right song. The fastest route from a dumb idea to a finished joke.

Demos & scratch vocals

Sketch a topline in your own voice, then render it in the voice the song is actually for.

Multilingual covers

Conversion follows the source vocal, so a voice can sing in languages its owner never recorded.

The ledger

Most converters stop where covers start

A typical converter

  • Raw converted vocal, mix it yourself
  • Dry vocal input only
  • One-size pitch shifting
  • Conversion output only
  • Desktop installs and GPU setup

VoiceDub

  • Finished cover, mixed over the instrumental
  • Full songs in — separation built into the chain
  • Pitch-aware conversion with natural formants
  • Finished mix, plus isolated vocals with your own clone
  • Runs in the browser, on anything

FAQ

AI voice covers, answered

What is an AI voice cover?

An AI voice cover is a song re-sung by a voice model. VoiceDub separates the vocal from your track, converts it with the voice you picked, and mixes it back over the original instrumental — so the melody, timing, and energy of the performance stay intact.

How long does a cover take to make?

Usually under a minute from upload to finished cover. Long tracks and busy mixes can take a little longer to separate, but the whole pipeline runs automatically — there is nothing to configure.

What can I use as input?

Upload an audio file, paste a link to a song, or type text. Audio inputs go through vocal separation first; text inputs are sung or spoken directly by the voice model.

Do covers keep the original melody and key?

Yes. Conversion is pitch-aware, so the melody is preserved note for note. If the target voice sits in a different range, you can shift pitch in semitones and the formants stay natural.

Can I use my own voice?

Yes — train a clone of your own voice from a few minutes of clean recordings, then use it on any track in the library like any other voice.

Which languages are covered?

The library spans English, Spanish, Korean, Japanese, French, Russian, Thai, and many others, and conversion itself is language-agnostic: the model reproduces whatever the source vocal sings.

Can I download stems?

For dubs made with your own cloned voice, yes. When one of those runs with separation, you can download the isolated converted vocal alongside the full mix — useful if you want to do your own mixdown.

Do I need to install anything?

No. Everything — browsing voices, uploading, rendering, previewing — runs in the browser on any device.

Ready when you are

Pick a voice. Press render.

The next cover in your head is about a minute away from being a file on your desktop.