this post was submitted on 01 Jul 2025
1031 points (98.3% liked)

memes

15965 readers
1987 users here now

Community rules

1. Be civilNo trolling, bigotry or other insulting / annoying behaviour

2. No politicsThis is non-politics community. For political memes please go to !politicalmemes@lemmy.world

3. No recent repostsCheck for reposts when posting a meme, you can only repost after 1 month

4. No botsNo bots without the express approval of the mods or the admins

5. No Spam/AdsNo advertisements or spam. This is an instance rule and the only way to live.

A collection of some classic Lemmy memes for your enjoyment

Sister communities

founded 2 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] SinAdjetivos@lemmy.world 7 points 1 day ago (1 children)

A single voice actor couldn't produce enough lines to fully train an AI model...

The model is trained on a massive corpus of existing data and then fine tuned to match the target voice actor. Using less than ~30s of reference audio you can get a pretty decent fine tuning the main issue is that it currently isn't on par with the quality and consistency of an in studio voice actor, especially over long time domains.

[–] XM34@feddit.org 0 points 1 day ago (1 children)

Hence my usage of the words "fully train". The other commentor wants to license every piece of audio used in training the model which obviously includes the base model...

[–] SinAdjetivos@lemmy.world 3 points 1 day ago

You can feed an infinite amount of data into existing models and it won't improve the issues. The problem is with the models themselves.

And the audio used to train the base model are licensed. Usually under an MIT, creative commons, etc. license.