this post was submitted on 16 Nov 2025
407 points (98.6% liked)

Entertainment

72 readers
1 users here now

News from all around the entertainment industry. Less focused on celebrity side of things.

Main areas:


Rules

1. English onlyPosts and comments has to be in English.
2. Use original linkPost URL should be the original link (even if paywalled) and archived copies left in the post body. It allows avoiding duplicate posts when cross-posting.
3. Respectful communicationAll communication has to be respectful of differing opinions, viewpoints, and experiences.
4. InclusivityEveryone is welcome here regardless of age, body size, visible or invisible disability, ethnicity, sex characteristics, gender identity and expression, education, socio-economic status, nationality, personal appearance, race, caste, color, religion, or sexual identity and orientation.
5. Off-topic tangentsStay on topic. Keep it relevant.
6. Instance rules may applyIf something is not covered by community rules, but are against lemmy.zip instance rules, they will be enforced.


Icon attribution


If someone is interested in moderating this community, message @brikox@lemmy.zip.

founded 1 month ago
MODERATORS
 

Disney Channel animator Dana Terrace did not hold back in her social media response to CEO Bob Iger's AI announcement, encouraging fans to 'unsubscribe' from Disney+

you are viewing a single comment's thread
view the rest of the comments
[–] Binturong@lemmy.ca 1 points 1 day ago (1 children)

In that case, yeah I fully agree, and that's an interesting argument. Like with Bezos' new AI initiative, Amazon would have an immense pool of their own data to pull from, and Disney certainly owns a hell of a lot of properties. I do think it's naive to assume that's what's going on, and Disney wouldn't be doing what every other multinational corporation engaging in AI training is doing, which is scraping any and all dataset they can get access to regardless of propriety since arguably ALL data is useful. Could be I'm just cynical but fastest, laziest profit turns out to be plan A in almost every case these days.

[–] brucethemoose@lemmy.world 2 points 1 day ago* (last edited 1 day ago) (1 children)

Disney wouldn’t be doing what every other multinational corporation engaging in AI training is doing, which is scraping any and all dataset they can get access to regardless of propriety since arguably ALL data is useful.

There are actually very few 'big' model trainers, or at least trainers worth anything.

OpenAI, Anthrophic, xAI, and Google (and formerly Meta) are the big names to investors. You have Mistral in the EU, LG in Korea, the 'Chinese Dragons' like Alibaba and Deepseek, a few enterprise niches like Palantir, Cohere, or AI21, Perplexity and such for search, and...

That's it, mostly?

The vast, vast majority of corporations don't even finetune. They just use APIs of others and say they're making 'AI.' And you do have a few niches pursuing, say, TTS or imagegen, but the training sets for that are much more specialized.

...And actually, a lot of research and 'new' LLMs largely mixes of public datasets (so no need to scrape), synthetically generated data, outputs of other LLMs and/or more specifically formatted stuff. Take this one, which uses 5.5T of completely synthetic tokens:

https://old.reddit.com/r/LocalLLaMA/comments/1p20zry/gigachat3702ba36bpreview/

That, and rumor on the street is the Chinese govt provides the Chinese trainers with a lot of data (since their outputs/quirks are so suspiciously similar).


Hence, 'scraping the internet' is not actually the trend folks think it is. On the contrary, Meta seems to have refuted the 'quantity over quality' data approach with how hard their Llama 4 models flopped vs. how well Deepseek did. It's not very efficient, traning models is generally not profitable, and its done less than you think.


Point I'm making, along with just dumping my thinking, is that Disney is a special case.

Their focus is narrow: they want to generate tiktok-style images/videos of their characters, and only their characters. Not code, not, long chats, not spam articles, just that. They have no financial incentive to 'scrape all the internet' beyond the excellent archives that already exist; the only temptation is the 'quick and dirty' solution of using Sora instead of properly making something themselves.

[–] Binturong@lemmy.ca 2 points 1 day ago

I appreciate the well thought out response.