News
Welcome to the News community!
Rules:
1. Be civil
Attack the argument, not the person. No racism/sexism/bigotry. Good faith argumentation only. This includes accusing another user of being a bot or paid actor. Trolling is uncivil and is grounds for removal and/or a community ban. Do not respond to rule-breaking content; report it and move on.
2. All posts should contain a source (url) that is as reliable and unbiased as possible and must only contain one link.
Obvious biased sources will be removed at the mods’ discretion. Supporting links can be added in comments or posted separately but not to the post body. Sources may be checked for reliability using Wikipedia, MBFC, AdFontes, GroundNews, etc.
3. No bots, spam or self-promotion.
Only approved bots, which follow the guidelines for bots set by the instance, are allowed.
4. Post titles should be the same as the article used as source. Clickbait titles may be removed.
Posts which titles don’t match the source may be removed. If the site changed their headline, we may ask you to update the post title. Clickbait titles use hyperbolic language and do not accurately describe the article content. When necessary, post titles may be edited, clearly marked with [brackets], but may never be used to editorialize or comment on the content.
5. Only recent news is allowed.
Posts must be news from the most recent 30 days.
6. All posts must be news articles.
No opinion pieces, Listicles, editorials, videos, blogs, press releases, or celebrity gossip will be allowed. All posts will be judged on a case-by-case basis. Mods may use discretion to pre-approve videos or press releases from highly credible sources that provide unique, newsworthy content not available or possible in another format.
7. No duplicate posts.
If an article has already been posted, it will be removed. Different articles reporting on the same subject are permitted. If the post that matches your post is very old, we refer you to rule 5.
8. Misinformation is prohibited.
Misinformation / propaganda is strictly prohibited. Any comment or post containing or linking to misinformation will be removed. If you feel that your post has been removed in error, credible sources must be provided.
9. No link shorteners or news aggregators.
All posts must link to original article sources. You may include archival links in the post description. News aggregators such as Yahoo, Google, Hacker News, etc. should be avoided in favor of the original source link. Newswire services such as AP, Reuters, or AFP, are frequently republished and may be shared from other credible sources.
10. Don't copy entire article in your post body
For copyright reasons, you are not allowed to copy an entire article into your post body. This is an instance wide rule, that is strictly enforced in this community.
view the rest of the comments
It looks, to me, like you're reading the briefing without understanding how the legal system functions. You're making some incredibly basic mistakes. Copyright violations and theft are two distinct legal concepts, for example. You're treating the case summary as if it were the legal argument in the brief and you're misinterpreting some pretty clear legal language written by the judge.
No, that is not their argument.
Their legal argument, in the appeal of the class certification, is that the judge did not apply the required analysis in order to certify the three plaintiffs as being part of a class. He instead relied on his intuition, not any discovered facts or evidence. This isn't allowed when analyzing a case for class certification.
In addition, Anthropic adds, it is well supported in case law (cited in the motion) that copyright claims are a bad fit for class action.
This is because copyright law focuses on individual works and each work has to be examined as to its eligibility for copyright protection, the standing of the plaintiff and if, and how much, of each individual work was the defendant responsible for violating copyright.
This can be done when 3 people claim a copyright violation, because they have a limited set of work which a court can reasonably examine.
A class action would require a court to consider hundreds or thousands of claimants and millions of individual works, each of which can be challenged individually by the defendant.
Courts typically don't like to take on cases that can require millions of briefings, hearings and rulings. Because of this, courts usually always deny class action certification for copyright violations.
The court, in its order, did not address this or apply any of the required analysis. The class was certified based on vibes, something that doesn't follow clearly established case law.
This is because training an LLM results in a language model.
A language model is in no way similar to a book and so training one is a transformative use of copyrighted material and protected under fair use.
No, the judge didn't make any claim about the model's output after training. That isn't an issue that's being addressed in this case. You're misunderstanding how judges address issues in writing.
Here, the judge is addressing a very narrow issue, specifically the exact claim made by the plaintiff (training with copyrighted material = copyright violation).
The subject of the paragraph is concerned with training the LLM. The claim by the plaintiff is that using copyrighted works to train LLMs is a violation of copyright. That's what the judge is addressing.
The judge dismissed this argument because it was transformative and so protected by fair use.
The judge further noted that the plaintiffs did not show that training the LLM resulted in "any exact copies nor even infringing knockoffs of their works being provided to the public" and if they could show that training the LLM resulted in "any exact copies nor even infringing knockoffs of their works being provided to the public" then they could bring a case in the future. This is the judge hinting that they can amend their filings in this case to clarify their argument, if they had any evidence to support their claim.
The judge is telling the plaintiff that in order to succeed in their claim, which is that training an LLM on their work is a violation of their copyright, they need to show that the thing that they're claiming has to result in copies of infringing material or knockoffs.
The training resulted in a model. Creating a model is transformative (a model and a book are two completely different things) and the plaintiffs didn't show that any infringing works were produced by the training and therefore they have no way of succeeding with their argument that training the model violated their rights.
You're reading a lot of extra into that statement that isn't there. The plaintiffs never made a claim about the output of a trained model and so that argument wasn't examined by the judge.