self

joined 2 years ago
MODERATOR OF
[–] self@awful.systems 8 points 5 months ago* (last edited 5 months ago) (3 children)

Is the R1 model better than all existing models? Well, it benchmarks well. But everyone trains their models to the benchmarks hard. The benchmarks exist to create headlines about model improvements while everyone using the model still sees lying slop machines. No, no, sir, this is much finer slop, with a bouquet from the rotting carcass side of the garbage heap.

[…]

This crash doesn’t mean AI sucks now or that it’s good now. It just means OpenAI, and everyone else whose stock dipped, was just throwing money into a fire. But we knew that.

Slop generators are cheap now, and that’s a sea change — but the output is still terrible slop, just more of it.

this bares repeating. I’ve seen quite a few people declare that DeepSeek fixes all of the issues with LLMs as a technology, but that just isn’t true. a DeepSeek LLM is still an unreliable plagiarism machine with no known use case trained on massive amounts of stolen data, even if OpenAI and other American ghouls were the ones who did the theft in the first place.

there’s a small victory in that Altman and friends were exposed very publicly as lying grifters, and that’s worth celebrating. but it’s very important to not get swept up in a hype wave, especially one crafted by people who are much more competent at managing public opinion than Altman & co. from what I understand: no, this thing isn’t meaningfully open source. ~~no, you can’t run the good version at home.~~ sure, it performs great at the benchmarks we know were designed to be cheated. yeah, DeepSeek LLMs are probably still an environmental disaster for the same reason most supposedly more efficient blockchains are — perverse financial incentives across the entire industry.

but hey, good news for the boy genius Prompt Engineer at your company: he gets to requisition another top end gaming PC, absolutely drowning in RGB, to run ~~the shit version of~~ DeepSeek on. maybe in a couple months he can spin switching from OpenAI’s rentseeking to a DeepSeek LLM startup’s slightly cheaper rentseeking into a mild pay bump.

e: see david’s reply, I’m wrong about not being able to run the full version at home — but you need $6000 of fairly specific hardware and it’s molasses slow

[–] self@awful.systems 7 points 5 months ago

I love both the content of this post and the fact that it’s a self-contained torture test for our pict-rs upgrade

also, lol @ musk, war genius, starting a domestic dispute with his ex-girlfriend cause she dared to betray him in his baby mobile 4x game when betrayals are a core part of every 4x I know

I’m getting the strong mental image of musk being the guy who flips the board 12 hours into Twilight Imperium cause the other players didn’t let him win

[–] self@awful.systems 5 points 5 months ago

18 minutes isn’t even long

[–] self@awful.systems 8 points 5 months ago

from what I’ve been told, a digital nomad visa and EU citizenship by descent are a couple of routes worth looking into. I have frustratingly little detail on the expectations around the visa though, and citizenship by descent laws vary by country.

[–] self@awful.systems 13 points 5 months ago (1 children)

then I’d tell it to shove itself into a fucking locker, that’s what

[–] self@awful.systems 11 points 5 months ago* (last edited 5 months ago)

oh cool, the logo’s just a barely modified sparkle emoji so you know it’s horseshit, and it’s directly funded by Scale AI and a Rationalist thinktank so the chances the models weren’t directly trained on the problem set are vanishingly thin. this is just the FrontierMath grift with new, more dramatic, paint.

e: also, slightly different targeting — FrontierMath was looking to grift institutional dollars, I feel. this one’s designed to look good in a breathless thinkpiece about how, I dunno…

When A.I. Passes This Test, Look Out

yeah, whatever the fuck they think this means. this one’s designed to be talked about, to be brought up behind closed doors as a reason why your pay’s being cut. this is vile shit.

[–] self@awful.systems 8 points 5 months ago

gonna start referring to awful.systems like how a twitch streamer refers to chat

[–] self@awful.systems 9 points 5 months ago* (last edited 5 months ago) (1 children)

oh boy: https://social.wake.st/@liaizon/113868769104056845 iOS devices send the contents of Signal chats to Apple Intelligence by default

e: this fortunately doesn’t seem to be accurate; excuse my haste. here’s the word from the signal forums

[–] self@awful.systems 12 points 5 months ago (1 children)

Chuds keep posting pictures of Democratic Party politicians (particularly Kamala Harris) with their arm raised

of course they are. there’s no convincing these fuckers because they’re collaborators looking to strengthen the conviction of other collaborators by any inane means necessary.

[–] self@awful.systems 5 points 5 months ago (2 children)

do you figure it’s $1000/query because the algorithms they wrote with their insider knowledge to cheat the benchmark are very expensive to run, or is it $1000/query because they’re grifters and all high mode does is use the model trained on frontiermath and allocate more resources to the query? and like any good grifter, they’re targeting whales and institutional marks who are so invested that throwing away $1000 on horseshit feels like a bargain

[–] self@awful.systems 8 points 5 months ago (2 children)

it must be ethiccal, it has two dolphins

[–] self@awful.systems 13 points 5 months ago (7 children)

holy shit, that’s the excuse they’re going for? they cheated on a benchmark so hard the results are totally meaningless, sold their most expensive new models yet on the back of that cheated benchmark, further eroded the scientific process both with their cheating and by selling those models as better for scientific research… and these weird fucks want that to be fine and normal? fuck them

view more: ‹ prev next ›