this post was submitted on 14 Jun 2026
820 points (99.5% liked)

People Twitter

10072 readers
950 users here now

People tweeting stuff. We allow tweets from anyone.

RULES:

  1. Mark NSFW content.
  2. No doxxing people.
  3. Must be a pic of the tweet or similar. No direct links to the tweet.
  4. No bullying or international politcs
  5. Be excellent to each other.
  6. Provide an archived link to the tweet (or similar) being shown if it's a major figure or a politician. Archive.is the best way.

founded 3 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] errer@lemmy.world 10 points 22 hours ago (3 children)

I mean the poster above you is wrong, they use math tools internally now when you ask math questions. Very obvious in Gemini. Yes the raw LLM trying to autocomplete the answer to a math problem is gonna be wrong but that’s not the way they are used to solve problems like that anymore.

[–] sbv@sh.itjust.works 7 points 22 hours ago

The LLM has to choose to use the calculating tools. Gemini tried to do this one solo:

4 + 2 + 2 + 2 + 1+ 2 + 0 = 15

Tbf, it did four of these calculations, and 75% were correct.

[–] baines@lemmy.cafe 5 points 20 hours ago

no way i’d want to drive on a bridge built on their supposed math

[–] wonderingwanderer@sopuli.xyz 2 points 22 hours ago

That makes sense. I clearly don't keep up on the frontier models...