this post was submitted on 26 Apr 2025
698 points (97.9% liked)
Microblog Memes
10239 readers
2265 users here now
A place to share screenshots of Microblog posts, whether from Mastodon, tumblr, ~~Twitter~~ X, KBin, Threads or elsewhere.
Created as an evolution of White People Twitter and other tweet-capture subreddits.
RULES:
- Your post must be a screen capture of a microblog-type post that includes the UI of the site it came from, preferably also including the avatar and username of the original poster. Including relevant comments made to the original post is encouraged.
- Your post, included comments, or your title/comment should include some kind of commentary or remark on the subject of the screen capture. Your title must include at least one word relevant to your post.
- You are encouraged to provide a link back to the source of your screen capture in the body of your post.
- Current politics and news are allowed, but discouraged. There MUST be some kind of human commentary/reaction included (either by the original poster or you). Just news articles or headlines will be deleted.
- Doctored posts/images and AI are allowed, but discouraged. You MUST indicate this in your post (even if you didn't originally know). If a post is found to be fabricated or edited in any way and it is not properly labeled, it will be deleted.
- Be nice. Take political debates to the appropriate communities. Take personal disagreements to private messages.
- No advertising, brand promotion, or guerrilla marketing.
Related communities:
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
But don't LLMs not do math, but just look at how often tokens show up next to each other? It's not actually doing any prime number math over there, I don't think.
If I fed it a big enough number, it would report back to me that a particular python math library failed to complete the task, so it must be neralling it's answer AND crunching the numbers using sympy on its big supercomputer
Is it running arbitrary python code server side? That sounds like a vector to do bad things. Maybe they constrained it to only run some trusted libraries in specific ways or something.
given the track record of these things i would not be surprised if you just have to finagle the prompt just right to sometimes slip through the cracks and pull off some ACE
They do math, just in a very weird (and obviously not super reliable) way. There is a recent paper by anthropic that explains it, I can track it down if you'd be interested.
Broadly speaking, the weights in a model will form sorts of "circuits" which can perform certain tasks. On something hard like factoring numbers the performance is probably abysmal but I'd guess the model is still trying to approximate the task somehow.