this post was submitted on 07 May 2026
240 points (85.7% liked)
Technology
84502 readers
3758 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
The picture you post contradict your claims. The 2 groups are getting the same question, but one has AI assistance, the other has not.
Again you fail to show anything to support your claims.
No, what they meant is: The control group had 12 questions to get into the flow of solving math problems and then solved three more math problems for good measure.
The AI group on the other hand got into the flow of formulating math problems to ChatGPT and then had to actually solve three math problems themselves
Their critique is, that solving math problems yourself and prompting ChatGPT to solve math problems are not necessarily comparable tasks and require different skill sets so disabling AI after 12 tasks meant the first group had to switch context and therefore had worse performance.
If you want to analyze the first groups general ability of problem solving you should give them again twelve tasks after disabling AI so they get used to this new type of task (solving math problems yourself vs. prompting math problems to the AI) before measuring their performance.
That's what the friggin test is about! So of course they did.
I also wrote text.
If you're just going to cherry pick a single point and dismiss everything else then we're done here.
Maybe they're unable to switch contexts
I hear that can cause a loss of performance.