AI - Artificial intelligence

162 readers

6 users here now

AI related news and articles.

Rules:

No Videos.
No self promotion: Don't post links to your articles.

founded 6 months ago

MODERATORS

Vacant@programming.dev

DeepSeek may have found a new way to improve AI’s ability to remember (www.technologyreview.com)

submitted 2 weeks ago by cm0002@lemmy.zip to c/Aii@programming.dev

2 comments fedilink hide all child comments

An AI model released by the Chinese AI company DeepSeek uses new techniques that could significantly improve AI’s ability to “remember.”

Released last week, the optical character recognition (OCR) model works by extracting text from an image and turning it into machine-readable words. This is the same technology that powers scanner apps, translation of text in photos, and many accessibility tools.

top 2 comments

sorted by: hot top controversial new old

[–] sukhmel@programming.dev 5 points 2 weeks ago

This is what should've been in the description, imo

The new methods developed by DeepSeek (and published in its latest paper) could help to overcome this issue. Instead of storing words as tokens, its system packs written information into image form, almost as if it’s taking a picture of pages from a book. This allows the model to retain nearly the same information while using far fewer tokens, the researchers found.

Besides using visual tokens instead of just text tokens, the model is built on a type of tiered compression that is not unlike how human memories fade: Older or less critical content is stored in a slightly more blurry form in order to save space. Despite that, the paper’s authors argue, this compressed content can still remain accessible in the background while maintaining a high level of system efficiency.

[–] Oka@sopuli.xyz 0 points 2 weeks ago

That doesn't seem possible