this post was submitted on 04 Nov 2025
13 points (100.0% liked)

AI - Artificial intelligence

162 readers
6 users here now

AI related news and articles.

Rules:

founded 6 months ago
MODERATORS
 

An AI model released by the Chinese AI company DeepSeek uses new techniques that could significantly improve AI’s ability to “remember.”

Released last week, the optical character recognition (OCR) model works by extracting text from an image and turning it into machine-readable words. This is the same technology that powers scanner apps, translation of text in photos, and many accessibility tools.

top 2 comments
sorted by: hot top controversial new old
[–] sukhmel@programming.dev 5 points 2 weeks ago

This is what should've been in the description, imo

The new methods developed by DeepSeek (and published in its latest paper) could help to overcome this issue. Instead of storing words as tokens, its system packs written information into image form, almost as if it’s taking a picture of pages from a book. This allows the model to retain nearly the same information while using far fewer tokens, the researchers found.

Besides using visual tokens instead of just text tokens, the model is built on a type of tiered compression that is not unlike how human memories fade: Older or less critical content is stored in a slightly more blurry form in order to save space. Despite that, the paper’s authors argue, this compressed content can still remain accessible in the background while maintaining a high level of system efficiency.

[–] Oka@sopuli.xyz 0 points 2 weeks ago

That doesn't seem possible