AI - Artificial intelligence

26

2

The Learning Loop and LLMs (martinfowler.com)

submitted 2 weeks ago by codeinabox@programming.dev to c/Aii@programming.dev

0 comments fedilink

27

2

Findings from DX’s 2025 report: AI won’t save you from your engineering culture (blog.robbowley.net)

submitted 2 weeks ago by codeinabox@programming.dev to c/Aii@programming.dev

0 comments fedilink

The two key points:

Meetings, interruptions, review delays, and slow CI pipelines cost more than AI saves. Individual productivity tools can’t fix organisational dysfunction.
AI amplifies existing engineering culture. Strong quality practices get faster. Weak practices accumulate debt faster.

28

3

Language models cannot reliably distinguish belief from knowledge and fact (www.nature.com)

submitted 2 weeks ago by cm0002@lemmy.zip to c/Aii@programming.dev

0 comments fedilink

Abstract

As language models (LMs) increasingly infiltrate into high-stakes domains such as law, medicine, journalism and science, their ability to distinguish belief from knowledge, and fact from fiction, becomes imperative. Failure to make such distinctions can mislead diagnoses, distort judicial judgments and amplify misinformation. Here we evaluate 24 cutting-edge LMs using a new KaBLE benchmark of 13,000 questions across 13 epistemic tasks. Our findings reveal crucial limitations. In particular, all models tested systematically fail to acknowledge first-person false beliefs, with GPT-4o dropping from 98.2% to 64.4% accuracy and DeepSeek R1 plummeting from over 90% to 14.4%. Further, models process third-person false beliefs with substantially higher accuracy (95% for newer models; 79% for older ones) than first-person false beliefs (62.6% for newer; 52.5% for older), revealing a troubling attribution bias. We also find that, while recent models show competence in recursive knowledge tasks, they still rely on inconsistent reasoning strategies, suggesting superficial pattern matching rather than robust epistemic understanding. Most models lack a robust understanding of the factive nature of knowledge, that knowledge inherently requires truth. These limitations necessitate urgent improvements before deploying LMs in high-stakes domains where epistemic distinctions are crucial.

29

13

DeepSeek may have found a new way to improve AI’s ability to remember (www.technologyreview.com)

submitted 2 weeks ago by cm0002@lemmy.zip to c/Aii@programming.dev

2 comments fedilink

An AI model released by the Chinese AI company DeepSeek uses new techniques that could significantly improve AI’s ability to “remember.”

Released last week, the optical character recognition (OCR) model works by extracting text from an image and turning it into machine-readable words. This is the same technology that powers scanner apps, translation of text in photos, and many accessibility tools.

30

3

In a First, AI Models Analyze Language As Well As a Human Expert (www.quantamagazine.org)

submitted 2 weeks ago by cm0002@lemmy.zip to c/Aii@programming.dev

0 comments fedilink

Among the myriad abilities that humans possess, which ones are uniquely human? Language has been a top candidate at least since Aristotle, who wrote that humanity was “the animal that has language.” Even as large language models such as ChatGPT superficially replicate ordinary speech, researchers want to know if there are specific aspects of human language that simply have no parallels in the communication systems of other animals or artificially intelligent devices.

In particular, researchers have been exploring the extent to which language models can reason about language itself. For some in the linguistic community, language models not only don’t have reasoning abilities, they can’t. This view was summed up by Noam Chomsky, a prominent linguist, and two co-authors in 2023, when they wrote in The New York Times(opens a new tab) that “the correct explanations of language are complicated and cannot be learned just by marinating in big data.” AI models may be adept at using language, these researchers argued, but they’re not capable of analyzing language in a sophisticated way.

31

3

TSU 101: An Entirely New Type of Computing Hardware to run AI models (extropic.ai)

submitted 2 weeks ago by cm0002@infosec.pub to c/Aii@programming.dev

0 comments fedilink

32

3

On Bullshit and Generative AI (adamcoster.com)

submitted 3 weeks ago by cm0002@infosec.pub to c/Aii@programming.dev

0 comments fedilink

33

5

LLMs Will Always Hallucinate (arxiv.org)

submitted 3 weeks ago by cm0002@infosec.pub to c/Aii@programming.dev

2 comments fedilink

34

6

Agentic AI and Security (martinfowler.com)

submitted 3 weeks ago by codeinabox@programming.dev to c/Aii@programming.dev

0 comments fedilink

35

2

Small language models: Why the future of AI agents might be tiny (blog.logrocket.com)

submitted 3 weeks ago by codeinabox@programming.dev to c/Aii@programming.dev

3 comments fedilink

36

7

Making The Smallest And Dumbest LLM With Extreme Quantization (hackaday.com)

submitted 3 weeks ago by cm0002@lemmy.zip to c/Aii@programming.dev

0 comments fedilink

The reason why large language models are called ‘large’ is not because of how smart they are, but as a factor of their sheer size in bytes. At billions of parameters at four bytes each, they pose a serious challenge when it comes to not just their size on disk, but also in RAM, specifically the RAM of your videocard (VRAM). Reducing this immense size, as is done routinely for the smaller pretrained models which one can download for local use, involves quantization. This process is explained and demonstrated by [Codeically], who takes it to its logical extreme: reducing what could be a GB-sized model down to a mere 63 MB by reducing the bits per parameter.

37

9

LLMs aren't databases (robingower.com)

submitted 3 weeks ago by codeinabox@programming.dev to c/Aii@programming.dev

0 comments fedilink

38

2

Foxconn Approves NT$42 Billion Investment to Boost AI and Supercomputing Capabilities (www.econotimes.com)

submitted 3 weeks ago by cm0002@lemmy.zip to c/Aii@programming.dev

1 comments fedilink

39

12