Microblog Memes

7664 readers

3405 users here now

A place to share screenshots of Microblog posts, whether from Mastodon, tumblr, ~~Twitter~~ X, KBin, Threads or elsewhere.

Created as an evolution of White People Twitter and other tweet-capture subreddits.

Rules:

Please put at least one word relevant to the post in the post title.
Be nice.
No advertising, brand promotion or guerilla marketing.
Posters are encouraged to link to the toot or tweet etc in the description of posts.

Related communities:

founded 2 years ago

MODERATORS

ReadyUser31@lemmy.world

aeronmelon@lemmy.world

needanke@feddit.org

1709

firefox also isn't immune (lemmy.blahaj.zone)

submitted 1 day ago by not_IO@lemmy.blahaj.zone to c/microblogmemes@lemmy.world

147 comments fedilink hide all child comments

https://mastodon.social/@gwynnion/114541537909461004

you are viewing a single comment's thread
view the rest of the comments

[–] cm0002@lemmy.world 25 points 1 day ago (3 children)

I wouldn't mind a decent LOCAL open source AI helping

[–] TheTechnician27@lemmy.world 13 points 23 hours ago* (last edited 23 hours ago) (2 children)

Large X models lack a crucial component of "open-source". Freely redistributable and modifiable for any purpose, sure, but there's no chance in hell of auditing one, let alone if the training data is kept a secret. It's literally impossible; human beings cannot look at a trillion weights and biases representing a single highly chaotic, unfathomably complex nonlinear function whose input and output space are the totality of human language/images/etc. and say "yup, looks good to me." Deep learning models – contrasted with traditional machine learning models – learn their own features which almost 100% of the time would be nonsense to a human. You just have a blob of shareware when you run DeepSeek.

(They also just outright steal from billions of copyright-protected sources to create it, so calling it "open-source" is pretty funny.)

[–] cm0002@lemmy.world 7 points 23 hours ago

Auditing for bias purposes, yea true. But my primary concern is it having the capability to "phone home" which you don't really need to audit the model itself to be able to detect or prevent

[–] brucethemoose@lemmy.world 1 points 20 hours ago* (last edited 20 hours ago)

There are a few that are "truly" open like IBM Granite, and a handful of others over the 7B range.

[–] Areldyb@lemmy.world 6 points 22 hours ago (1 children)

Firefox can use a local llamafile model, but you have to enable it in about:config first.

[–] gamermanh@lemmy.dbzer0.com 1 points 17 hours ago

Honestly it's easier to find an addon that'll hook to ollama instead, fire fox's inbuilt support is shit

[–] thatKamGuy@sh.itjust.works 7 points 1 day ago (2 children)

DeepSeek’s model is open-sourced and can be run locally; though I think there some bits related to its training data they have been kept obscured (if I remember correctly) - likely due to the dubious nature of how it was acquired.

[–] rImITywR@lemmy.world 10 points 1 day ago

Unless training data is made available, a model is not open source. DeepSeek is better described as "open weight".

[–] brucethemoose@lemmy.world 2 points 20 hours ago* (last edited 20 hours ago)

some bits related to its training data

AKA ANY details about its training data, and its training hyperparameters, and literally any other details about its training. An 'open' secret among LLM tinkerers is that the Chinese companies seem to have particularly strong English/Chinese training data (not so much other languages though), and I'll give you one guess on how.

Deepseek is unusal in that they are open sourcing the general techniques they used and even some (not all) of the software frameworks they use.

Don't get me wrong, I think any level of openness should be encouraged (unlike OpenAI being as closed as physically possible), but they are still very closed. Unlike, say, IBM Granite models which should be reproducible.