this post was submitted on 26 Dec 2024
35 points (97.3% liked)

LocalLLaMA

2845 readers
2 users here now

Welcome to LocalLLaMA! Here we discuss running and developing machine learning models at home. Lets explore cutting edge open source neural network technology together.

Get support from the community! Ask questions, share prompts, discuss benchmarks, get hyped at the latest and greatest model releases! Enjoy talking about our awesome hobby.

As ambassadors of the self-hosting machine learning community, we strive to support each other and share our enthusiasm in a positive constructive way.

founded 2 years ago
MODERATORS
 

Absolutely humongous model. Mixture of 256 experts with 8 activated each time.

Aider leaderboard: The only model above ๐Ÿ‹ v3 here is ~~Open~~AI o1. DeepSeek is known to make amazing models and Aider rotates their benchmark over time, so it is unlikely that this is a train-on-benchmark situation.

Some more benchmarks: on Reddit.

you are viewing a single comment's thread
view the rest of the comments
[โ€“] [email protected] 5 points 3 months ago* (last edited 3 months ago) (1 children)

For the user whose VRAM knob goes to 11

[โ€“] [email protected] 3 points 3 months ago

Someone managed to run it on a cluster of Mac Minis lol https://blog.exolabs.net/day-2/