llama

links

The above are distilled from an unreleased, still in training, larger model:

ai, llm, model, llama, meta • 2025-04-06 • 9:12am

Llama 3.3 is a text-only 70B instruction-tuned model that provides enhanced performance

ai, model, llama, meta • 2024-12-17 • 9:49am

ai, model, llama, meta • 2024-07-24 • 8:58am

Mark Zuckerberg - Llama 3, $10B Models, Caesar Augustus, & 1 GW Datacenters - fascinating interview - the least robot-like I’ve ever seen Zuck, he’s getting that billion-dollar media training. Highlights include:

They got the edge in GPU race because in 2022 they realised they were short on GPUs for training their Reels recommendation system, so purchased double what they needed.
They foresee the bottleneck being energy production (not chips) both in the (regulatory) time and tech required to produce enough energy to power the chips
They have their own chips now, so they can lessen their reliance on more expensive Nvidia chips - they wont train llama4 with their own silicon but might train llama5

ai, meta, moat, llama • 2024-04-19 • 10:27am