Today's AI/ML headlines are brought to you by ThreatPerspective

DailyAI

Meta releases Llama 3.1 models, sticks with open strategy


Meta has released its upgraded Llama 3.1 models in 8B, 70B, and 405B versions and committed to Mark Zuckerberg’s open source vision for the future of AI. The new additions to Meta’s Llama family of models come with an expanded context length of 128k and support across eight languages. Meta says its highly anticipated 405B model demonstrates “unmatched flexibility, control, and state-of-the-art capabilities that rival the best closed source models.” It also claims that Llama 3.1 405B is the “the world’s largest and most capable openly available foundation model.” With eye-watering computing costs being spent to train ever-larger models, there

The post Meta releases Llama 3.1 models, sticks with open strategy appeared first on DailyAI.

Meta has released its upgraded Llama 3.1 models in 8B, 70B, and 405B versions and committed to Mark Zuckerberg’s open source vision for the future of AI.

The new additions to Meta’s Llama family of models come with an expanded context length of 128k and support across eight languages.

Meta says its highly anticipated 405B model demonstrates “unmatched flexibility, control, and state-of-the-art capabilities that rival the best closed source models.” It also claims that Llama 3.1 405B is the “the world’s largest and most capable openly available foundation model.”

With eye-watering computing costs being spent to train ever-larger models, there was a lot of speculation that Meta’s flagship 405B model could be its first paid model.

Llama 3.1 405B was trained on over 15 trillion tokens using 16,000 NVIDIA H100s, likely costing hundreds of millions of dollars.

In a blog post, Meta CEO Mark Zuckerberg reaffirmed the company’s view that open source AI is the way forward and that the release of Llama 3.1 is the next step “towards open source AI becoming the industry standard.”

The Llama 3.1 models are free to download and modify or fine-tune with a suite of services from Amazon, Databricks, and NVIDIA.

The models are also available on cloud service providers including AWS, Azure, Google, Oracle.




Published: 2024-07-24T09:42:00











© Digital Event Horizon . All rights reserved.

Privacy | Terms of Use | Contact Us