Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
In honor of Archimedes, whose 2311th anniversary we celebrate this year, we are excited to introduce our inaugural Mathstral model, a specialized 7B architecture tailored for mathematical reasoning and scientific exploration. This model features a 32k context window and is released under the Apache 2.0 license. Our intention behind contributing Mathstral to the scientific community is to enhance the pursuit of solving advanced mathematical challenges that necessitate intricate, multi-step logical reasoning. The launch of Mathstral is part of our wider initiative to support academic endeavors, developed in conjunction with Project Numina. Much like Isaac Newton during his era, Mathstral builds upon the foundation laid by Mistral 7B, focusing on STEM disciplines. It demonstrates top-tier reasoning capabilities within its category, achieving remarkable results on various industry-standard benchmarks. Notably, it scores 56.6% on the MATH benchmark and 63.47% on the MMLU benchmark, showcasing the performance differences by subject between Mathstral 7B and its predecessor, Mistral 7B, further emphasizing the advancements made in mathematical modeling. This initiative aims to foster innovation and collaboration within the mathematical community.
Description
Smaug-72B is a formidable open-source large language model (LLM) distinguished by several prominent features:
Exceptional Performance: It currently ranks first on the Hugging Face Open LLM leaderboard, outperforming models such as GPT-3.5 in multiple evaluations, demonstrating its ability to comprehend, react to, and generate text that closely resembles human writing.
Open Source Availability: In contrast to many high-end LLMs, Smaug-72B is accessible to everyone for use and modification, which encourages cooperation and innovation within the AI ecosystem.
Emphasis on Reasoning and Mathematics: This model excels particularly in reasoning and mathematical challenges, a capability attributed to specialized fine-tuning methods developed by its creators, Abacus AI.
Derived from Qwen-72B: It is essentially a refined version of another robust LLM, Qwen-72B, which was launched by Alibaba, thereby enhancing its overall performance.
In summary, Smaug-72B marks a notable advancement in the realm of open-source artificial intelligence, making it a valuable resource for developers and researchers alike. Its unique strengths not only elevate its status but also contribute to the ongoing evolution of AI technology.
API Access
Has API
API Access
Has API
Screenshots View All
No images available
Integrations
1min.AI
Amazon Bedrock
DataChain
Deep Infra
Graydient AI
Hugging Face
HumanLayer
Keywords AI
Kiin
LM-Kit.NET
Integrations
1min.AI
Amazon Bedrock
DataChain
Deep Infra
Graydient AI
Hugging Face
HumanLayer
Keywords AI
Kiin
LM-Kit.NET
Pricing Details
Free
Free Trial
Free Version
Pricing Details
Free
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
Mistral AI
Founded
2023
Country
France
Website
mistral.ai/news/mathstral/
Vendor Details
Company Name
Abacus
Founded
2019
Country
United States
Website
huggingface.co/abacusai/Smaug-72B-v0.1