Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
We are excited to present MPT-7B, the newest addition to the MosaicML Foundation Series. This transformer model has been meticulously trained from the ground up using 1 trillion tokens of diverse text and code. It is open-source and ready for commercial applications, delivering performance on par with LLaMA-7B. The training process took 9.5 days on the MosaicML platform, requiring no human input and incurring an approximate cost of $200,000.
With MPT-7B, you can now train, fine-tune, and launch your own customized MPT models, whether you choose to begin with one of our provided checkpoints or start anew. To provide additional options, we are also introducing three fine-tuned variants alongside the base MPT-7B: MPT-7B-Instruct, MPT-7B-Chat, and MPT-7B-StoryWriter-65k+, the latter boasting an impressive context length of 65,000 tokens, allowing for extensive content generation. These advancements open up new possibilities for developers and researchers looking to leverage the power of transformer models in their projects.
Description
StarCoder and StarCoderBase represent advanced Large Language Models specifically designed for code, developed using openly licensed data from GitHub, which encompasses over 80 programming languages, Git commits, GitHub issues, and Jupyter notebooks. In a manner akin to LLaMA, we constructed a model with approximately 15 billion parameters trained on a staggering 1 trillion tokens. Furthermore, we tailored the StarCoderBase model with 35 billion Python tokens, leading to the creation of what we now refer to as StarCoder.
Our evaluations indicated that StarCoderBase surpasses other existing open Code LLMs when tested against popular programming benchmarks and performs on par with or even exceeds proprietary models like code-cushman-001 from OpenAI, the original Codex model that fueled early iterations of GitHub Copilot. With an impressive context length exceeding 8,000 tokens, the StarCoder models possess the capability to handle more information than any other open LLM, thus paving the way for a variety of innovative applications. This versatility is highlighted by our ability to prompt the StarCoder models through a sequence of dialogues, effectively transforming them into dynamic technical assistants that can provide support in diverse programming tasks.
API Access
Has API
API Access
Has API
Integrations
Axolotl
ChatGPT
CodeQwen
Git
GitHub
LM Studio
MosaicML
OpenAI
Python
Tabby
Integrations
Axolotl
ChatGPT
CodeQwen
Git
GitHub
LM Studio
MosaicML
OpenAI
Python
Tabby
Pricing Details
Free
Free Trial
Free Version
Pricing Details
Free
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
MosaicML
Founded
2021
Country
United States
Website
www.mosaicml.com/blog/mpt-7b
Vendor Details
Company Name
BigCode
Founded
2023
Website
huggingface.co/blog/starcoder