MPT-7B Description

We are excited to present MPT-7B, the newest addition to the MosaicML Foundation Series. This transformer model has been meticulously trained from the ground up using 1 trillion tokens of diverse text and code. It is open-source and ready for commercial applications, delivering performance on par with LLaMA-7B. The training process took 9.5 days on the MosaicML platform, requiring no human input and incurring an approximate cost of $200,000.

With MPT-7B, you can now train, fine-tune, and launch your own customized MPT models, whether you choose to begin with one of our provided checkpoints or start anew. To provide additional options, we are also introducing three fine-tuned variants alongside the base MPT-7B: MPT-7B-Instruct, MPT-7B-Chat, and MPT-7B-StoryWriter-65k+, the latter boasting an impressive context length of 65,000 tokens, allowing for extensive content generation. These advancements open up new possibilities for developers and researchers looking to leverage the power of transformer models in their projects.

Pricing

Pricing Starts At:
Free
Pricing Information:
Open source
Free Version:
Yes

Integrations

Reviews

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Company Details

Company:
MosaicML
Year Founded:
2021
Headquarters:
United States
Website:
www.mosaicml.com/blog/mpt-7b

Media

MPT-7B Screenshot 1
Recommended Products
Go From AI Idea to AI App Fast Icon
Go From AI Idea to AI App Fast

One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
Try Free

Product Details

Platforms
Web-Based
Windows
Mac
Linux
On-Premises
Types of Training
Training Docs

MPT-7B Features and Options

MPT-7B User Reviews

Write a Review
  • Previous
  • Next