Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
The MiniMax‑M1 model, introduced by MiniMax AI and licensed under Apache 2.0, represents a significant advancement in hybrid-attention reasoning architecture. With an extraordinary capacity for handling a 1 million-token context window and generating outputs of up to 80,000 tokens, it facilitates in-depth analysis of lengthy texts. Utilizing a cutting-edge CISPO algorithm, MiniMax‑M1 was trained through extensive reinforcement learning, achieving completion on 512 H800 GPUs in approximately three weeks. This model sets a new benchmark in performance across various domains, including mathematics, programming, software development, tool utilization, and understanding of long contexts, either matching or surpassing the capabilities of leading models in the field. Additionally, users can choose between two distinct variants of the model, each with a thinking budget of either 40K or 80K, and access the model's weights and deployment instructions on platforms like GitHub and Hugging Face. Such features make MiniMax‑M1 a versatile tool for developers and researchers alike.
Description
Reka Flash 3 is a cutting-edge multimodal AI model with 21 billion parameters, crafted by Reka AI to perform exceptionally well in tasks such as general conversation, coding, following instructions, and executing functions. This model adeptly handles and analyzes a myriad of inputs, including text, images, video, and audio, providing a versatile and compact solution for a wide range of applications. Built from the ground up, Reka Flash 3 was trained on a rich array of datasets, encompassing both publicly available and synthetic information, and it underwent a meticulous instruction tuning process with high-quality selected data to fine-tune its capabilities. The final phase of its training involved employing reinforcement learning techniques, specifically using the REINFORCE Leave One-Out (RLOO) method, which combined both model-based and rule-based rewards to significantly improve its reasoning skills. With an impressive context length of 32,000 tokens, Reka Flash 3 competes effectively with proprietary models like OpenAI's o1-mini, making it an excellent choice for applications requiring low latency or on-device processing. The model operates at full precision with a memory requirement of 39GB (fp16), although it can be efficiently reduced to just 11GB through the use of 4-bit quantization, demonstrating its adaptability for various deployment scenarios. Overall, Reka Flash 3 represents a significant advancement in multimodal AI technology, capable of meeting diverse user needs across multiple platforms.
API Access
Has API
API Access
Has API
Pricing Details
No price information available.
Free Trial
Free Version
Pricing Details
No price information available.
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
MiniMax
Founded
2021
Country
Singapore
Website
github.com/MiniMax-AI/MiniMax-M1
Vendor Details
Company Name
Reka
Founded
2022
Country
United States
Website
www.reka.ai/news/introducing-reka-flash