Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Description

Stable LM represents a significant advancement in the field of language models by leveraging our previous experience with open-source initiatives, particularly in collaboration with EleutherAI, a nonprofit research organization. This journey includes the development of notable models such as GPT-J, GPT-NeoX, and the Pythia suite, all of which were trained on The Pile open-source dataset, while many contemporary open-source models like Cerebras-GPT and Dolly-2 have drawn inspiration from this foundational work. Unlike its predecessors, Stable LM is trained on an innovative dataset that is three times the size of The Pile, encompassing a staggering 1.5 trillion tokens. We plan to share more information about this dataset in the near future. The extensive nature of this dataset enables Stable LM to excel remarkably in both conversational and coding scenarios, despite its relatively modest size of 3 to 7 billion parameters when compared to larger models like GPT-3, which boasts 175 billion parameters. Designed for versatility, Stable LM 3B is a streamlined model that can efficiently function on portable devices such as laptops and handheld gadgets, making us enthusiastic about its practical applications and mobility. Overall, the development of Stable LM marks a pivotal step towards creating more efficient and accessible language models for a wider audience.

Description

We have developed and are releasing an open-source neural network named Whisper, which achieves levels of accuracy and resilience in English speech recognition that are comparable to human performance. This automatic speech recognition (ASR) system is trained on an extensive dataset comprising 680,000 hours of multilingual and multitask supervised information gathered from online sources. Our research demonstrates that leveraging such a comprehensive and varied dataset significantly enhances the system's capability to handle different accents, ambient noise, and specialized terminology. Additionally, Whisper facilitates transcription across various languages and provides translation into English from those languages. We are making available both the models and the inference code to support the development of practical applications and to encourage further exploration in the field of robust speech processing. The architecture of Whisper follows a straightforward end-to-end design, utilizing an encoder-decoder Transformer framework. The process begins with dividing the input audio into 30-second segments, which are then transformed into log-Mel spectrograms before being input into the encoder. By making this technology accessible, we aim to foster innovation in speech recognition technologies.

API Access

Has API

API Access

Has API

Screenshots View All

Screenshots View All

Integrations

AI Sparks Studio
AnotherWrapper
Automi
Baseten
Bolna
LazyTyper
MacWhisper
Monster API
Nekton.ai
NoteVocal
OpenAI
SheepScript.ai
Shownotes
Snippets AI
Spark NLP
Thinkbuddy
Vocode
Whisper Notes
brancher.ai

Integrations

AI Sparks Studio
AnotherWrapper
Automi
Baseten
Bolna
LazyTyper
MacWhisper
Monster API
Nekton.ai
NoteVocal
OpenAI
SheepScript.ai
Shownotes
Snippets AI
Spark NLP
Thinkbuddy
Vocode
Whisper Notes
brancher.ai

Pricing Details

Free
Free Trial
Free Version

Pricing Details

No price information available.
Free Trial
Free Version

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Vendor Details

Company Name

Stability AI

Founded

2019

Country

United Kingdom

Website

stability.ai/

Vendor Details

Company Name

OpenAI

Country

United States

Website

openai.com/blog/whisper/

Product Features

Product Features

Speech Recognition

Audio Capture
Automatic Form Fill
Automatic Transcription
Call Analysis
Concatenated Speech
Continuous Speech
Customizable Macros
Multi-Languages
Specialty Vocabularies
Speech-to-Text Analysis
Variable Frequency
Voice Recognition

Transcription

AI / Machine Learning
Annotations
Audio/Video File Upload
Automatic Transcription
Collaboration Tools
File Sharing
For Manual Transcription
Full Text Search
Multi-Language Support
Natural Language Processing (NLP)
Playback Controls
Speech Recognition
Subtitles
Text Editor
Timecoding

Alternatives

Dolly Reviews

Dolly

Databricks

Alternatives

Cerebras-GPT Reviews

Cerebras-GPT

Cerebras
GPT-J Reviews

GPT-J

EleutherAI
Transcribe Reviews

Transcribe

Wreally
Falcon-40B Reviews

Falcon-40B

Technology Innovation Institute (TII)