Follow Slashdot stories on Twitter

 



Forgot your password?
typodupeerror
AI Microsoft

Microsoft Reveals Two In-House AI Models 17

Today, Microsoft unveiled two in-house AI models: MAI-Voice-1, a high-speed speech-generation system now live in Copilot, and MAI-1-Preview, its first end-to-end foundation model trained on 15,000 H100 GPUs. Neowin reports: MAI-Voice-1 is a speech generation model and is already available in Copilot Daily and Podcasts. To preview the full capabilities of this voice model, Microsoft has created a new Copilot Labs experience that anyone can try today. With the Copilot Audio Expressions experience, users can just paste text content and select the voice, style, and mode to generate high-fidelity, expressive audio. They can also download the generated audio if required. Microsoft also highlighted that this MAI-Voice-1 model is very fast and efficient. In fact, it can generate a full minute of audio in under a second on a single GPU.

Second, Microsoft has begun public testing of MAI-1-preview on LMArena, a popular platform for community model evaluation. This represents MAI's first foundation model trained end-to-end and offers a glimpse of future offerings inside Copilot. They are actively spinning the flywheel to deliver improved models and will have much more to share in the coming months. MAI-1-preview is an MoE (mixture-of-experts) model, pre-trained and post-trained on nearly 15,000 NVIDIA H100 GPUs. Notably, MAI-1-preview is Microsoft's first foundation model trained end-to-end in-house. Microsoft claims that this model is better at following instructions and can offer helpful responses to everyday user questions. Microsoft will be rolling out this new model to certain text use cases within Copilot over the coming weeks.
This discussion has been archived. No new comments can be posted.

Microsoft Reveals Two In-House AI Models

Comments Filter:
  • "They are actively spinning the flywheel ..."

    Wow fantastic. Finally something AI is good for.

  • Click the link, paste the text of like, the news story below, and generate. It's hilarious to hear an AI voice utter disappointment in MS making their cloud gaming cheaper.
  • Where's the Microsoft out-house models? Gives new meaning to crapification or enshittification.

    JoshK.

  • high-speed speech-generation system

    Serious question: do people really want to talk to their AI? Unless you live and work alone, I just don't see it. Plus, the overly-bubbly California valley-girl voices are just grating.

    • by PDXNerd ( 654900 )
      I have both text-to-speech and speech-to-text to run my Home Assistant instance at home so we can do things like shut off lights and intercom to the kids rooms hands-free while cooking. The text-to-speech voice is used to acknowledge commands that were run and its basically a google home/alexa replacement that runs in house for home automation, no cloud needed.

      Any improvement in this area is welcome, as long as it runs offline. Current models are....barely acceptable.
    • by Malc ( 1751 )

      I was at a video trade show earlier this year and there was a lot of talk about agentic AI being used to help in video production environments. Not sure if they need more noise in a production room but the goal was to keep eyes on what's going on instead of looking down to fumble around with a keyboard and mouse. That's a bit different to speech generation, but is definitely about talking to AI. People are finding lots of uses for this in different environments, although it remains to be seen whether it

    • I live and work alone: the last thing I want to do is talk/listen to AI. I want to listen to my music.
    • by allo ( 1728082 )

      Do people talk to Siri or Google? Voice is for many an interface. And for many it is much faster than typing a full question on a mobile phone. Not everyone is using a desktop PC.

  • by DaFallus ( 805248 ) on Friday August 29, 2025 @11:29AM (#65624350)
    Meanwhile, you can't even paste images into Copilot in a web browser from your clipboard - you have to save it as a file and upload it or paste it in Teams.
    • by Anonymous Coward

      It is a good idea to disable that your browser can read the clipboard anyway.

    • by gweihir ( 88907 )

      Microsoft has long stopped caring about actually well-working technology. As long as all the clueless fanbois think there is no alternative to Microsoft's crappy products, profits are high and that is the only thing Microsoft cares about.

I was playing poker the other night... with Tarot cards. I got a full house and 4 people died. -- Steven Wright

Working...