Microsoft new AI models: Create transcriptions, voice, and images with MAI-Transcribe 1, MAI-Voice 1, MAI-Image 2

Microsoft AI models: Microsoft has introduced three new artificial intelligence models-MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2-on April 2, expanding its AI offerings for developers and enterprise users. The models are designed to handle speech recognition, voice generation, and image creation, and are now available through the company’s Foundry platform.

MAI-Transcribe-1 is Microsoft’s first-generation speech recognition model. It supports up to 25 languages and is built to de liver accurate transcription across different accents and real-world audio conditions. The company said the model achieves competitive accuracy while using nearly 50% less GPU cost compared to similar tools.

On the other hand, MAI-Voice-1 focuses on speech generation. It can produce up to 60 seconds of expressive audio in less than one second using a single GPU. The model is aimed at applications where quick and realistic voice output is required.

Both MAI-Transcribe-1 and MAI-Voice-1 are available through Azure Speech services and are intended for real-world deployment across various industries.

Use cases across industries

The speech-based models are designed for multiple use cases, including conversational AI systems such as virtual assistants and call-centre tools. They can also support live captioning for events and meetings, helping improve accessibility.

Other applications include media production, where the models can automate subtitling and transcription, and education platforms, where lectures and training materials can be converted into text. Businesses can also use these tools to analyse customer interactions and generate insights from spoken data.

MAI-Image-2: Image generation

Microsoft also introduced MAI-Image-2, a text-to-image model that can generate visuals based on written prompts. The model has been ranked among the top image model families on the Arena.ai leaderboard.

MAI-Image-2 is designed for use in creative and business workflows. It can help designers and content creators quickly generate visual concepts, while organisations can use it to produce customised graphics for communication and branding.

Integration with existing products

According to Microsoft, these models are already being used in its products such as Copilot, Bing, PowerPoint, and Azure Speech. By making them available to developers, the company aims to expand their use across different platforms and applications.

With the release of these latest advanced AI models, Microsoft continues to focus on building AI tools that can handle a wide range of tasks, from speech and language processing to visual content creation.

Latest

Netflix launches safe, ad-free gaming app for children under 8 – All details

Parents worried about ads and spending in kids' apps now have a new option, as Netflix introduces its gaming platform without interruptions or extra costs.

Apple MacBook Neo Now at ₹64,490! Here’s when and how to unlock this limited-time deal

Grab the Apple MacBook Neo at ₹64,490 on Flipkart with exchange, bank discounts, EMI plans, and cashback offers. Here’s how you can unlock this deal.

Google Meet rolls out on Apple CarPlay before Android Auto

Google Meet now on Apple CarPlay: Jump into hands-free meetings from your dash. Audio-only for safety, iOS 17+.

Paytm adds biometric authentication to UPI payments, starts allowing cardless ATM withdrawals

Paytm has introduced two new features — biometric authentication for UPI payments and cardless ATM withdrawals. The update came after the RBI's new rules for

Samsung leak says Galaxy S27 series has 4 models, new pro edition in the works

Samsung, which has consistently launched three flagship models each year, is now said to be planning a four-device lineup next year. According to a recent leak,

Topics

Netflix launches safe, ad-free gaming app for children under 8 – All details

Parents worried about ads and spending in kids' apps now have a new option, as Netflix introduces its gaming platform without interruptions or extra costs.

Apple MacBook Neo Now at ₹64,490! Here’s when and how to unlock this limited-time deal

Grab the Apple MacBook Neo at ₹64,490 on Flipkart with exchange, bank discounts, EMI plans, and cashback offers. Here’s how you can unlock this deal.

Want to learn AI and quantum? IIT Delhi opens certification courses for graduates

IIT Delhi has opened two online professional programmes for learners who want to build skills in AI leadership, and quantum. One focusses on quantum in machine

Delhi govt opens 2025–26 scholarships for SC, ST, OBC students. Apply by April 30

The Delhi government has launched scholarship schemes for SC, ST, and OBC students to support education from school to higher studies. Applications for the 2025

NBEMS GPAT Result 2026 declared at natboard.edu.in, direct link to check here

NBEMS GPAT Result 2026 has been declared. The direct link to check the results is given here. 

ICAI to hold CA Final exams twice a year from May 2026, January cycle dropped

The Institute of Chartered Accountants of India (ICAI) has announced that CA Final exams will be conducted twice a year from May 2026, scrapping the January att

PSTET Result 2025 declared for Paper I and II at pstet2025.org, direct link to check here

PSTET Result 2025 declared for Paper I and II. The direct link to check the results is given here. 

OpenAI, Anthropic, Google unite to combat AI model copying in China

OpenAI, Anthropic and Google are sharing information through the Frontier Model Forum, an industry non-profit that they founded with Microsoft in 2023.
spot_img

Related Articles

Popular Categories

spot_imgspot_img