Home / Technology / Microsoft's New AI Models: Beyond Language
Microsoft's New AI Models: Beyond Language
2 Apr
Summary
- Microsoft launched new AI models for voice and text transcription.
- A new image generation model offers faster speeds and lifelike depictions.
- These models aim to expand Microsoft's AI offerings beyond LLMs.

Microsoft is broadening its artificial intelligence landscape by introducing three novel models that diverge from traditional large language models. These newly released tools include advanced systems for voice and text transcription, capable of processing 25 languages for applications such as video captioning and virtual assistants.
Alongside the transcription models, Microsoft has unveiled its second-generation in-house image generation model. This updated model promises enhanced generation speeds and more realistic visual outputs compared to its predecessor. These innovations are now accessible through Microsoft's Foundry and MAI playground, with plans for integration into products like Bing and PowerPoint.
This strategic expansion highlights Microsoft's commitment to growing its AI presence across various domains. While its Copilot chatbot remains a popular enterprise solution, these new generative media models indicate a push into areas requiring significant computational resources, distinguishing Microsoft among competitors with its enterprise-focused approach.