AI & Tech

Google Gemini is equipped with 'Lyria 3', which creates music with text and images

The latest music creation model capable of creating 30-second tracks, officially integrated into the Gemini app

AI Reporter Alpha·2026년 2월 18일 수 07:01·4 min read·

Summary

•Google DeepMind has installed the latest music creation model, Lyria 3, into the Gemini app.
•You can create a 30-second music track just by entering text or images.
•The competition for multimodal functions of AI assistants is expected to expand into the audio area.

Key takeaway: Add music creation functionality to the Gemini app

Google DeepMind has installed the latest music creation model 'Lyria 3' into its AI assistant Gemini app. This allows users to create 30-second music tracks using only text prompts or images.

Lyria 3 is the most advanced music creation model developed by Google DeepMind, and is designed to allow anyone to create their own music without any musical knowledge or composition experience. If you describe the desired mood or genre in text or upload an image, AI will automatically compose music that matches the emotion of the image.

Why It Matters: Multimodal Scaling of Generative AI

This update is an example showing that generative AI is expanding beyond text and images into the audio area. The existing Gemini has shown strengths in text generation, image analysis, and code writing, but the addition of the music creation function has greatly expanded its range of use as a creative tool.

In particular, it is expected to accelerate the democratization of creation in that even ordinary users without professional music production software (DAW) or the ability to play musical instruments can create music with just their ideas. Content creators, social media users, video producers, etc. can easily create background music (BGM).

Comparison of Competitive Landscape: AI Music Generation Market Status

service	Developer	maximum creation length	Input method	Platform integration
Lyria 3 (Gemini)	Google DeepMind	30 seconds	text, image	Gemini App
Suno	Suno AI	Up to 4 minutes	text	Web, API
Udio	Udio	Up to 2 minutes	text	web
MusicLM	Google	20 seconds	text	experimental release
Stable Audio	Stability AI	Up to 3 minutes	text	Web, API

Lyria 3's 30-second time limit is shorter than that of competing services, but it is differentiated in terms of accessibility in that it is directly integrated into the Gemini app and can be used immediately without signing up for a separate service. Additionally, image-based music creation is a unique feature not provided by competing services such as Suno and Udio.

Lyria's development flow

Google DeepMind's music AI development began in earnest with the release of MusicLM in 2023. MusicLM is an early model for converting text into music, and was released on a limited basis for academic research purposes. Afterwards, the Lyria model was announced in 2024 and applied to 'Dream Track', an AI music experiment tool on the YouTube platform.

This Lyria 3 is the 3rd generation version of this technology and is the first to be officially installed in the Gemini app for general consumers. This means that Google has transitioned its AI music creation technology from the experimental stage to the public service stage.

[AI Analysis] Future prospects and implications

Lyria 3's Gemini integration is likely to impact the market in several ways.

First, competition for AI assistant functions is intensifying. Competitors such as OpenAI, Microsoft, and Meta also have greater incentives to integrate music creation functions into their AI platforms. In particular, OpenAI has recently made few public moves related to music AI after Jukebox, but competitive pressure is expected to increase.

Second, copyright and royalty issues are emerging. It is highly likely that issues of copyright attribution to AI-generated music and compensation for existing music used as learning data will be discussed in earnest. The music industry has already expressed concerns about AI learning, and the popularization of Lyria 3 could accelerate these debates.

Third, changes in the creator ecosystem. The 30-second limit is exactly in line with the demand for BGM for short-form content (Reels, TikTok, YouTube Shorts). Creators who previously used copyright-free music libraries or stock music services are likely to move to AI-generated music, and a reorganization of the related market is expected.

Currently, Google has not disclosed any further details, including Lyria 3's exact release region, pricing policy, and whether the music produced will be available for commercial use. There is also the possibility that creation length expansion and more detailed editing functions will be added through future updates.

#deepmind-series #gemini #Lyria #AI음악생성 #멀티모달 #생성형AI #구글