Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
AI-Enabled Text-to-Music Generation: A Comprehensive Review of Methods, Frameworks, and Future Directions
University of Science and Technology Beijing, China.
Guangxi Tourism Development One-Click Tour Digital Cultural Tourism Industry Co., Ltd., China.
Administrative Office, Chunan Academy of Governance, China.
Jinzhong University, China.
Vise andre og tillknytning
2025 (engelsk)Inngår i: Electronics, E-ISSN 2079-9292, Vol. 14, nr 6, artikkel-id 1197Artikkel, forskningsoversikt (Fagfellevurdert) Published
Abstract [en]

Text-to-music generation integrates natural language processing and music generation, enabling artificial intelligence (AI) to compose music from textual descriptions. While AI-enabled music generation has advanced, challenges in aligning text with musical structures remain underexplored. This paper systematically reviews text-to-music generation across symbolic and audio domains, covering melody composition, polyphony, instrumental synthesis, and singing voice generation. It categorizes existing methods into traditional, hybrid, and end-to-end LLM-centric frameworks according to the usage of large language models (LLMs), highlighting the growing role of LLMs in improving controllability and expressiveness. Despite progress, challenges such as data scarcity, representation limitations, and long-term coherence persist. Future work should enhance multi-modal integration, improve model generalization, and develop more user-controllable frameworks to advance AI-enabled music composition. 

sted, utgiver, år, opplag, sider
MDPI, 2025. Vol. 14, nr 6, artikkel-id 1197
Emneord [en]
artificial intelligence, large language model, music generation, text-to-music generation
HSV kategori
Identifikatorer
URN: urn:nbn:se:bth-27688DOI: 10.3390/electronics14061197ISI: 001453821500001Scopus ID: 2-s2.0-105001095759OAI: oai:DiVA.org:bth-27688DiVA, id: diva2:1950322
Tilgjengelig fra: 2025-04-07 Laget: 2025-04-07 Sist oppdatert: 2025-09-30bibliografisk kontrollert

Open Access i DiVA

fulltext(6550 kB)779 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 6550 kBChecksum SHA-512
e9087b7477befb25d30a52b18ebad8cbc11741856c9a81a165a2a82cf6b353d107adf9f34ecbad3c1bd644dfef9ed26a1b4daed2fbed45c8827ef46dce9b7576
Type fulltextMimetype application/pdf

Andre lenker

Forlagets fulltekstScopus

Person

Ding, Jianguo

Søk i DiVA

Av forfatter/redaktør
Ding, Jianguo
Av organisasjonen
I samme tidsskrift
Electronics

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 781 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

doi
urn-nbn

Altmetric

doi
urn-nbn
Totalt: 1422 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf