Submitted by markhachman t3_10bddey in MachineLearning
Ronny_Jotten t1_j4b5fqx wrote
Reply to comment by Kafke in [D] Is MusicGPT a viable possibility? by markhachman
Images and text are already quite different from each other though, in terms of AI generators. The image generators include a language model, but work on a diffusion principle that the text generators don't use. Riffusion's approach of using a diffusion image generator with sonograms is interesting to some extent, but I sincerely doubt it will be the future direction of high-quality music generators.
Viewing a single comment thread. View all comments