學術演講

Generative AI for Music and Audio (以英文演講)

講者董皓文先生 (Department of Computer Science and Engineering University of California San Diego)
邀請人：蘇黎
時間2023-09-18 (Mon.) 10:00 ~ 12:00
地點資訊所新館106演講廳

摘要

Generative AI has been transforming the way we interact with technology and consume content. In this talk, I will introduce two of my recent research projects to showcase how generative models can be applied to generate music and audio. First, I will introduce the Multitrack Music Transformer project, where we aimed to generate orchestral music using a multi-dimensional transformer model and a new compact representation for multi-instrument music. Second, I will introduce the CLIPSonic project, where we aimed to tackle text-to-audio generation without using any text-audio pairs, and we leveraged the visual modality as a bridge to learn the desired text-audio correspondence using pretrained vision-language models and diffusion models. Finally, I will close this talk by sharing my view on the future of generative AI in music and audio.

BIO

Hao-Wen (Herman) Dong is a PhD Candidate in Computer Science at University of California San Diego, where he works on Music x AI research with Julian McAuley and Taylor Berg-Kirkpatrick. He is broadly interested in music generation, audio synthesis and machine learning for music and audio. He has collaborated with researchers at Adobe, Dolby, Amazon, Sony and Yamaha through internships. Prior to his PhD, he was a research assistant at Academia Sinica working with Yi-Hsuan Yang. He received his bachelor's degree in Electrical Engineering from National Taiwan University and his master's degree in Computer Science from UC San Diego. Herman's research has been recognized by the ICASSP Rising Stars in Signal Processing, UCSD GPSA Interdisciplinary Research Award, Taiwan Government Scholarship to Study Abroad, J. Yang Scholarship and UCSD ECE Department Fellowship. For more information, please visit his personal website (https://salu133445.github.io/).

中央研究院資訊科學研究所

活動訊息

學術演講

Generative AI for Music and Audio (以英文演講)

摘要

BIO