學術演講

TIGP (SNHCC) -- Recent Studies on Audio and Symbolic Music Understanding

講者蘇黎博士 (中央研究院資訊科學研究所)
邀請人：TIGP (SNHCC)
時間2022-11-07 (Mon.) 14:00 ~ 16:00
地點資訊所新館106演講廳

摘要

Music is a compound of hierarchical semantics. By leveraging the multi-task learning utilities which can be easily performed by modern deep learning packages, joint learning of multiple musical attributes at a time has become feasible. Building high-quality datasets with fine-grained labels and learning all of them is the key to achieving high-level music understanding. In this talk, we will discuss some recent research on music understanding in our lab. First, we will introduce Omnizart, the first toolkit that offers transcription models for various music content including piano solo, instrument ensembles, percussion and vocal. We will take the vocal transcription model as an example and show how multi-task learning improves the performance. Second, we will introduce the voice segregation task, which seems to be simple but actually requires several layers of music understanding process in symbolic music. We will show how a simple model can serve as a general solution to this task.

活動訊息

學術演講

摘要