mirdata
Genís Plaja-Roglans, PhD student
I am a last year PhD student at Music Technology Group, Universitat Pompeu Fabra, Barcelona, under the supervision of Prof. Xavier Serra and Dr. Marius Miron. My research areas are Audio Signal Processing (ASP), Music Information Retrieval (MIR), and Machine and Deep Learning (ML & DL). Specifically, I work on vocal melody extraction, vocal source separation, and automatic melodic pattern discovery. I am also applying these problems on World traditional music repertoires, to promote the cultural diversity in MIR. On that note, I am working on challenges such as working with few or noisy data, and designing musically-tailored approaches. During the PhD, I did a research stay at Moises AI, working in the task of music source separation using generative modling under the supervision of Dr. Igor Gadelha. More recently, I joined back Moises AI as a full-time Data Scientist.
mirdata
soundata
compIAM
dmx-diffusion
Education
(Sept. 2021 - ongoing) // Universitat Pompeu Fabra
Under the supervision of Prof. Xavier Serra
(Sept. 2020 - Sept. 2021) // Universitat Pompeu Fabra
Feb. 2020 - July 2020) // Royal Melbourne Institute of Tech.
(Sept. 2016 - July 2020) // Universitat Pompeu Fabra
Work experience
(Aug. 2025 - ongoing) // Moises AI
(Apr. 2023 - Dec. 2023) // Moises AI
Working on music source separation using generative modeling under the supervision of Dr. Igor Gadelha.
(Sept. - Dec. 2021-2024) // Universitat Pompeu Fabra
Given to the 3rd year of the Bachelor's Degree in Audiovisual Systems Engineering.
(Sept. 2020 - Aug. 2021) // Universitat Pompeu Fabra
Contributing to the mirdata and soundata libraries by writing dataset loaders for datasets in the MTG,
also contributing to the core development and maintenance of both libraries.
(Nov. 2018 - Jan. 2020) // Universitat Pompeu Fabra
Leading the Sounds of Science project, an initiative to promote research through the matter of sound.
We also collected sounds from several different science departments, created a database, and performed
labelling.
Selected publications
→ G. Plaja-Roglans, Y. Hung, X. Serra & I. Pereira, 2025. Efficient and fast generative-based singing voice separation using a Latent Diffusion Model. In: Proc. of the Int. Joint Conference of Neural Networks (IJCNN), Rome, Italy. Link to paper.
→ G. Plaja-Roglans, Y. Hung, X. Serra & I. Pereira, 2025. Generating separated singing vocals using a diffusion model conditioned on music mixtures. In: Proc. of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Lake Tahoe, USA. Link to paper.
→ G. Plaja-Roglans, M. Miron, A. Shankar, X. Serra, 2023. Carnatic singing voice separation using cold diffusion on training data with bleeding. In: Proc. of the 24rd Int. Society for Music Information Retrieval Conference (ISMIR 2023), Milan, Italy. Link to paper.
→ G. Plaja-Roglans, M. Miron and X. Serra, 2022. A diffusion-inspired training strategy for singing voice extraction in the waveform domain. In: Proc. of the 23rd Int. Society for Music Information Retrieval Conference (ISMIR 2022), Bengaluru, India. Link to paper.
→ M. Fuentes, G. Plaja-Roglans, G. Cortès-Sebastà, T. Khandelwal, M. Miron, X. Serra, J. P. Bello, and J. Salamon, 2024. Soundata: Reproducible use of audio datasets. Journal of Open Source Software, 9(98), 6634. Link to paper.
→ T. Nuttall, G. Plaja-Roglans, L. Pearson and X. Serra, 2022. In search of sañcaras: tradition-informed repeated melodic pattern recognition in Carnatic Music. In: Proc. of the 23rd Int. Society for Music Information Retrieval Conference (ISMIR 2022), Bengaluru, India. Link to paper.
→ T. Nuttall,G. Plaja-Roglans, L. Pearson, X. Serra, 2021. “The Matrix Profile for Automated Discovery of Repeated Motifs in Audio - Indian Carnatic Music”. In: International Symposium on Computer Music Multidisciplinary Research (CMMR), Tokyo, Japan. Link to paper.
Additional recent research contributions
→ "mirdata: dataset loaders for reproducible research in MIR" in the Seminar for Reproducible Research organized by GdR ISIS (France).
→ Lecturer and main developer in the ISMIR 2022 Tutorial on Computational Methods for Supporting Corpus-Based Research on Indian Art Music.
→ Speaker and organizer in CompMusic workshop, organized in Chennai, India, as a satellite event of ISMIR 2022.