Zur JKU Startseite
Institut für Computational Perception
Was ist das?

Institute, Schools und andere Einrichtungen oder Angebote haben einen Webauftritt mit eigenen Inhalten und Menüs.

Um die Navigation zu erleichtern, ist hier erkennbar, wo man sich gerade befindet.


Strong rank in the HEAR 2021 Challenge

CP's team (Khaled Koutini, Jan Schlüter, Hamid Eghbal-zadeh, Gerhard Widmer) scores top places in different tasks of HEAR 2021 NeurIPS Challenge, Holistic Evaluation of Audio Representations.

HEAR 2021 Logo

The aim of the challenge is to push machine listening models to be as holistic as the human ear, i.e., develop a model that performs well across a variety of everyday domains. Models are meant to produce a general-purpose audio representation as a strong basis for audio classification and sequence labeling. Representations are evaluated using a benchmark suite across a variety of domains, including speech, environmental sound, and music. Results for all tasks are published at https://neuralaudio.ai/hear2021-leaderboard.html, öffnet eine externe URL in einem neuen Fenster.

CP's team used their latest state-of-the-art transformer model, the details of which are published in a preprint (https://arxiv.org/abs/2110.05069, öffnet eine externe URL in einem neuen Fenster). The source code for training such a model, and pre-trained models are available at https://github.com/kkoutini/PaSST, öffnet eine externe URL in einem neuen Fenster.