FMP AudioLabs
C2

MIR Resources


This notebook contains links to literature, toolboxes, and other resources that may be useful for research in music processing and music information retrieval (MIR). We would like to point out that the information provided on this website does not claim to be comprehensive or up-to-date. The selection of information merely reflects the authors' personal taste.

ISMIR

Music information retrieval (MIR) is a research area that systematically deals with processing, searching, organizing, and accessing music-related data. MIR is an interdisciplinary field which is related to various disciplines including signal processing, information retrieval, machine learning, multimedia engineering, library science, musicology, and digital humanities. The International Society for Music Information Retrieval (ISMIR) is a non-profit organization, which represents the MIR research community. As one main task, the society oversees the organization of the annual ISMIR conference. The proceedings of ISMIR conferences are an essential source of scientific contributions in music processing and MIR.

The ISMIR website not only contains information on the society, but also provides links to various resources including:

Toolboxes

There are many different well-documented toolboxes that provide modular source code for processing and analyzing music and audio signals. Here is a (non-comprehensive) list of such toolboxes.

  • ACA-Code provides MATALB scripts for hands-on experience in audio and music processing accompanying the book An Introduction to Audio Content Analysis.

  • The Audio Degradation Toolbox consists of MATLAB code for the controlled degradation of audio signals, and for the adaptation of ground-truth to the degraded audio.

  • The Chroma Toolbox contains MATLAB implementations for extracting various types of pitch-based and chroma-based audio features.

  • The Constant-Q Toolbox is a MATLAB toolbox for efficient perfect reconstruction time–frequency transforms with log-frequency resolution.

  • Essentia is an open-source library including tools for audio and music analysis, description and synthesis.

  • JAMS provides JSON annotated music specification for reproducible MIR research.

  • LibROSA is a Python package for music and audio analysis providing building blocks necessary to create MIR systems.

  • The MIRtoolbox offers an integrated set of functions written in MATLAB, dedicated to the extraction from audio files of musical features such as tonality, rhythm, and structures.

  • The MSAF toolbox is a Python package for the analysis of music structural segmentation algorithms.

  • Madmom is an audio signal processing library written in Python with a strong focus on MIR tasks.

  • Marsyas (Music Analysis, Retrieval and Synthesis for Audio Signals) is an open source software framework for audio processing with specific emphasis on Music Information Retrieval applications.

  • Muda contains tools for annotation-aware musical data augmentation.

  • The NMF Toolbox provides implementations of conceptually distinct variants of Nonnegative matrix factorization (NMF). The toolbox is available both for MATLAB and Python. Additionally, it contains unit tests to ensure that the results for both programming languages are consistent. A small dataset of example audio files is provided for demonstration purposes.

  • The SM Toolbox contains MATLAB implementations for computing and enhancing similarity matrices in various ways.

  • The TSM Toolbox provides MATLAB implementations of various classical time-scale modification (TSM) algorithms like OLA, WSOLA, and the phase vocoder.

  • The Tempogram Toolbox provides MATLAB implementations for extracting various types of proposed tempo and pulse related audio representations

  • jAudio is a Digital Signal Processing project built to provide an easy-to-use program for audio feature extraction.

  • mir_eval is a Python library for computing common heuristic accuracy scores for various music/audio information retrieval/signal processing tasks.

  • musicinformationretrieval.com is a collection of instructional materials (including Python code) for MIR.

Books

Here are some MIR-related books listed in alphabetical order of the first author's name.

  • Òscar Celma. Music Recommendation and Discovery: The Long Tail, Long Fail, and Long Play in the Digital Music Space. Springer, 2010.

  • Anssi P. Klapuri and Manuel Davy (editors). Signal Processing Methods for Music Transcription. Springer, 2006.

  • Peter Knees and Markus Schedl. Music Similarity and Retrieval: An Introduction to Audio- and Web-based Strategies. Springer, 2016

  • Alexander Lerch. An Introduction to Audio Content Analysis. Wiley, 2012.

  • Tao Li, Mitsunori Ogihara, George Tzanetakis (editors). Music Data Mining. Chapman & Hall/CRC Data Mining and Knowledge Discovery Series, 2011.

  • Meinard Müller. Fundamentals of Music Processing. Springer, 2015.

  • Meinard Müller. Information Retrieval for Music and Motion. Springer, 2007.

  • William Arthur Sethares. Rhythm and Transforms. Springer, 2007.

  • William Arthur Sethares. Tuning, Timbre, Spectrum, Scale. Springer, 1998.

  • Claus Weihs, Dietmar Jannach, Igor Vatolkin, Guenter Rudolph (editors). Music Data Analysis: Foundations and Applications. Chapman & Hall/CRC Computer Science & Data Analysis, 2016

Acknowledgment: This notebook was created by Meinard Müller.
C0 C1 C2 C3 C4 C5 C6 C7 C8