MT3: Multi-Task Multitrack Music Transcription

MT3 is a multi-instrument automatic music transcription model that uses the T5X framework.

This is not an officially supported Google product.

Installation

Option 1: Local Installation with Poetry

This project uses Poetry for dependency management.

# Install Poetry if you haven't already
curl -sSL https://install.python-poetry.org | python3 -

# Clone the repository
git clone https://github.com/magenta/mt3.git
cd mt3

# Install dependencies using Poetry (including git dependencies)
poetry install

# Activate the virtual environment
poetry shell

Option 2: Using Docker (Recommended)

For consistent and reproducible environments, we recommend using Docker:

# Clone the repository
git clone https://github.com/magenta/mt3.git
cd mt3

# Build and start the Docker container
docker-compose up -d

# Connect to the running container
docker-compose exec mt3 bash

# Once inside the container, you can run commands directly

Transcribe your own audio

Use our colab notebook to transcribe audio files of your choosing. You can use a pretrained checkpoint from either a) the piano transcription model described in our ISMIR 2021 paper or b) the multi-instrument transcription model described in our ICLR 2022 paper.

Train a model

For now, we do not (easily) support training. If you like, you can try to follow the T5X training instructions and use one of the tasks defined in tasks.py.

Development

This project is configured with Python 3.11 as the target version. We use:

Poetry for dependency management
pyenv or mise for Python version management
pytest for testing

All dependencies, including git dependencies (flax, note-seq, seqio, and t5x), are managed through Poetry in the pyproject.toml file.

Known Issues

Due to the complex dependency tree of TensorFlow and T5X, some compatibility issues may arise when installing dependencies locally. Using Docker is recommended for a more consistent environment.

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
.github/workflows		.github/workflows
mt3		mt3
.envrc		.envrc
.gitignore		.gitignore
.mise.toml		.mise.toml
.python-version		.python-version
.tool-versions		.tool-versions
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MT3: Multi-Task Multitrack Music Transcription

Installation

Option 1: Local Installation with Poetry

Option 2: Using Docker (Recommended)

Transcribe your own audio

Train a model

Development

Known Issues

About

Uh oh!

Releases

Packages

Languages

License

probablyrobot/mt3

Folders and files

Latest commit

History

Repository files navigation

MT3: Multi-Task Multitrack Music Transcription

Installation

Option 1: Local Installation with Poetry

Option 2: Using Docker (Recommended)

Transcribe your own audio

Train a model

Development

Known Issues

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages