MuseGAN: Multi-track Sequential Generative Adversarial ... · Data. LPD (Lakh Pianoroll Dataset)...

MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and AccompanimentHao-Wen Dong*, Wen-Yi Hsiao*, Li-Chia Yang, Yi-Hsuan YangResearch Center of IT Innovation, Academia Sinica

Demo Page https://salu133445.github.io/musegan/

*these authors contributed equally to this work

Outline。Goals & Challenges。Data。Proposed Model。Results & Evaluation。Future Works

Source Code https://github.com/salu133445/museganDemo Page https://salu133445.github.io/musegan/

Generate pop music。of multiple tracks

。in piano-roll format

。using GAN with CNNs

[Source Code]https://github.com/salu133445/musegan

[Demo Page]https://salu133445.github.io/musegan/

Challenge IMultitrack Interdependency

bassdrums

strings

music & clip by phycause

Multi-track GAN

Challenge IIMusic Texture

melody

chord(harmony)

Convolutional Neural Networks

Challenge IIITemporal Structure

paragraph 1 paragraph 2 paragraph 3

phrase 1 phrase 2 phrase 3 phrase 4

bar 1 bar 2 bar 3 bar 4

beat 1 beat 2 beat 3 beat 4

step 1 step 2 ··· step 24

phrase 2

4/4 time

Challenge IIITemporal Structure

phrase 2 Fixed Structure

Convolutional Neural Networks

4/4 time

Data Representation

Bar 1 Bar 2 Bar 3 Bar 4time step

Piano-roll

polyphonic multi-track

(with symbolic timing)

Data Representation

Piano-roll

Bar 1 Bar 2 Bar 3 Bar 4polyphonic multi-track

Data RepresentationMulti-track Piano-roll

tracks

polyphonic multi-track

Data Representation

96 time steps

84pitches 5 tracks

4 bars

a 4×96×84×5 tensor

GuitarPiano

Strings

LPD (Lakh Pianoroll Dataset)。>170,000 multi-track piano-rolls。Derived from Lakh MIDI Dataset。Mainly pop songs

Pypianoroll (Python package)。Manipulation & Visualization。Efficient Save/Load。Parse/Write MIDI files。On PYPI (pip installable)

[Dataset]https://salu133445.github.io/musegan/dataset

[Pypianoroll]https://salu133445.github.io/pypianoroll/

Generative Adversarial Networks

real data

Gz~p(z) G(z)

random noise fake dataGenerator

D real/fake

Discriminator

4-bar phrases of 5 tracks

critic(wgan-gp)

MuseGAN – An Overview

4 latent variables1 random noise

temporalgenerator

bargenerator

4 piano-roll matrices

Bar Generator

MuseGAN

Bar Generator

zzzzzzzzz

No Coordination

Coordination

track-dependent

track-independent

MuseGAN

Bar GeneratorGz

zzzzzzzzz

MuseGAN

Bar GeneratorGz

zzzzzzzzz

TimeDependent Independent

TrackDependent Melody Groove

Independent Chords Style

MuseGAN

Bar GeneratorGz

zzzzzzzzz

ChordsStyle

Melody

Groove

Results

More Samples on Demo Pagehttps://salu133445.github.io/musegan/

Sample 1 Sample 2

BassDrumsGuitarStringsPiano

Step 0 Step 700 Step 2500 Step 6000 Step 7900

Drum pattern

Chords

Bass Line

Objective MetricsUPC

UPC number of used pitch classes per bar

QN ratio of qualified notes

Monitor the Training

step2000 4000 6000 8000104

Negative Critic Loss

User Study

H: harmoniousR: rhythmicMS: musically structuredC: coherentOR: overall rating

composer

jamming

hybrid

Summary。MuseGAN

◦ a novel GAN for multi-track sequence generation

◦ multi-track, polyphonic music

◦ human-AI cooperative scenario (see the paper)

。Lakh Pianoroll Dataset (LPD) (new dataset!!)

。Pypianoroll (new package!!)

Future Works

Full Song Generation

phrase 2

paragraph 1 paragraph 2 paragraph 3

phrase 1 phrase 2 phrase 3 phrase 4

Hierarchical Temporal Structure

Future Works

Cross-modal Generation。Music + Video。Music + Lyrics。Video + Text

Q&A MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music

Generation and Accompaniment

Source Code https://github.com/salu133445/museganDemo Page https://salu133445.github.io/musegan/

MuseGAN: Multi-track Sequential Generative Adversarial ... · Data. LPD (Lakh Pianoroll Dataset)...

Documents

Transcript of MuseGAN: Multi-track Sequential Generative Adversarial ... · Data. LPD (Lakh Pianoroll Dataset)...

Distributed DASH Dataset

Dataset Jamu Complete.xls.xlsx

Learning to Track: Online Multi-Object Tracking by Decision Making · 2020-06-20 · Experiments: Dataset •Multiple Object Tracking Benchmark [1] •11 training sequences •11

Dataset Journeys

Partially Occluded Hands: A challenging new dataset for ... · Partially Occluded Hands Dataset 5 3 A dataset of partially occluded hands We collected a dataset of 11,840 images of

Name Position Organisation Date. What is data integration? Dataset A Dataset B Integrated dataset Education data + EMPLOYMENT data = understanding education.

The inD Dataset: A Drone Dataset of Naturalistic Road User Trajectories at … · 2019. 11. 19. · The inD Dataset: A Drone Dataset of Naturalistic Road User Trajectories at German

DATASET INTRODUCTION 1. Dataset: Urine 2 From Cleveland Clinic 1981-1984.

Coding ADO.Net DataSet Objects ISYS 512. DataSet Object A DataSet object can hold several tables and relationships between tables. A DataSet is a set.

The IMHG dataset: A Multi-View Hand Gesture RGB-D Dataset ...

Vehicle Energy Dataset (VED), A Large-scale Dataset for ...

Fcv dataset zisserman_2

The highD Dataset: A Drone Dataset of Naturalistic Vehicle ...

iho.int and Standards/DQWG... · Web viewscope for a dataset can comprise a dataset series to which the dataset belongs, the dataset itself, or a smaller grouping of data located

Supporting Information S1. The benchmark dataset consists ... · 1 Supporting Information S1. The benchmark dataset consists of a positive dataset and a negative dataset . The positive

2016 conservation track: evaluating lidar derived synthetic streams as a source for national hydrography dataset flowlines by cynthia miller-corbett

DATASET - landlaeknir.is

MuseGAN: Multi-track Sequential Generative Adversarial ......MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment Hao-Wen

Delphi Client DataSet

Dataset for Rodanthe