Learning to Play Jazz with Deep Belief Networks

Greg Bickerman, Harvey Mudd College Sam Bosley, Stanford University Peter Swire, Brandeis University

Robert Keller, Harvey Mudd College

MoHvaHon

•  People are able to improvise jazz on the spot •  Jazz ImprovisaHon

– PaKerned and structured – CreaHve and novel

•  Could a machine learn to improvise as well as a human?

MoHvaHon

•  ArHficial jazz improvisers already exist – GenJam

•  Supervised geneHc learning –  Impro‐Visor

•  Extensive musical knowledge built in

•  Interested in unsupervised learning •  Minimal representaHonal assumpHons

Restricted Boltzmann Machines

•  2‐layer network – Visible layer – Hidden layer

•  Nodes –  Interconnected – Can be set ON or OFF

•  Weights – Assigned to each connecHon – Symmetric

Visible Nodes

Hidden Nodes

AcHvaHon

•  Nodes acHvated probabilis)cally based on acHvaHon states of nodes in opposite layer – Compute weighted sum of acHve connecHons – AcHvaHon funcHon determines probability of firing

AcHvaHon

B 0 1 0

Visible Nodes

Hidden Nodes

B 0 1 0

Visible Nodes

Hidden Nodes

B 0 1 0

Visible Nodes

Hidden Nodes

B 0 1 0

Visible Nodes

Hidden Nodes

B 0 1 0

Visible Nodes

Hidden Nodes

B 0 1 0

Visible Nodes

Hidden Nodes

B 0 1 0

Visible Nodes

Hidden Nodes

AcHvaHon

Visible Nodes

Hidden Nodes

Input / Output

•  Input – Binary data sequences – Mapped onto visible neurons

•  Output –  IdenHcally sized data sequences – Read off of visible neurons

Input / Output

Visible Nodes

Hidden Nodes

1 1 0 0

1 0 0 1

Output

Input/Output

Visible Nodes

Hidden Nodes

1 1 0 0

1 0 0 1

Output

Training

•  ContrasHve divergence method – AcHvate network normally

– AcHvate network with inputs “clamped” – Adjust weights to make normal acHvaHon behave more like clamped acHvaHon

Deep Belief Networks

•  Use individual RBMs as layers in a larger network

•  Hidden layer of one RBM forms input layer of another

•  If single RBMs learn features about data, DBMs learn features about features

Layer 1

Layer 2

Layer 3

Encoding Scheme

•  Requirements – Binary encoding – Music must be encoded in a string of standard length

– Each note must be the same “distance” from every other note

Encoding Scheme

•  Break melody into beat subdivisions

•  Each subdivision contains 18 bits – 1 Sustain Bit – 1 Rest Bit – 12 Pitch Bits – 4 Octave Bits

S R C C# D D# E F F# G G# A A# B 1 2 3 4

Pitch Bits Octave Bits Rest Bit Sustain Bit

Encoding Scheme

0 0 1 1 1 0 1

0 1 1 1 0 1

0 1 1 0 1 1 0 1 1

0 0 0 0 0 0 0 0

0 0 0 0 0

0 0 0 0

S R C C# D D# E F F# G G# A A# B 1 2 3 4

Pitch Bits Octave Bits Rest Bit Sustain Bit

Chord Encoding Scheme

0 0 1 1

0 0 0 0

0 0 1 1

1 1 0 0

0 0 0 0

0 0 1 1

0 0 0 0

C C# D D# E F F# G G# A A# B

Chord Bits

1 0 0 0 0 0 0 0 0 0 1 1

1 0 0 0 0 0 0 0 0 0 1 1 1 0 0 0 0 0 0 0 0 0 1 1

IniHal Dataset

•  Children’s songs – 2 measures

– 8th note resoluHon – Simple chords – 14 melodies

•  Generated similar songs

Main Dataset

•  Jazz licks – 4 measures

– 12 beat subdivisions •  (32nd triplet note resoluHon)

–  ii‐V7‐I‐VI7 chord progression – 100+ licks

•  Transcribed •  HandwriKen

Windowing

•  Rather than training on enHre piece at once, break music into “windows”

•  Start with the first measure of a lick and gradually move window forward

•  Allows learning/generaHng arbitrary length music sequences with a fixed size network

Windowing

GeneraHng New Melodies

•  Need to specify a chord sequence over which to generate a new melody.

•  Chord bits are “clamped” during generaHon so that they can influence the melody being generated without changing themselves

•  Use a windowing strategy analogous to our windowed training method

•  As each successive beat is generated the whole melody and chord sequence shijs forward to make room for the next beat

Chord Seed

Random Data

Chord Seed

Generated Melody

Results

•  Rhythmically stable

•  Respects chord tones

•  Occasional color tones

•  Very few foreign tones

Results

•  Trained on transposiHons as well – Generated music following key of given chord progression

– Succeeded with up to four transposiHons

Example Training Licks

Example Generated Licks

More Generated Examples

Random Music

Future Work – Repeated Notes

•  Our machines produced disproporHonate numbers of repeated notes

•  Can sound staHc or too immobile for jazz

Future Work – Repeated Notes

•  Possible soluHon: post processing – Merge repeated notes together

– Results in a smoother output, but starts to cross line of unsupervised learning

•  Ideally, machine should avoid repeated notes in the first place

Future Work – Training Algorithm

•  Slow – OpHmizaHon

– ParallelizaHon – AdapHve terminaHon

•  SensiHve to training presentaHon order – Randomize training inputs

Future Work – Chord Inference

•  We believe our work naturally lends itself to the open problem of inferring unknown chords for a melody – Currently we provide a chord seed to generate a melody.

–  If we instead provide a melody as input, we could determine which chords fit that melody

Conclusion

•  Unsupervised learning algorithm

•  Based on probabilisHc neural network theory

•  Able to create novel jazz licks based on an exisHng corpus

•  Minimal assumpHons about musical knowledge

Learning to Play Jazz with Deep Belief Networks - Conference Version

Transcript of Learning to Play Jazz with Deep Belief Networks - Conference Version

Learning to Play Jazz with Deep Belief Networks - Conference Version

Documents

Transcript of Learning to Play Jazz with Deep Belief Networks - Conference Version

Recognition of Facial Emotions Relying on Deep Belief ...

Deep Belief Networks (D2L1 Deep Learning for Speech and Language UPC 2017)

A Fast Learning Algorithm for Deep Belief Nets

Convolutional Deep Belief Networks for Scalable ... · •Convolutional Restricted Boltzmann Machine –Probabilistic max-pooling •Convolutional Deep Belief Networks –Scalable

Deep Belief Networks - pdfs.semanticscholar.org€¦ · Deep Belief Networks Ruslan Salakhutdinov and Geoff Hinton Department of Computer Science, University of Toronto 6 King’s

Low-Energy Deep Belief Networks Using Intrinsic Sigmoidal ...cal.ucf.edu/journal/GLSVLSI.pdf · Low-Energy Deep Belief Networks Using Intrinsic Sigmoidal Spintronic-based Probabilistic

Lecture 13 - Deep Belief Networks - Columbia Universitystanchen/fall12/e6870/slides/lecture13.pdfLecture 13 Deep Belief Networks Michael Picheny, Bhuvana Ramabhadran, Stanley F. Chen

Deep Belief Networks Learning Advanced Machines319655500.online.de/du/upload/DeepBeliefLearning.pdf · "Deep Belief Networks" The "magic" behind will be explained in this talk Developed

LEARNING FEATURES FROM MUSIC AUDIO WITH DEEP BELIEF NETWORKSmusicweb.ucsd.edu/~sdubnov/Mu270d/DeepLearning/... · LEARNING FEATURES FROM MUSIC AUDIO WITH DEEP BELIEF NETWORKS Philippe

Anomaly Detection with Deep Belief Networks · 3.2 Deep Belief Networks Deep Belief Networks (DBN) are models composed of multiple layers of RBMs stacked on top of one another, with

LEARNING RHYTHM AND MELODY FEATURES WITH DEEP BELIEF NETWORKSmusic.ece.drexel.edu/files/Navigation/Publications/Schmidt2013b.pdf · LEARNING RHYTHM AND MELODY FEATURES WITH DEEP BELIEF

Learning to Create Jazz Melodies Using Deep Belief Nets

Deep Belief Nets and Restricted Boltzmann Machines

CS 678 – Deep Learning1 Deep Learning Early Work Why Deep Learning Stacked Auto Encoders Deep Belief Networks.

EEG-based Emotion Classification using Deep Belief Networks · Deep Belief Networks (DBN) is a probabilistic gen-erative model with deep architecture, which charac-terizes the input

Deep Belief nets

Deep Belief Networks Based Feature Generation and ...

Kravet Jazz Deep Sleeper Sectional Tearsheet

Restricted Boltzmann Machine and Deep Belief Net

Deep Belief Networks