Applications of Memristors in ANNs

OutlineOutline

• Brief intro to ANNsBrief intro to ANNs

• Firing rate networksSi l l t i t– Single layer perceptron experiment

– Other (simulation) examples

• Spiking networks and STDP

ANNsANNs

ANN is bio‐inpsired massively parallel network,ANN is bio inpsired massively parallel network, i.e. directed graph, with nodes acting as neurons and edges acting as synapses. The functionality is learned during training phase by changing weights of synapses

• By topology• By learning paradigm• By coding neural informationBy coding neural information

Very good reviewVery good review

Applications

ChallengesComplexity

~ 1011 neurons

~ 1015 synapses 10 synapses

Connectivity

~ 1 : 10000

i ll liMassive parallelism

100 steps long rule: few to several hundred hertz; face recognition i 00in ~100 ms

2‐3 mm think , 2200 cm2

McCulloch‐Pitts neuron

d ff f

different activation functions

By topologyBy topology

By learning paradigm

l lKey questions: Capacity, Sample complexity, Computational complexity

By information codingBy information coding

• Firing rate vs spikingFiring rate vs spiking models

Perceptron: Main idea

xBias, x0

Single layer perceptron

x1x2x3

]sgn[9

ii xwy

Hebbian ruleHebbian rule

• Learning usingLearning using local information

• Orientation• Orientation selectivity

Multilayer perceptron

Key questions: number of layers, number of hidden neurons

BackpropagationBackpropagation

Gradient descent method to i i i t f timinimize cost function

Competitive learningCompetitive learning

Learning binary patterns with kcompetitive network

Instar learning law:

What happens if more than four unique patterns are presented? q p p

What happens when all white pattern is presented?

Complementary codingComplementary coding

• Resolve no signal issue for a particular (instar) learning lawlearning law

• How to learn invariance? (translation, size, angle etc.)

With added complex cellsWith added complex cells

• AND in bottom layer OR in top present one hot• AND in bottom layer, OR in top, present one hot patterns to the top layer

Perceptron: Main idea

x7x x = +1x1

Bias, x0

Single layer perceptron Binary pixel array

hw bottleneckx2x3

x = –1x2x3

]sgn[9

ii xwy

Considered training/test patterns

Pattern “X”, class d = +1Perceptron training rule: ∆wi = αxi(p)(d(p)‐y(p))

Crossbar implementation

V ∞ x G+-G- = G ∞ w

[I+ I ]

V0 V1 V9V2

G0+ G1

+ G2+ G9

Pattern “T”, class d = –1+ ‐

y = sgn[I+-I -]param. analyzer‐based

Alibart et al., submitted, 2012AI–G0– G1

– G2– G9

Windrow’s memistorAdaLiNe concept … … and hardware implementation

BernardWidrow

MarcianHoff

B. Widrow and M.E. Hoff, Jr., IRE WESCON Convention Record, 4:96 1960

Pt/TiO2‐x/Pt devicesg = I(0.2V)/ 0.2 V

25 nm Au / 15 nm Pt top electrode

Pt top electrode

5 nm Ti / 25 nm Pt bottom electrode

e‐beam patterned Pt protrusion

30 nm TiO2‐xS

rent (m

‐ Any state betweenON and OFF

‐ In principle dynamic

‐1.0

Curr S

‐ In principle dynamic system with frequencydependent loop size but ….

‐1.0 0 1.0Voltage (V)

A‐ Strongly (superexp)nonlinear switching dynamics

‐ Gray area = no changeVoltage (V)+Vswitch‐VswitchAlibart et al.,

submitted, 2012

Gray area no change ‐ State defined within

gray area

Switching dynamics

RESET: R =Rd

setvoltage initialize to R0FF

RESET: R0=RON

SET: R0=ROFF

time initialize to R0N

‐ Small pulse amp = finer state change butmay require exp long time

‐ Large pulse amp faster but at cruder step

0.11E-4

-0.9VmV

(A) -0.5V to -0.8V

1E-81E-6

1E-40.01

-1.5-1.0-0.5

0.00.5

1.01 5 Tim

Pulse voltage (1E-5

1.5 Timge (V)

F. Alibart et al. Nanotechnology, 23 075201, 2012

0 1x10-5 2x10-5

Time (s)

Nonlinear switching dynamics

effective barrier modulation due to:

heating

electric field

2 ion hopping

ion hoping

z+z+e‐

electrodeelectrode

~Eaq/2

~ kB∆T

initial profile

eoxidation reduction‐+ v

energy a∆UA

h t iti d ti3

hop distance

position

phase transition or redox reaction3

J. Yang et al. submitted 2012

Speed vs. retention

linear ionic transport linear ionic transport pp

store ~)()0(

nonnonlinearlinear effect due to temperature and/or electric field

)(~ writeB

storeB

store TkU

e.g. temperature only:

Twrite V

D.Strukov et al. Appl.Phys.A 94 515 (2009)

Switching statistics

RESET SET

0.02.0x10-6

4.0x10-6

6.0x10-6

8 0x10-6

0.60.8

1.01.2

ve tim

Voltag

5.0x10-7

1.0x10-6

1.5x10-6

-1.4-1.2

time (

Voltage8.0x101.0x10-51.4

Cumula

tivetage (V)

2.0x10-6-0.8

-0.6 Cumula

tage (V)

10 TiO2‐x devices

Alibart et al., submitted, 2012Large switching dynamics dispersion!

Variations in switching behavior

g = I(0.2V)/ 0.2 V

g INIT

‐1.0

Curren

t (mA)

write‐1.0 0 1.0

Voltage (V)SET

S =readtune

1 ulse voltage (V)

ynaptic weight

gINITIAL (mS

Pulsht,mS)

Alibart et al., submitted, 2012

RESET‐ Continuous state change

Tuning algorithmWrite

apply pulse VWRITE

Processing

VWRITE = VWRITE + sign * TVSTEPoldsign = sign

Processing

Is state reached

(inputs: desired state Idesired, desired accuracy

Processing

check for overshoot and set the i f i t i

within required precision, i.e. (Idesired – Icurrent)/ Idesired < Adesired ?

Adesired; initialize: write voltage to small non‐disturbing value VWRITE = 200 mV, voltage step TVSTEP = 10

(apply VREAD = 200 mV and read current Icurrent)

sign of increment, i.e. sign = Icurrent ‐ Idesired ;

if VWRITE !=VREAD and sign !=oldsign then initialize VWRITE =

200 mV

Finish

Intuitive algorithm Implemented algorithmvoltage

set timevoltage

Intuitive algorithm Implemented algorithm

resetread

non‐disturbing pulse F. Alibart et al. Nanotechnology, 23 075201, 2012

High precision tuning

120AIncrease WeightDecrease Weightvoltage

set time

Increase WeightStand-by (Read only)0

resetread TiO2‐x devices

(w/o protrusion)

100(gdes‐gact)/gdes<1% ~ 8‐bit precision

1E-5Cur

rent 15A

0 1000 2000 3000

1E 57A

950 1000 1050 1100 115028

0 1000 2000 3000

Pulse Number F. Alibart et al. Nanotechnology, 23 075201, 2012

Limitation to tuning accuracy: Random telegraph noise

0 2 4 6 8 10

4k 2k 1k 0.5k

Resistance (k)

D/I2 (H

Resistance (k)

102 103 104

0.2 0.4 0.6 0.8 1.0 1.2

0.2 0.4 0.6 0.8

Time (s)

Time (s) Frequency (Hz)

‐ Solid‐state electrolyte (electrochemical) are noisierThe higher R the larger is noise

Ligang Gao et al, VLSI‐SoC, 2012

‐ The higher R, the larger is noise‐ For a‐Si limit to ~5‐6‐bit precision (but no optimization)

Perceptron experimental setup

Switching matrix( l )

Arbitrary waveform generator B1530

(Agilent E5250A)

Current measurementB1530 (fast IV mode)

Ground (GNDU, Agilent)

Agilent B1500

Wires implementing crossbar circuit

Agilent B1500

Chip packaged wire bonded memristive devices

Perceptron: Ex‐situ trainings1

Evolution of synaptic conductance upon sequential tunings2

v s10 5

+ tuning

final weights after programming

weight import accuracy ~10%

y p p q g

read pulse write pulse0.3

g (m g+ tuning g ‐

123456

gi+, i

0 20 40 60 80 100 120 250 3000.0

weight slightly affected by half‐select problem

678910

+Vswitch

-Vswitch

voltage at g8- 0 20 40 60 80 100 120 250 300

Pulse number #

‐ Crossbar half‐select tricklf l d d i li h l ff d ( bi i i )switch

‐ Half‐selected devices slightly affected (>5‐bit precision)

Perceptron: In‐situ training

V tra in = 1 VV tra in = 0 .9 V

g1+ g4

Evolution of synaptic conductance upon parallel tuning

‐ Four steps‐ α (V g)

∆gi ± = ±αxi(d(p)‐y(p))

-0 .10 .00 .1

-0 .050.00

s3s4g1

s1=PSx=+1 voltage at g1+

‐ α (V, g)

0.000.05

-0 .050.000.000.05

+Vtrain/2v

t1 2 3 4

1 x=+1

s2=PS 1

voltage at g1

voltage at g1-

-Vtrain-Vtrain/2

-0 .20

-0 .15-0 .150.000.15

s2 PSx=‐1

s3=PS+d=+1

voltage at g1

voltage at g4+

0 00.1

-0 .15-0 .10-0 .05

0.00 .1

3 d=+1 g g4

voltage at g4-s4=PS‐d=+1

0 4 8 1 2 1 6

T ra in in g e p o c h

t+Vswitch

-Vswitch

Results

initialInitial (random

initial

Ex‐situ In‐situ

accuracy ~ 40%

( a doweights)

weight import accuracy ~40% 0

accuracy ~ 10%

accuracy ~40%

weight import

tern after 10 epochs

with Vtrain =0.9V

accuracy ~ 2%

accuracy ~10%

weight import 10

0after 7 more epochs with Vtrain =1V

accuracy 2%weight import accuracy ~2%

-0.0002 0.0000 0.00020

-0.0002 0.0000 0.0002I+ - I- (A) I+ - I- (A)

Alibart et al., submitted, 2012‐ 3‐bit is enough for considered task

Big picture

add‐on

Ti h i i i h CMOS l i (CMOL)

CMOSstack

Tight integration with CMOS logic (CMOL)Multi‐layer perceptron network

x ywj1

weight memristor

wj3 x3

CMOS CMOS cell

Spiking Networks and Spike‐Timing Dependent Plasticity (STDP)Dependent Plasticity (STDP)

Spiking vs. firing rate neural networksFiring rate (average frequency matters, high frequency level 1, low frequency level 0)

Spiking networks

Relative timing of h ikthe spikes matters

Delay between neurons matters Enriches the functionality

Spiking neural networks

Spatiotemporal processing

Known to happen in biology, d i h di i fe.g. detecting the direction of

the sound with two sensors and two neurons

Polychronization: Computation with kSpikes

• According to Izhikevitch: Accounting for timingAccording to Izhikevitch: Accounting for timing of spikes allows to increase the capacity of the network beyond that of Hopfield networksnetwork beyond that of Hopfield networks

Hopfield Networks

Binary Hopfield network

])(sgn[)1(0

ijij tvwtv

Capacity is pmax = N/logN

Polychronization: Computation with Spikes

Due to STDP system can self‐organized to activate various polychronous groups

Spike Timing Dependent Plasticity

STDP Implementation (first attempt)STDP Implementation (first attempt)

“ h i l t d CMOS“… we have implemented a CMOS neuron circuit to convert the relative timing information of the neuron spikes into pulse width p pinformation seen by thememristor synapse

STDP Implementation Proposal for Memristors

Assumed rate change as a function of applied voltage

Proposal for Memristors

STDP Implementation with PCM

Long Term Depression and Short Term PotentiatingPotentiating

Electronic Pavlov’s Dog

Snider’s Spiking NetworksSnider s Spiking Networks

Example: Network Self-Organization

(Spatial Orientation Filter Array)(Spatial Orientation Filter Array)

adaptiveadaptiverecurrentnetwork

+‐ output

+ ‐‐‐ ++

G. Snider, Nanotechnology 18 365202 (2007)

Applications of Memristors in ANNs - Electrical and...

Transcript of Applications of Memristors in ANNs - Electrical and...

Applications of Memristors in ANNs - Electrical and...

Documents

Transcript of Applications of Memristors in ANNs - Electrical and...

HR/PAYROLL MODERNIZATION PROGRAM - Washington 2.11.16 FINAL nn.pdf · HR/PAYROLL MODERNIZATION PROGRAM Lessons Learned Technology Services Board February 11, 2016. PROGRAM BACKGROUND.

APPLICATION OF NEURAL NETWORKS TO AN EMERGING …web.ist.utl.pt/adriano.simoes/tese/referencias/Papers - Pedro/NN.pdf · trading strategies guided by forecasts of the direction of

Michigan State University K. N systems, some inspired by ...jianhua/nn.pdf · I ification is to assign an input pat- n applications include ition, EEG waveform , and printed circuit

Yeonho Electronics - ic114.comproduct.ic114.com/PDF/S/SMW250-NN.pdf · C] AutoCAD 2002 - and ÄRWW2BâOl%W2.50MMWSMW250-OOA- Byl_ayer dwg] SyLaye; 563.14, 440.10, 0.00 OSNAP OTRACK

Novel Devices Circuits Computingstrukov/ece594BBWinter2013/veiwgraphs/...• T Lee et al, “Organic resistive nonvolatile memory materials” – J Yang, DBS and D Stewart, Memristive

I/NO DESCRIPTION MATERIAL 1 HOUSING NYLON 66, …product.ic114.com/PDF/S/SMH200-NN.pdf · AutoCAD 2002 - [C*Documents and E] AutoCA 8yLayei AutoCA ByLayer @00 200- 200- 200- 200-

Novel Devices Circuits Computing - UCSBstrukov/ece594BBWinter2013/veiwgraphs/...Winter 2013 Lecture 6 : TCM cell Class Outline • TCM = ThermoChemical Memory – General features

H- CDM8061-NN - magnitola.rumagnitola.ru/manuals/h-cdm8061-nn.pdf · 3 Important safeguards • Read carefully through this manual to familiarize yourself with this high-quality sound

Case report of oral candidiasis in iron deficiency anemia patients …basjsci.net/wp-content/uploads/2019/01/Case-report-Final-nn.pdf · immunodeficiency hosts. Two patients with

THE J&K BOARD OF PROFESSIONAL ENTRANCE EXAMINATIONSjakbopee.org/Uploads/Notifications/No-021-02072013-01-NN.pdf · 13 harvind singh bupaish singh 06 om m 750795 90.0 0058 06 om civil

ST5-Q-NN - Servo Systemsservosystems.com/pdf/amp/st5-q-nn.pdf · The ST5-Q-NN stepper drive is a DC-powered microstepping drive for controlling two-phase, bipolar step motors. It

Michigan State University K. N systems, some inspired by ...csc.lsu.edu/~jianhua/nn.pdf · Michigan State University ... N systems, some inspired by biological neural networks. ...

Machine Learning - cs.cmu.eduepxing/Class/10701/slides/lecture7-nn.pdf · 10 5x5 convolution 5x5 convolution 5x5 2x2 convolution pooling/ subsampling 2x2 pooling/ subsampling Layer

Near-Neighbor Searchi.stanford.edu/~ullman/mining/2006/lectureslides/nn.pdf · Example : “ng av” and “ouchd” are most common in sports articles. 16 Shingles: Compression Option

Yeonho Electronicstopbestech.com/product-pdf/SMAW250A-NN.pdf · 2011. 5. 26. · Specification ITEM SPEC Voltage Rating AC/DC 250V Current Rating AC/DC 3A Operating Temperature -25

Nonparametric estimation of conditional quantiles using NN.pdf

Introduction to Neural Networksafkjm/cs405/handouts/NN.pdf · 2006-11-14 · Introduction to Neural Networks CS405 What are connectionist neural networks? • Connectionism refers

I/NO DESCRIPTION MATERIAL 1 WAFER NYLON 66, UL94V-0 2 PINjkelec.co.kr/img/yeonho/SMW200-NN.pdf · 2014-04-04 · 20.0 22.0 24.0 1789.73, -74785, 0.00 osnap otrack lwt cadarog - windows

Synopsis - KTH NN.pdf · Hebbs teori • Cellgrupp ... like the receptive fields of neurons in the visual cortex.” ... Hodgkin-Huxley formalism Na, K, K Ca, Ca-kanaler Ca AP and

acid dexaexilcolic,acid folic ]n dezvoltarea creer nn.pdf