June 2003Neural Computation for Time Series1 Neural Computation and Applications in Time Series and...

June 2003 Neural Computation for Time Series 1

Neural Computationand Applications in

Time Series and Signal Processing

Georg DorffnerDept. of Medical Cybernetics and

Artificial Intelligence, University of Vienna

Austrian Research Institute for Artificial Intelligence

Neural Computation

• Originally biologically motivated(information processing in the brain)

• Simple mathematical model of the neuron neural network

• Large number of simple „units“• Massively parallel (in theory)• Complexity through the interplay of many simple

elements• Strong relationship to methods from statistics• Suitable for pattern recognition

A Unit

• Propagation rule:– Weighted sum

– Euclidian distance

• Transfer function f:– Threshold fct.

(McCulloch & Pitts)

– Linear fct.

– Sigmoid fct.

– Gaussian fct.

yj f xj

Weight

Unit (Neuron)

Activation, Output

(Net-) Input

Multilayer Perceptron (MLP), Radial Basis Function Network (RBFN)

• 2 (or more) layers (= connections)

2inout

iiilljj

Input Units

Hidden Units (typically nonlinear)

Output Units(typically linear)

iiilljj

inoutMLP: RBFN:

MLP as Universal Function Approximator

• E.g,: 1 Input, 1 Output, 5 Hidden

• MLP can approximate arbitrary functions (Hornik et al. 1990)

• trough superposition of weighted sigmoids

• Similar is true for RBFN

inhidoutinoutj

iiiijjkkk wwxwfwxgx

move(bias)

Stretch, mirror

Training (Model Estimation)

• Typical error function:

• „Backpropagation“ (application of chain rule):

contribution of error function contribution of network

• Iterative optimisation based on gradient(gradient descent, conjugent gradient, quasi-Newton):

outout'outjjjj xtyf out

outhid'hidk

kjkjj wyf

ik txE

2)()(out, (summed squared error)

targetall patterns

all outputs

Recurrent Perceptrons

• Recurrent connection = feedback loop

• From hidden layer („Elman“) or output layer („Jordan“) Learning:

„backpropagation through time“

Input Zustands- bzw.Kontextlayer

Time series processing

• Given: time-dependent observables

• Scalar: univariate; vector: multivariate

• Typical tasks:

- Forecasting- Noise modeling

- Pattern recognition- Modeling

- Filtering- Source separation

Time series(minutes to days)

Signals(milliseconds to seconds)

,1,0, txt

ExamplesStandard & Poor‘s Sunspots

Preprocessed: (returns) Preprocessed: (de-seasoned)1 ttt xxr 11 ttt xxs

Autoregressive models

• Forecasting: making use of past information to predict (estimate) the future

• AR: Past information = past observations

tptttt xxxFx ,,, 21

past observations ptX ,

Expected value tx̂

Noise,„random shock“

• Best forecast: expected value

Linear AR models

• Most common case:

• Simplest form: random walk

• Nontrivial forecast impossible

ittit xax

1,0~ ;1 Nxx tttt

MLP as NAR

• Neural network can approximate nonlinear AR model

• „time window“ or „time delay“

Noise modeling

• Regression is density estimation of:(Bishop 1995)

• Likelihood:

xxttx, ppp |

ii ppL xxt

Distribution with expected value F(xi)

Target = future past

Gaussian noise

• Likelihood:

• Maximization = minimization of -logL(constant terms can be deleted, incl. p(x))

• Corresponds to summed squared error(typical backpropagation)

xXFXxpL

itpt xXFE

Complex noise models

• Assumption: arbitrary distribution

• Parameters are time dependent (dependent on past):

• Likelihood:

ptXg ,

iptXgdL

Probability density function for D

Heteroskedastic time series

• Assumption: Noise is Gaussian with time-dependent variance

• ARCH model

• MLP is nonlinear ARCH (when applied to returns/residuals)

iitit ra

2 ,,,',,, ptttptttt rrrFrrrF

Non-Gaussian noise

• Other parametric pdfs (e.g. t-distribution)

• Mixture of Gaussians (Mixture density network, Bishop 1994)

• Network with 3k outputs (or separate networks)

pti pti

Identifiability problem• Mixture models (like neural networks) are not identifiable

(parameters cannot be interpreted)• No distinction between model and noise

e.g. sunspot data:

Models have to be treated with care

Recurrent networks: Moving Average

• Second model class: Moving Average models• Past information: random shocks

• Recurrent (Jordan) network: Nonlinear MA

• However, convergence notguaranteed

iitit bx

ttt xx ˆ

• Extension of ARCH:

• Explains „volatility clustering“

• Neural network can again be a nonlinear version

• Using past estimates: recurrent network

iitiitit bra

State space models

• Observables depend on (hidden) time-variant state

• Strong relationship to recurrent (Elman) networks

• Nonlinear version only with additional hidden layers

Symbolic time series

• Examples:– DNA

– Text

– Quantised time series (e.g. „up“ and „down“)

• Past information: past p symbols probability distribution

• Markov chains

• Problem: long substrings are rare

ptttt xxxxp ,,,| 21

alphabet

Fractal prediction machines

• Similar subsequences are mapped to points close in space

• Clustering = extraction of stochastic automaton

Relationship to recurrent network

• Network of 2nd order

June 2003Neural Computation for Time Series1 Neural Computation and Applications in Time Series and...

Documents

Transcript of June 2003Neural Computation for Time Series1 Neural Computation and Applications in Time Series and...

Exceed Pharmacist Test Series1

Cyber-Physical Systems, Power Grid, and Engineering ...€¦ · Cyber-Physical Systems (CPS) • Roots in Cybernetics: idea goes back to Norbert Wiener • Computation, communications,

Economic Computation and Economic Cybernetics Studies … - Radu Catalina, Mihai Orzan (T).pdfAnamaria-Cătălina Radu, Mihai Orzan, Sebastian Ceptureanu,Ivona Stoica _____ 90 1. The

Babylonacademy series1-1

Economic Computation and Economic Cybernetics Studies and ... - Plopeanu Aurelian... · Aurelian-Petrus Plopeanu, Daniel Homocianu, Dinu Airinei _____ 102 one side, there is an improvement

Cybernetics and Second-Order Cybernetics -

Cybernetics - research.shahed.ac.irresearch.shahed.ac.ir/WSR/SiteData/PaperFiles/8231_914513486.pdf · cybernetics cybernetics cybernetics as (us-el) i j as "the" al i (reductionist)

Economic Computation and Economic Cybernetics Studies and ... - Sarah Marcelino, P.Al. Henrique.pdf · Stock selection is a challenging and crucial part of investor decision-making.

Francis Heylighen - Cybernetics and Second-Order Cybernetics

Economic Computation and Economic Cybernetics Studies and ... - KUBINSCHI Matei, Dinu Barnea (6).pdf · international crises, but low spillovers from Asian towards American markets.

Economic Computation and Economic Cybernetics Studies and … - Mohammad MOMENIKIYAI.pdf · EFFECT: ROBUST META-HEURISTICS APPROACH Abstract. In this paper, a bi -objective model

Economic Computation and Economic Cybernetics Studies and ... - Duica Mircea Constantin (17).pdf · Mircea Constantin Duica, Nicoleta Valentina Florea, Anisoara Duica, Raluca Gilmeanu

Soviet Cybernetics

Owg en Companion5 Series1

FCR Plus Series1-0827 - Fujifilm

Human Cybernetics

Economic Computation and Economic Cybernetics Studies … - Christian Nedu OSAKWE.pdf · Economic Computation and Economic Cybernetics Studies and Research, Issue 4/2015 243 Eng.

Comonadic notions of computationmath.ut.ee/luebeck2006/TLW2006.pdfComonadic notions of computation Tarmo Uustalu1 Varmo Vene2 1Institute of Cybernetics, Tallinn 2University of Tartu

Economic Computation and Economic Cybernetics Studies and ...

Cybernetics and Second-Order Cybernetics - Welcome to Principia