Introduction to Numerical Optimizationcsg.sph.umich.edu/abecasis/class/2008/615.14.pdfNumerical...

Introduction to Numerical Optimization

Biostatistics 615/815Lecture 14Lecture 14

Course is More Than Half Done!

If you have comments…… they are very welcomey y• Lectures• Lecture notes• Weekly Homework • Midterm• Content

Last Lecture

Computer generated “random” numbers

Linear congruential generators• Improvements through shuffling summingImprovements through shuffling, summing

Importance of using validated generatorsImportance of using validated generators• Beware of problems with the default rand()

functionu c o

Today …

Root finding

Minimization for functions of one variable

Ideas:• Li it• Limits on accuracy• Local approximations

Numerical Optimization

Consider some function f(x)• e.g. Likelihood for some model …

Find the value of x for which f takes a maximum or minimum valuemaximum or minimum value

Maximization and minimization are equivalentMaximization and minimization are equivalent• Replace f(x) with –f(x)

Algorithmic Objectives

Solve problem…• Conserve CPU time• Conserve memory

Most often, the CPU time is dominated by the cost of evaluating f(x)• Minimize the number of evaluations

The Minimization Problem

Specific Objectives

Finding global minimum• The lowest possible value of the function• Extremely hard problem

Finding local minimum• Smallest value within finite neighborhood

Typical Quality Checks

When solving an optimization problem it is good practice to check the quality of the solution

Try different starting values …

Perturb solution and repeat …

A Quick Detour

Consider the problem of finding zeros for f(x)

Assume that you know:• Point a where f(a) is positive• P i t b h f(b) i ti• Point b where f(b) is negative• f(x) is continuous between a and b

How would you proceed to find x such that f(x)=0?

Root Finding in Cdouble zero(double (*func)(double), double lo, double hi, double e)

{while (true)

{double d = hi – lo;double d = hi – lo;double point = lo + d * 0.5;double fpoint = (*func)(point);

if (fpoint < 0.0)( po t . ){ d = lo – point; lo = point; }

else{ d = point – hi; hi = point; }

if (fabs(d) < e || fpoint == 0.0) return point;

Improvements to Root Finding

Consider the following approximation:

abafbfaxafxf

−−

−+=)()()()()(*

Select new trial point such that f*(x) is zero.

Improved Root Finding in Cdouble zero (double (*func)(double), double lo, double hi, double e)

{double flo = (*func)(lo);double fhi = (*func)(hi);while (1)while (1)

{double d = hi – lo;double point = lo + d * flo / (flo – fhi);double fpoint = (*func)(point);doub e po t ( u c)(po t);

if (fpoint < 0.0){ d = lo – point; lo = point; flo = fpoint; }

else{ d = point – hi; hi = point; fhi = fpoint; }

if (fabs(d) < e || fpoint == 0.0) return point;

Performance Comparison

Find the zero for sin(x) • In the interval -π/4 to π/2• Accuracy parameter set to 10-5

Bi ti th d d 17 ll t i ( )Bisection method used 17 calls to sin(x)

Approximation used 5 calls to sin(x)

Program That Uses Root Findingdouble zero (double (*func)(double), double lo, double hi, double e);

double my_function(double x){t (4* 3)return (4*x – 3);

int main(int argc, char ** argv){{double solution = zero(my_function, -5, +5, 1e-5);

printf(“Zero for my function is %.3f at %.3f\n”,f ti ( l ti ) l ti )my_function(solution), solution);

Notes on Root Finding

The 2nd method we implemented is the False Position Method

In the bisection method, the bracketing i t l i h l d t h tinterval is halved at each step

For well-behaved functions, the False Position Method will converge faster, but there is no performance guaranteethere is no performance guarantee

Questions on Root Finding

What care is required in setting precision?

How to set starting brackets for minimum?• If the function was monotonic?If the function was monotonic?• If there is a specific target interval?

What would happen for a function such as f(x) = 1 / (x – c)f(x) 1 / (x c)

Back to Numerical Optimization

Consider some function f(x)• e.g. Likelihood for some model …

Find the value of x for which f takes a maximum or minimum valuemaximum or minimum value

Maximization and minimization are equivalentMaximization and minimization are equivalent• Replace f(x) with –f(x)

Notes from Root Finding

Introduces two useful ideas that we’ll apply to function minimization

BracketingBracketing• Keep track of interval containing solution

Accuracy• Recognize that solution has limited precisionRecognize that solution has limited precision

Note on Accuracy

When estimating minima and bracketing intervals, floating point accuracy must be considered

In general, if the machine precision is εthe achievable accuracy is no more than ysqrt(ε)

Note on Accuracy II

The error results from the second term in the Taylor approximation:

1 ))(()()( bxbfbfxf −′′+≈

For functions where higher order terms are i t t ld b limportant, accuracy could be even lower. • For example, the minimum for f(x) = 1 + x4 is only

estimated to about ε1/4

Outline of Minimization Strategy

Part I• Bracket minimum

Part II• Successively tighten bracketing intervalSuccessively tighten bracketing interval

Detailed Minimization Strategy

Find 3 points such that • a < b < c• f(b) < f(a) and f(b) < f(c)

Then search for minimum by• Selecting trial point in intervalSelecting trial point in interval• Keep minimum and flanking points

Minimization after Bracketing

Part I: Part I: Finding a Bracketing Interval

Consider two points• a, b• f(a) > f(b)

Take successively larger steps beyond b until function starts increasing

Bracketing in C##define SCALE 1.618

void bracket (double (*f)(double), double* a, double* b, double* c){double fa = (*f)( *a);double fa = (*f)( *a);double fb = (*f)( *b);double fc = (*f)( *c = *b + SCALE * (*b - *a) );

while (fb > fc)e ( b c){*a = *b; fa = fb;*b = *c; fb = fc;*c = *b + SCALE * (*b - *a);fc = (*f) (*c);}

Bracketing in C++##define SCALE 1.618

void bracket (double (*f)(double), double & a, double & b, double & c){double fa = (*f)(a);double fa = (*f)(a);double fb = (*f)(b);double fc = (*f)(c = b + SCALE * (b - a) );

while (fb > fc)e ( b c){a = b; fa = fb;b = c; fb = fc;c = b + SCALE * (b - a);fc = (*f) (c);}

Part II:Part II:Finding Minimum after Bracketing

Given 3 points such that • a < b < c• f(b) < f(a) and f(b) < f(c)

How do we select new trial point?

Consider …

BA B C

What is the best location for a new point X?

Consider …

BA B CX

We want to minimize the size of the next h i t l hi h ill b ithsearch interval which will be either

from A to X or from B to C

Formulae …

−−

length have willSegments

soypossibilitcaseworstminimizewant toWe

or 1 zww +−

so...y possibilitcaseworst minimize want toWe

Effectively …ii lTh

21iscaseoptimal The

−= wz

givesThis

38197.05-3w

gives This

Golden Search

The Golden Ratio

Bracketing Triplet

The Golden Ratio

A B CX

New Point

A B CX

0.38196 0.38196

The number 0.38196 is related to the golden mean studied by Pythagoras

The Golden Ratio

New Bracketing Triplet

0.38196

Alternative New Bracketing Triplet

0.38196

Golden Search

Reduces bracketing by ~40% after each function evaluation

Performance is independent of the functionPerformance is independent of the function that is being minimized

In many cases, better schemes are available Any ideas?available… Any ideas?

Golden Step##define GOLD 0.38196#define ZEPS 1e-10

double golden step (double a double b double c)double golden_step (double a, double b, double c){double mid = (a + c) * 0.5;

if (b > mid)return GOLD * (a - b);

elsereturn GOLD * (c - b);

Golden SearchGolden Searchdouble golden_search(double (*func)(double),

double a, double b, double c, double e){d bl fb (*f )(b)double fb = (*func)(b);

while ( fabs(c - a) > fabs(b * e) + ZEPS){d bl b + ld t ( b )double x = b + golden_step(a, b, c);double fx = (*func)(x);

if (fx < fb) {{if (x > b) { a = b; } else { c = b; }b = x; fb = fx;}

lelseif (x < b) { a = x; } else { c = x; }

}return b;}}

Further Improvements

As with root finding, performance can improve substantially when a local approximation is used …

However, a linear approximation won't do in this case!

Approximating The Function

Introduction to Numerical Optimizationcsg.sph.umich.edu/abecasis/class/2008/615.14.pdfNumerical...

Documents

Transcript of Introduction to Numerical Optimizationcsg.sph.umich.edu/abecasis/class/2008/615.14.pdfNumerical...

Do Now: Lab Safety (add to your table of contents) zGrab all of your needed papers zFind your new assigned seat zIn the space provided (Do Now Tracker)

615.19 -- Simulated Annealingcsg.sph.umich.edu/abecasis/class/2006/615.19.pdf · Simulated Annealing Strategy zConsider decreasing series of temperatures zFor each temperature, iterate

The dying experience - CareSearch, palliative care ... · PDF fileThe Dying Experience ... zConsider the environment of care zSpiritual care - rituals, ceremonies etc . 4 ... zPhysical

Random Number Generation - Statistical geneticscsg.sph.umich.edu › abecasis › class › 815.16.pdf · 2004-12-13 · Random Number Generation Biostatistics 615/815 Lecture 16.

Multidimensional Optimization: The Simplex Methodcsg.sph.umich.edu/abecasis/class/2006/615.17.pdf · Multidimensional Optimization: The Simplex Method Lecture 17 Biostatistics 615/815.

666.25 -- Parametric Linkage Analysiscsg.sph.umich.edu/abecasis/class/666.25.pdf · zParametric methods define explicit relationship between phenotypic and genetic similarity zNon-parametric

Association Mapping in Families Gonçalo Abecasis University of Oxford.

Biostatistics 615/815 Statistical Computingcsg.sph.umich.edu/abecasis/class/2006/615.01.pdfStatistical Computing Gonçalo Abecasis Course Objective zIntroduce skills required for executing

Practical With Merlin Gonçalo Abecasis. MERLIN Website Reference FAQ Source.

Genotype Imputation - Statistical geneticscsg.sph.umich.edu/abecasis/publications/pdf/Annual...2Istituto di Neurogenetica e Neurofarmacologia, Consiglio Nazionale delle Ricerche, Cagliari,

Human Genetics Zeitgeist - Institute for Behavioral Geneticsibg.colorado.edu/cdrom2017/abecasis/humanGeneticsZeitgeist/2017… · Human Genetics Zeitgeist Goncalo Abecasis University

Recursion and Dynamic Programming - Statistical geneticscsg.sph.umich.edu › abecasis › class › 2008 › 615.04.pdf · 2008-09-23 · Key Applications of Recursion zDynamic Programming

Calculation of IBD State Probabilities Gonçalo Abecasis University of Michigan.

Design Considerations in Large- Scale Genetic Association Studies Michael Boehnke, Andrew Skol, Laura Scott, Cristen Willer, Gonçalo Abecasis, Anne Jackson,

Accomplishments & Future Directions · zFind a model for class attribute as a function of the values of other attributes. zGoal: previously unseen records should be assigned a class

Lecture 02 -- Genotypes, Phenotypes and Hardy Weinberg ...csg.sph.umich.edu/abecasis/class/2003/Lecture02.pdfGenotype frequencies are function of allele frequencies Equilibrium reached

Conference "comment vivre ensemble ?" Jacques Abecasis

MBAex The Greek Letters - CoMPUscompus.uom.gr/.../MBAex_The_Greek_Letters.pdfThe Greek Letters 27 Vega (continued) zConsider a portfolio that is delta neutral, with Γ = - 5,000 and

Monitoring Volume Level Application - End of Project Presentation Made by: Roi Abecasis Maxim Meltsin Supervisor: Boaz Mizrahi.

Errors in Genetic Data Gonçalo Abecasis. Errors in Genetic Data Pedigree Errors Genotyping Errors Phenotyping Errors.