Numerical Optimization 13: Sampling Plansqzhu/Teaching/Optimization/13... · 2020. 5. 20. ·...

13
Numerical Optimization 13: Sampling Plans Qiang Zhu University of Nevada Las Vegas May 20, 2020 1 / 13

Transcript of Numerical Optimization 13: Sampling Plansqzhu/Teaching/Optimization/13... · 2020. 5. 20. ·...

Page 1: Numerical Optimization 13: Sampling Plansqzhu/Teaching/Optimization/13... · 2020. 5. 20. · Numerical Optimization 13: Sampling Plans Qiang Zhu University of Nevada Las Vegas May

Numerical Optimization 13: Sampling Plans

Qiang Zhu

University of Nevada Las Vegas

May 20, 2020

1 / 13

Page 2: Numerical Optimization 13: Sampling Plansqzhu/Teaching/Optimization/13... · 2020. 5. 20. · Numerical Optimization 13: Sampling Plans Qiang Zhu University of Nevada Las Vegas May

Overview

1 Sampling

2 Full Factorial

3 Random Sampling

4 Uniform Projection Plans

5 Stratified Sampling

6 Space-Filling Metrics

7 Quasi-Random Sequences

8 Summary

2 / 13

Page 3: Numerical Optimization 13: Sampling Plansqzhu/Teaching/Optimization/13... · 2020. 5. 20. · Numerical Optimization 13: Sampling Plans Qiang Zhu University of Nevada Las Vegas May

Sampling

Optimization with expensive function evaluations

For many optimization problems, function evaluations can be quiteexpensive.

an aircraft design may require a wind tunnel test

deep learning hyperparameters may require a week of GPU training

· · ·A common approach for optimizing in these contexts is to build asurrogate model, Further evaluations of the true objective function can beused to improve the model. Fitting such models requires an initial set ofpoints, ideally points that are space-filling; that is, points that cover theregion as well as possible.

3 / 13

Page 4: Numerical Optimization 13: Sampling Plansqzhu/Teaching/Optimization/13... · 2020. 5. 20. · Numerical Optimization 13: Sampling Plans Qiang Zhu University of Nevada Las Vegas May

Full Factorial

Full Factorial

The full factorial sampling plan places a grid of evenly spaced points overthe search space.

a lower/upper-bound vector a, b such that ai ≤ xi ≤ bimi samples in each xi separated by a distance (bi − ai )/(mi − 1)

4 / 13

Page 5: Numerical Optimization 13: Sampling Plansqzhu/Teaching/Optimization/13... · 2020. 5. 20. · Numerical Optimization 13: Sampling Plans Qiang Zhu University of Nevada Las Vegas May

Random Sampling

Random Sampling

In some cases, it may be possible to transform a problem so thatconstraints can be removed. For example, bound constraints a x b canbe removed by passing x through a transform

5 / 13

Page 6: Numerical Optimization 13: Sampling Plansqzhu/Teaching/Optimization/13... · 2020. 5. 20. · Numerical Optimization 13: Sampling Plans Qiang Zhu University of Nevada Las Vegas May

Uniform Projection Plans

Uniform Projection Plans

A uniform projection plan with m samples on an m ×m grid can beconstructed using an m-element permutation. There are therefore m!possible uniform projection plans.

6 / 13

Page 7: Numerical Optimization 13: Sampling Plansqzhu/Teaching/Optimization/13... · 2020. 5. 20. · Numerical Optimization 13: Sampling Plans Qiang Zhu University of Nevada Las Vegas May

Stratified Sampling

Uniform Projection Plans

Stratified sampling modifies any grid-based sampling plan, including fullfactorial and uniform projection plans. Cells are sampled at a point chosenuniformly at random from within the cell rather than at the cells center

7 / 13

Page 8: Numerical Optimization 13: Sampling Plansqzhu/Teaching/Optimization/13... · 2020. 5. 20. · Numerical Optimization 13: Sampling Plans Qiang Zhu University of Nevada Las Vegas May

Space-Filling Metrics

Space-Filling Metrics

A good sampling plan fills the design space since the ability for a surrogatemodel to generalize from samples decays with the distance from thosesamples. Not all plans, even uniform projection plans, are equally good atcovering the search space.

Discrepancy, the maximum difference between the fraction of samplesin a hyper-rectangular subset H and that subsets volume:

d(X ) = supH|#X ∩ H

#X− λ(H)|

where #X and #X ∩ H are the numbers of X points and X in H.

Pairwise Distances between all points within each sampling plan

8 / 13

Page 9: Numerical Optimization 13: Sampling Plansqzhu/Teaching/Optimization/13... · 2020. 5. 20. · Numerical Optimization 13: Sampling Plans Qiang Zhu University of Nevada Las Vegas May

Quasi-Random Sequences

Quasi-Random Sequences

Quasi-random sequences are often used in the context of trying toapproximate an integral over a multidimensional space:∫

χf (x)dx ≈ v

m

m∑i=1

f (x i )

where each x i is sampled uniformly at random over the domain X and v isthe volume of χ .Quasi-random sequences are deterministic sequences that fill the space ina systematic manner so that the integral converges as fast as possible inthe number of points m. They are typically constructed for the unitn-dimensional hypercube with the following methods.

Additive Recurrence

Halton Sequence

Sobol Sequence

9 / 13

Page 10: Numerical Optimization 13: Sampling Plansqzhu/Teaching/Optimization/13... · 2020. 5. 20. · Numerical Optimization 13: Sampling Plans Qiang Zhu University of Nevada Las Vegas May

Quasi-Random Sequences

Additive Recurrence

Quasi-random sequences are often used in the context of trying toapproximate an integral over a multidimensional space:

xk+1 = xk + c ( mod 1)

produce space-filling sets provided that c is irrational. The value of cleading to

c = 1− Φ =

√5− 1

2= 0.618

where Φ is the golden ratio. We can construct a space-filling set over ndimensions using an additive recurrence sequence for each coordinate,each with its own value of c . The square roots of the primes are known tobe irrational, and can thus be used to obtain different sequences for eachcoordinate:

c1 =√

2, c2 =√

3, c3 =√

5, c4 =√

7, c5 =√

11,

10 / 13

Page 11: Numerical Optimization 13: Sampling Plansqzhu/Teaching/Optimization/13... · 2020. 5. 20. · Numerical Optimization 13: Sampling Plans Qiang Zhu University of Nevada Las Vegas May

Quasi-Random Sequences

Halton Sequence

Radical Inversion

i =M−1∑k=0

ak(i)bk

Ψb,C = (b−1, · · · , b−M)[C (a0(i), · · · , aM(i))T ]

where b is the base number, and C is the generator matrix. When C is theidentity matrix, it is called van der Corput sequences,

b = 2

X =

{1

2,

1

4,

3

4,

1

8,

5

8,

3

8,

7

8,

1

16, · · ·

}b = 5

X =

{1

5,

2

5,

3

5,

4

5,

1

25,

6

25,

11

25, · · ·

}Halton Sequence uses coprime numbers in order to be uncorrelated.

11 / 13

Page 12: Numerical Optimization 13: Sampling Plansqzhu/Teaching/Optimization/13... · 2020. 5. 20. · Numerical Optimization 13: Sampling Plans Qiang Zhu University of Nevada Las Vegas May

Quasi-Random Sequences

Sobol Sequence

In the Sobol sequence, each dimension uses the base 2 with different C .

12 / 13

Page 13: Numerical Optimization 13: Sampling Plansqzhu/Teaching/Optimization/13... · 2020. 5. 20. · Numerical Optimization 13: Sampling Plans Qiang Zhu University of Nevada Las Vegas May

Summary

Summary

Sampling plans are used to cover search spaces with a limited numberof points.

Full factorial sampling, which involves sampling at the vertices of auniformly discretized grid, requires a number of points exponential inthe number of dimensions.

Uniform projection plans,which project uniformly over eachdimension,can be efficiently generated and can be optimized to bespace filling.

Quasi-random sequences are deterministic procedures by whichspace-filling sampling plans can be generated.

13 / 13