Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology...

29
Florent Leclercq www.florent-leclercq.eu Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11 th , 2019 Alan Heavens, Andrew Jaffe, George Kyriacou, Arrykrishna Mootoovaloo, James Prideaux-Ghee (Imperial College), Jens Jasche (U. Stockhom), Guilhem Lavaux, Benjamin Wandelt (IAP), Wolfgang Enzi (MPA), Will Percival (U. Waterloo) and the Aquila Consortium www.aquila-consortium.org

Transcript of Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology...

Page 1: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

Florent Leclercq

www.florent-leclercq.eu

Imperial Centre for Inference and CosmologyImperial College London

Bayesian analyses of galaxy surveys

November 11th

, 2019

Alan Heavens, Andrew Jaffe, George Kyriacou,Arrykrishna Mootoovaloo, James Prideaux-Ghee (Imperial College),

Jens Jasche (U. Stockhom),Guilhem Lavaux, Benjamin Wandelt (IAP),

Wolfgang Enzi (MPA), Will Percival (U. Waterloo)

and the Aquila Consortiumwww.aquila-consortium.org

Page 2: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

The big picture: the Universe is highly structuredYou are here. Make the best of it…

2

Page 3: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

What we want to know from the large-scale structure

The LSS is a vast source of knowledge:

• Cosmology:• ΛCDM: cosmological parameters and tests against alternatives,

• Physical nature of the dark components,

• Neutrinos: number and masses,

• Geometry of the Universe,

• Tests of General Relativity,

• Initial conditions and link to high energy physics

• Astrophysics: galaxy formation and evolution as a function of their environment• Galaxy properties (colours, chemical composition, shapes),

• Intrinsic alignments, intrinsic size-magnitude correlations

3

Page 4: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

We have theoretical and computer models…

• Initial conditions:a Gaussian random field

Everything seems consistent with the simplest inflationary scenario, as tested by Planck.

• Structure formation:numerical solution of the Vlasov-Poisson system for dark matter dynamics

4

Page 5: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

• The challenge: non-linear evolution at

small scales and late times.

• The strategy:

• Pushing down the smallest scale usable for cosmological analysis

• Using a numerical model linking initial and final conditions

• Precise tests require many modes.

• In 3D galaxy surveys, the number of modes usable scales as .

5In other words: go beyond the linear and static analysis of the LSS.

… how do we test these models against survey data?

Redshift

range

Volume

(Gpc3)

kmax

(Mpc/h)-1Nmodes

0-1 50 0.15 107

1-2 140 0.5 5x108

2-3 160 1.3 1010

Page 6: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

• Inference of signals = ill-posed problem• Incomplete observations: finite resolution,

survey geometry, selection effects

• Noise, biases, systematic effects

• Cosmic variance

• Cox-Jaynes theorem: Any system to manipulate “plausibilities”, consistent with Cox’s desiderata, is isomorphic to

(Bayesian) probability theory

Why Bayesian inference?

No unique recovery is possible!

“What is the probability distribution of possible formation histories (signals) compatible with the observations?”

“What is the formation history of the Universe?”

Bayes’ theorem:

6

So how do we do that?

Page 7: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

All possible FCsAll possible ICs

Forward model

Observations

Bayesian forward modelling: the ideal scenario

7

Page 8: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

Bayesian forward modelling: the challenge

The (true) likelihoodlives in

8

d≈107

Page 9: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

Likelihood-based solution: BORGBayesian Origin Reconstruction from Galaxies

9

Data assimilation

Page 10: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

• Use classical mechanics to solve statistical problems!

• The potential:

• The Hamiltonian:

• HMC beats the curse of dimensionality by:

• Exploiting gradients

• Using conservation of the Hamiltonian

Hamiltonian (Hybrid) Monte Carlo

acceptance ratio unity

gradients of the pdf

10

Page 11: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

BORG at work: Bayesian chrono-cosmography

ObservationsFinal conditionsInitial conditions

Supergalactic plane

67,224 galaxies, ≈ 17 million parameters, 5 TB of primary data products, 10,000 samples,

≈ 500,000 forward and adjoint gradient data model evaluations, 1.5 million CPU-hours

11

Page 12: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

BORGPM density field: full non-linear dynamics

Mass profile of the Coma cluster, in agreement with gravitational lensing andX-ray observations down to a few Mpc.

Böötes void

Coma

Virgo

Perseus

PiscesNorma

Hydra-Centaurus

Shapleyconcentration

12

Page 13: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

Velocity field in the supergalactic plane

The gravitational infall of known structures can be observed.

13

Velocitypotential

Norm ofvorticity

( ),vorticity was a postdiction.

Thanks to BORGPM (full non-linear dynamics),

we have now actual measurements - with

uncertainties.

In earlier work

and velocitydispersion…

Page 14: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

Mapping the Universe: epilogue?

14

Page 15: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

Likelihood-free solution: SELFISimulator Expansion for Likelihood-Free Inference

Data assimilation Generative inference

15

Page 16: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

SELFI: Method

• Gaussian prior + Gaussian effective likelihood

• Linearisation of the black-box around an expansion point + finite differences:

• The posterior is Gaussian and analogous to a Wiener filter:

gradient of the black-boxcovariance of summaries

observed summaries

prior covariance

expansion point

Arbitrarystatisticalsummaries

Primordialpowerspectrum

16

can be evaluated through simulations only.and,

The number of required simulations is fixed a priori.

Page 17: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

A black-box: Simbelmynë

Survey simulation:Redshift-space distortions, galaxy

bias, selection effects, survey geometry, instrumental noise

Dark matter simulation with COLA

17

Publicly available code:

Initial conditions Final conditions (dark matter) Galaxies

I’m happy to explain the name later today…

Page 18: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

SELFI + Simbelmynë: Proof-of-concept

: 5 times more modes are used in the analysis

100 parameters are simultaneously inferred from a black-box data model

18

1 (Gpc/h)3 only! Much more potential for Euclid data…

Page 19: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

SELFI + Simbelmynë: Proof-of-concept

• Robust inference of cosmological parameters can be easily performed aposteriori once the linearised data model is learnt

19

Page 20: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

Dark energy and neutrino masses with SELFI

20

pyselfi is publicly available at

Page 21: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

The Future: Opportunities & Challenges

21

DESI, Euclid, LSST, WFIRST, and more…

Page 22: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

Large-scale structure surveys roadmap

22

2014 2016 2018 2020 2022 2024 2026 2028 2030 2032

BOSS eBOSS

Dark Energy Survey

HETDEX

Hyper Suprime Prime Focus Spectro.

DESI

Euclid

LSST

WFIRST

Stage III

Stage IV

spectroscopic

photometric

Page 23: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

Data-intensive scientific discovery from galaxy surveys

• Next-generation surveys will be dominated by systematics

• 80% of the total signal will come from non-linear structures

• Challenging data analysis questions and/or hints for new physics will first show up as tensions between measurements

• Can data analysts keep pace?

23

Page 24: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

Accounting for known and unknown systematics

• Some known foreground contaminants (11 in total)

• A procedure to marginalise over unknown foreground contaminations

24

Forward model introduced by

Robust likelihood introduced by

Map of patches on the sky… … extruded in 3D

Page 25: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

Application to SDSS-III/BOSS (LOWZ+CMASS)

25

BORG a posterioripower spectrum

Den

sity

fie

ld

Vel

oci

ty f

ield

State-of-the-art with backward-modellingtechnique (mode subtraction)

No apparentcontamination,

even well beyondthe turn-over

Page 26: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

The Imperial weak lensing inference framework

26

reconstruction variance

Joint inference of cosmic shear maps andpower spectra/cosmology from CFHTLenS

from lensing data alone

Bayesian hierarchical inference ofgalaxy redshift distributions n(z) with KV450

Results on S8 = σ8(Ωm/0.3)0.5

KV450 BHM re-analysis

KV450

DES Y1

HSC DR1

Planck+lensing+BAO+JLA

Page 27: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

The Aquila Consortium

• Created in 2016. Currently 22 members from the UK, France, Germany, Sweden, Denmark & Canada.

• Gathers people interested in developing the Bayesian pipelines and running analyses on cosmological data.

27www.aquila-consortium.orgVisit us at

Page 28: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

Concluding thoughts

• Bayesian analyses of galaxy surveys with fully non-linear numerical models is not an impossible task!

• A likelihood-based solution (BORG): general purpose

reconstruction of dark matter from galaxy clustering, providing new measurements and predictions

• A likelihood-free solution (SELFI): algorithm for targeted

questions, allowing the use of simulators including all relevant physical and observational effects

28

Data assimilation Generative inference

Page 29: Bayesian analyses of galaxy surveys...Florent Leclercq Imperial Centre for Inference and Cosmology Imperial College London Bayesian analyses of galaxy surveys November 11th, 2019 Alan

Concluding thoughts

• The future: great science

29

DESI, Euclid, LSST, WFIRST

Bayesian large-scale structure analyses

Cosmological measurements:• Cosmic expansion• Power spectrum (and governing

parameters)• Gaussianity tests of the initial

conditions• Dark energy from the growth of

structure

Predictive cosmology:• Velocity field• X-ray cluster emission• Gravitational lensing• CMB secondary effects• Dark matter?

Galaxy formation: bias model & likelihoodLarge volume, photometric redshifts

Instruments modelling

and challenges