A Statistical Study of Non-Linearity in the Leavitt Laws for...

1
A Statistical Study of Non-Linearity in the Leavitt Laws for Classical Cepheids Anupam Bhardwaj 1,2 , S. M. Kanbur 3 , L. M. Macri 4 , H. P. Singh 1 , C. C. Ngeow 5 , Emille E. O. Ishida 6 1. Department of Physics & Astrophysics, University of Delhi, Delhi 110007, India. 2. European Southern Observatory, Karl-Schwarzschild-Straße 2, 85748, Garching, Germany. 3. State University of New York, Oswego, NY 13126, USA. 4. Texas A&M University, College Station, TX 77843, USA. 5. Graduate Institute of Astronomy, National Central University, Jhongli 32001, Taiwan. 6. Max-Planck-Institut f¨ ur Astrophysik, Karl-Schwarzschild-Straße 1, 85748, Garching, Germany. Abstract We present a detailed statistical analysis of possible non-linearities in the Period- Luminosity (P-L), Period-Wesenheit (P-W) and Period-Color (P-C) relations for Cepheid variables in the LMC at optical (VI) and near-infrared (JHK s ) wave- lengths. We test for the presence of possible non-linearities and determine their statistical significance by applying a variety of robust statistical tests (F-test, Random-Walk, Testimator and the Davies test) to optical data from OGLE III and near-infrared data from LMC synoptic survey. We also use a global fit to Cepheid and Supernovae data using Chi-square minimization, a method used in determination of precise and accurate value of the Hubble constant in SH0ES project, to determine the impact of observed non-linearities in P-L relations on the distance scale. We do not find any significant difference in the slope of the global-fit solution using a linear or non-linear LMC P-L relation as calibrator, but the linear version provides a 2× better constraint on the slope and metallicity coefficient. The data Band No. of stars FU FO V 1849 1238 I 1842 1226 J 775 474 H 775 474 K s 775 474 Table 1 : The Cepheid data selected for the present analysis. The optical data is taken from OGLE-III [1] catalog while the near-infrared data is taken from LMC synoptic survey [2]. The mean magnitudes for Cepheids in the optical and near-infrared bands are corrected for reddening using the Haschke maps [3] together with the extinction law of Cardelli et al. [4]. The Statistical Tests F-test : We fit a single regression line over the entire period range under the null hypothesis while two-slope model is applied as an alternative hypothesis with an assumed break period. The F-statistic is calculated using equation: F = RSS R -RSS F RSS F ν F ν R -ν F . Here, RSS denotes the residual sums of squares, ν is the number of degrees of freedom, and subscripts R and F denote the one and two-slope models, respec- tively. We compare the observed value of F with the values obtained from the F-distribution under the null hypothesis at a given significance level. Random walk : The non-parametric random walk test generates the distribution of the residuals from the data. If r k are the residuals obtained from the linear fit to P-L relation, then partial sum of the residuals C j = j k=1 r k . If there is a departure from the linearity, then C j will not be a simple random walk. The amplitude R = max(C j ) - min(C j ), is taken as the test-statistics. If the partial sums are a random walk, R will be small. To determine the significance level for the R-values, we use the permutation of residuals such as it destroys any possible trends. A large number of permutations are done to determine the probability as the theoretical distribution of the test statistic of C j is not known. Testimator : The entire sample is divided into n different non-overlapping, and hence completely independent subsets. We fit a linear regression to the first subset and determine a slope ˆ β . This initial estimate (testimator) of the slope is taken as β 0 in the next subset, under the null hypothesis that the slope of the second subset is equal to the slope of the first subset, i.e. ˆ β = β 0 . The alternate hypothesis is that the two slopes are not equal. The t-statistic is used to compare the slopes and determine the probability that the initial guess of the testimator is true. If the null hypothesis is accepted, we derive the new testimator slope for the next subset using the previously determined β 0 s such that: β w = k ˆ β - (1 - k)β 0 , where, k =(|t obs |/t c ) . This value of the testimator is taken as β 0 for the next subset. This process of hypothesis testing is repeated n g times or until the value k>1, suggesting rejection of the null hypothesis. Segmented lines and the Davies test : This method first performs a linear piecewise regression considering the existence of the break. Thereafter, the Davies test is used to evaluate if the two segments are different enough to account for two separate linear behaviours. The method is implemented in the R package segmented. The regression is performed through the use of Generalized Linear Models (GLM). All the test-statistics are discussed in detail in Bhardwaj et al. [5]. We also test our data for the assumption homoskedasticity and normality of residuals (q-q plots). Results 0.5 1.0 1.5 log(P) 5 10 15 Magnitudes V I - 1 J - 2 H - 3 K s - 4 W V,I - 5 W J,H - 6 W J,K s - 7 W H,K s - 8 Fundamental mode Figure 1: Optical and near-infrared P-L and P-W relations for LMC FU mode Cepheids. 0.0 0.4 0.8 log(P) 5 10 15 Magnitudes V I - 1 J - 2 H - 3 K s - 4 W V,I - 5 W J,H - 6 W J,K s - 7 W H,K s - 8 First-overtone mode Figure 2: Optical and near-infrared P-L and P-W relations for LMC FO mode Cepheids. 0.5 1.0 1.5 log(P) -1 0 1 Color V - I (J - H) - 0.1 (J - K s ) - 0.5 (H - K s ) - 0.5 Fundamental mode Figure 3: Optical and near-infrared P-C relations for LMC FU mode Cepheids. 0.0 0.4 0.8 log(P) -1 0 1 Color V - I (J - H) - 0.2 (J - K s ) - 0.6 (H - K s ) - 0.6 First-overtone mode Figure 4: Optical and near-infrared P-C relations for LMC FO mode Cepheids. Band FU FO FT RW TM DT FT RW TM DT V Y N Y Y Y Y Y Y I Y N Y Y Y Y Y Y J Y Y Y Y Y N Y N H Y Y Y Y N N Y N K s Y Y Y Y N N Y N Table 2 : For each band P-L relations, Y/N represents the break/no break un- der each test statis- tics. Conclusions I For fundamental-mode Cepheids, we find that the optical P-L, P-W and P-C relations are non-linear at 10 days. I The near-infrared P-L and the SH0ES Wesenheit relations are non-linear around 18 days. This break is attributed to a distinct variation in mean Fourier amplitude parameters near 20 days for longer wavelengths as compared to optical bands [6]. I For first-overtone mode Cepheids, a significant change in the slope of P-L, P-W and P-C relations is found around 2.5 days only at optical wavelengths. I We determine a global slope of -3.212±0.013 for the SH0ES Wesenheit relation by combining our LMC data with observations of Cepheids in Supernovae host galaxies [7]. I We find this slope to be consistent with the corresponding LMC relation at short periods, and significantly different to the long-period value. I We do not find any significant difference in the slope derived from the global fit using linear or non-linear P-L relations, due to the large observational scatter in the distant Cepheid data. References [1] I. Soszynski, et al. Acta Astron., vol. 58, pp. 163–185, Sept. 2008. [2] L. M. Macri, et al. AJ, vol. 149, p. 117, Apr. 2015. [3] R. Haschke, et al. AJ, vol. 141, p. 158, May 2011. [4] J. A. Cardelli, et al. ApJ, vol. 345, pp. 245–256, Oct. 1989. [5] A. Bhardwaj, et al. MNRAS, vol. 457, pp. 1644–1665, Apr. 2016. [6] A. Bhardwaj, et al. MNRAS, vol. 447, pp. 3342–3360, Mar. 2015. [7] A. G. Riess, et al. ApJ, vol. 730, p. 119, Apr. 2011. email : [email protected] SCMA VI, 2016

Transcript of A Statistical Study of Non-Linearity in the Leavitt Laws for...

Page 1: A Statistical Study of Non-Linearity in the Leavitt Laws for ...cschafer/SCMA6/POSTERS/Bhardwaj.pdfk=1 r k. If there is a departure from the linearity, then C j will not be a simple

A Statistical Study of Non-Linearity in the Leavitt Laws for Classical CepheidsAnupam Bhardwaj1,2, S. M. Kanbur3, L. M. Macri4, H. P. Singh1, C. C. Ngeow5, Emille E. O. Ishida6

1. Department of Physics & Astrophysics, University of Delhi, Delhi 110007, India.2. European Southern Observatory, Karl-Schwarzschild-Straße 2, 85748, Garching, Germany.

3. State University of New York, Oswego, NY 13126, USA.4. Texas A&M University, College Station, TX 77843, USA.

5. Graduate Institute of Astronomy, National Central University, Jhongli 32001, Taiwan.6. Max-Planck-Institut fur Astrophysik, Karl-Schwarzschild-Straße 1, 85748, Garching, Germany.

Abstract

We present a detailed statistical analysis of possible non-linearities in the Period-Luminosity (P-L), Period-Wesenheit (P-W) and Period-Color (P-C) relations forCepheid variables in the LMC at optical (VI) and near-infrared (JHKs) wave-lengths. We test for the presence of possible non-linearities and determine theirstatistical significance by applying a variety of robust statistical tests (F-test,Random-Walk, Testimator and the Davies test) to optical data from OGLE IIIand near-infrared data from LMC synoptic survey. We also use a global fit toCepheid and Supernovae data using Chi-square minimization, a method used indetermination of precise and accurate value of the Hubble constant in SH0ESproject, to determine the impact of observed non-linearities in P-L relations onthe distance scale. We do not find any significant difference in the slope of theglobal-fit solution using a linear or non-linear LMC P-L relation as calibrator, butthe linear version provides a 2× better constraint on the slope and metallicitycoefficient.

The data

BandNo. of starsFU FO

V 1849 1238I 1842 1226J 775 474H 775 474Ks 775 474

Table 1 : The Cepheid data selected for the presentanalysis. The optical data is taken from OGLE-III [1]catalog while the near-infrared data is taken fromLMC synoptic survey [2]. The mean magnitudes forCepheids in the optical and near-infrared bands arecorrected for reddening using the Haschke maps [3]together with the extinction law of Cardelli et al. [4].

The Statistical Tests

F-test : We fit a single regression line over the entire period range under the nullhypothesis while two-slope model is applied as an alternative hypothesis with anassumed break period. The F-statistic is calculated using equation:

F =(RSSR−RSSF

RSSF

) (νF

νR−νF

).

Here, RSS denotes the residual sums of squares, ν is the number of degrees offreedom, and subscripts R and F denote the one and two-slope models, respec-tively. We compare the observed value of F with the values obtained from theF-distribution under the null hypothesis at a given significance level.

Random walk : The non-parametric random walk test generates the distributionof the residuals from the data. If rk are the residuals obtained from the linear fitto P-L relation, then partial sum of the residuals Cj =

∑jk=1 rk. If there is

a departure from the linearity, then Cj will not be a simple random walk. Theamplitude R = max(Cj) - min(Cj), is taken as the test-statistics. If the partialsums are a random walk, R will be small. To determine the significance level forthe R-values, we use the permutation of residuals such as it destroys any possibletrends. A large number of permutations are done to determine the probability asthe theoretical distribution of the test statistic of Cj is not known.

Testimator : The entire sample is divided into n different non-overlapping, andhence completely independent subsets. We fit a linear regression to the first subsetand determine a slope β. This initial estimate (testimator) of the slope is takenas β0 in the next subset, under the null hypothesis that the slope of the secondsubset is equal to the slope of the first subset, i.e. β = β0. The alternatehypothesis is that the two slopes are not equal. The t-statistic is used to comparethe slopes and determine the probability that the initial guess of the testimatoris true. If the null hypothesis is accepted, we derive the new testimator slope forthe next subset using the previously determined β′s such that:

βw = kβ − (1− k)β0, where, k = (|tobs|/tc) .

This value of the testimator is taken as β0 for the next subset. This process ofhypothesis testing is repeated ng times or until the value k>1, suggesting rejectionof the null hypothesis.

Segmented lines and the Davies test : This method first performs a linearpiecewise regression considering the existence of the break. Thereafter, the Daviestest is used to evaluate if the two segments are different enough to account fortwo separate linear behaviours. The method is implemented in the R packagesegmented. The regression is performed through the use of Generalized LinearModels (GLM).

All the test-statistics are discussed in detail in Bhardwaj et al. [5]. We also test ourdata for the assumption homoskedasticity and normality of residuals (q-q plots).

Results

0.5 1.0 1.5log(P)

5

10

15

Ma

gn

itu

de

s

VI - 1J - 2H - 3Ks - 4WV,I - 5WJ,H - 6WJ,Ks

- 7WH,Ks

- 8

Fundamental mode

Figure 1: Optical and near-infrared P-L and P-W

relations for LMC FU mode Cepheids.

0.0 0.4 0.8log(P)

5

10

15

Ma

gn

itu

de

s

VI - 1J - 2H - 3Ks - 4WV,I - 5WJ,H - 6WJ,Ks

- 7WH,Ks

- 8

First-overtone mode

Figure 2: Optical and near-infrared P-L and P-W

relations for LMC FO mode Cepheids.

0.5 1.0 1.5log(P)

-1

0

1

Co

lor

V - I(J - H) - 0.1(J - Ks) - 0.5(H - Ks) - 0.5Fundamental mode

Figure 3: Optical and near-infrared P-C relations

for LMC FU mode Cepheids.

0.0 0.4 0.8log(P)

-1

0

1

Co

lor

V - I(J - H) - 0.2(J - Ks) - 0.6(H - Ks) - 0.6First-overtone mode

Figure 4: Optical and near-infrared P-C relations

for LMC FO mode Cepheids.

BandFU FO

FT RW TM DT FT RW TM DTV Y N Y Y Y Y Y YI Y N Y Y Y Y Y YJ Y Y Y Y Y N Y NH Y Y Y Y N N Y NKs Y Y Y Y N N Y N

Table 2 : For eachband P-L relations,Y/N represents thebreak/no break un-der each test statis-tics.

Conclusions

I For fundamental-mode Cepheids, we find that the optical P-L, P-W and P-Crelations are non-linear at 10 days.

I The near-infrared P-L and the SH0ES Wesenheit relations are non-linear around18 days. This break is attributed to a distinct variation in mean Fourier amplitudeparameters near 20 days for longer wavelengths as compared to optical bands [6].

I For first-overtone mode Cepheids, a significant change in the slope of P-L, P-Wand P-C relations is found around 2.5 days only at optical wavelengths.

I We determine a global slope of -3.212±0.013 for the SH0ES Wesenheit relationby combining our LMC data with observations of Cepheids in Supernovae hostgalaxies [7].

I We find this slope to be consistent with the corresponding LMC relation at shortperiods, and significantly different to the long-period value.

I We do not find any significant difference in the slope derived from the global fitusing linear or non-linear P-L relations, due to the large observational scatter inthe distant Cepheid data.

References

[1] I. Soszynski, et al. Acta Astron., vol. 58, pp. 163–185, Sept. 2008.

[2] L. M. Macri, et al. AJ, vol. 149, p. 117, Apr. 2015.

[3] R. Haschke, et al. AJ, vol. 141, p. 158, May 2011.

[4] J. A. Cardelli, et al. ApJ, vol. 345, pp. 245–256, Oct. 1989.

[5] A. Bhardwaj, et al. MNRAS, vol. 457, pp. 1644–1665, Apr. 2016.

[6] A. Bhardwaj, et al. MNRAS, vol. 447, pp. 3342–3360, Mar. 2015.

[7] A. G. Riess, et al. ApJ, vol. 730, p. 119, Apr. 2011.

email : [email protected] SCMA VI, 2016