Regression Analysis of Coal Mining in Appalachia · Regression Analysis of Coal Mining in...

6
Regression Analysis of Coal Mining in Appalachia Andrew Gott, Dr. Buddhi Gyawali, Jeremy Sandifer, Kentucky State University Cumberland URISA Meeting 1-29-16

Transcript of Regression Analysis of Coal Mining in Appalachia · Regression Analysis of Coal Mining in...

Page 1: Regression Analysis of Coal Mining in Appalachia · Regression Analysis of Coal Mining in Appalachia. Andrew Gott, Dr. Buddhi Gyawali, Jeremy Sandifer, Kentucky State University Introduction

Regression Analysis of Coal Mining in Appalachia

Andrew Gott, Dr. Buddhi Gyawali, Jeremy Sandifer, Kentucky State University

Cumberland URISA Meeting

1-29-16

Page 2: Regression Analysis of Coal Mining in Appalachia · Regression Analysis of Coal Mining in Appalachia. Andrew Gott, Dr. Buddhi Gyawali, Jeremy Sandifer, Kentucky State University Introduction

Introduction Mining in Appalachia has been linked to:

• higher birth defect rates (Ahern et al., 2011)

• water quality degradation (Lindberg et al., 2011; Bernhardt et al. 2012)

• increased cardiovascular disease (Esch & Hendryx 2011)

• higher cancer rates (Hendryx, Wolfe, Luo & Webb, 2011)

• increased poverty levels (Partridge, Betz & Lobao, 2012).

The evidence is challenged in that:

• the research is too focused and does not see the larger picture (Donovan, 2011)

• the results are caused by something else in the region like the mountains, forests, or other health and wealth factors (Ellis, 2013).

Page 3: Regression Analysis of Coal Mining in Appalachia · Regression Analysis of Coal Mining in Appalachia. Andrew Gott, Dr. Buddhi Gyawali, Jeremy Sandifer, Kentucky State University Introduction

Methods

A) National Elevation Dataset (NED) tiles from 2013 mosaicked to the study region.

B) Mining production from 2000-2014 as provided by the MSHA.

C) Average slope by CBG.

D) The number of mines in the neighborhood of each CBG totaled for the years 2000-2014.

A B C D

Page 4: Regression Analysis of Coal Mining in Appalachia · Regression Analysis of Coal Mining in Appalachia. Andrew Gott, Dr. Buddhi Gyawali, Jeremy Sandifer, Kentucky State University Introduction

Results

These “top” variables still only minimally explained variance in the data along with small coefficients.

They did, however, remain statistically significant contributors (p < 0.000001) with the inclusion of terrain and other socioeconomic covariates (Figures 2-4).

• (Top). Multivariate Regression results with mine count, covariates, and ACS variable B19051E2: Percent with Earnings

• (Middle). Multivariate Regression results with mine count, covariates, and ACS variable B22010E7: Percent on Food Stamps

• (Bottom). Multivariate Regression results with mine count, covariates, and ACS variableB23025E4: Percent Employed

B19051E2 Coefficient StdError t-Statistic Probability Intercept -41.705 2.013 -20.721 0.000000*

% in Poverty 11.597 0.578 20.072 0.000000* Median Income (log) 7.263 0.195 37.170 0.000000*

Mine Count -1.5E-04 0.000 -15.748 0.000000* % Employed 0.667 0.006 110.206 0.000000*

B22010E7 Coefficient StdError t-Statistic Probability Intercept -123.791 1.992 -62.146 0.000000*

Median Income (log) 9.573 0.227 42.204 0.000000* Avg. Ed. Attainment 4.278 0.078 54.971 0.000000*

Mine Count -2.2E-04 0.000 -18.376 0.000000* % Employed 0.377 0.008 47.223 0.000000*

B23025E4 Coefficient StdError t-Statistic Probability Intercept -26.375 1.527 -17.278 0.000000*

Avg. Slope -0.332 0.017 -19.143 0.000000* % in Poverty -6.823 0.733 -9.309 0.000000*

Median Income (log) 5.663 0.132 43.018 0.000000* Avg. Ed. Attainment 1.670 0.068 24.482 0.000000*

% on SNAP -15.826 0.796 -19.880 0.000000* Mine Count -1.0E-04 0.000 -7.647 0.000000*

Page 5: Regression Analysis of Coal Mining in Appalachia · Regression Analysis of Coal Mining in Appalachia. Andrew Gott, Dr. Buddhi Gyawali, Jeremy Sandifer, Kentucky State University Introduction

Future work • Investigate the uniqueness of Central Appalachia which tends to be

among the highest or lowest percentages when looking at mining production, average slope, wealth, education, employment, those receiving benefits, and more.

• Investigate factors other than socioeconomic status. Health and environmental quality should be investigated with the same methodology.

Page 6: Regression Analysis of Coal Mining in Appalachia · Regression Analysis of Coal Mining in Appalachia. Andrew Gott, Dr. Buddhi Gyawali, Jeremy Sandifer, Kentucky State University Introduction

Mining in Appalachia has been linked to higher birth defect rates (Ahern et al., 2011), water quality degradation (Lindberg et al., 2011; Bernhardt et al. 2012), increased cardiovascular disease (Esch & Hendryx 2011), higher cancer rates (Hendryx, Wolfe, Luo & Webb, 2011), and increased poverty levels (Partridge, Betz & Lobao, 2012).

The evidence of detrimental coal mining impacts on the environment and local communities is robust and mounting. The evidence, though, is challenged in that the research is too focused and does not see the larger picture (Donovan, 2011) or that the results are caused by something else in the region like the mountains, forests, or other health and wealth factors (Ellis, 2013). There is then a substantial gap in the literature addressing those issues. This study seeks to fill that gap by using a regression model (opposed to a mining presence/non‐presence factor) both on a small and large geographic scale, across a large range of variables.

Parsing out the effects of coal mining versus the surrounding terrain is uniquely a spatial issue that cannot simply be addressed through table analysis, which brings in the power of geographic information. 

This study looks at the Appalachian region as defined by the Appalachian Regional Commission (ARC) to find correlations between mining and socioeconomic status.

1. Quantify the mountainous quality of a region2. Determine in coal mining contributes to socioeconomic models after the inclusion of terrain variables and other covariates

Regression Analysis of Coal Mining in AppalachiaAndrew Gott, Dr. Buddhi Gyawali, Jeremy Sandifer, Kentucky State University

Introduction

Objectives

Methodology

Abstract ResultsAppalachia has long been associated with coal mining and the impacts have not all been beneficial for supporting communities. Opposition to research presenting detrimental coal impacts, focuses on potential other contributors to these negative impacts. This research seeks to quantify potential other contributors, such as the mountainous terrain, and find if mining still contributes with the inclusion of these other factors. This was accomplished by using elevation data to find the range in elevation and average slope for each Census Block Group and using this information along with other covariates to find if mining still contributes to socioeconomic status with these other factors included. The results show that all covariates struggle to explain variance, but despite the large variation, mining still contributes slightly but significantly to factors such as source of income, employment, benefits received, and education level.

The regression analysis shows that while source of income, employment, and education levels consistently showed to correlate best with mining compared to 1700 other investigated variables, these “top” variables still only minimally explained variance in the data along with small coefficients. 

They did, however, remain statistically significant contributors (p < 0.000001) with the inclusion of terrain and other socioeconomic covariates (Figures 2‐4).

Figure 2.  Multivariate Regression results with mine count, covariates, and ACS variable B19051E2: Percent with Earnings

Figure 3.  Multivariate Regression results with mine count, covariates, and ACS variable B22010E7: Percent on Food Stamps

Figure 4.  Multivariate Regression results with mine count, covariates, and ACS variableB23025E4: Percent Employed

Figure 5. A) Percent Employed by CBG. B) Percent receiving SNAP benefits

Future work should approach two different angles. One is to investigate factors other than socioeconomic status. Health and environmental quality should be investigated with the same methodology.

The second approach involves investigating what makes Central Appalachia so unique. Eastern Kentucky is marked in black outline in Figures 1 and 5. Central Appalachia tends to be among the highest or lowest percentages when looking at mining production, average slope, wealth, education, employment, those receiving benefits and more. As the results showed that mining is not the main explaining factor, not the mountains themselves (viewing Figure 1 shows that both mining and the mountains extend all through Appalachia), some unknown factor or factors is/are creating this regional uniqueness.

The author would like to thank Maheteme Gebremedhin, Charlie Collins, and Jens Hannemann for their guidance and support. Gratitude is also due to Demetrio Zourarakis, and Neelam Poudyal for assistance with thesis direction, Ken Bates for GIS help and Aron Patrick for help with coal production data. Thanks are also due to immediate family, extended family, the Gillig family and Lauren Worth for their support. 

B19051E2 Coefficient  StdError t‐Statistic Probability Intercept ‐41.705 2.013 ‐20.721 0.000000*

% in Poverty 11.597 0.578 20.072 0.000000*Median Income (log) 7.263 0.195 37.170 0.000000*

Mine Count ‐1.5E‐04 0.000 ‐15.748 0.000000*% Employed 0.667 0.006 110.206 0.000000*

B22010E7 Coefficient  StdError t‐Statistic Probability Intercept ‐123.791 1.992 ‐62.146 0.000000*

Median Income (log) 9.573 0.227 42.204 0.000000*Avg. Ed. Attainment 4.278 0.078 54.971 0.000000*

Mine Count ‐2.2E‐04 0.000 ‐18.376 0.000000*% Employed 0.377 0.008 47.223 0.000000*

B23025E4 Coefficient  StdError t‐Statistic Probability Intercept ‐26.375 1.527 ‐17.278 0.000000*

Avg. Slope ‐0.332 0.017 ‐19.143 0.000000*% in Poverty ‐6.823 0.733 ‐9.309 0.000000*

Median Income (log) 5.663 0.132 43.018 0.000000*Avg. Ed. Attainment 1.670 0.068 24.482 0.000000*

% on SNAP ‐15.826 0.796 ‐19.880 0.000000*Mine Count ‐1.0E‐04 0.000 ‐7.647 0.000000*

National Elevation Dataset (NED) 2013 1/3 arc second DEMs were downloaded from USGS in 1 degree tiles. Using ArcGIS Pro, all tiles were merged using the Mosaic to New Raster Tool and clipped to the study site (Figure 1a). The elevation range, and average elevation were calculated for each Census Block Group (CBG) using the Zonal Statistics to Table tool. The Slope tool was then used, and the Zonal Statistics to Table tool repeated, finding the range and average slope for each CBG (Figure 1c).

Next, coal mining production data for the years 2000‐2014 were downloaded from the Mine Safety and Health Administration (MSHA), and XY data displayed (Figure 1b). The Summarize Within tool was then used, aggregating the count of mines, surface production, underground production, and total production found in each CBG neighborhood (52 km around each CBG centroid) for all 15 years (Figure 1d). 

ArcGIS Pro was then used to run the Exploratory Regression tool to find the model of best fit for American Community Survey socioeconomic variables using: ⋯

with y being the value of the ACS variable,  being the intercept (the predicted value of the ACS variable when all covariates are zero), being the coefficient, and  being the residual error.

Figure 1. A) National Elevation Dataset (NED) tiles from 2013 mosaicked to the study region. B) Mining production from 2000‐2014 as provided by the Mine Safety and Health Administration. C) Average slope by CBG. D) The number of mines in the neighborhood of each CBG totaled for the years 2000‐2014.

Literature CitedAhern, M., Hendryx, M., Conley, J., Fedorko, E., Ducatman, A., & Zullig, K. (2011). The  association between mountaintop mining and birth defects among live births in central Appalachia, 1996–2003. Environmental Research, 838‐846. doi:10.1016/j.envres.2011.05.019Bernhardt, E., Lutz, B., King, R., Fay, J., Carter, C., Helton, A., . . . Amos, J. (2012). How Many Mountains Can We Mine? Assessing the Regional Degradation of  Central Appalachian Rivers by Surface Coal Mining. Environmental Science & Technology Environ. Sci. Technol., 8115‐8122.Donovan, T. (2011, June 27). Mountaintop Removal Mining Birth Defects: New Study  Suggests Controversial Coal Operations Linked To Adverse Health Effects. Retrieved August 19, 2015. 

Ellis, R. (2013, March 16). Report: Health risks high for mountaintop removal areas. Retrieved August 19, 2015Esch, L., & Hendryx, M. (2011). Chronic Cardiovascular Disease Mortality in Mountaintop Mining Areas of Central Appalachian States. The Journal of Rural Health, 350‐357. doi:10.1111/j.1748‐0361.2011.00361.xLindberg, T., Bernhardt, E., Bier, R., Helton, A., Merola, R., Vengosh, A., & Giulio, R.  (2011). Cumulative impacts of mountaintop mining on an Appalachian watershed. Proceedings of the National Academy of Sciences, 20929‐20934.Partridge, M., Betz, M., & Lobao, L. (2013). Natural Resource Curse and Poverty in Appalachian America. American Journal of Agricultural Economics, 449‐456.

Acknowledgements