Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School...

22
Extracting Extracting Meaningful Data: Meaningful Data: Distinguishing Distinguishing Signal from Noise in Signal from Noise in Climate Change Climate Change Q. Steven Hu Q. Steven Hu School of Natural School of Natural Resources Resources University of Nebraska- University of Nebraska- Lincoln Lincoln

Transcript of Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School...

Page 1: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

Extracting Meaningful Extracting Meaningful Data: Distinguishing Data: Distinguishing Signal from Noise in Signal from Noise in

Climate ChangeClimate Change

Q. Steven HuQ. Steven Hu

School of Natural ResourcesSchool of Natural Resources

University of Nebraska-LincolnUniversity of Nebraska-Lincoln

Page 2: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

In general, noise is the part of information that we In general, noise is the part of information that we want to tease out and signal is the part that we want want to tease out and signal is the part that we want

to keep. to keep. Noise is “stubborn” and always present and interferes Noise is “stubborn” and always present and interferes with signal, forcing us to identify it and find ways to with signal, forcing us to identify it and find ways to

separate it in the data.separate it in the data.We don’t want to throw away everything (data) we We don’t want to throw away everything (data) we

have collected because of presence of noise, but we have collected because of presence of noise, but we cannot keep everything either. We want to keep the cannot keep everything either. We want to keep the signal (the baby) after clearing the noise (the bath signal (the baby) after clearing the noise (the bath

water)!water)!

Page 3: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

Except for some “absolute noise,” noise and Except for some “absolute noise,” noise and signal are relative and they are determined by signal are relative and they are determined by the interest of studies.the interest of studies.

Example 1Example 1: The atmosphere contains variations at : The atmosphere contains variations at rather wide ranges of frequencies and spatial scales. rather wide ranges of frequencies and spatial scales. All those variations are signals to the climate. Yet, if All those variations are signals to the climate. Yet, if we are interested in studying a particular frequency we are interested in studying a particular frequency variation, e.g., interannual variation – changes of variation, e.g., interannual variation – changes of rainfall from one summer to the next, or variation at a rainfall from one summer to the next, or variation at a specific spatial scale, e.g., the synoptic scale – in the specific spatial scale, e.g., the synoptic scale – in the order of 1000km, all the other signals become order of 1000km, all the other signals become “noises.” We must identify and remove them before “noises.” We must identify and remove them before we can examine variations at the interested we can examine variations at the interested frequency and scale and understand their behavior frequency and scale and understand their behavior and change.and change.

Page 4: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

Example 2Example 2: In phonological research, an : In phonological research, an overwhelming number of studies have overwhelming number of studies have examined long-term data on phonological examined long-term data on phonological patterns and life-events of animals and plants, patterns and life-events of animals and plants, and found earlier migration to breeding sites, and found earlier migration to breeding sites, birds laying eggs on earlier dates, and plants birds laying eggs on earlier dates, and plants flowering earlier. These rather diverse yet flowering earlier. These rather diverse yet consistent changes of phenology are, as consistent changes of phenology are, as believed, responses to a warming climate. believed, responses to a warming climate.

Page 5: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

This phenological signal of change is an independent source of This phenological signal of change is an independent source of information of climate change. To a great extent, this signal is information of climate change. To a great extent, this signal is

free of errors and noises resulting from gathering and free of errors and noises resulting from gathering and manipulating instrumentation records, thus providing a manipulating instrumentation records, thus providing a

independent check of this following result from instrumentation independent check of this following result from instrumentation data.data.

Page 6: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

However,However, the phonological change can the phonological change can only tell us the direction of climate only tell us the direction of climate change and cannot tell the rate or change and cannot tell the rate or magnitude of the change. magnitude of the change.

Can we know how may degrees the Can we know how may degrees the temperature may have increased from temperature may have increased from how many days earlier a bird or a how many days earlier a bird or a butterfly has migrated north to certain butterfly has migrated north to certain latitude?latitude?

Page 7: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

Not yet, because the correlations, which have Not yet, because the correlations, which have been exclusively used in connecting changes been exclusively used in connecting changes in evolution of life-events of animals and in evolution of life-events of animals and plants with environmental conditions, “do not plants with environmental conditions, “do not allow us to discern whether the earlier allow us to discern whether the earlier reproduction is a direct response to warmer reproduction is a direct response to warmer temperatures, or to other factors that may temperatures, or to other factors that may also vary with climate, such as reproductive also vary with climate, such as reproductive resources and inter- and intra-specific resources and inter- and intra-specific competition,” as elaborated in Post et al. competition,” as elaborated in Post et al. (2001, in (2001, in Proc. R. Soc. LondonProc. R. Soc. London, B)., B).

Page 8: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

When we try to tease theWhen we try to tease the information and information and single out those responses, the one organism single out those responses, the one organism

that biologically connects a species that biologically connects a species reproduction behavior with warmer reproduction behavior with warmer

temperature and dominates the other temperature and dominates the other organisms would be considered the signal, organisms would be considered the signal,

and the rest would be noises. and the rest would be noises. This signal-noise relationship can change in This signal-noise relationship can change in different analyses of varying aspects of the different analyses of varying aspects of the

problem. problem.

There are, of course, other ways to treat There are, of course, other ways to treat several major organisms simultaneously, and several major organisms simultaneously, and even include their nonlinear interactions (e.g., even include their nonlinear interactions (e.g.,

data mining).data mining).

Page 9: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

To summarize the previous To summarize the previous slides: slides:

There are “absolute” noises There are “absolute” noises and erroneous information in and erroneous information in

data, but more often the data, but more often the noises are relative to the noises are relative to the

signal we want to examine.signal we want to examine.

Page 10: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

Now, let’s examine what are the “absolute Now, let’s examine what are the “absolute noises” in the meteorological and climatic noises” in the meteorological and climatic

datadata► Instrumentation drift induced noise to data (ground Instrumentation drift induced noise to data (ground

sensor and satellite drift) sensor and satellite drift) ► Instrumentation upgrading induced changes in data Instrumentation upgrading induced changes in data

(sensor differences)(sensor differences)► Station’s geographical location change induced Station’s geographical location change induced

shift to the data (terrain and surface differences)shift to the data (terrain and surface differences)► Local surroundings change induced noise (a tree Local surroundings change induced noise (a tree

grows into full canopy and cools the surroundings of grows into full canopy and cools the surroundings of the station, inducing a cooling noise. Similarly the the station, inducing a cooling noise. Similarly the urban expansion may warm a previously rural area, urban expansion may warm a previously rural area, inducing a warming noise to the rural station inducing a warming noise to the rural station temperature data)temperature data)

► Different ways observers read the instrument (low Different ways observers read the instrument (low vs. high angle) vs. high angle)

► Observation time differences add noise in the data Observation time differences add noise in the data (for precipitation in some frequencies)(for precipitation in some frequencies)

Page 11: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

How well have we extracted climate signal?How well have we extracted climate signal?(How have we identified and treated the (How have we identified and treated the

noises?)noises?)Many of the noises have been identified and their Many of the noises have been identified and their effects on signals minimized.effects on signals minimized.

► Instrumentation drift induced noise to data (Instrumentation drift induced noise to data (satellite position drift have been satellite position drift have been calculated and included in retrieval schemes for surface temperature and calculated and included in retrieval schemes for surface temperature and precipitationprecipitation) )

► Instrumentation upgrading induced changes in data (may have Instrumentation upgrading induced changes in data (may have considered in developing temperature data series)considered in developing temperature data series)

► Observation time differences add noise in the data (Observation time differences add noise in the data (Observation time Observation time effects were also estimated and included in finalizing the station observed effects were also estimated and included in finalizing the station observed precipitationprecipitation))

Others remain to be specified and their effects Others remain to be specified and their effects removed in developing climate dataremoved in developing climate data (lacking station (lacking station history has made these following noises very difficult history has made these following noises very difficult to clarify).to clarify).

► Station’s geographical location change induced shift to the dataStation’s geographical location change induced shift to the data► Local surroundings change induced noiseLocal surroundings change induced noise► Different ways observers read the instrument (can never be certain)Different ways observers read the instrument (can never be certain)

Page 12: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

Those noises and biases in the climate Those noises and biases in the climate data have left uncertainties in the data have left uncertainties in the

results derived from the data.results derived from the data.

They have made it particularly difficult They have made it particularly difficult to examine local climate change – to examine local climate change –

because for large regions the noise and because for large regions the noise and biases may cancel each other and biases may cancel each other and reduce their effects on the results.reduce their effects on the results.

Page 13: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

A very brief discussion onA very brief discussion on

Noise in climate models and their Noise in climate models and their outputsoutputs

Page 14: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.
Page 15: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

Let me use an example to show the noise Let me use an example to show the noise resulting from numerical treatment of the resulting from numerical treatment of the governing equations of the atmospheric and governing equations of the atmospheric and oceanic motions.oceanic motions.

The generalized linear system of governing The generalized linear system of governing equations, describing a number of types of equations, describing a number of types of wave motions in the atmosphere and ocean, wave motions in the atmosphere and ocean, can be written as:can be written as:

dU

dti U , U U t ( ).

Page 16: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

In finite difference, this equation is written (at In finite difference, this equation is written (at time = ntime = n××ΔΔt)t)

So, the true solution to the equation should So, the true solution to the equation should have an invariant amplitude, U(0) = the initial have an invariant amplitude, U(0) = the initial

amplitude.amplitude.

U t U e i t( ) ( ) 0

U n t U e in t( ) ( ) 0

Page 17: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

In numerical models, various “finite differencing In numerical models, various “finite differencing schemes” are used to calculate the values of U at schemes” are used to calculate the values of U at time t and locations.time t and locations.These values are estimates of true U’s and contain These values are estimates of true U’s and contain noises intrinsic to those schemes. noises intrinsic to those schemes.

To evaluate the effect of those “numerical noises” on To evaluate the effect of those “numerical noises” on the solutions we use the von Neumann method. By the solutions we use the von Neumann method. By defining a variable defining a variable λλ (“distortion” of the solution from (“distortion” of the solution from the true one) the true one)

we can getwe can getU U en n in( ) ( ) 0

U Un n( ) ( ) 1 e i

Page 18: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

Let’s focus on the amplitude of the amplitude of the Let’s focus on the amplitude of the amplitude of the modeled solution, |modeled solution, |λλ||nn×U(0). It is different from the ×U(0). It is different from the analytic (true) solution, and the coefficient analytic (true) solution, and the coefficient ||λλ||nn measures how different the amplitude, U(nmeasures how different the amplitude, U(nΔΔt), at time t), at time step n is from the true solution. We have these step n is from the true solution. We have these possibilities:possibilities:

1. |1. |λλ|>1 |>1 the numerical noise grows every time step the numerical noise grows every time step and quickly overwhelms the signal (the solution of the and quickly overwhelms the signal (the solution of the equation);equation);2. |2. |λλ|=1 |=1 neutral solution, noise is minimal (good); neutral solution, noise is minimal (good); andand3. |3. |λλ|<1 |<1 damping solution, noise “erodes” the damping solution, noise “erodes” the

signal and it will be gone during the model signal and it will be gone during the model integration. integration.

Page 19: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

The value of The value of ||λλ| for some popular time | for some popular time differencing schemes is shown below.differencing schemes is shown below.

Page 20: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

To summarizeTo summarize: Although researchers have : Although researchers have strived to minimize the numerical noises strived to minimize the numerical noises

resulting from various numerical schemes resulting from various numerical schemes used in models, those noises remain and used in models, those noises remain and

make numerical models and their predictions make numerical models and their predictions of climate suffer uncertainties. of climate suffer uncertainties.

Page 21: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

Concluding remarks:Concluding remarks:► Except for the “absolute noise,” noise and signal are Except for the “absolute noise,” noise and signal are

relative and are determined by the nature of a problem.relative and are determined by the nature of a problem.► The sources of noise can be determined after the The sources of noise can be determined after the

research question is well defined.research question is well defined.► Various methods can be used to filter out or attenuate Various methods can be used to filter out or attenuate

the noises and minimize their effect on signal, although the noises and minimize their effect on signal, although such effect may always exist to varying magnitudes. such effect may always exist to varying magnitudes.

► Conventional climate data have many types of noises.Conventional climate data have many types of noises.► Phenology data of life-events of animals and plants Phenology data of life-events of animals and plants

provide an independent source of information to detect provide an independent source of information to detect climate and environmental change. A challenge for us to climate and environmental change. A challenge for us to use the data effectively is that the signal are biologically use the data effectively is that the signal are biologically and chemically intertwined with “noises.”and chemically intertwined with “noises.”

Page 22: Extracting Meaningful Data: Distinguishing Signal from Noise in Climate Change Q. Steven Hu School of Natural Resources University of Nebraska-Lincoln.

(“(“Sandhills Cranes in FlightSandhills Cranes in Flight” – photo by Michael ” – photo by Michael Forsberg)Forsberg)