Problems of Stellar Convection: Proceedings of the Colloquium Nr. 38 of the International...
Transcript of Problems of Stellar Convection: Proceedings of the Colloquium Nr. 38 of the International...
Lecture Notes in Physics Edited by'J. Ehlers, M0nchen, K. Hepp, ZLirich, R. Kippenhahn, MLinchen, H. A. WeidenmiJIler, Heidelberg, and 1 Zittartz, K61n Managing Editor: W. Beiglb6ck, Heidelberg
71
Problems of Stellar Convection Proceedings of the Colloquium Nr. 38 of the International Astronomical Union, Held in Nice, August 16-20, 1976
ml
Edited by E. A. Spiegel and J. P. Zahn
Springer-Verlag Berlin Heidelberg New York 1977
Editors
Edward A. Spiegel Astronomy Department Columbia University New York, New York 10027 /USA
Jean-Paul Zahn Observatoire de Nice Le Mont Gros 0 6 3 0 0 Nice/France
ISBN 3-540-08532-? Springer-Verlag Berlin Heidelberg New York ISBN 0-387-08532-7 Springer-Verlag New York Heidelberg Berlin
This work is subject to copyright. All rights are reeerved, whether the whole or part of the material is concerned, specifically those of translation, re- printing, re-use of illustrations, broadcasting, reproduction by photocopying machine or similar means, and storage in data banks.
Under § 54 of the German Copyright Law where copies are made for other than private use, a fee is payable to the publisher, the amount of the fee to be determined by agreement with the publisher. © by Springer-Verlag Berlin Heidelberg 1977 Printed in Germany Printing and binding: Beltz Offsetdruck, Hemsbach/Bergstr. 2153/3140-543210
PREFACE
This volume constitutes the proceedings of Colloquium N ° 38 of the
International Astronomical Union, held at the Nice Observatory during the week of
August 16-20, 1976.
The scientific organizing committee was composed of L. Biermann,
F.H. Busse, P. Ledoux, B° Paczynski, E.A. Spiegel (chairman), R. Van der Borght,
N.O. Weiss and J.P. Zahn, They decided to adopt a format of general reviews followed
by discussion and informal contributions, much more in the spirit of a workshop than
in that of a classical colloquium. For this reason, the number of participants was
limited to about fifty~ but particular care was taken to represent a wide range of
interests and ages. It was also agreed that papers submitted for publication in the
proceedings, other than the invited reviews, should be refereed.
The colloquium was funded by the Centre National de la Recherche
Scientifique, whose Directeur Scientifique J. Delhaye was of great help, by the Comit~
National Fran~ais d'Astronomle, the City of Nice and the Nice Observatory. The Inter-
national Astronomical Union provided travel grants for young astronomers. Most parti-
Cipants were accomodated at the Centre Artistique de Rencontres Internationales, thanks
to p. Oliver and Ch. de Saran.
The local organization lay in the competent hands of D. Benotto
and R. Petrini. The social events were highlighted by a visit of the Music Chagall
under the guidance of its curator P. Provoyeur~ and followed by a concert given by the
Trio de Freville whose violonist, M.E. Mclntyre, was also an active participant of the
colloquium. R. Zahn took care of the ladies' prograrmne.
These proceedings were put together by D. Benotto and R. Petrini,
and D,O. Cough carefully checked them in their final form.
To all those quoted above, to the many others who also contributed
to the success of the meeting and to the editors of the Springer Verlag, we express
our warm thanks.
Jean-Paul Zahn
CONTENTS
Intr___oductory Remarks
E.A. SPIEGEL ..................................................................
I. M____ixing-Length Theory
- "Historical Reminiscences of the Origins of Stellar Convection Theory",
L. BIERMANN ................................................................... 4
- "The Current State of Mixing-Length Theory",
D. GOUGH ...................................................................... ~5
- "On Taking Mixing-Length Theory Seriously",
9.~M~H and E.A. SPIEGF~ ..................................................... 57
- " Observations Bearing on the Theory of Stellar Convection",
E. BOHM-VITENSE ............................................................... 63
II.___ Linear Theory
- "Dynamical Instabilities in Stars",
P. LEDOUX ..................................................................... 87
lll____~.Observational Aspects
- "Observations Bearing on Convection",
K.H. BDHM .................................................................... 103
- "Evolution Pattern of the Exploding Granules" ,
O. NAMBA and R. VAN RIJSBERGEN ............................................... 119
- "Granulation Observations",
A. NESIS ..................................................................... 126
- "Some Aspects of Convection in Meteorology",
R.S. LINDZEN ................................................................. 128
IV_~_NNumerical Solutions
- "Nu merzcal Methods in Convection Theory",
N.O. WEISS ................................................................... 142 - ,i C °
Ompresslble Convection",
E. GRAHAM ........................................ ............................ 151
VI
~t Rotation and Magnetic Fields
- "Convection in Rotating Stars",
F.H. BUSSE ................................................................ 156
- "Magnetic Fields and Convection",
~ . ~ ~ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 176
- "Axisyn~etric Convection with a Magnetic Field"~
D.J. GALLOWAY .............................................................. 188
- " Convective Dynamos",
S. CHILDRESS .............................................................. 195
VI. Penetration
- "Penetrative Convection in Stars",
J.p, ~ ................................................................. 225
- "The BDundaries of a Convective Zone",
A. MAEDER ................................................................. 235
- "Convective Overshooting in the Solar Photosphere;
a Model Granular Velocity Field",
A. NORDLUND ................................................................ 237
VII. Special To~ics
- "Thermosolutal Convection",
H.E. HUPPERT .............................................................. 239
- "The URCA Convection",
G. SHAVIV . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 255
- "Photoconvec tlon",
E . A . ~ ............................................................... 267
- " Convection in the Helium Flash",
A.J. WICKETT ............................................................... 284
VIII. Waves
- "Wave Transport in Stratified, Rotating Fluids",
M.E. Mc INTYRE ...........................................................
- "Wave Generation and Pulsation in Stars with Convective Zones",
W~ UNNO ..................................................................
290
315
VII
IX. Turbulence
- "Fully Developed Turbulence, Intermittency and Magnetic Fields",
U. FRISCH .... • ............................................................... 325
- "Turbulence : Determinism and Chaos",
Y. POMEAU .................................................................. 337
X. Appendix
- "Stellar Convection",
D.O. GOUGH ................................................................. 349
INTRODUCTORY REMARKS
E.A. Spiegel
Astronomy Department
Columbia University
New York, N.Y. 10027 U.S.A.
As president of the organizing committee of this meeting I was granted the honor of
opening the conference. But despite appearances I was only a figurehead that Jean-Paul
Zahn somehow decided to set up, Whatever hismotivatlon was, his execution was excellent
and my first remark must be an expression of my admiration for the marvelous job that
he and his associates have done in providing all the spiritual, intellectual, and
material advantages that we found waiting for us in Nice. Let me assure you though,
that a token president is not without uses and I wish I had known this before accepting
the job. I spent the days before and during the conference running routine errands,
carrying luggage, and being reprimanded for some of the minor things that inevitably
must go wrong in many large gatherings, I was even scolded because the name of some-
one who had not said he was coming was omitted from the list of participants. And when
the time was running short during the meeting, I was obliged, in a statesmanlike gesture,
to cUt my scheduled one-hour talk to eight minutes. But rank has its privileges and mine
was to be informed of the guiding principles behind the organization of the conference.
~ermit me now to share these with you.
It happened that the first day of the conference coincided with that of a large po-
litlcal convention (in another place,happily), and that suggested a convenient meta-
phor for describing the divergence of viewpoints among the participants. Let us there-
fore discuss the politics of stellar convection theory.
At the extreme right of the convective political spectrum are those who want to
write down the full equations and solve them. The ultra-conservatives, as I shall call
them, have virtue but no results that apply directly to stars.
At the other extreme of the convection spectrum are the radicals who want to write
down an algorithm for computing stellar structure that contains adjustable parameters
which can be fit to well known cases. In an extreme version of this we would write :
R = II where R is the radius and R± is an adjustable parameter. If we fit the parameter
to the sun we get R = 7 x 10 ]I cm and the resulting formula turns out to describe a
large number of stars tolerably well. I think it is fair to say that no one at the con-
ference was this radical, but it would be hard to deny that there have been things in
the literature that have these overtones.
But let me come to the political views represented by the actual participants. I
cannot be too specific since many participants have sometimes yielded to expediency and
shifted ground shamelessly. Nigel Weiss is a case in point. This paragon of the right
has recently (with Gough) written a paper on stellar mixing-length theory in what must
be the greatest fall from grace in recent memory. Having viewed this behaviour with
alarm let me point with pride to the spectrum of opinion represented here.(The infrared
has been filtered out.)
In these proceedings we have a coverage of this spectrum from mixing-length theory to
computations on the full equations ( for limited parameter ranges ). Naturally, this
represents more than most astrophysicists need to know about convection. Some will merely
read this introduction, expecting to find out where the best current approach is des-
cribed, hoping that this will be consistent with the constraint that results are to be
found in a finite time ........... say months, I have,of course, anticipated this need,
but am not sure I can meet it.
Douglas Gough and I spent the summer in a Cambridge drought trying to prepare a
statement that will answer such a specific question. Naturally, I didn't have time to
give the results of our lucubrations in my spoken introductory remarks. Nor did Gough
manage to fit them into his lecture on standard mlxing-length theory. That does not
mean that we could not put it all into one of our manuscripts. But which paper should it
be in? The solution is that we have prepared a joint appendix which I am told will appear
somewhere below.
Our conclusion is that a non-local mixing-length theory seems to be the best that
one can do at present. Unfortunately, this is not a precise a~tement and we simply
give an outline of how such an approach might be made and try to give an indication of
the physical assumptions needed. There are other ways to go about this, and our aim is
merely to suggest the level of sophistication in mixing-length theory that we think may
be warranted in stellar models.
I have indicated the spread in the approaches to convection discussed below as a
kind of abscissa. There is also an ordinate which represents a spectrum of complications
that arise in convection theory in specific kinds of stars on stages of evolution, or
refer to effects that are usually presented but ignored in first approximation. If we
must look mostly to the left to get usable results for stellar structure theory, it is
equally true that we usually turn to the right for guidance about how to handle these
special effects. For even if the solutions of the conservatives are not directly usable
for stars, they can be extended to include compressibility, rotation, magnetic fields,
compositional inhomogeneities, penetration, and, if we would just take the trouble, coup-
ling to pulsation. The hope is that what a special effect does to a conservative's solu-
tion it will probably do to a radical's model. This half-truth in practical terms means
that by seeing what rotation does to Boussinesq convection in two-dimensional or modal
convection, you may build enough intuition to make a cogent argument about what it does
to stellar convection. For example, when stellar model-builders want to decide what to
do about semiconvection, let them read Huppert's article on thermohaline convection. No
doubt many astrophysicists will not care for this general viewpoint unless it happens
to lead to answers according with what they need to coax their models into agreement
with observations. Eric Graham's discussion is a good case in point.
Graham has numerical solutions for fullyc~mpressible three-dlmensional convection
in a layer several pressure (and density) scale heights thick. Apart from a charming
tendency to swirl aboutj his flows look startlingly llke Boussinesq convection, and
he detects no sign of scale heights influencing his dynamics. Radicals will probably
ignore this result. What else can they do?
Lest I seem to give too much credit to the conservatives let me point out their
main fault : they rarely include effects in their calculations that are motivated by
purely astrophysical convection problems, but rather study traditional effects. If they
want to prove me wrong about this let one or more of them do a proper Boussinesq calcu-
lation of the URCA convection problem suramarlzed here by Giora Shaviv. This example does
not have the double entendre of something like rotation that interests meteorologists
also. So much for the ordinate.
In these proceedings we shall also leave the phase I have been describing to have a
look at recent trends in turbulence theory. Those who have followed this subject at all
know that it too has something of a political spectrum and some of the extreme conser-
vatives of turbulence report here on current approaches. Urlel Frisch will translate the
right wing's latest credo, fractal dimensions, into terms the leftist can understand,
and Yves Pomeau will tell us about aperiodic oscillations. These both refer to forms of
mathematics that may help us to see what turbulence is. Pomeau's talk is concerned with
systems of o.d.e.~s that give periodic solutions except in certain parameter ranges
where they go into aperiodic, almost random hehaviour, The suspicion has been around
for many years that this behaviour may have the mathematical ingredients that give tur-
bulence its stochastic features and, lately, attempts to formulate this idea precisely
have been mounted. But even if this does not turn out to work, it does not hurt to know
about aperiodic oscillators in other contexts. The funny behaviour of the solar cycle
during the reign of Louis XIV may have been a manifestation of such an aperiodic
oscillator of interest to this audience.
This has been a lengthy introduction yet it has not told you the full range of topics
to be covered. I hope that it gives you a flavour of what to expect in looking over the
proceedings. I am told that all the contributed papers have been refereed and so the pro-
lixity stops here. There is not even a concluding oration to be reported. Of course, I
happen to have a manuscript called "Convection in Stars III..." that might have served,
but that is destined for other things. However, a brief summary of developments before
this meeting is in Gough's report for IAU Commission Mestel and it is reprinted here
with bibliography. Its adequae~ as a summary may be a measure either of the rate of pro-
gress in this subject or of Gough's perspicacity.
HISTORICAL REMINISCENCES OF THE ORIGINS OF
STELLAR CONVECTION TKEORY
(1930- 1945)
Ludwig Biermann
Max-Planck-Institut fur Physik und Astrophysik
Munich, Germany
To set the stage for the report to follow +) , let me start with a
quotation from A.S. Eddington's classical work "The Internal Con-
stitution of the Stars", published exactly half a century ago.
Eddington wrote: "We shall not enter further into the historic problem
of convective equilibrium since modern researches show that the hypo-
thesis is untenable. In stellar conditions the main process of trans-
fer of heat is by radiation and other modes of transfer may be
neglected." (paragraph 69, p. 98).
Only 19 years earlier Robert Emden had outlined for the sun a very
different picture; in paragraph 4, chapter 18, of "Gaskugeln" - also
a classical treatise which influenced research well into the thir-
ties - he had stated with equal conviction that the energy radiated
into space from the photosphere could be brought there almost ex-
clusively only by convection, and that the granulation depicts the
cross section of the ascending hotter and the descending darker
currents. Emden's manuscrip£ had been read in the proof stage by his
brother-in-law Karl Schwarzschild, who advised Emden on a number of
points. It seems worth noting that Schwarzschild's somewhat earlier
work on the radiative equilibrium of the sun's atmosphere was discussed
in Emden's monograph (in chapter 16, par. 13) ++) .
+) The text includes a number of points, which actually came up in the later discussions during the colloquium or, in one case, during the IAU assembly in Grenoble. The author is indebted to many colleagues, in particular to D. Gough, L. Mestel, M. Schwarzschild and N. Weiss for important comments.
++) Whether Schwarzschild, who had proven that radiative transport prevailed in the sun's photospheric layer and had formulated the quantitative criterion for the stability of such layers, was in com- plete agreement with Emden, is not quite clear, though Emden's text does convey this impression.
Part of the change of the scientific argument during the intervening
time (1907-1926) was of course due to Jean's and L~demann and Newa]]'s
discovery that at the pressures and temperatures prevailing in the
stellar interiors, all the molecules and atoms would be broken up into
almost bare atomic nuclei and electrons and that, as a consequence,
radiative transport was recognized to be most efficient, in Eddington's
scheme of 1926 almost too efficient. In retrospect it looks however
as if Eddington, in laying the foundations of the theory of radiative
equilibrium in stars, did apparently not fully appreciate the power
of even very slow turbulent convection in transporting energy in
stellar interiors; that he neglected the possible influence of the
surface conditions was perhaps related to the fact that they were,
at the time when Eddington wrote his book, essentially inapplicable
to most stars. +)
This review will be concerned mainly with three developments, which
took place between 1930 and 1946: I) the application to stellar in-
teriors of the convective transport equation developed in hydro-
dynamics, which led to the proof that the adiabatic temperature
gradient is, in the case of thermal instability, a very good appro-
ximation there; 2) the influence of the surface boundary condition in
determining the extent of outer convection zones, which in certain
circumstances may comprise the whole star; 3) the introduction of the
scale height as a measure of the mixing length used in the transport
equation. Their connection with some further developments which may
come up during the present conference can be sketched only very
briefly.
To begin with item I, we note first that observations and measure-
("turbulent) mass ments of transport processes by non-stationary • "
motions in the earth' atmosphere and in the oceans had led, between
1915 and 1925, to a reasonably successful theoretical scheme. G.I.
Taylor (1915) and W. Schmidt (1917) noted++)that the quotient of the
+) Concerning both points it is instructive to reread his discussion of the point source model (ICS, § 91). It should be added, however, that Eddington (as far as the present author was able to find out) fully accepted the change of position which occurred during the period under review in this report. ++) Apparently, as a result of the war conditions, independently, as can be judged from their papers of 1915 and 1917.
flux of some quantity - like the momentum, the salinity or heat - and
the gradient of the content per cm 3 or per gram of the same quantity
led to consistent values of this coefficient, for which W. Schmidt
introduced the term "Austausch" (gr cm-lsec-1). In the special case
of heat flow ("Scheinleitung") the excess of the actual temperature
gradient over its adiabatic value, A?T, multiplied with the specific
heat for constant pressure, has to be used, because only small pressure
differences arise if the Mach number is <<I, as is usually true in the
earth~atmosphere. For interpreting the observed va~s of the "Austausch"
concepts developed by G.%. Taylor and by L. Prandtl n in particular the
"mixing length" were most useful.(These had been introduced first for
the case of dynamical instability.)These concepts relied to some ex-
tent of considerations analogous to those of kinetic theory. The
analogue of the molecule of kinetic theory is an element of the fluid,
which (having detached from its surroundings as a consequence of the
given instability due either to a super-adiabatic temperature gra-
dient or to the dynamical situation, for instance shear) moves as a
whole over some distance until it mixes again with the surrounding
fluid, in Prandtl's original thought the mixing length i was determined
by the geometry of the situation, e.g. the distance from the nearest
boundary or the diameter of the unstable region, and this was used
in the first application to stellar interiors. The somewhat later idea,
to link the mixing length with the scale height, will be discussed
below.
TO determine the velocity it was considered that pressure equilibrium
is only slightly disturbed, whereas - the flow of heat by conduction
or radiation being usually much slower - the temperature of an ele-
ment of the fluid is determined by the adiabatic gradient, such that
in case of thermal instability a rising element is less dense and
hotter than the surrounding fluid, and as a consequence is accelerated.
These considerations led t~ an expression for the velocity (v) of the
moving elements, which can be written in the form (g = acceleration
of gravity)
2 AT g£2 A?T v ~ gi • --~ T
For the convective transport of heat (H K = erg/cm2sec, Cp = specific
heat per gr for constant pressure, p = density) wewrite
H K ~Cp •piv • AVT
v 2
~pv • ~ R
(p - pressure =(R /D)pT; R - gas constant, ~ - mean molecular weight),
suppressing again a constant of order unity.
In meteorology the largest scale for which the transport equation has
been used with at least qualitative success (by A. Defant, see
W. Schmidt 1925), is the meridional transport of heat from the equa-
torial to the polar regions, which raises the average temperatures
of our own latitudes quite noticeably, the meridional heat flux ex-
ceeding the "solar constant" by roughly two powers of ten (as it must).
In this case the elements are low pressure systems which, due to their
rotation, have a certain stability; it happens, that the mixing length
and the pressure, but of course not the temperature and the density,
are comparable to those in the outermost layers of the sun's hydrogen
convection zone.
The application to stellar interiors was contained in a G6ttingen thesis
of 1932 (L. Biermann 1933), which owed a great deal to Prandtl's ad-
vice. For the convective zone around the centre, due to the conversion
of hydrogen into helium (supposed to be highly temperature sensitive
as is true for the C-N cycle), the mixing length was taken to be of
the order of 10.OO0 km, and the flux to be transported assumed to be
of the order of 1012erg/cm2sec, about 15 times that on the sun's sur-
face. For the third power of the velocity a combination of the equa-
tions given above leads to
3 HKg£ R* v p
which was found to be or order 1011 (cm/sec) 3. This then leads to
IAVTI ~ 10 - 8 or ~IO-51VTI
Though there is some incertainty about the coefficients of order
unity contained in the equations given above, it is clear that the
result last stated, that a relatively very small excess over the adia-
batic gradient is sufficient to carry all the flux, is quite insensi-
tige to errors made by the phenomenological theory used here. Two
years later Thomas G. Cowling, who used a slightly different approach
and formalism, recovered the same result. Since 1933/34 it has become
standard practice to use, in theoretical models of star's interiors,
the adiabatic temperature gradient whenever that required for radiative
transport exceeds the adiabatic one +) . In this form the stability cri-
terion was formulated first by Karl Schwarzschild for the solar atmo-
sphere (1906); in meteorology an equivalent criterion had been in use
already many years earlier.
Concerning item 2, we note first, that during the years 1934-38, an
attempt was made to explore the question whether partially or wholly
convective stellar models (not only such with a central convection zone)
lead to a more complex picture of the overall constitution of the stars
than the one given by Eddington (as was finally found to be the case).
Such models would result from a higher luminosity than the radiative
ones, for the same radius and opacity if the luminosity could be re-
garded as an (effectively) free parameter. Near the surface two cir-
cumstances, the second of which was not at once fully appreciated have
to be taken into account: the existence, in all stars with not too high
surface temperature, of convective zones due to the partial ionization
of hydrogen and helium, which had first been investigated by Uns~id in
1930, and (second) the necessity of using in the photosphere of every
star the radiative transport equation together with that of hydro-
static equilibrium, which means that the pressure must be of order
g/c, < being the opacity (gr-lcm 2) of the photosphereic layers - a
relation which of course could be written down with better accuracy.
The importance of this boundary condition for the case under discussion
(L. Biermann 1935) was emphasized by T.G. Cowling in 1936, but its
application was held back for some years by the (then) poor knowledge
of the value of the photospheric opacity for all stars later than
spectral class A.
+) ~The first such model being Cowiing's point source model of 1935.
Shortly before the discovery by Rupert Wildt that the negative hydrogen
ion is the main source of the photospheric opacity in such stars, an
attempt was made (L. Biermann 1938) to use approximate values for the
photospheric opacity based on work of Pannekoek which fortunately led
to approximately correct answers. It was found (a result which was
soon confirmed on the basis of Wildt's pioneering work of 1939, and
subsequent work of others) that in the surface of the sun and similar
stars (with photospheric pressures of the order of IO5dyn/cm 2, such
that the radiation pressure is only ~ 10-5p) convection was likely
to be an efficient mechanism of transport of heat not only in the
main parts of the hydrogen convection zone, but also in its outer
layers; as a consequence the adiabatic gradient should at least appro-
ximately be established in the hydrogen convection zone up to its
outer boundary in the middle or deep photosphere.
In order to i l l u s t r a t e the importance of this result le t us look at a diagram taken from the author's paper of 1938. This diagram, with the
logarithm of the total pressure (P = p+pR ) and of radiation pressure
(pR) as coordinates, shows besides the adiabats ( f u l l l ines +) ) the
lines of constant ratio PR to P (Eddington I-8, weak lines), further-
more the "dominant" ionization potential ~++) and lines for fixed
ratio of ~ to kT, of which the one for ~ = 0 corresponds to the limit
of degeneracy ("Entartungsgrenze"). Near the bottom are shown the
photospheric values of the pressure (for given effective temperature)
for three values of the surface gravity, including that for the sun.
The crossed line marked
log ad
indicates the zone of rapid increase (inwards) of the opacity and as a
consequence of the radiative temperature gradient, and the incipient
decrease of the adiabatic gradient (inwards) due to the additional
degrees of freedom (using the terminology of the kinetic theory of the
+) The branching above logT ~ 5 corresponds to then existing uncertain- ties regarding the chemical composition, especially the value of Z (in the terminology in use now).
++) For which the degree of ionization is ~I/2 according to Saha's formula.
J. !
! ..
...
!
i I
J ...
. !
! |
,i
!
~r
,q
~ i.
.i
~
"V
%
11
specific heat) resulting from the increasing degree of ionization. Above
= 50 eV, the ratio of the radiative gradient to the adiabatic gra-
dient
(d log PR/d log P)R
(d log PR/d log P)ad
is essentially given by the value of 1-B, such that for the special
solution corresponding to Eddington's model, for given mass and lumi-
nosity, and for larger values of I-8, the radiative gradient is smaller
than the adiabatic one and Schwarzschild's stability criterion is ful-
filled. For smaller values of 1-8 the radiative gradient increases,
such that the instability limit is reached soon and the adiabatic
gradient (d log PR/d log P ~ 8/5 for stars of about the sun's mass,
for which I-8 <<I) is the smaller one and convection prevails. +)
It is thus see~)that the surface boundary condition effectively results
in such a relation between PR and P that the total pressure increases
inwards at first considerably faster than the radiation pressure; in
the regions with temperatures around some 104 degrees, I-B has a mini-
mum and the radiative gradient is much larger than the adiabatic one,
the opacity being approximately proportional (I-8) -1 according to
Kramers' law (in that temperature region aotually still larger), where-
as the adiabatic gradient is approximately constant (~ 8/5). With the
increase of I-8 along the adiabates inwards, the stability limit is
gradually approached and in the case of the sun finally reached at
T ~ 106 (with modern values of the chemical composition Eddington's
model would show I-8 ~IO -3 for the sun). It is therefore clear that
in the sun and in similar stars the outer convection zone must comprise
a substantial number of scale heights until the radiative temperature
gradient decreases below the adiabatic one, such that the inner boun-
dary of the hydrogen convection zone should be at a temperature
T ~> 106 and a depth of ~ 1OO OOO km. This result has been confirmed
by the much more accurate computations of recent years. Dwarf stars
of later spectral class should have still deeper convection zones,
+)The presentation attempts to retrace the steps, by which the complete stellar models with deep outer convection zones and the fully convective models were actually found. For a more detailed review see L. Biermann 1945.
12
and it was proposed in 1938 that late type giants might even be fully
coDvective, a result which was recovered many years later by Hayashi.
In retrospect, it is easily seen why all these possibilities had not
been noticed earlier: that convective transport could be efficient up
to almost photospheric layers, was a rather remote possibility on the
background of the earlier theory - though not on that of Emden - and
a quantitative discussion of the power of convective transport was
hindered by the lack of reliable knowledge of the photospheric pressure
(which had been highly underestimated before 1938/39).
The largest uncertainty in transferring the mixing length theory to
astrophysics - our item 3 - is evidently connected with the value of
to be used, under the different circumstances. For the work done in
1938 described above the observed size of the solar granulation had
been taken as a guide; this choice +) whioh fortunately did not intro-
duce serious errors was until 1943 replaced by the answer which has
been the basis of all subsequent work, and which is to use the local
scale height, defined either by the density gradient or that of the
pressure, as measure of ~, such that £ is given:
I I ~Av-~J or Iv--f~j"
Since there should be a nondimensional factor of order unity, which re-
quires separate d£scussion, the two expressions are under most circum-
stances equivalent. The idea behind this choice is that in any case the
largest elements should travel farthest and reach the highest velocity
but that an element of the fluid, after having travelled over a density
scale height, should have changed ~ts shape to such an extent that it
is likely to break up into smaller fragments and to mix with the sur-
rounding fluid. This approximation is of course precisely in the spirit
of Prandtl's original ideas on the subject, and had most probably been
discussed with him. On a slightly different background, an equivalent
proposal had been made by E. 0pik already in 1938~ +)
+)An at least approximate determination of the size distribution func- tion of the solar granulation became possible recently (J.W. Harvey, M. Schwarzschild, 1975), whereas for red giants the observational si- tuation is still less clear (M. Schwarzschild, 1975).
++) The work reported here under items 1) and 2) had remained unknown to ~pik until his work of 1938 was completed, cf. his "Note added in proof" (~pik 1938). For a more recent review of the general problem see M. Schwarzschild, 1961).
13
To conclude these reminiscences I would first like to mention that the
results discussed under item 2), led for the sun to a proposal concern-
ing the old question, why a sunspot is dark, the answer being, that the
strong magnetic fields observed in the umbra should inhibit the convec-
tive transport in the layers underneath the spot+); it has been pursued
by a number of authors up to the present and may come up again at this
colloquium.
Of the various formalisms suggested to improve upon the one given above
for the heat transport, the scheme proposed by E. B~hm-Vitense 1953, 1970
1958 became the most widely used. Much more recently R. Ulrich has pro-
posed still further refinements, which are particularly useful at rela-
tively low densities. All attempts to determine the exact relationship
between the mixing length and the scale height (and/or other parameters)
have not been really satisfactory so far, though comparisons with obser-
vational data on the integral properties of stars (including their
chemical composition) and on the position of the instability strip in
the Herzsprung-Russell Diagram have been used with some success; only
for the sun its known age provides an independent parameter and thereby
a check that the mixing length theory leads to essentially reliable re-
sults (D.O. Gough, N.O. Weiss 1976). It seems that only a deductive
theory of stellar convection would offer the chance to go beyond the
present essentially phenomenological approach used hitherto; at least
one contribution at the present colloquium will, I understand, deal
with this problem.
+) L. Biermann 1941.
14
References
Biermann, L., 1933, Z. Astrophys. ~, 117,
Biermann, L., 1935, Astr. Nachr. 257, 269.
Biermann, L., 1938, Astr. Nachr. 264 , 395.
Biermann, L., 1941, Vierteljahresschrift Astron. Ges. 76, 194.
Biermann, L., 1943, Z. Astroph. 2_22, 244.
B~hm-Vitense, E., 1958, Z. Astrophys. 46, 108.
Cowling, T.G., 1935, Mon.Not.R.astr. Soc. 96, 18.
Cowling, T.G., 1936, Astron. Nachr. 258, 133.
Eddington, A.S., 1926, Internal Constitution of Stars
Emden, R., 1907, "Gaskugeln"
Gough, D., Weiss, N.O., 1976, Mon.Not.R.astr.Soc. 176, 589.
Harvey, J.W., Schwarzschild, M., 1975, Ap.J. 196, 221.
Opik, E.J., 1938, Publ. Obs. astr. Univ. Tartu 30, No. 3.
Prandtl, L., 1925, Zeitschr. f. angew. Math. u. Mech. ~, 136.
Prandtl, L., 1932, Beitr. Physik freier Atmosphere 19, 188.
Schmidt, W., 1917, "Der Massenaustausch bei der ungeordneten Str6mung
in freier Luft und selnen Folgen", Kalserl. Akad. d. Wiss., Wien.
Schmidt, W., 1925, "Der Massenaustausch in freier Luft und verwandte
Erscheinungen", Verlag von Henri Grand, Hamburg.
Schwarzschild, K., 1906, Nachr. KSnigl. Ges. d. Wiss., G~ttingen, No.1.
Schwarzschild, M., 1961, Ap. J. 134, I.
Schwarzschild, M., 1975, Ap. J. 195, 137.
Taylor, G.I., 1915, Phil. Trans. R. Soc. Lond. A, 215, 1.
Taylor, G.~., 1932, Proe. R. Soc. Lond. A, 135, 685.
Ulrich, R., 1970, Astroph. & Space Science ~, 71.
Ulrich, R., 1970, Astroph. & Space Science ~, 183.
UnsOld, A., 1930, Z. f. Astroph. !, 138.
Vitense, E., 1953, Z. Astrophys. 32, 135.
Wild, R., 1939, Astroph. J. 9_O0, 611.
THE CURRENT STATE OF STELLAR MIXING-LENGTH THEORY
Douglas Gough
Institute of Astronomy & Department of Applied Mathematics
and Theoretical Physics, University of Cambridge
SUMMARY
The basic assumptions of the mixing-length formalism are described, and the
theory is developed with a view to representing convection in stars. Directions
in which the results might be improved and extended are indicated.
I. INTRODUCTION
Aside from some recent pioneering work by Latour, Spiegel, Toomre and Zahn
(1976 a,b), the mlxing-length formalism in one or other of its guises remains
the sole method for computing the stratification of convection zones in stellar
models. Little attention is usually paid to assessing the accuracy of the models,
partly because there is a general feeling that mixing-length theory is so un-
certain that the task would be fruitless, and partly, perhaps, because of an
optimism that the theory will soon be superseded by something better. There
appears to be no better convection theory emerging that migh= be applicable to
stars in the foreseeable future, however; the mixing-length is likely to stay
with us for some time. It is perhaps time, therefore, to take stock of the
situation, and to ask whether the methods currently employed can be made more
reliable.
The first stage of any enquiry of this kind must be a definition of the
physical model upon which the theory is based. What started as little more than
an order-of-magnitude estimate of turbulent transport processes has subsequently
16
been taken rather literally in some contexts. It is therefore important to
appreciate what the assumptions are, and where the uncertainties lie. Only after
that can there be some hope of improving the representation of the physics. And
as a byproduct, one might see how best the theory might he extended to describe
more general situations than those for which it is customarily employed in stellar
physics.
But perhaps most important of all is to appreciate the degree of contact
with reality. Astronomical verification of the mixing-length prescription is at a
very primitive level, and permits only a poor assessment of the validity of the
functional forms of the formulae describing the convective transport processes.
2. THE IDEAS BEHIND MIXING-LENGTH THEORIES
The mixing-length idea was introduced independently by Taylor (1915), Schmidt
(19|~ and Prandtl (|925) to provide a means of understanding the transport of vor-
ticity, heat and momentum in turbulent fluid. By analogy with gas kinetic theory the
fluid is considered to be composed of turbulent 'eddies', 'parcels' or 'elements'
which advect properties such as heat, in the case of thermal convection, and
vorticity or momentum, in the case of shear turbulence. An element arises as a
result of instability, with about the same properties as its immediate environment.
It travels with a characteristic speed ~ through a mean-free-path or mixing
length { , and finally breaks up because it becomes unstable itself, and merges
with its new surroundings. This breakup into smaller scales of motion is
considered to be instantaneous. It is the mixing-length description of the
beginning of the turbulent cascade; velocity components of the consequent small
scale motion and the associated temperature fluctuations are assumed to be un-
correlated so there is no contribution from them to the overall transport of heat
and transverse momentum. From such a description it is a straightforward matter
to estimate the mean heat flux or shear stress in terms of ( , ~ and the
structure of the mean environment. To complete the theory a procedure for
obtaining ~ and ~ must be found.
In the case of shear flow Prandtl (1925) assumed the turbulence to be more
or less isotropic and so equated the velocity ~'- perpendicular to the mean motion
to the velocity fluctuation in the mean flow direction induced by the shear.
Prandtl assumed the turbulent elements to be momentum conserving and obtained an
expression for the shear stress in the form of a product of the mean velocity
gradient and a turbulent transport or exchange coefficient ~u~ (Austausch
coefficient) where ~ is density. Thus in his form of the theory, turbulent shear
stresses (Reynolds stresses) behave like viscous stresses with the Austausch
coefficient being a sort of turbulent viscosity, a concept that had been discussed
previously by Boussinesq (1877). The Reynolds stresses take on a somewhat
17
different form if it is assumed that turbulent elements conserve vorticity (Taylor
1915, 1932) and sometimes this yields better agreement with experiment (e.g.
Prandtlp 1952). In either case it is only the mixing length ~ that remains un-
determined.
In the case of free convection there is no externally imposed velocity scale
as in shear flow, and it is necessary to consider the dynamics of the turbulent
elements in greater detail. This can be done only after the mixing-length model
is mere preclsely defined. During its existence a turbulent element is accelerated
by the imbalance between buoyancy forces, pressure gradients and nonlinear
advection processes. In addition it can gain or lose mass by entrainment or
erosion. As a result of ignoring different combinations of these processes,
approximating the remaining ones in slightly different ways and making slightly
different assumptions about the geometry of the flow, different physical models
have emerged. They all predict similar heat transports when the mean atmospheric
structure is time independent, which is hardly surprising because the formulae
can be obtained from barely more than dimensional reasoning. As a consequence,
the differences between the physical assumptions are not usually emphasized in
the astrophysical literature, perhaps because it is difficult to differentiate
astronomically between rather gross variations in the functional form of the
turbulent heat flux.
It was pointed out by Prandtl (1926) in a discussion of turbulent shear flow
that~in the absence of a driving forcejturbulent drag would cause an element of
characteristic size ~ to lose its kinetic energy after travelling a distance of
about ~ . This is simply because turbulent drag at high Reynolds number is
proportional to the square of the velocity, and hence also to the kinetic energy.
Thus if the mixing length represents both the element size and the mean-free-path
it is inm~aterial whether one postulates unimpeded motion followed by instantaneous
annihilation, as would be natural by direct analogy with gas kinetic theory, or
continuous momentum exchange between the element and its surroundings. This led
to the first and perhaps the simplest description of the dynamics of thermal
convection : namely an exact balance between buoyancy force and turbulent drag
(Prandtl, 1932). Convective elements are assumed to achieve this balance
instantaneously, which implies that their inertia is unimportant. They move
through a distance ( comparable with their own diameter, conserving their heat,
and then instantaneously mix their excess heat with the new surroundings. These
ideas were applied to stellar convection by Biermann (1932, 1937, 1943) and
Siedentopf (1933 a,b, 1935).
The model can be made more consistent by assuming interchange of heat between
the element and its surroundings to be continuous toop as was emphasized by ~pik
(1950). Then heat and momentum exchange are treated similarly. Since there is
always an exact balance between buoyancy force and turbulent drag, and between the
18
rate of increase of a temperature fluctuation and the diminution of that
fluctuation by heat exchange, it doesn't matter where the element came from, end
the mixing-length description of annihilation of elements can be dispensed with
entirely. This is not the case, however, when the star is pulsating (Unno 1967,
Gough 1977).
The alternative approach of considering elements to he accelerated
adiabatically from rest by buoyancy alone was adopted in the later papers by
Biermann (e.g. 1948 a,b). Pressure forces and turbulent drag can be incorporated
approximately into the dynamics without changing the functional forms of the
equations used, though they introduce different factors of order unity. In more
recent work that includes radiative heat exchange (e.g. Vitense 1953, BShm-Vitense
1958) it is common to ignore turbulent exchange during the life of an element and
invoke instantaneous breakup to account for all the nonlinearities that occur in
the equations governing the turbulent fluctuations.
3. EQUATIONS OF MOTION
To simplify the presentation attention will be restricted to a plane parallel
fluid layer. It will be assumed that horizontal averages, which will be denoted by
overbars, are independent of time and that there is no mean mass transport through
the convection zone. The horizontally averaged momentum and total energy equations
can then be written
dz
. . . . . ~ . 1 ' (3.2)
where z is the vertical co-ordinate of a Cartesian system (x, ~, z ), ~, ~,
are gas plus radiation pressure, density and specific enthalpy; ~ = ( uL, ~r u~r )
is the fluid velocity, ~t is the z,r component of the viscous stress [ and ~z
is the vertical component of the radiative energy flux F~ which will he assumed,
again for simplicity, to be given by ~e diffusion approximation
= - K ~ Z T , (3.3)
where T is temperature and K = ~.c'Ty3Xj~ , ~ being the radiation density
constant, c the speed of light and ~ the Rosseland mean opacity. Perturbations
in the gravitational acceleration ~ = ( O, o,-~ ) have been ignored. Equation
19
(3.3) can be replaced by a more general though local representation of radiative
transfer, such as the Eddlngton approximation (Unno & Spiegel 1966), without
adding unduly to the complexity of the analysis; adding a nuclear source term to
equation (3.2) or casting the problem in spherical geometry introduces no new
conceptual difficulties.
Equations (3. i) - (3.3) must be supplemented with a continuity equation and
an equation of state. The system is then completed with formulae for the
convective fluxes. It is these that the mixing-length theory must provide.
In studying the dynamics of convection it is usual to separate all quantities
into mean (horizontally averaged) and Eulerlan fluctuating parts, as in
p = ?(=) + ] ~ ' ( = , ~ , ~ . ~ ) , (3.4)
where t is time, and to subtract the mean equations from the full equations of
motion from which they were derived to obtain equations for the fluctuations. It
is at this point that serious assumptions are first introduced. Though it is
rarely stated explicitly, in almost all attempts to model stellar convection the
Boussinesq (1903) approximation is used~ this can be justified only when the
scale ~ of the motion is much less than the pressure and density scale heights
of the layer (Spiegel & Veronis 1960, Malkus 1964). In this approximation the
viscous terms and the kinetic energy flux ~?~-~; in equations (3.1) & (3.2)
are neglected which renders (3.2) indistinguishable from the mean thermal energy
equation. The equations for the fluctuations, in this approximation, are
= -- _ ~- ~ -XT?'-~ T ~ , (3.5)
d.iv ~ = O~ (3.6)
20
f = - ~ - T ~ , (3.8)
where,with the exception of fl, all mean quantities are considered to be constant
over the scale ~ of the motion. In these equations c~ is the specific heat at
constant pressure, ~ : - ( ~ [ . f l l i . T ) ~ and
az ./~G az . (3.9)
Moreover, the convective momentum and heat fluxes in equations (3.1) and (3.2)
simplify to
(3. I0)
~-- fh~r ~ rE? ~ T i . (3.11)
Quantities such as X and ~ are considered to be functions of the thermodyn~nical
state variables f and T whose fluctuations are related by (3.8). The pressure
fluctuation appears only in the momentum equation, and has no thermodynamical
significance. Indeed it can be eliminated by taking the double curl of equation
(3.5), the vertical component of which, after use of (3.6), becomes
V -~ V O~- ~, a~,,U - V,~T " = O , (3.12)
where
u - ( u . , u : , u . ) - ~ . v ~ - L~.v .~ , (3.~3)
~. ~/~ , ~ ~/~ ~d V;~V ~-~. The viscous stress has been omitted from the momentum equation (3.5), and
hence from (3.12). This is justifiable in stars because the Reynolds number
characteristic of the largest convective eddies, which are the only motions
21
treated explicitly in stellar mlxlng-length theories, is large. The continuity
equation (3.6) indicates that dynamically the fluid behaves as though it were
incompressible; thus the occurrence of acoustic waves is prohibited. Density
fluctuations that do arise serve only to provide buoyancy. Equation (3.7) is the
fluctuating thermal energy equation, and not the equation for fluctuations in
total energy.
It should be emphasized that the Boussinesq approximation is not an essentlal
component of the assumptions of mixing-length theory. Other less restrictive
though more complicated approximations to the equations of motion could be used,
such as the anelastlc approximation (Ogura and Phillips 1962, Gough 1969) which
holds for low Mach number convection in deep layers of gas. Like the Boussinesq
approximation it filters out the possibility of acoustic waves, whilst retaining
some of the features of compressibillty. No attempt to develop a mixing-length
theory with consistent use of such an approximation seems to have been made.
4. LOCAL MIXING-LENGTH FORMALISMS FOR A STATIONARY ENVELOPE
In the Boussinesq approximation~mean thermodynamical state variables are
considered to be constant over the assumed scale ~ of the motion. Another
approximation, common to the formulation of most mixing-length theories used in
stellar structure computationsp is to treat the superadiabatic lapse rata ~ in
the thermal energy equation as though it were constant. Though in practice it is
found that this approximation is poor, because at the top of the hydrogen
ionization zone in particular ~ varies on a scale much shorter than { , it is
usually retained because it brings great simplicity to the mixing-length formulae.
It permits the heat flux and Reynolds stress to be expressed at any level in the
envelope solely in terms of the mean conditions at that level. For this reason
the resulting theories are called local.
Formulation under the assumption of balance baleen buoyancy and turbulent drag
Most mixlng-length descriptions are £ormulated in terms of rising and falling
fluid parcels having typical radius ~ ~ and which at any instant can be
characterized by a single vertical velocity ~ and temperature fluctuation T"
In the early discussions by Prandtl, Biermann and Siedentopf the parcels were
presumed to travel at their terminal veloclty, buoyant driving being balanced
exactly by turbulent drag. Thus in the vertical component of the momentum
equation (3.5) the time derivative was essentially ignored and the nonlinear
advection terms were replaced by a drag force of the form u~/~{) . If the
pressure fluctuation is ignored the coupling between vertical and horizontal
motion is removed, and a relation between the vertical speed and the temperature
fluctuation results!
22
From here on overbars are omitted from mean quantities where no ambiguity results.
Initially it was common to consider the fluid motion to be adiabatic. Equation
(3.~), with the right hand side set to zero and
integrated along the trajectory of the parcel.
distance the parcel moves~one obtains
~.VT' ignored, can then be
Taking ~ to be constant over the
(4.2)
where ~ is the vertical displacement of the parcel above its point of origin.
The constant of integration is zero since T' is assumed to be zero at ~ = O. A
typical parcel at any instant might have travelled say half the distance I , so a
typical temperature fluctuation 8 and velocity um can be obtained from (4.1) and
(4.2) by setting ~ - ~. Noting furthermore Lhat parcels with T'>O rise
and parcels with Tk O fall, the heat flux ~ can be estimated by replacing
with ~-~ . Then
- - - ¢
The Reynolds stress may be estimated in a similar way to be
(4.4)
The numerical factors in front of these formulae vary from paper to paper,
because the precise definition of ~ and in particular the relation between parcel
size and mean-free-path is not universal, and because factors of order unity can
be introduced to account for effects of pressure fluctuation or imperfect
correlation between ~ and T" .
Kinetic theory of .acceleratin~ fluid elements
The alternative approach is to imagine the fluid parcel to accelerate from
rest. It is usual then to ignore the nonlinear terms in the momentum equation.
The influence of pressure fluctuations can be estimated by working from equation
(3.12), and introducing typical horizontal and vertical waven~bers by setting
V, =-k and z. = -- ~= This is perhaps not quite as crude an
approximation as one might first imagine, because these relationships are
satisfied by the convective eddies of linear stability theory whose visual
23
appearance is not wholly dissimilar to the eddies of intensely turbulent
convection. The linearized form of (3.12) then becomes
where
- = o , (4.5)
----- I "*' k,,/k, (4.6)
The only difference between equation (4.5) and what one would have obtained from
the linearized vertical component of (3.5) with the pressure fluctuations ignored
is the factor ~ The pressure fluctuations divert vertical motion into
horizontal flow, thereby decreasing the efficiency with which the motion might
otherwise have released potential energy. The effect in this approximation is
simply to increase the apparent inertia of the vertically moving fluid, without
changing the functional form of the equation of motion. In some derivations
equation (4.5) is obtained directly from (3.5), the factor ~ being introduced by
analogy with the virtual inertia of a body moving in a potential flow.
When integrating equation (4.5) it is usual to regard the temperature
fluctuation as a function of the parcel displacement ~ , and approximate it by
the leading term in its Taylor expansion. Of course for adiabatic motion
equation (4.2) indicates that the leading term is the only term present. The
operator ~ in equation (4.5) can be replaced by ~/~ without further
assumption~ since in linear theory there is no distinction between Eulerian and
Lagrangian time derivatives of perturbation quantities. The equation can then be
integrated to yield
For adiabatic motion, (4.7) together with (4.2) complete the description of the
dynamics. If typical velocity and temperature fluctuations defined by setting
= -~ are used as before to estimate ~ and p~ , t h e same equations (4.3)
and (4.4) are obtained, aside from factors involving ~ . Note that pnessure
fluctuations could have been incorporated into the original formulation of the
theory by dividing the right hand side of (4.1) by ~ .
Heat exchange between fluid 2arcels~dtheoenVironment
Heat exchange between fluid parcels and their surroundings is most simply
accounted for by treating equation (3.7) in an analagous way to the momentum
equation. Retaining only the leading term in the Taylor expansion of ~(~) in
the linearized version of (3.7) and integrating along the trajectory yields
24
-r' i f - (4.8)
where k ~ = k~* ~ When ~ =~ this is precisely the same relation that
one would obtain by neglecting the time derivative in (3.7) and replacing
~,.VT' - ~ by the estimate ~T'/(~ t) for the turbulent
heat exchange. In deriving this equation the fluctuating part of (3.3) was used;
fluctuations in ~ do not arise because, consistency wi~h assuming ~ constant,
the gradient of ~ is small compared with ~VT'I . Whereas for adiabatic motion
the wavenumbers entered only in their ratio~ in the nonadiabatic theory their
magnitudes are also required for estimating ~u F; . Taking the mixing
length t o be a measure of the vertical extent of the eddy suggests
Proceeding as in the adiabatic case, but with (4.8) replacing (4.2), one is led to
Fo = ¢ ~-"~'l" s-' [-¢',*R's) ''~- I ] ' Kp, ~,~.~o)
[(, ~-~5) '~- ¢ (~r~ / '0 t ~ , (4.~1)
where
~(S/T)p t"" 5 ~ (:K/,,,~)"" (4.1~)
is the product of the Prandtl number and a Rayleigh number based on ~ , and
(4.13)
is a geometrical factor of order unity.
When ~S >>I the convective motion is almost adiabatic and
(4.14) I=o ~, ~ " z ~ S ' / " 1<18 ,
25
which are the same as (4.3) and (4.4) aside from the factors involving ~ . In
the solar convection zone this condition is satisfied everywhere except in
extremely thin layers at the top and bottom~ the latter and sometimes even the
former being too thin to be resolved in normal stellar structure computations.
When 7"5<~ I,
Fo = ~ ~-'/'~' S" ~ , (4.16)
--( 0 ?
which are relevant in substantial regions of some red giant envelopes.
Other formulations
In essence, the derivation presented above in terms of accelerated and
subsequently annihilated fluid parcels is the same as that of Vitense (1953;
Behm-Vitense 1958) whose prescription has been the most widely used for computing
stellar models. Her 1958 formulae for the heat flux imply (4.10) with ~ = 2 and
= J~/~ Vitense also studied the case when fluid parcels are optically
thin, end adjusted numerical constants so that the optically thick and optically
thin formulae gave the same result at unit optical thickness. A smoother and
probably more accurate transition between the two limits can be obtained by using
the Eddington approximation to radiative transfer (Unno & Spiegel 1966, Unno 1967,
Gough 1977).
The derivation in terms of continuous turbulent exchange of heat and
momentum has been adopted by Unno (1967) and is similar to an earlier approach of
~pik (1950) in terms of convective cells. Opik's formula for ~ is
mathematically somewhat more complicated than (4.10)D but it takes similar values
if ~ and ~ are chosen suitably (Gough & Weiss, 1976). The differences between
the values of ~ predicted by Vitense and Unno arise mainly from the different
assumptions about flow geometry and the slightly different constants of order
unity appearing in the approximations to the equations of motion~ rather than from
apparent variances in the physical models.
5. REMARKS ON THE ASSUMED STRUCTURE OF THE CONVECTIVE FLOW
In order to complete the prescription for ~ and ~ the parameter ~ ,
26
which depends on the geometry of the flow, and the mixing length ~ must be
determined. The latter is one of the most difficult parts of the theory. Here
the criteria that might be most important in affecting a choice of ~ are
discussed; the mixing length is postponed to a later section.
What one might consider the most natural choice of ~ depends very much on
the mixing-length model that has been adopted. Thinking in terms of approximately
spherical parcels of fluid rising and sinking through the background medium
suggests adopting the formula for the virtual inertia of a sphere. The particle
is thus considered in isolation, and ~ = 3/2. Different values would be
obtained if the parcel were thought to be aspherical.
An alternative approach is to ass~ne the return flow around a moving parcel
is confined to the immediate environment by the interaction with neighbouring
eddies, so that a relatively compact convective eddy or cell is formed. To
determine ~ the shape of the eddy must be specified. For want of a more reliable
procedure, the marginal or unstable modes of linear theory might be used to
describe the flow in convective cells. This has the computational advantages of
being simple to calculate for local mixing-length theory, and of providing a basis
on which to generalize to nonlocal theories or theories that one might hope to
apply to convection in more complicated situations. Of course it is not clear
what boundary conditions are the best to adopt, but that is unlikely to be
crucial; it is expedient to choose those that yield the simplest solutions. Thus
for the relatively simple theory discussed in the previous section one might set
(cf. Chandrasekhar 1961)
(5.I)
in an obvious n o t a t i o n , where ~ and ~ are s i n u s o i d a l i n z and depend p o s s i b l y
on t, and the planform f depends only on x and y and satisfies
v , " f - - k: f - , I . (5.3)
Equation (4.5) was derived from (3.12) with a flow such as this in mind.
If the mixing-length model is one in which a statistically steady eddy is
maintained, the continual turbulent interchange of momentum and heat may be
regarded as being due to an eddy viscosity and conductivity. The convective
cell is thus like the marginal mode of linear stability theory, for which ~ = 3.
If, on the other hand, the model is one in which the eddy grows and subsequently
breaks up, Spiegelts (1963) suggestion of choosing the most unstable mode is more
27
appropriate. The rationale for this is that it is the most rapidly growing mode
that would dominate the flow. Its shape is approximated by (Gough, 1977)
k~'/k: = ± {I • [ 3 ~ l ( h - ~ ' ~ 5 -t ~) ' /~ ] ' /~ ' } , (s.4)
given that its vertical extent 7T/kz = ~ is fixed. Thus ~ varies monotonically
from 2 to i as S varies from 0 to
Other possibilities come to mind, such as choosing the mode that maximizes
F c , though that results in a shape not very different from the one implied by
(5.4), with ~ varying between 5/2 and I. One might consider averaging over an
ensemble of different eddies, but then one is faced with the problem of choosing
the distribution function.
The flow geometry also enters directly into the mean equation for the hydro-
static support of a stellar envelope. In a plane parallel layer only vertical
transport of vertical momentum matters, but in a star vertical support is provided
partly by horizontal stress. If the star is spherically symmetrical, the Reynolds
stress tensor is axisymmetrical about the vertical and has only two distinct
eigenvalues. Thus in spherical polar co-ordinates the components depend on only
two quantities, which may be taken to he T~ and ~ . The equation for hydro-
static support, which is nontrivlal only in the radial direction, may be written
a (5.5) * = o ,
where r is the radial co-ordinate. If the turbulence is isotropic, ~ = 3 and
the Reynolds stress behaves like a pressure of magnitude F~ .
6. FURTHER REMARKS ON THE DYNAMICS OF CONVECTIVE EDDIES
When the growth and subsequent annihilation of convective elements or eddies
is taken into account, mean transports are usually estimated by using
characteristic values of the velocity and temperature fluctuations. These values
are normally taken to be the actual fluctuations at ~ =~ , when the element
has moved half its mean-free-path; Vitense took a space average over an element's
trajectory, which in local theory is equivalent. The approximation implies
certain assumptions about the creation of eddies, which become apparent as soon
as an explicit attempt to average over all turbulent elements is made. The
discussion that follows is based on the ideas behind Splegel's (1963) formulation
of the theory, though the details are somewhat different from that approach.
A specific mixing-length model
Consider, for example, the evaluation of the heat flux ~ ~ rcl,~--~. It
is convenient to have a specific model in mind, and to this end a flow field of
28
the kind (5.1) - (5.3) will be assumed. Thus the flow is represented by a
conglomerate of cells or eddies that form, grow and subsequently break up. Though
it is envisaged that there is a degree of randomness in the positions at which
the eddies form, each is presumed to stay in the same place during its existence.
An eddy centred at height :~o that originated at time to is thus described by
until the moment of annihilation, the functions W and ~ being determined by the
linearized forms of equations (3.7) and (3.12). If ~(2o,~°~ ~) is the
probability that the eddy has not yet been destroyed, the average of any function
over all eddies at height z. is obtained by weighting that function with
and integrating over all possible times of creation. Thus if eddies have mass ~,t
and are created at a rate ~ per unit volume per unit time, the heat flux at
2 f 7-° is
Fo = ~ c T ! W ® ~ a ~ o . (6.3)
Here all the eddies have been assumed to be centred at z~ where they contribute
maximally to F~ • In principal an average over eddies centred in the range
(Zo-~ , Zo ÷ ~ ) ought really to be taken. In local theories this does no
more than multiply the right hand side of (6.3) by a constant factor of order
~ity.
The growth Of convective eddies
The velocity and temperature fluctuation amplitudes depend on the initial
conditions of the eddy. One of the assumptions of the mixing-length approach is
that turbulent fluid elements originate with the same properties on average as
their ium~diate environment, though in any individual eddy there must be some
deviation from the mean state for otherwise that eddy would have no identity. It
must be assumed that the eddy grew from a non-zero perturbation, but provided the
initial amplitude is much smaller then the average value, the precise details of
the initial conditions are unimportant. In this discussion the conditions that
lead to pure exponential time dependence of ~/ and ~ will be chosen, merely to
simplify the mathematics. Then
29
W ; W . ~
(6.4)
where V~o and ~o are the initial values of k~J and ~ , and are related by
The growth rates 0~ are given by
(6.s)
o-~ ~ ± J v ~ . ~ - - ~ = ~ , ~ - ' s - ' ~ " [ + ( ~ t ~ ' 5 ) ' I ~ - l ] , (6.6)
where ff~ m ~ ~ ~/~ T and ~ ~ kLK/lf%. Note that if W, and ~o are of
the same sign the eddy grows, but if they are of reverse sign it decays.
It is presumed that in the initial perturbations, which arise out of the
smaller scale turbulence resulting from the breakup of both the major heat
carrying eddies and possibly also from larger eddies that make lesser contributions
to the heat flux, the velocity and temperature fluctuations are uncorrelated. Thus
only half the eddies have VJ o and ~ of the same sign and subsequently grow. The
other half make an insignificant contribution to the heat flux and are ignored.
Eddy creation rate and initial qonditions
The expression for the heat flux depends explicitly on the eddy creation
rate n and the amplitudes 9~/o , ~.. These are governed by the background
turbulence. In a statistically steady state, however. ~ can be evaluated from
the statement that the entire fluid volume (or some constant fraction of it) is
occupied by eddies. Thus
%- (6.7)
It is much more difficult to specify the initial amplitudes, and at this point it
will be observed only that if the flow is to be controlled for most of the time by
the linearimed dynamics leading to (6.4), and not by the eddy breakup process.
then ~/. and ~. should be small compared with the average amplitudes. Thus
~x~ (~) >> | , where ~- . ~'+ and "~ is the mean lifetime of an eddy. If r is
30
estimated by integrating ~/ in (6.4) and setting t h e resulting displacement
equal to ~ , this condition becomes
x ~ ~{/Wo >> i . (6.8)
Eddy annihilation hypothesis
Finally it is necessary to obtain ~ , which depends on the disintegration of
eddies. The most natural interpretation of the mixing-length annihilation
hypothesis is that a fluid element is considered to break up as it is displaced
through &~ with probability a~/~ : (W/~)a~ . In other words the element has a
mean-free-path { , and the probability of its annihilation is proportional to the
shear in the eddy and is not explicitly dependent on the details of its past
history. It follows that
.h_ t - - I ~-,(l='-I:.)
: } { ' + oc,, t. ( 6 . 9 )
The tffrbulent fluxes
It is now straightforward to evaluate
terms are ignored, equation (6.7) for the eddy creation rate yields
" , '<~ = o 7 / ( t , < >- - y) ,
where ~ is Euleris constant. Equation (6.3) then becomes
~f c t o" w . e . i- X-'c <:"(': "':') } = .t.(t. 'x-y) I eq,{Z~(t:4:o) 4t:o
¢c~ T t_'-~ "~
- - s-' r(, , - f s ' ' - ? ' K ] [I/).(t,,>,-~,).] ~ '~7 '
F~ If terms 0(~-) of the leading
(6.zo)
(6.11)
(6.12)
the factor 2 in the denominator arising because only half the eddies have positive
growth rates. This expression has the same form as (4.10), and can be made equal
to it by setting
31
~.. s' ~ ~, = e _~ 15. (6.13)
The stress ~ can be computed similarly, and yields
?~ = ([.x-~) ' (6.14)
which reduces to (4.11) when (6.13) is again used for ~ .
Remarks
It is now evident that the rough estimates (4.10) and (4.11) imply a value
for % , which approximates the degree of amplification of the velocity and
temperature fluctuations during the lifetime of a convective eddy. Of course the
precise value (6.13) must not be taken seriously, especially since it was a
obtained by exponentiating/poorly estimated quantity, though one may be tempted to
take comfort in the fact that it is at least reasonably consistent with (6.8). It
would be preferable if some independent method of determining the initial
conditions could be found. It is worth noting, however, that (6.12) and the
corresponding expression for ~ depend only logarithmically on A , so the method
of estimating the initial conditions need not be very accurate. The constraint
(6.5), which was applied only to minimize algebraic complexity, is not an
essential part of the formulation; other relations between Wo and ~o lead to
expressions for ~ similar to (6.12). These also have multiplicative factors
that contain the logarithm of the amplification ~ in the denominator.
The theory can also be formulated in terms of rising and falling fluid
parcels, with the only difference that the integrals in (6.3), (6.7) and (6.9) are
now considered to be evaluated along the parcel trajectories. In the local
approximation the two approaches are identical.
The discussion in this section has not led to any modification in the
mixing-length formulae, lts purpose has been to highlight the role of the eddy
creation process in determining ~ and ~k .
7. THE CHOICE OF MIXING LENGTH
Expressions (4.10) and (4.11) for F= and ~ are both increasing functions
of ~ . Since the philosophy of the mixing-length approach is to concentrate on
only one scale of motion at any level in the fluid, namely the scale that
contributes most to the fluxes~ the largest value of ~ compatible with the
dynamics must be chosen.
32
When modelling laboratory flows the choice of [ seems straightforward. The
largest eddy that one can imagine is determined by the geometry of the vessel
col%raining the fluid. Thus at any point it is usual to take
= ~ x , (7.1)
where ~ is the distance to the nearest boundary of the container and ~ is a
constant scaling factor of order u~ity which is determined by comparison of theory
with experiment.
Of course the dynamics of the flow could be such as to prevent this largest
conceivable eddy from growing, so that ~ is determined mainly by other factors.
Thus von K~rm~n (1930) suggested that for turbulent shear flow the mixing length
should be taken to be a multiple of a scale length of the mean shear. It seems
that for most laboratory applications this yields results that agree with
experiment less well than (7. i).
A stellar convection zone is not bounded by a rigid container, and must
therefore determine its own length scales. But just how ~ should be specified is
not clear. It is most common to follow yon K~rm~n's philosophy and choose
(7 .2 ) I[ - , ~ H ,
where H is a scale height of the mean stratification, though some stellar models
have been Computed using the lesser of (7.1) and (7.2) (IIofmeister & Weigert 1964;
B~hm & S=~ckl 1967). 0plk (1938) took H to be a scale height of density. This
choice has been favoured also by Biermann (1943), and by Schwarzschild (1961) who
argued that it is the distortion of expanding or contracting fluid elements as
they experience substantial changes in mean density that limits the size of an
eddy. This reasoning is not Boussinesq, and introduces some representation of
the effect of compressibility into the prescription. Vitense (1953) set H to a
pressure scale height. This has been widely used since, presumably for reasons
of computational convenience.
It is unfortunate that the attempt to incorporate compressibility into the
local theory results in a choice of ~ that does not reduce continuously in any
natural way to the kind of value that is favoured for laboratory applications.
Stellar convection zones are often many scale heights deep, which is currently
unattainable in the laboratory, but it would have been encouraging had some
experimental verification of the theory been feasible, even if it were in a
parameter range inappropriate to astrophysics. Only astronomical checks are
available at present, but these appear to provide little support for the details
of the theory.
33
8. cALIBRATION OF THE HEAT FLUX FORMULA
Stellar models are usually computed using equation (4. I0) for the convective
heat flux, with (7.2) defining the mixing length. The Reynolds stress T~ is
rarely included. The constant ~ is calibrated by comparison with observation
and then taken to be a universal constant, though some authors do not bother with
this nicety on the grounds that the mlxing-length formulae are too unreliable for
the calibration to be meaningful. The determination of ~ can be affected either
by constructing a solar model with the correct luminosity and effective temperature
or by comparing the slope of a theoretical lower main sequence with observation.
The two methods give results in reasonable agreement with one another when H is a
pressure scale height, though they are subject to considerable uncertainty. In
particular, uncertainties in the solar composition are reflected in the
calibration, as are uncertainties in the opacities, equation of state and nuclear
reaction rates. The results depend also on assumptions concerning the mixing of
material that has been processed in the core. In addition to these, additional
uncertainty is introduced by inaccuracies in the numerical techniques, whose
existence is indicated by discrepances between the results of different
invest igato r s.
It should be noticed that the conclusion =< -~ I does not satisfy the
condition ~ << I upon which the Boussinesq approximation depends. The theory
is therefore not internally consistent. However ~ is not much greater than
unity, and the effects of compressibility may be insufficient to modify the heat
flux severely. More serious is the local approximation that regards ? as being
approximately constant over the length scale ~ It is one of the aims of non-
local mixing-length theories to rectify this flaw.
The calibration of ~ rationalizes the astronomical data, but it does not
provide a test of the mixlng-length theory. The reason is partly that convective
envelopes of solar type stare are approximately adiabatically stratified every-
where except in thin transition regions above the hydrogen ionization zone. The
sole function of the convection theory in determining the gross structure of the
star, therefore, is merely to prescribe which adiabat characterizes the bulk of
the convection zone. This depends on the jump in temperature across the
transition region, but that hardly depends on the detailed functional form of the
expression for Fc • Indeed if ($.10) is replaced by (4.16), even though qL5 >> I
throughout almost the entire convection zone, the solar calibration requires
o( = 1.35 x 10 -3 when ~ and ~ take the values implied by BShm-Vitense's (1958)
formulae (Gough & Weiss 1976). The resulting solar model is barely distinguishable
from that computed in the usual way. In red giants there are regions where
q~5 < J , but the envelope models are insensitive to details of how the two
asymptotic limits (4.14) and (4.16) meet.
34
The calibration of ~ does not even determine both asymptotic limits, since
there is considerable uncertainty in the geometry of the convective motion. The
discussion in §5 suggests that ~'/~'does not vary greatly, but the possible range
for ~ is very wide. This influences only the limit (4.16)~ which hardly matters
for solar type stars. One might anticipate, however, that a calibration of q
would be possible by comparing theoretical red giants with observation.
An investigation of the sensitivity of red giant evolution to changes in the
constants in the mixing-length theory has been reported by Henyey, Vardya and
Bodenheimer (1965), and interpreted in terms of the asymptotic limits (4.14) and
(4.16) by Gough and Weiss (1976). Plausible variations in q do induce
noticeable changes in the evolutionary tracks on the H-R diagram, but it appears
that other uncertainties in both the theory and observation at present prohibit
calibration of q by this method.
A potentially more sensitive test for ~ might have been a measure of the
maximum depth of penetration of the convection zone. In some red giants the
convection zone extends deep enough to mix the products of the nuclear reactions
to the surface. The extent to which the convection zone has penetrated in such a
star could be determined in principle from observations of 017/O 18 ratios in red
giant atmospheres (Dearborn & Eggleton 1976). Computations by D.S.P. Dearborn
and myself, however, have revealed that such a test would probably not provide the
required information, for at its maximum depth the convection zone of a red giant
envelope is adiabatically stratified almost throughout, and the heat flux is
determined by (4.14). But this does not necessarily imply that there is nothing
to be learn~. Conditions may be sufficiently different in red giants that a
recalibration of (4.14) would yield a different result. This might shed some
light on the variation of ~ , whether density or pressure scale heights are
appropriate, or whether (7.2) is even a useful assumption.
9. THE REYNOLDS STRESS
It is common practice to ignore the turbulent stress ~e = I~£ in the mean
momentum equation (3.1). One reason, perhaps, is that to justify the Boussinesq
equations upon which mixing-length theory is based the convective velocities must
always be substantially less than the sound speed. This implies ~e << ~ '
However, it is the derivative of ~ that appears in (3.1), and if la?~/arl is
evaluated in a stellar model that has been computed without that term, it can be
the case that it exceeds Id~/=[rl by a considerable degree in the transition
region at the top of t3ae convection zone, even though ~ might be small compared
with ~ . Another reason for ignoring Te is because to do so removes
singularities from the equations of stellar structure and thus makes them much
easier to solve.
35
The equations, in the plane layer approximation, are
(9.1)
and
aT
The heat flux and Reynolds stress are given by (4.10) and (4.11) in terms of ~ .
Equations (9.1), (9.2) and the equation (3.9) defining ~ can be rewritten as a
system of first order equations for ~e, ~ and T thus:
(9.3)
- ( 9 . 4 )
aT = l
(9.5)
Here ~ and ~ are regarded as functions of Tt The usual stellar structure
computations are governed by the spherical analogues of (9.1) and (9.2) with T~
ignored, which is of one order lower than the system considered here. Solutions
of (9.3) - (9.5) are to be sought satisfying T~ = O at the edges of the
convection zones, where F/K - ~$/c~ also vanishes.
To analyse the nature of the singularities it is sufficient to consider the
structure of equation (9.3) near a boundary of a convection zone. Since S -~ 0
as the boundary is approached, the asymptotic forms of (4.10) and (4.11)
(9.6) Fo ~ A t'~ )
?~ ~ t 6 ~ --, ( 9 , 7 )
may be used, where A and B are nonvanishlng functions. If =he origin of z is
taken to be at the base of the convection zone, and all the coefficients in (9.3)
are expanded in a Taylor series about the origin, only the leading terms being
38
retained, the resulting equation is found to have the structure
am (9.8)
where M, N and Q are positive constants. In deriving this it was assumed that the
mixing length does not vanish at the boundary of the convection zone. The
singularity introduced by the Tt 'f~ term is best analyzed by setting ~4, z -- ~ and
writing (9.8), with the small last term neglected, as a linear system in terms of
a new independent variable ~ :
~ (9.9)
The coe f f i c i en t s ma t r i x of the r i g h t hand side has eigenvalues
" N "- J ± N ~ + ~ M ~ (9.1o)
which are real and of opposite sign, indicating a saddle point at the origin.
A similar analysis may be performed at an upper boundary. The resulting
equation for ~i has the same structure as (9.8), except that now the coefficients
M and N arenegative. Provided N ~" >/ ~ IMJ , both eigenvalues ~,~are real and
of the same sign, indicating that the origin is a nodal point, though if N ~ < EIMJ
the solution is a spiral that cannot satisfy the condition that ~ vanishes at the
boundary of the zone. This latter situation might arise if too large a mixing
length is chosen.
I~ is because the upper singularity is either a node or does not permit a
physically acceptable solution that inward n~nerical integrations from the
atmosphere of a star cannot be successful. Stellingwerf (1976) has pointed out
that an outward integration might workD and has presented a solution to a simple
model problem. Realistic stellar envelopes can be computed in this way only if
the convection zone is thin; otherwise a more stable nm~erical procedure must be
adopted.
Attempts to include T~ in realistic stellar envelopes have been made by
Henyey, Vardya and Bodenheimer (1965) and by Travis and Matsushlma (1971). In
both cases the structure equations were simplified in a manner tantamount to
ignoring the heft hand side of equation (9.3)~ thereby reducing the order of the
differential system and removing the singularities. Henyey et el. anticipated
that this approximation was not serious. Unpublished computations by Baker,
Gough and Stellingwerf of RE Lyrae envelopes with shallow convection zones using
37
the full system of equations revealed that at least in those stars the effect of
~ is not profound. Its inclusion smooths out the region near the top of the
convection zone~ so that d~/ar remains smaller in magnitude than ~/~ ,
and has little influence on the remainder of the convection zone.
iO. REFINEMENTS AND'GENERALIZATIONS
The discussion in ~6 demonstrated how ~ and y~ depend on the growth rate
of convective eddies. This dependence was emphasized by Spiegel (1963), who
also showed how the expressions are modified when viscosity is considered.
The averaging procedure used to derive (6.11) and (6.14) does not depend on
the precise nature of the turbulent flow. The description of the breaking up of
eddies is not refined enough to distinguish between the different circumstances
to which the theory might be applied. Detailed descriptions of the dynamics is
confined to eddy growth, and is contained in the expression for ~ . It is to
this that refinements and generalizations are most easily made.
Transport by small-scale turbulence
As an illustration, an attempt will be made to incorporate into the dynamics
the exchange of heat and momentum by smaller scale turbulence that was ignored in
6. It will be assumed that turbulence on a scale smaller than the heat
carrying eddies is isotropic, so the transport might be roughly represented in
terms of a scalar eddy diffusivity
= k" . (lO.1)
where ~' is a characteristic velocity and k' a characteristic wavenumber of the
background turbulence. This diffusity will be taken to be the same for both
momentum and heat. Its value is related to the velocity and length scales of
the major eddies, whose disruption seeds the small scale motion, and may be
rewritten
-- ~(_~.~)" '~k- ' -- ~ k " ( ~ - ' O '#"
( / r )
(lO.2)
where ~ is o f order unity and depends on the spectrum of the turbulence. It is
likely that E is only weakly dependent on the amplitude of the convection and
can probably be safely asstuned constant. This expression can now be incorporated
into the expression for the growth rate of a disturbance in a viscous conducting
38
fluid (e.g. Spiegel 1963):
, r = - I I, (lO. ~)
where ~ and ~ are the effective thermal diffusivity and kinematic viscosity:
= ~ ~ K/~c? , ~ = ~. (10.4)
Equations (10.2) - (10.4) define a growth rate ~ which can be substituted into
(6.11) and (6.14) to obtain equations for ~ and ?~ The prescription is
algebraically more complicated than the previous formulation which led to (4.10)
and (4.11), though its effect can be approximated by simply multiplying the value
of ~ obtained previously by the factor {I ~ ~e~ (~ -~"/~"
It is perhaps not surprising that the modifications to the results hardly change
the functional dependence of ~ and ~ on S, because the two extreme approaches
discussed in ~ 4 led to the same formulae. The new results may be no better than
(4.10) and (4.11), because the attempted improvement to the representation of the
physics may be insignificant compared with the errors that remain. It should be
noted, however~ that the modifications cannot simply be absorbed into the
definition of ~ .
The small scale turbulence not only influences the dynamics of the larger
eddies but also contributes directly to the fluxes. The heat flux can be
accounted for by replacing K by fc~ in the equation for the radiative flux.
The relevant Reynolds stress component must be augmented by ~f ~; which can
be written as [?~ , where ~ is yet another undetermined parameter of order
unity that depends on the spectrum of the turbulence.
Other refinements can be included, such as a representation of entrainment
and erosion of eddies, or the generation of waves. The former has been considered
by Ulrich (1970a), who used the meteorologists' model of convection based on
rising thermals. D.W. Moore and Spiegel (unpublished) considered the influence of
acoustic generation by convective eddies, and found that this noticeably reduces the
turbulent velocities when the Mach number is of order unity. Generation of gravity
waves with wavelengths comparable with Z , which occurs at the boundaries of
convection zones, probably requires a nonlocal theory for an adequate description.
Further refinements are discussed by Spiegel (1971).
Convection in s lqEly rotating stars
Aside from suggesting improvements to the standard theory, this approach can
be used to formulate mixing-length theories for more general circumstances.
Rotation or a magnetic field, for example, can easily be incorporated into the
39
stability analysis that determines ~ . If the convection zone is rotating, the
maximally contributing eddies are rolls aligned with the horizontal component of
the rotation rate /~ (e.g. Chandrasekhar 1961). Their growth rate is determined
by
(~÷~)~- p~ ~'(~-,)n ~} ÷ ~ = o, (lO5)
where 11 is the vertical component of J~ . Only if _(I is small might one reasonably
hope to obtain meaningful results by just using this growth rate in the normal
mlxing-length formulae, since the effect of the rotation on eddy disruption has
been ignored. In that event the solution to (10.5) can be approximated by
-, ~ -' ~ '/~ (10.6)
and ~ is given in terms of it by (6.11). Note that J~ measures the local
rotation in the vicinity of the eddy, and should therefore be interpreted not as
the angular velocity but as half the vorticity of the mean flow.
It is more difficult to calculate the Reynolds stress. The rotation
introduces a degree of order to the turbulence that destroys the axisymmetry of
the stress tensor and rotates its principal axes. Provided -(I ~ << ~ the
effect is small and for the purposes of computing the hydrostatic structure of
the star can no doubt be safely ignored. Equation (6.14) can be used for ~
with (r determined by (10.6). But this approximation is not good enough for the
horizontal components of the mean momentum equation, since the relatively small
off-diagonal terms in the stress tensor generated by the rotation are important
for determining the angular momentum transport by the turbulence. It is
straightforward to construct a Reynolds stress tensor from the eigenfunctions of
linear stability theory, but in the absence of experimental tests it would be
most unwise to rely on it.
.Influence of a magnetic field
A magnetic field ~ can be treated similarly, provided its turbulent
distortion may be considered random and does not lead to organized concentrations
such as sunspots. Once again the turbulent motion is most efficient as rolls,
aligned with the horizontal field, and the growth rate is determined by the
equation obtained from (10.5) or (10.6) by replacing 4.~-'(~-0Jl ~ by
ir~/{ ~ , where ~ is the vertical component of ~ . The caveats
concerning the Reynolds stress mentioned in connection with rotation apply here
too.
40
Nuclear react!one and composition sradients
The interaction between nuclear reactions and convection is of particular
interest when reaction timescales are comparable with ~-'. This can be the
case in late stages of stellar evolution. The fluctuations in energy generation
rate induced by the convection influence the eddy dynamics through modifications
in both the temperature and chemical composition of fluid elements. The
convection influences the nuclear reactions not only via ~ and ~e , but also
by transporting the products of the reactions,
The mixing-length theory can be generalized as before. Variations =~ in
the abundances x t of elements £ must now be taken into account when calculating
both the buoyancy and,of course, the energy generation rate £(T, T, ~ per unit
mass that must be introduced into equation (3.7). The amplitudes W and ~ are
now determined by
~ w ( 'ST" 0 (lO.7)
(lO. 8)
' defined in a way analogous to Vv' and @ in where X~. is the amplitude of =¢~ ,
(6.1) and (6.2), and
The summation convention is being used. The abundances are determined by
where ~ measures the rate of production of ~ . The linearized fluctuation
equation derived from (10.9) can be combined with (IO.7) and (10.8) to yield the
following equation for e- :
-' ~ = 0 (lO.lO)
where
41
R 5 ~- Cf'f=j ~.r,'r ' 1Z"r "- 5W .~,,,~
and ~j is defined by
(O '~ , j - l~,j - T P . , . j . j / I ) S j~ = ~ , , -
where ~j is the Kroneeker delta.
Overbars on mean quantities have been omitted as usual. Once again Y~ and i~
are given in terms of 0-by (6.11) and (6.14).In addition one can easily derive
for the flux of =c 6 , which is in the m direction,
(10.11)
chemical The first term represents the transport that arises solely because~elements are
created or destroyed at different rates in upward and downward moving fluid. The
second term represents turbulent mixing, though that too is influenced by the
reactions and is not a simple scalar diffusion.
In the special ease when there are no reactions 5~3 ~ 0-" ~U '
and
F~ = - (~-f)-' T~ ~-Z ' ( l o . 1 4 )
which leads to a simple diffusion equation governing the mean abundance ==~ .
Convection in pulsating envelopes
Application of the mlxing-length formalism to stellar pulsation is somewhat
more complicated than the examples considered above, because now the time
dependence of the coefficients in the fluctuation equations must be taken into
account. Additional assumptions must also be made. 0nly a few brief remarks are
made here, since detailed discussion of this problem is to be found in Unno's
42
contribution to this volume.
The case of radial pulsations is the simplest to discuss, provided attention
is restricted to fundamental and low overtone modes ~hat vary on a length scale
greater than ~ . If Lagrangian co-ordlnates defined in terms of the mean flow
are used to describe the pulsations, but locally defined Eulerian co-ordinates
for the convection, the equations governing the convective fluctuations are
rather similar to those used for a static atmosphere, though additional terms
must be added to account for the mean dilation and ~ must be modified because
the co-ordinate frame is no longer inertial. A mixing-length theory can therefore
be developed by following one of the procedures outlined in ~# 4-6.
Unno (1967) formulated a theory by generalizing the model that assumes
continuous turbulent exchange of momentum and heat between a convective element
and its surroundings. Though the general growth of convective fluctuations
during the lifetime of an eddy is ignored in this approach, acceleration of a
convective element and modulations in its temperature induced by the pulsations
are taken into account. An eddy is presumed to maintain its identity, deforming
instantaneously with the mean environment. The theory now requires one to
recognize that the lifetime of an eddy is finite, for a turbulent eddy retains
some memory of the conditions at the time of its formation. Thus much of the
apparent simplicity enjoyed by this model when applied to a stationary stellar
envelope is lost. Alternatively, the discussion of ~ 6 can be adapted for a
pulsating star by introducing the appropriate time dependence into the equations
of motion (Gough 1977).
These approaches each require an explicit statement about how the initial
state of a convective element depends on conditions at the time of creation.
Since the mixing length, which determines both the destruction rate and the
initial dimensions of elements, is assumed to depend only on the mean (horizontally
averaged) state and not on convective fluctuations, it is perhaps most natural,
and certainly simplest, to assume that it has the same functional form as for a
stationary envelope (and thus does not depend explicitly on time derivatives) and
to make a similar assumption about all other aspects of creation. It must be
realized that this is yet another unverified assumption of the theory. It may not
be a good approximation, for although it is the mean stratification of the
convection zone that controls which eddies grow most rapidly, the level of
turbulence at the instant of creation presumably does have some influence on the
perturbations out of which those eddies grew.
A similar objection may be levelled at the assumption that the mixing length,
when it determines eddy annlhilation, depends only on the mean environment.
If breakup is determined by shear within the eddy, perhaps the current eddy
dimensions provide a more appropriate length scale. These depend on the history
of the eddy and not just on instantaneous conditions. Likewise turbulent drag
43
and heat exchange depends on the current eddy size, and also on the intensity of
the small scale turbulence which may not vary in phase with the larger eddies.
The different versions of mixing-length theory yield different formulae for
F= and Tk when applied to radially pulsating stars. This emphasizes the
uncertainties in the assumptions. The differences offer some hope of choosing
between them by observation.
The range of possible asstEaptions widens further when nonradial pulsations
are considered. One prescription has been offered by Gabriel, Scuflaire, Noels
& Boury (1974) who generalized Unno's approach in a natural way. Amongst the
approximations is the neglect of anisotropy in the turbulent flow, which
circumvents the complicated problem of determining how the changing shear
associated with the pulsations modifies the convective velocity field. Anlsotropy
appears to have a more complicated influence on the pulsational perturbations of
the heat flux and Reynolds stress in this case than it does for radial pulsations,
so the ass~m~ption may be critical. It would be useful to know how sensitive
pulsations of stellar models are to changes in this and other assumptions in
order to assess where effort to improve the theory might most profitably be
dlrected.
One of the motivations for developing a convection theory in a time-
dependent envelope is to study the pulsations of the cooler Cepheid and RE Lyrae
variables. Unpublished computations by N.H. Baker and myself of the linear
stability of such stars to radial pulsations, using a generalization of the
formulation in ~ 6, indicate that the modulation of ~ generally has a
stabilizing influence on the pulsations, and is responsible for determining the
red edge of the instability strip.The phase of the modulation of T~ is such as
to &rive the pulsations in some regions of the convection zone and damp them in
others. The driving is greater in the cooler stars, and may be a significant
factor in the excitation of the long period variables.
Commen t s
These examples illustrate how the basic ideas of mixing-length theory might
be applied to a variety of situations. The generalizations all concentrate on
describing the dynamics of the major eddies prior to breakup, and ignore the
more difficult issues concerning creation and annihilation. To do more would
require a more sophisticated study of the mechanisms of turbulence.
In particular, there is no prescription for determining the mixing length.
One could choose the same value as one believes is applicable to ordinary
convection. In that case the theory predicts, for example, that a vertical
component of rotation or magnetic field reduces the heat flux. It appears how-
ever that there can be circumstances where rotation increases the heat
44
flux through a convecting fluid (Rossby 1969, Sommerville & Lipps 1973, Baker &
Spiegel 1975), which shows that the mixing-length prescription hasn't even
predicted the correct sign of the change. Perhaps the influence of a small
composition gradient is more reliably described, because the perturbation is via a
scalar rather than a vector field, and influences the dynamics only by modifying
buoyancy. However this could be the case only when that modification is small,
for we know from experimental studies of thermoha!ine convection that once
composition gradients are sufficient to change the stability characteristics of
the mean stratification the gross structure of the flow suffers a qualitative
change (Turner 1973). Thus the theory is not immediately adaptable to semi-
convection.
ii. NONLOCAL THEORIES
One of the obvious inaccuracies in the theory developed above comes from
assuming velocity and temperature fluctuations to depend on only local properties
of the environment. This would be justified if ~ were much less than all
relevant scale heights, but the stellar calibration suggests that this is not the
case. In particular ~ can vary on a scale much shorter than [ . Nonlocal
treatments take some account of the finite extent of convective eddies, and lead
to prescriptions for F, and ]~ that involve averages over distances of order ~.
Thus sharp gradients in ~ no longer lead to rapid variations in the convective
transports. Moreover, the treatments aim at representing overshoot into adjacent
stable regions.
There are two nonlocal properties of eddies that can be represented in a
straightforward way. One is that an eddy centred at •, samples ~ over the
range Cz.- ~£ , z.-e ~£); the other is that ~(z) and ~(z) are
determined not only by eddies with z,= z , but by all the eddies centred between
zo-~ and zo ~-~e. These can be taken into account within the framework of
the Boussinesq approximation, which entails ignoring the variation of all other
variables over the scale of an eddy.
Avera~in~ over eddies
The only place ~ enters into the formulae (6.11) and (6.14) for ~ and ]%
is in the growth rate o- . As was noticed by Spiegel (1963), the linearised
equations of motion used to determine the eddy growth are the Euler equations of
a variational equation for o- , whose solution is (6.6) with ~ replaced by
45
[ ~Cz') w'(z', z) a.z.'
where ~;(z, Zo) is t_he vertical component of the velocity of an eddy centred at
~o and the range of the integrals is the vertical extent of the eddy. With the
introduction of (6.1) this becomes
z - i {
(11.1)
Taking account of contributions to ~ and F~ from eddies centred at
different heights leads to similar averages: the assumptions (6.1) and (6.2) imply
that both ~T' and u; ~ have a m dependence quadratic in cos [~z- Zo)/~ ] .
Thus if Fc ° (z) and ~ (z) are defined as the right hand sides of (6.11)
and (6.14) with # replaced bye>, the nonlocal formulae for the heat flux and
Reynolds stress may be written
Fo = z I F~oC~O) ~o~[.~(~-~.)/~]az. z-~
(11.2)
(11.3)
Use of these expressions converts the ordinary differential equations of stellar
structure that obtain from local mixing-length theory into integro-differential
equations.
The fluid element approach
The extent of the region over which ~o and ~o are averaged in (11.2) and
(11.3) depends on the mixing length in its role of being a measure of the eddy
size. The description of mixing-length theory in terms of rising and falling
46
fluid elements, howeverp averages over a mean-free-path~ and so depends on ~ as a
measure of the annihilation rate. The analysis will now be repeated for this more
commonly used picture, to illustrate how uncertain the fine details of the theory
are. The arguments are similar to those used by Spiegel (1963).
It is a straightforward matter to repeat the analysis of ~ 6 with the fluid
particle picture in mind. The mathematical structure is almost the same, with the
principal difference that the integrals in the equations leading to (6.9) and (6.ii)
are now to be considered as llne integrals along fluid element trajectorles. It
is simplest to use the vertical displacement ~ of a parcel from its initial
position to define an independent .variable S according to
As = a~/£ . (n.4)
The bottom and top of the convection zone are assumed to be at 5-o and
Then the contribution to ~ from rising elements is approximately
I ,S $ - So
F~,( I ~ s) * I o - ' ~ T { " v .......... a S o , , ~ ¢c, ~s ( s - s . ) e - o
5 ~ S I *
(n.s)
and that from sinking elements is
~e-s
I I s' o-'~ T{" - ~ ....... (11.6) : l,~, - 7 - - ( ~ . - , ) e a~o .
In obtaining these equations the ~emperature fluctuation of an element was taken
to be
T ' = o - ' / t / - r t s.) .......... ~ . . ( ~ - ,
which was estimated by integrating W in (6.4) and using this and (6.5) to
eliminate ~- t o and @j in the expression for ~ . Once again ~" is assumed
to be defined in terms of an average (~> such as (ii.I) to take account of both
the finite size of fluid elements and the fact that they traverse a finite
distance through their environment. Note that the creation rate ~t has been
taken to be the same as in the local theory. It has been assumed that the
47
motion is not necessarily vertical, as does Spiegel, the cosine of the angle made
by the velocity with the vertical being denoted by ~ • Thus Fee =[~ is the
flux due to elements moving in directions between ~ and ~, ~p when ~ ~ O
The total flux is obtained by integrating over ~ and yields
i &,
O
(11.7)
where ~x is an exponential integral. The expression for ~ is similar. These
averages are rather different from (11.2) and (11.3), the main weight coming from
5 - S. I = 0.6 rather than being concentrated near zero. The value of
defining the initial conditions is once again undetermined. If it is fixed by
insisting that (11.7) approaches (4.10) in the limit e ~ O, one finds
~-P'- d >, "" e ~ 7 . ( n , 8 )
The averaging procedure lnboth this formulationandthe eddy approach is
rather crude, and depends in particular on an assumed structure ~r the velocity
and temperat~e fluctuations based on local theory. Other versions of the theory
that pay more explicit attention to the motion of elements have been formulated,
not~ly by Faulkner, Griffins and Hoyle (1965), Ulri~ (1970a). ~avlv &
Salpeter (1973) and Maeder (1975). Nordlund (1976) has recently studied a model
based on rising and sinking columns. The differences in outcome between the
various procedures ~pears to derive mainly from variances in the rather a~itrary
~oices of scal~g factors.
Spiesel's theory
A major drawback to the methods described so far is that they require one to
solve the equations of motion for the eddies. This becomes especially awkward
when the theory is generalized ~r ~plication to more complicated circ~nstances,
such as pulsating stars. It may be poss~le to alleviate the difficulties by
working within he framework suggested by Spiegel (1963) who started from an
element conservation equation ~ phase space. Spiegel considered a plane parallel
atmosphere and set
{ , (n.9)
48
where ~ is the element distribution function and v is the magnitude of the
velocity. The term ~(g~)/~L , which depends on the dynamics of elements
and which would normally appear on the left hand side of a conservation equation,
has been absorbed into the source function O_ ° This equation can be formally
solved for ~ in terms of (A , as is sometimes done in radiative transfer theory,
and the heat flux and Reynolds stress computed by averaging appropriate moments
of ~ over ~ . In particular, the heat flux is
Fo = I F L'j'a = (ll.lO
where ~' is the specific enthalpy fluctuation in an element. Rather than
discuss the element dynamics explicitly, Spiegel simply assumed that I~'I6~(so)
is independent of 8 and then chose it to make (II.i0) reduce to (4.10) in the
limit ~ ~ O. The result is
I S,
F= (s) = o F~o(So') E ~ ( I s o - s l ) &So , (n.n)
with I given by (6.13). This result differs from (11.7) because of the
assumption about the functional form of l&'l~.
AR~roximations
Since integral equations are not readily incorporated into most stellar
structure programmes it is tempting tO approximate the equations for ~ and ~
with differential equations, Spiegel's approach now exhibits the advantage that
one can immediately draw on the techniques of radiative transfer theory. In
particular, Eddington's first approximation provides simple equations relating
and ~ to 4~> that are no doubt accurate enough. To obtain the equation
for ~ , for example, moment equations are first constructed by multiplying
(11.9) by ~'~ and by ~ and integrating with respect to ~ , remembering
that ~'@ o when ~ ~ 0 . This gives
dh__ _ 3" -- O , &s
(Ii.12)
49
&S (11.13)
where
!
7 " I ~" I "a~' • - i
Eddington' s approximation is t o t a k e
where ~, and ~_ are independent of ff .
_~ a~F~ _
This implies }~ -- ~ 3" , and hence
F~ = - F=o , (11.14)
where ~=~ (cf. Travis & Matsushima 1973). The equation for ~ is similar.
But there remains the problem of finding an approximate equation determining <18> .
Guidance may be found by attempting to rederive an equation of the type (Ii.14)
directly from the integral relation (11.11).
The approximation (11.14) is equivalent to replacing the kernel ~ (so-s) =
E-~(I 5o-5 I) in (ii.ii) by the simpler function
"K°CSo-S) = ~ I , ~ ' I ' ( - b l s o - s l ) ( n . 1 5 )
with b - a. Equation (ii.I) might therefore be approximated in a similar manner.
But how dues one best choose b? Equation (11,II) may be rewritten
I , F0(~) = }('oCS.-s) ~'(s.)aso ~ -X.C~o s)]~(s°)aSo . ,mm
- FJ °~ cs) -r F~ c'l (s) , 01.1~)
w h e r e
F~o(S) , 0 .< s .< s, .-~ C~) =
0 • $ < 0 , S.> S,
(11.17)
50
The limits of integration have formally been written as ~ ~ , and are meant to
denote positions well into the bounding stable regions where ~ is small. Obvious
adjustments must be made when two convection zones are close together, or if the
domain of integration includes the central regions of the star.
It is clear that b is best chosen in such a way as to minimize the magnitude
of ~o). This problem is of a kind that has been encountered in radiative
transfer theory (Monaghan 1970) and statistical mechanics (e.g. Barker &
Henderson 1976) and its solution depends on the features of (Ii.Ii) one wishes to
represent most accurately. Here, an approximation will be sought that roughly
represents the solution when the scale of variation [r of <o is not a great
deal less than ~ ; to find a representation approximately valid for all scales
~r would entail an analysis of the equations that determine Fco . Thus ~(5o)
is replaced by its Taylor series about s, and the leading terms of the expansion
of Y$'~ so generated are made to vanish. The first two terms are automatically
zero, and the third vanishes provided 5 =~. This result differs somewhat
from the value obtained from the Eddington approximation, which is a representation
that appears to be good at both extremes of ~r , at least for radiative transfer.
If equation (ii.I) is treated similarly one obtains
6~ ~> _ ~#> = _~ , (ii.18)
where b, which is calculated as before but now with ~Cc5) = Icos~7[5 , is
given by
If equations (11.2) and (11.3) are used to determine ~ and p~ , this value
must also replace a in (11.14) and the analagous equation for ~.
The differential equations determining the mean structure of the star, with
this approximation to nonlocal mixing--length theory, is of order five higher than
when local theory is used. Computing time is therefore increased. However the
singular points at the edges of the convection zone discussed in ~ 9, and the
numerical difficulties associated with them, are no longer present. Equation
(II.14), its analogue for ?~ and equation (11.18) should be solved subject to
the boundary conditions ~ -+ O, T~ -~ O and ?> -*~ as 5 -~ ~:
51
Comments
The factor of about 5 by which the values of b obtained from the two kernels
differ emphasizes the different roles played by the mixing length. Fluid crossing
the midplane of a convective eddy of diameter ~ is likely to have risen vertically
by perhaps about half the radius, which is only about ~ the mean-free-path of a
fluid element. The mean-free-path and the element or eddy diameter have been
arbitrarily set equal, but perhaps a factor of order unity should have been
introduced between them. Consequently the coefficients in (11.14) and (11.18) are
parameters that, like the mixing length itself, are not determined by the theory
but await calibration by comparing theoretical models with observation.
The most obvious testing ground is the top of the solar convection zone,
where overshooting into the stable regions can be studied, However it is in the
regions of overshoot that new uncertainties seem to enter. The integral equation
(11.16) for ~ , for example, explicitly assumes that the stable regions provide
no source of convective elements: ~= O outside the convection zone. The
decceleration of elements in the stably stratified regions is represented by the
averaging of ~ , but this does not adequately account for the possible
oscillation of elements and the generation of waves. Negative values of ~ would
he required, and Spiegel (1963) has suggested replacing ~ in the formula (6.11)
for ~o by its real part, presumably to account for the damping of those waves.
However, that does not account for possible propagation of energy by the waves,
and subsequent dissipation far from the site of generation. A more careful
analysis of the coupling between the convection and the waves must be undertaken
before one can have confidence in the procedure.
Calibration of nonlocal theories is at present in an unsatisfactory state.
Attempts are made to construct model solar atmospheres and to compare overshoot
velocities or limb darkening with observation, adjusting parameters where
necessary (e.g. Ulrich 1970b; Travis & Matsushima 1973; Nordlund 1974), with some
diversity in the conclusions. Indeed Spruit (1974) has fitted the limb darkening
function using a local mixing-length theory. Moreover, though the models are
constructed with an averaged ~ , p is not always averaged and ~ is ignored
entirely. Ulrich (1976) has recently investigated the sensitivity of solar type
model atmospheres to variations in the parameter b in the kernel (Ii.15), with
< ~ , and to the addition of a multiple of a delta function to that kernel.
The refinements end generalizations discussed in ~IO can easily he
incorporated into these nonlocal procedures. Nonlocal effects of small scale
shear turbulence have been discussed by Kraichnen (1962), using rather different
arguments which suggest that at very high values of 5 there is a qualitative
change in the functional dependence of ~ on S . Scalo and Ulrich (1973) have
incorporated nuclear reactions into a nonlocal theory.
S2
12. COMPRESSIBLE CONVECTION
Compressibility plays two kinds of role. First it influences the structure
of the convective flow discussed above, and secondly it introduces new phenomena
that are not represented by the Bousslnesq approximation.
Various studies of the structure of the linear eigenfunctions of convective
motion in a compressible atmosphere have been made, though no attempt has been
made to incorporate the results into a mixing-length theory. This is not
surprising. One reason is that the mathematical difficulties are rather greater
than for Boussinesq theories, but a more fundamental reason is that it is not at
all clear how the mixing-length hypothesis should be interpreted in these
circumstances. It should be recalled, however, that the assumption ~ == H is
based on arguments concerned with the structure of eddies in a compressible
stratified medium, and that in some sense, therefore, compressibility is
acknowledged. New phenomena that must he considered include the pressure
fluctuations in the equation of state, which not only modify the structure of the
eigenfunctions but also must be included in the formula for the heat flux. Viscous
dissipation must be included in the mean energy equation. Unno assesses the
importance of such mechanisms in his contribution to this volume.
13. CONCLUDING REMARKS
The mixing-length formulae derived in ~ 4 are based on very rough order-of-
magnitude estimates. The physical argt~nents supporting them are based on
imprecisely defined models. Moreover the observational evidence for the validity
of the formula for the heat flux is very weak; the Reynolds stress is ignored in
almost all stellar structure computations.
Even if it could be ascertained that the Boussinesq formulation outlined in
this article is sound, there would still be the difficulty of extrapolating the
theory to stellar conditions where compressibility is important. The major point
at which compressible arguments are invoked is in the choice of the mixing length
It is commonly believed that effective heat carrying eddies cannot extend
over much more than a scale height H of density or pressure~ and accordingly
is taken to be of order H • The solar calibration of the heat flux is not
inconsistent with this assumption, though it is inconsistent with the conditions
under which the Boussinesq approximation is justified. However~numerical
computations of compressible convection that either solve the equations of fluid
motion directly in two dimensions (Graham 1975) or three (Graham, these proceedings)
or represent the solutions in the single-mode approximation (Toomre, Zahnj Latour
& Spiegel 1976b; Van der Borght 1975) predict large eddies extending over the
entire convection zone that show little tendency to break up into smaller scales.
But perhaps the computations do not mimic solar conditions well enough, since they
lack the thin transition zone at the top of the convective region in which the
53
temperature gradient is very strongly superadiabatic. The convection just beneath
the photosphere is observed to have a characteristic length scale comparable with
H and it is no~ unlikely that the vertical scale is similar. Whether in the
region below~ the dominant scale of motion is always of order H, or whether it is
quite differen~is hardly relevant for most purposes, because any plausible
formula for ~ implies that beneath the first scale height the temperature
gradient is very close to being adiabatic. Moreover the detailed structure of the
transition zone doesn't influence the interior significantly, so any formula for
Fc with an adjustable factor multiplying it can serve to construct models of the
sun and solar type stars that have the correct luminosity and radius. Of course
the motion in the transition zone is important for determining the photospheric
velocity field, but here mixlng-length theory is currently inadequate for making
reliable predictions.
One must not conclude from these remarks that a good convection theory is
unnecessary to stellar evolution theory for modelling solar type stars. On the
contrary, though it is only the integrated properties of the transition zone that
are required to determine the adiabat deep down, a theory is required for extra-
polating from models of the sun to other solar type stars. Andj of course, as
soon as one wishes to discuss the structure of a stellar a~nosphere, a knowledge
of the subphotospheric velocity field is essential.
The structure of convective envelopes of red giants is more sensitive to ~ ,
but calibration is difficult because there are other uncertainties in both theory
and observation. The degree of overshooting and consequent material mixing at
the edges of convective cores is also of interest, but difficult to assess
observationally.
It is common practice to argue that because the mixing-length hypothesis, in
whatever guise it is to be used, is so uncertain, it is hardly worth the trouble
to calculate its consequences accurately. Indeed it is sometimes the case that
so coarse a mesh is used for the numerical solution of the stellar structure
equations that the solutions are not resolved in the convection zone, and that
the differential equations are therefore not adequately represented by the finite
difference equations. It is also coEmon, once a formula for ~ has been decided
upon~ not to calibrate the mixing length, nor even to report precisely the formula
that was used for Fc • Though it may be true that in our present state of
knowledge there is little reason to prefer, say, a red giant model computed with
a mixing-length formula that has been carefully calibrated on the main sequence
to a model computed with a similar formula that has not, there would be greater
hope of improving our understanding of stellar convection and its influence on
stellar structure if investigations were more meticulously carried out and
reported. The prospects of an imminent supersession of mixing-length theory by a
theory that is demonstrably more reliable for describing stellar convection zones
54
is bleak. Therefore it seems worthwhile to invest some effort into trying to
improve the theory we already have. Modern sophisticated mixing-length theories
have achieved some measure of success in describing turbulent flows in the
laboratory (e.g. Launder & Spalding, 1972), so there is some hope that the effort
would not be in vain.
E.A. Spiegel and I have recently been attempting to consolidate the theory by
synthesizing the ideas that have been severally used in the past. The approach is
based on a two-fluid model, one so-called fluid being an assembly of thermals and
the other being the background environment. Entrainment, erosion and turbulent
exchange of energy and momentum are represented in the equations of motion, using
laboratory calibrations where possible. The goal is to derive a set of equations
determining the heat flux and the Reynolds stresses that would be applicable to a
sufficiently wide variety of circumstances for a meaningful calibration to be
possible. The success or failure will be reported elsewhere.
It has been the aim of this article to clarify the ideas and assumptions
behind the simple mixing-length theories used in astrophysics, and so provide a
basis for the necessary improvement and generalization to circumstances more
complicated than those for which the theory was originally formulated. Some
indication of how this might be achieved has been given. Other measures that may
have to be taken include abandoning the idea that the flow can always be
described adequately in terms of a single length scale @ This may be necessary
for a theory of semiconvection, for example. It must be realized, however, that
many attempts to improve or generalize the theory involve additional physical
mechanisms, and co~Isequently the introduction of new parameters that must he
determined by observation.
55
REFERENCE S
Baker, L. and Spiegel, E.A., 1975, J. Atmos. Sci., 32, 1909
Barker, J.A. and Henderson, D., 1976, Key. Mod. Phys., 48, 587
Biermann, L., 1932, Zs f. Ap, 5, 117
Biermann, L., 1937, A.__~N., 26__4, 361
Biermann, L., 1943, Zs f. Ap, 2__2, 244
Biermann, L., 1948a, Zs f. Ap, 2_5, 135
Biermann, L,, 1948b, FIAT Review of German Scienqe (Astr. Ap Costa.), 161
BShm, K.-H. & StSckl, E., 1967, Zs f. Ap, 6__6, 487
B~hm-Vitense, E., 1958, Zs f. Ap, 46, 108
Boussinesq, J., 1877, M~n. div. savants Acad. Sci. Inst. France, 2__3, No I
Boussinesq, J., 1903, Th~orie analytique de la chaleur, Tome II (Paris, Gauthier-
Villars)
Chandrasekhar, S., 1961, Hydrodynamic and hYdrom@gnetic stability, (Oxford Univ.
Press)
Dearborn, D.S.P. and Eggleton, P.P., 1976, QJRAS, 1__7, 448
Faulkner, J., Griffiths, K. and Hoyle, F., 1965, MNRAS, 12__99, 363
Gabriel, M., Scuflaire, R., Noels, A. and Boury, A., 1974, Bull. Acad. roy
Belgique, C]. Sci, 6__0, 866
Cough, D.O., 1969, J. Atmos. Sci., 2__6, 448
Cough, D.O., 1977, Ap J., in press
Cough, D.O. and Weiss, N.0., 1976, MNRAS, !76, 589
Graham, E., 1975, JFM, 70, 689
Henyey, L., Vardya, M.S. and Bodenheimer, P., 1965, Ap J., 142, 841
Hofmeister, E. & Weigert, A., 1964, Zs f. Ap, 59, 119
Kraichnan, R.H., 1962, Phys. Fluids, 5, 1374
Latour, J., Spiegel, E.A., Toomre, J. and Zahn, J.-P., 1976a, Ap J.., 207, 233
Launder, B.E. and Spalding, D.B., 1972, Mathematical models of turbulence
(Academic Press)
Maeder, A., 1975, Astr. and Ap, 40, 303
Malkus, W.V.R., 1964, Ceophys. Fluid D~amics (Woods Hole Oceanographic
Institution) _I, i
Monaghan, J.J., 1970, MNRAS, 148, 353
Nordlund, ~., 1974, Astr. and Ap, 32, 407
Nordlund, ~., 1976, Astr. and Ap, 5_~0, 23
Ogura, Y. and Phillips, N.A., 1962, J. Atmos. Sci., 1__9, 173
~pik, E.J., 1938, Publ. Obs. astr. Univ. Tartu, 3__0, No 3
0pik, E.J., 1950, MNRAS, IIO, 559
Prandtl, L., 1925, Zs f. angew. Math. Mech., 5, 136
Prandtl, L., 1926, Verhandl. II intern. Kongr. tech. Mech., ZSrich, p 62
56
Prandtl, L., 1932, B eitr. z. Phys. d. frelen Arm., 19, 188
Prandtl, L., 1952, Essentials of Fluid dynamics, (London, Blackie)
Rossby, }{.T., 1969, JFM, 36, 309
Scalo, J.M. and Ulrich, R.K., 1973, A p J., 183, 151
Schmidt, W. 1917, Sitzun~sbgr. ~Kais. Ak. d. Wiss. We!n), Abt !!a, 126, 757
Shaviv, G. and Salpeter, E.E., 1973, Ap J., 184, 191
Schwarzschild, M., 1961, A2 J., 134 , i
Siedentopf, H., 1933a, A.N., 247, 297
Siedentopf, H., 19335, A.N., 249, 53
Siedentopf, H., 1935, A.N., 255, 157
Sonm~erville, R.C. and Lipps, F.B., 1973, J. Atmos. Sci., 30, 590
Spiegel, E.A. 1963, Ap J., 138, 216.
Spiegel, E.A., 1971, Ann. Re v. A. AP, 9, 323
Spiegel, E.A. and Veronis, G., 1960, Ap J., 131, 442 (correction: 135, 655)
Spruit, H.C., 1974, Solar Phys., 34, 277
Stellingwerf, R.F., 1976, Ap J., 206, 543
Taylor, G.I., 1915, Phil. Trans., A, 215, I
Taylor, G.I., 1932, Proe. Roy. Soc., A, 135, 685
Toomre, J., Zahn, J.-P., Latour, J. and Spiegel, E.A., 1976b, Ap J., 207, 545
Travis, L.D. and Matsushima, S., 1971, Scientiflc Report No 024, (Penn. State
Univ., Astr. Dept)
T ravis, L.D. and Matsushima, S., 1973, Ap J., 180, 975
Turner, J.S., 1973, Buoyancy effects in fluids, (Cambridge Univ. Press)
Ulrich, R.K., 1970a, Ap Sp. Sci., --7, 71
Ulrich, R.K., 1970b, Ap Sp. Sci., 7, 183
Ulrich, R.K., 1976, Ap J., 207_, 564
Unno, W., 1967, Publ. Astr. Soc. Jap~j 19, 140
Unno, W. and Spiegel, E.A., 1966, Publ. As tr. Soc. Japan, 18, 85
Van der Borght, R., 1975, MNRAS, 173, 85
Vitense, E., 1953, Zs f. Ap, 32, 135
yon K~rm~n, T., 1930, Proc. III inter. Contr. App. Mech. ~ Stockholm~ _I, 85
ON TAKING MIXING-LENGTH THEORY SERIOUSLY
D. O. Gough
Institute of Astronomy and Department of Applied Mathematics and Theoretical Physics, University of Cambridge
and
E. A. Splegel
Astronomy Department, Columbia University New York 10027, U.S.A.
There has been progress in convection theory in the past decade, mainly in
the problem of mild convection. Yet, we are still not able to cope with vigorous
convection such as we face in the envelopes of late-type stars. Most astrophysic-
ists therefore use mixing-length theory and get on with calculating their models.
As this sltuatlonmay continue for a while, it may be a good thing to consider
what mlxing-length theory really is and to see whether it can be taken seriously as
a physical model for stellar convection.
Different authors mean different things when they speak of mixlng-length
theory. Here, we interpret the theory in terms of the specific model in which a
star is composed of a background fluid through which discrete, well-deflned parcels
of fluid move. These parcels may be thought of as quasiparticles whose number
density is sufficiently high that they constitute a second fluid permeating the
background fluid. The convective model is therefore a two-fluld model loosely
resembling the composite of radiation and matter familiar in astrophysics, except
that the quasipartlcle fluid is more complicated than the photon gas.
In applying this model we must write down equations of motion for the
quaeiparticles. We have to specify the nature of the quaslpartlcles, and most
people, with varying degrees of explicitness, treat them as idealizations of the
buoyant thermals described by meteorologists. Fortunately, there is by now some
guidance provided by laboratory data on the motion of isolated thermals in both
laminar and turbulent fluids. Turner (1963, 1973) has described these experiments
and has outlined the simple theory which has been evolved to describe them. In
particular, he assumes that the thermals are small compared with any scale heights
so that gradients across them, both inside and Just outside, may be neglected. Only
in their vertical motion do they sense the presence of the ambient temperature
gradient.
Turner's description allows for turbulent exchange of heat, momentum~ and mass
between a quasiparticle and the ambient medium. With some slight modifications of
his discussions we may derive the following set of equations governing the motion
of quasiparticles. We display these Just to give some idea of their form:
58
dm ~ lu-Ul - Elv~ , (i) d--~ =~ i
du d m ~ = g(m-m) + ~ (m U)
+ P0 zllU-U-I (~-u)-&~(u-U) , (2)
dtd-~h = w d~_p dz ~ EllU-UI +-- q](h-6) , (3)
dx (~)
Here m, x = (x,y,z), ~ = (u,v,w) and h are the mass, position, velocity and
specific enthalpy of a thermal, p, ~, 6 and ~ are the local means of density,
pressure, enthalpy and velocity of the ambient medium at x. E 1 and E are cross
sections for entrainment and erosion (both of the order of the geometrical cross
section of the thermal), X is the ambient turbulent velocity at x, m is
ambient mass displaced by thermal, ~ is the acceleration of gravity corrected for
the hydrodynamic mass of the thermal, q-i is a thermal decay time allowing for
radiative and turbulent exchanges, and ~-i is a similar viscous decay time.
Evidently these formulae must contain some fudge factors to be obtained by
comparison with measurements, be they experimental, meteorological, or astrophysical.
In astrophysical treatments of convection many of the effects modelled in these
equations have been included. Turbulent exchange of momentum between fluid elements
and the ambient medium was included in the early theories (e.g. Prandtl 1932,
Biermann 1932, Siedentopf 1933) and ~pik (1950) allowed for turbulent exchange of
heat. Ulrich (197Oa,b) has adopted the formulation of Morton, Taylor, and Turner
(1956) in his studies. However, when astrophysicists use these equations of motion
they generally replace them by algebraic equations; that is they essentially replace
d/dr by w/Z where % is a length to be specified. This gives rise to the usual
local mixing-length treatment. Sometimes, some or all of these algebraic equations
are averaged over height with some arbitrary weigh= function to produce a nonlocal
extension of the theory (e.g. Ulrich 1976).
Such reductions of the dynamical equations for thermals have not been favoured
in the meteorological literature. Certainly, they are not suitable for use by
anyone interested in studying the interaction of stellar pulsations with convection.
An alternative procedure, first attempted by Priestley (1953, 1954, 1959) for
hydrostatic convective layers, is to solve the differential equations and use them
together with some hypotheses about the distribution of initial conditions of
quasiparticlea to compute the heat flux. This has also been attempted for linear
pulsation theory (Gough 1977). But in both instances one has to build in some
information about the number density of each kind of quasipartiele at each height,
generally by specifying creation rates. This becomes quite an undertaking for the
nonlinear pulsation problem and even the formulation of the calculation has not
been agreed upon. The manner of incorporating the dynamical equations into the
convection theory thus poses a major difficulty in applying this kind of model. As
we have hinted, it requires a prescription of the number of quasipartlcles for each
value of the parameters, and this distribution must be specified in a way that is
compatible with the dynamics.
Formulated in this way, the model resembles kinetic theory and, in an
attempt to capitalize on this, a transport equation was written down as if the
quasiparticles satisfied Hamiltonian dynamics (Spiegel 1963). Deviations from this
ideal behaviour were compensated for by introducing a source term in the transport
equation. Amodlfieatlon was suggested by Castor (unpublished manuscript) who
renounced the simple form provided by Hamiltonian dynamics and wrote a continuity
equation for the one-particle distribution in the phase space of the quaslparticles.
The phase space was enlarged over the usual six dimensional ~-space of position
and velocity to include the temperature of a single quasiparticle as a phase
parameter. In doing this one loses the volume-preserving feature of the phase fluid,
which raises questions about the meaning of the approach, especially when one
attempts coordinate transformations. Yet it seems to us a useful thing to write a
continuity equation for the phase space density of quasiparticles and, for the
present, ignore some of the niceties. We modify Castor's choice and use specific
enthalpy (~ather than temperature) of the quasiparticle as a variable and add an
additional phase parameter~ the quasiparticlemass. We have then an eight-
dimensional phase space in which the density of representative points is
f(x,~,h,m;t). The continuity equation satisfied by f is:
~f + ~ • ~ • ~ (~f)+ ~ (mf)= ~(-~-) ~-~ ~ (xif) + ~ (uif) + ~ ~m coll
(5)
where dots denote differentiation with respect to time and a collision term has
been introduced. The collision term is supposed to express the turbulent
destruction and creation of quasiparticles; through this term we may represent our
crude understanding of turbulence. It seems inadvisable to use a form llke the
Boltzmann collision integral since the interactions are probably not dominated by
two-body collisions. Instead, it is perhaps best to include a loss term like -f/~
to represent the destruction of quasiparticles, where T is a time required for
the quasiparticle to travel its own diameter. This term then embodies a basic idea
of mixlng-length theory. But what about the creation term?
The generation of new quasiparticles is not really understood, and to quantify
it, a specific model is needed. Often one imagines that quasiparticles grow from
small fluctuations because of the instability mechanism. However, in a turbulent
medium the fluctuations are not small. In the quasiparticle picture we think of
the new quasiparticles as decay products of the old ones to represent the turbulent
80
cascade process. Their development through instability is already included in the
dynamical equations.
The problem, of course, is that we do not know much about the decay products
following the destruction of the quasiparticles, and this is the first clear
difficulty that must be faced in completing the theory. It is becoming increasingly
clear in turbulence theory that the turbulent spectrum is strongly influenced by
the number of decay products in the breakup of a quasiparticle, and possible models
have been discussed which may provide guidance (cf. Frisch 1977). We shall not offer
any preferences in the present discussion. Our aim instead is to bring out the
points at which physical assumptions are needed to make the mixing-length model
cogent.
Once a form for (Sf/St)coll is decided, the remaining difficulties are
computational. This is not to belittle them; they are fierce and a moderately
reasonable approximation scheme is not immediately apparent. The computational
methods depend on the way one uses equation (5), and that has to be discussed next.
We believe that it would be sensible to try to construct moment equations
from equation (5). For example, multiplication of equation (5) by m followed by
integration over d5~ = d3u dm dh gives
3P m [/ ~ 8f [ --+v.F =|,~-f, mdsn- j~fds~ • (6) 3t "m #x--, coll
where
f Pm = J mfd5~
is the mass density of the gas of quasiparticles and
(7)
r (8) F m = J m-ufd5f~
is the mass flux of the gas. The last term on the right of equation (6)
represents the mass exchanged with the background fluid by entrainment and erosion.
One may compute other moment equations, but we shall not do that here. We
should however mention that the number of moments goes up faster than in ordinary
kinetic theory or transfer theory. Quantities like fluxes of enthalpy and mechanical energy arise and there is the all-important turbulent stress tensor:
Ti j = f mui~jfd5 ~ (9)
Once a hierarchy of moment equations has been written down [a skeleton version
has recently been studied by Stellingwerf (private connnunication)] the problem of
closing it off must be faced. A possible approach, resembling the moment method,
is to decide on an approximation for f and use that guess, for that is all it is
at present, to get approximate expressions for the higher moments in terms of the
lower moments. Once this is done, a last problem of principle remains. One must
81
still decide how to describe that part of the fluid that does not move in
quasipartieles. Should this be thought of as a zero fluctuation condensate of the
quasiparticle gas? Or should one describe the background as an ordinary laminar
fluid acted on by the stresses and so forth generated by the quasiparticles? The
latter course seems decidedly preferable to us, especially for treating penetrative
convection, where most of the matter may be in the background fluid. If that is
accepted, the next course of action is to write the dynamical equations for the
background fluid including the mass, momentum, and energy sources indicated by the
moment equations of the quaslparticle gas. Then, in principle, one has a complete
set of equations for the dynamics of a star with turbulent convection, but for the
present without rotation or magnetic field.
Now we have tO come to the kay question: is this what has to be done or are
we to be saved from it by a 'real theory' starting from the full fluid equations?
We think that the inediate prospects for a sound fluid dynamical approach are not
bright. And even the approximations to such an approach as are on the horizon
promise to be far more demanding computationally than the scheme summarized here.
At present, untold computing hours are being lavished on stellar models using
a mlxlng-length theory whose reliability is untested off the main sequence. It
seems to us that if this situation is to continue it would be well to take the
mixlng-length theory seriously. In particular, one should be clear on the turbulence
model one is using and not simply alter the standard formulae according to whim,
as is often done in the literature. We are not saying that alternative general
structures to that given here may not be preferable. Nor are the procedures we
outline meant to be rigid. The present version of a mixing-length procedure is a
synthesis of ingredients existing in the literature and we have done no more than
put it together to show that a cogent discussion of mlxing-length theory is
possible. We have especially tried to show where the physics is missing and to
indicate a framework for including it. The resulting equations are in principle
capable of dealing with many of the problems of current interest, such as the
nonlinear interaction of pulsation and convection. Those coping with such questiOns
are all too familiar with many of the problems we have raised. But they, as we
ourselves, have sometimes dealt with these problems piecemeal and have not tried to
put them into context by working with a concrete general model. We are claiming
here that the specification of such a model is possible and desirable and that if
one can be constructed, stellar convection theory may begin to seem more rational.
We thank the SRC and the NSF for supporting our work on this subject.
62
REFERENCES
Biermann, L. 1932, Zs f. Ap, 5, 117
Frlsch, U. 1977, these proceedings
Gough, D. O. 1977, Ap J., ~!~, 196
Morton, B. R., Taylor, G. I. and Turner, J. S. 1956, P~0q. Roy. Soc., A23~, 1
Oplk, E. J. 1950, MNRAS, !~, 559
Prandtl, L. 1932, Beitr. z. Phys. d. Freien Arm., !9, 188
Priestley, C. H. B. 1953, Austr. J. Phxs., 6, 279
Priestley, C. H. B. 1954, Austr. J. Phys., Z, 202
Priestley, c. H. B. 1959~ Turbulent transfer in the lower atmosphere (University
of Chicago Press)
Siedentopf~ H.
Spiegel, E. A.
Turner, J. S.
Turner, J. S.
Ulrich, R. K.
Ulrich, R. K.
Ulrich, R. K.
1933, A toN', 247, 297
1963, Ap J., !~, 216
1963, J. Fluid Mech., !~, 1
1973, Buoyancy effects in fluids (Cambridge University Press)
1970a, Ap Sp. Sci., !, 71
1970b, Ap Sp. Sci., !, 183
1976, AP a . , ~ Z , 56~
OBSERVATIONS BEARING ON THE THEORY OF STELLAR CONVECTION II
Erika B~hm-Vitense
University of Washington, Seattle, Wa.,U.S.A.
and
University of C~ttingen
Germany
SUMMARY
It is shown that the best way to get information about efficiency of convective
energy transport in the hydrogen convection zones in stars other than the sun is pre-
sently contained in continuous energy distributions of A and F stars. Scans show that
convective energy transport must be much more efficient than thought hitherto. The scans
indicate that rapid rotation enhances convective energy transport.
Figure I lists all the observations, we could think of, which are influenced by the
outer convection zone and which can therefore give us information for our convection
theory.
I. STELLAR STRUCTURE AND EVOLUTION
Already Hoyle and Sehwarzsehild (1955) pointed out that with increasing efficiency
of convection the radius of a star shrinks.
If we describe the efficiency of convection by the one parameter £, the characteris-
tic length or the mixing length of the convective flow, and if we further assume that
the ratio £/H = ~ is depth independent, where H is the pressure scale height - an assump-
tion that may not be valid, see for instance B~hm (1958), Schwarzschild (1974) - then
this one parameter ~ can in principle be determined from the observed radius of the star.
Unfortunately, the radius depends also on the initial abundances Z, Y and CNO (Iben 1963)
as well as on the age of the star. From Iben's study we see that for given T a ~Iog L e
0.3 is obtained for A log ~ ~ 0.5, but one also finds A log L % 0.3 for a change in
abundances Y and Z by a factor of 2. A similar change in log L is obtained if opacities
are computed with different approximations. Different authors therefore obtain different
values of ~ from the radii of the sun and the stars, ranging from ~ = 0.4 to e = 2.
From the position of the main sequence we can therefore presently only conclude that
log e = 0 ~ 0.5.
l m
ixin
g t
o &
fr
cQ
de
ep
la
ers
tim
e sc
ale
o
f e
vo
luti
on
on
M.S
. ~
- (b
lue
stragElers]
Li,
Be,
B
qe-
C,
N,
O,
J H
e
wh
ite
dw
arf
abu
nd
ance
s
CONVECTION
stratification i~
deeper l
ayers
stratification
(K o
f a
dia
ba
t)
in
surf
ac
e
lay
ers
rad
ius
%-
shap
e o
f th
e K
- tu
rn o
ff
tra
ck
po
siti
on
of
red
g
ian
t b
ran
ch
~-
Tcenter
white dwarfs
(conli.g time) ~
F v
(0
4-
on
set
of
con
vec
tio
n ~
-
B-V
gap
on
M.S
red
b
ou
nd
ary
of
cep
hei
d
stri
p
Tce
nte
r ~f
th
e su
n
ne
utr
ino
s)
sola
r v
elo
cit
y
fie
ld
dam
pin
g of
sola
r o
scil
lati
on
s (G
old
reic
h
+ K
eele
y)
ce
nte
r to
ll
mb
v
ari
ati
on
o
f su
n
ve
loc
ity
~
--
.~
field
T
4--
chro
mo
sph
eric
heating
I em
issi
on
CoII,Mgll
absorption
He 10830
stellar wind
(Vro
t)
curve of growth
osc
illa
tio
ns
--P
~'~
of sun
Vturb
or grad W
solar intensity
fluctuations
+ F,
(T)
->line profiles
llne intensities
va
ria
bil
ity
o
f red
sup
erg
ian
ts
"~
(Se
hw
arz
sch
ild
)
FIGORE !
Ways in wh
ich the effects of t
he outer convection zo
nes ap
pear
in astronomical observations and which
therefor~ can
give us
information ab
out convection theory.
65
The same difficulties apply to studies of the evolutionary tracks which again
depend on Z, Y and = and also C, N, 0 abundances. In fact one would like to know
in order to find the other parameters°
An additional difficulty is encountered when studying H.Ro diagrams:In order to
compare theoretical and observed tracks we have to relate the theoretical parameters
T e and g to the observed ones: color and m or M . The relation between the color and v v
Te, g depends not only on Z, and the CNO abundances, but also on the influence of con-
vection on the observed energy distribution, i.e. the changes of the T (T) relation in
the surface layers. This could possibly be important on the whole evolutionary tracks
for stars of spectral types F and later.
There are several indications that the colors are indeed influenced by surface
convection :
(a) Theoretical color Computations for giants in radiative equilibrium giving
U - B as a function of Z for a given value of B -V show a maximum of U - B for
Z ~ 0o3 ° Z (B~hm-Vitense and Szkody 1974). This maximum is not well seen in the oh- ® servations (Wallerstein et al. 1966).
(b) For intermediate Z values we find a discrepancy between the observed and com-
puted M 4 = (B - V) - (V - r) index (Mannery et al. 1968) for giants in radiative equi-
librium (B~hm-Vitense and Szkody 1974).
(c) Canterna (1976) finds similar problems ~or his metallicity index C - M.
All these discrepancies show that for a given energy distribution in the red there
is less energy observed in the blue and violet region than predicted by the radiative
equilibrium models. Scaled solar Bilderberg models, i.e. models with a decreased tem-
perature gradient in the deeper layers, would ease the problem. It might be emphasized
however, that the discrepancies only exist for giants with 0.I Ze ~ Z < Ze,not for very
metal poor stars. It could,therefore, also be related to an error in the line blanke-
ting computation, for instance a wrong value for the microturbulence.
While the decreased ultraviolet flux is in itself an interesting problem and might
well tell us something about convective overshoot in stars with different Z, it also
tells us that we have to study and understand this convective overshoot before we
can deduce the "observed" evolutionary tracks in the L, Te diagram and proceed to de-
re,mine Z, Y, C, N, O, the age t, and finally e .
We then conclude that the stellar evolution computations require a knowledge of
rather than provide one.
66
II. SURFACE PHENOMENA
A. DIRECT MEASUREMENT OF VELOCITY FIELDS
The most direct observations of convection are the velocity fields which for stars
can only be observed by broadened line profiles. If the velocity field changes over one
mean free path of a photon it will lead to a broadening of the line absorption coeffi-
cient thereby increasing the line width and the equivalent width. If the velocity chan-
ges only over much larger scales it will lead to a broadening of the intensity profile
only and not change the equivalent widths of the lines. According to the influence on
the equivalent width we describe the two effects as micro or macro-turbulence (or possi-
bly rotation). However, we have to keep in mind that an increase in the equivalent width
could also be due to other effects than small scale velocity fields. Also there is no
reason why we should have only small scale or large scale turbulence, in fact we expect
a continuous turbulence spectrum of all scales. (For isotropictu~ulence see the dis-
cussions by H. and U. Friseh 1975, by G. Traving 1975 and by E. Sedlmayr 1975). There-
fore, we have to be careful with the interpretation of the socalled microturhulence.
If we believe that the microturbulence as determined from the equivalent widths by means
of curve of growth analysis really is a measure for the small scale velocity field,
which could be either due to turbulence or to laminar velocity gradients, then we find
the picture first compiled by Wright 1955, see Figure 2.
Generally the microturbulence increases for decreasing densities in stellar atmos-
pheres as is expected for convective velocities: Since the convective flux F c Q. V 3
p = density, V = convective velocity, we expect for a given F ~ F, where F is the total C
energy flux, that V = p-I/3. We do however not observe the decreas£of the microturbu-
fence for hot stars for which F << F is expected. c
A compilation of more recent data especially for giants has been given and dis-
cussed by Reimers (|976).
For supergiants Rosendhal (1970) and van Paradijs (1973) have more. recent studies,
qualitatively confirming the results shown by Wright (]955).
For main sequence stars Baschek and Reimers (1969) did a detailed investigation
especially in order to study possible differences between A m and normal A stars. Chaffee
(1970) extended the study to cooler stars. Andersen (1973) repeated the determination
with the new Fel oscillator strengths (Garz and Kock 1969). The result is shown in Fi-
gure 3. An increase in Vturb is found when going from late F to early F stars in quan-
titative agreement with convective velocities obtained for ~ = |. However, for earlier
stars the expected decrease in velocities is not found. For later stars an increase in
Vturb is suggested - though not observed for the sun - also in contradiction to expec-
tations from convection theory.
67
I / , - -
1I - -
111 - -
I V - -
V - -
X X~&A =x~
A ~
O OO
AA & •
X
O 0 O •
Oo
• & 4,
1
o@
o o
@
• •
o
@
I I I i I I ! I ! 1 I I 05 B0 B2 B5 A0 A5 F 0 F 5 GO G 5 K 0 K 5 M 0
Veloci~ (km./~0
• 1-2
• 2-3
o 4-5
A 5 - 7
• 7 - 1 0
A 10-15
15-20
X 20-30
X > 3 0
FIGURE 2:
Microturbulent velocities for stars of different spectral types and luminosity classes as given by Wright (1955).
68
70
At• A ~ •
Q &
i % o16
! ...... I ......... I ..... I I f
O O ~ OO
• • ® @_..
I 1 I ,,I,, I 1 t 0.7 0.8 0,9 :~.o
FIGURE 3 :
Microturbulent velocities for main sequence stars of different effective temperatures according to Andersen (1973). The open symbols refer to previous investigations by Baschek and Reimers (1969) and Chaffee (1970) using old oscillator strengths, the filled symbols to Anderson's determination with the Garz and Kock (1969) oscillator strengths. V, ~ refer to A m stars.
69
Another puzzle is provided by the measurements of Allen and G~eenstein (1960) and
Wallersteln (1962) showing that in Pop. II dwarfs Vturb ~ 0, a result which is certain-
ly not expected from convection theory , but these studies will have to be repeated with
new Fel oscillator strengths in order to be sure. Reimers (1976) attributes the increase
of Vturb for late type stars to possible measuring errors. Baschek and Reimers (1969)
suggest that for the A stars the high Vturb is caused by a large number of pulsation
modes similar to the ones studied recently by Lucy (1976) for e Cyg.
In short, measured values of Vturb sometimes do and sometimes do not agree even
qualitatively with exceptations from convection theory, indicating either that our
expectations are sometimes quite wrong, or more likely, that the measured microturbu-
lence has quite often nothing to do with convective velocities. How then do we know
when they do and when they do not ?
Even more difficult is the judgement of the observed depth dependence of the micro-
turbulence (Huang and Struve 1952, Rosendahl 1970). In general there does not seem to
be any observed contradiction to the assumption that for other stars the depth depen-
dence of Vturb is similar to the one observed for the sun.
B. INDIRECT MEASUREMENTS OF VELOCITY FIELDS BY MEANS OF ATMOSPHERES
AND CORONAE
(a) Chromospheric emission :
It is general belief that for the formation of classical solar type chromospheres
a velocity field is a necessary condition. We do not know whether it is also a suffi-
cient condition. The different strengths and the age dependence of the Call K 2
emissions for otherwise similar stars show the importance of a second parameter, pro-
bably the magnetic field. The absence of chromospheric emission may therefore not be
proof of the absence of a velocity field, only, if for a given spectral type we never
find chromospheric emission, I would believe this to indicate the absence of efficient
convection.
Chromospheres in cooler stars are seen by means of CaII K 2 and MgII h and k
emission or by the ]0830 line of HeI in absorption. O.C. Wilson (1976) has made ex-
tensive studies of the CaII K 2 emission in G and K stars. His results are shown in
Figure 4. In the same graph I have also plotted the bluest stars that have been ob-
served to show Mgll emission and I0830 HeI absorption according to Zirin (1975). There
appears to be a line in the HR diagram on the blue side of which the chromospheric ac-
tivity seems to stop. In the low luminosity part is not quite clear to me whether the
Call K 2 emission stops for somewhat more red stars than the Mgll emission. If so, it
~uld be an effect of the larger abundance and ionization energy of Mgll. If they
stop at the same time itshould indicate a cause different from ionization.
In the same graph I have also plotted the reddest Pop. I Cepheids according to
Sandage and Tammann (1974).
70
:E
- 6
- 4
- 2
0
2
4
GO ' I ' ' ' I ~ ' ' l l l l l ' I ' ' " ............... i ......... ' I
• • o " °" ° ~ ° . " "" ":" " -
• " " ° ".i ea=e/7 ; ° , . . . . . . - / -
o ° o . . , . . . = ~ .
/ . : . x o . . . . ~ :. . . . . . '~4,~n,:.~.:
/ - : ~ ~ . . . .
. , . - " X ~ . " , ' . : : . , . . . , , , ,
'2.00 0.40 0.80 1.20 1.60 B-V
+ S T A R S W i T H M g l l E M I S S I O N x B L U E S T S T A R S W I T H H e A B S O R P T I O N L I N E S
• R E D C E P H E I D S
FIGURE 4:
The color magnitude diagram for G and K stars with CalI K 2 emission (ooee) taken from Wilson (1976). We have added an additional point for Procyon (FOIV). We have also added ++ for the bluest stars observed to have MgII h and k emission and xx for the bluest stars showing He absorption lines (Zirin 1975). Also shown are the positions for Cephelds close to the red boundary of the instability strip. The straight line roughly marks the boundary for stars with or without observed signs of solar type chromospheres.
For the higher luminosities the red boundary of the instability strip appears to
agree roughly with the boundary line for Call and Mgll emission. There is of course
Call K 2 emission observed for some Cepheids and also for @ Cyg but this is supposedly
due to shockwaves created by pulsation. The agreement of these two boundary lines is
not surprising since we see no other reason fo= the breakdown of the pulsational in-
stability but the onset of efficient convection which reduces Frad, thereby reducing
the driving force.
Since the theoretical line for the onset of efficient convective energy transport
depends on ~ we can check which e should be chosen to make the theoretical and observed
boundary lines agree. Assuming that ~ is the same for all stars in this region - an
assumption which has been criticized by Schwarzsehild (1974) - we found agreement for
=0.5, 2, 3 or 5. This can be seen from Figure 5 (B~hm-Vitense and Nelson 1976).
(If e should not be same for all stars, then £ = R 2 appears to be also a possibility
except for la supergiants.)
As already noted earlier, the extension of the instability strip boundary reaches
the main sequence at about FO or B - V ~ 0.3. So F stars would be expected to have
efficient convection, while A stars would not, but they appear in the extension of
the instability strip, as mentioned earlier.
(h) Stellar rotation:
It has been suggested that stellar rotation will be braked by means of the stellar
wind and the magnetic field. (See for instance Kippenhahn ]972; further references are
given there.) Since stellar winds for later type stars are due to the presence of co-
ronae which are linked to stellar convection zones,the decrease of the rotational ve-
locities for the F stars may also mark the onset of efflc~ant convection. In Figure 6
(B~hm-Vitense and canterna 1974) we show the dependence oi the rotational velocities
on B - V for main sequence stars for different clusters. For field stars there is a
drop in v sin i for B - V ~ .25~ a second drop seems to appear for B - V ~ 0.4. r
For some of the clusters the drop at B - V = 0.40 is the more pronounced one. Apparant-
ly the final drop in v sin i does not occur where we expect convection to set in but r
only for cooler stars. It seems to occur at temperatures where the hydrogen and helium
convection zones merge.
C. TEMPERATURE INHOMOGENEITIES
Convective temperature inhomogeneities are expected to be largest for F stars.
We might look for evidence in the integrated light.
In ~igure 7 we compare continuum energy distribution of stars whose surface is
assumed to be half covered with an atmosphere with T e ~ ~ 100 ° and half with T e = 8340°.
The average would be 7500 °. The resulting energy distrib ion would appear as that of
an atmosphere with T = 7750 °. The inhomogeneous atmos ~ere~ therefore~ would re- e
72
I I ' H I I, I I ' I ~ i
0 • J- , H~z
5
c:) o,°~, //o "(~-)
..o/ I T ,I , I
7 0 0 0 6 0 0 0 5 0 0 0
,°, °°*' / /
• .00a L 0
| , ~ l . - - ~ ! 6 0 0 0 5000 ?000
FIGURE 5 :
Taken from B6hm-Vitense and Nelson (1976) this figure shows a comparison of theoretical and observed (---- or - - ) boundary lines for efficient
convection in the luminosity T e diagram (T e = T'-400 +_ 150~. Different values for the ratio of the mixing length £ to the pressure scale height H were
assumed, symbols are given in the graph. To obtain the points in Figure a we required that £ -- < 21 D for a con-
sistent theory, where D is the thickness of the unstable layer. The theoreti- cal ( .... ) and observed boundary lines agree roughly for £ = H.
For the points in Figure b we assumed that ~ < DF, where D F is the ex- tent of the zone where F c > 10 -2 - F, where F is the total flux and F c is the convective flux. No agreement between observed and theoretical boundary line can be found for any value of </H.
In Figure c we required i= D I at the boundary, where D 1 is the extent of layer for which V - V' > O. I, with the usual notation. No agreement be- tween theoretical and observed boundary can be found.
In Figure d we required £ = D at the boundary, where D m is the extenc
of the layer for which V - V' = 0.~ (V - V')max, where (? - V')ma x is the maximum value of V - V' in the convection zone under consideration. No agree- ment can be found between observed and theoretical boundary lines.
For details of the derivation see B6hm-Vitense and Nelson (19761.
73
3 0 0
2 0 0
I00 ¢.l (g
3o °"
= 2 0 0 b~
I00
30 O"
2 0 0
I00
(0) o F IELD STARS
!%~', O e 0 0
"~ e • •O
(c) PRAESEPE
ee = e •
e ~ • e e ~ o
e • o e • •
• • •d l , e O N ,, ,* ~,
' (e ) PLEIADES "
• ; ; .
e
. o • •
o ~ ' = ' I " I t I I | ,
O 0 0.1 0 .2 0.5 0 .4 0 .5
J I ] I I
(b ) COMA BERENICES
e • • •
(d) HYADES-
• ..- .
"., ,,. ,- • e e • e o
• •
(f)
e
e
= PER
0
B-V
o• e
%
I ! [ ...... d ~ • I
O.I 0.2 0.3 0 .4 0.5 0.6
FIGURE 6 :
The rotational velocity for field stars and for stars in different
clusters. TWO discontinuities in the velocity distribution of field stars are
suggested, one at B - V ~ 0.25 and the other at B - V = 0.40. For the clusters
the decrease of vsin i at B - V % 0.40 is more pronounced. The figure is
based on measurements by (a) : Abt and Hunter 1962 and Slettiebak 1955; (b)
and (d): Kraft 1965; (e) : Dickens et al. 1968; (e) : Andersen et al. 1966;
(f) : Kraft 1967.
74
o
un
0~1 I
I0.0
10.5 -
II . 0 -
HOMOGENEOUS RADIATIVE T, .-' 7 6 0 0
I I I
T e ~ 8 0 0 0 (,% log Fv:+ 0 .12)
i I I I 3.6 3.7 3.8 3.9
log X
FIGURE 7 :
The continuum energy distribution of a star whose surface is half cover-
ed with an atmosphere with T e = 6800 ° and half with T e = 8340 ° is compared
with the continuum energy distribution of stars with T e = 8000 ° (shifted
by Alog F v = 0.12) and T e = 7600 ° . The energy distribution of the inhomo-
geneous star is indistinguishable from that of a homogeneous star with
T e ~ 7750 ° .
75
semble one with a temperature about 250 a higher than its actual T e
In Figure 8 (from B~hm-Vitense 1972) we compare line profiles of a homogeneous
star and an inhomogeneous one with AT = ~ ]000 °. The T e and the corresponding frac-
tions of the surface area on the inhomogeneous star were chosen in such a way that in
Hy the~lines agree for both the homogeneous and the inhomogeneous stars. The B - V
colors then turn out to agree also. One finds that, because of the larger contribution
from the hot component, the spectral lines for the inhomogeneoua star are generally
weaker by ~ ]0% than those for the homogeneous one. The effect is especially strong
for the Ca lines,which are reduced by about 30%. One is reminded of the A stars oc- m
curring in this temperature range which show weak Ca lines, however for the A m stars
the other metallic lines appear generally stronger.
D. THE INFLUENCE OF CONVECTIVE ENERGY TRANSPORT ON THE CONTINUOUS
ENERGY DISTRIBUTION
The major influence of convection on the stellar structure is due to the convec-
tive energy transport which reduces the temperature gradient. If ~is happens in visible
layers then we might be able to see changes in the continuum flux distribution. F stars
are especially promising since the instability sets in rather high regions of the at-
mosphere and we have a pronounced minimum in the continuous absorption coefficient K C
longward of X = 3647 ~, where we can see down into the atmosphere tot = 2.
In Figure 9 I have plotted temperature stratifications for various depth dependen-
ces of the radiative flux F r = F - F c. Also shown are the corresponding energy distri-
butions in the visual region for stars with T ~ 7900 °. e
The curve shifted upwards (separate scale) shows the depth dependence of the ra-
diative flux as computed with the mixing length theory of convection.
The model with the steepest decrease in F r is a scaled solar, the socalled Bilder-
berg model. In an earlier study we had found (Bbhm-Vitense 1970) that the UBV colors
of main sequence F stars could not be obtained for radiative equilibrium models but
that scaled Bilderherg models would do fine. The other models were computed in order
to see whether we could reproduce the observed energy distributions with less ex-
treme reductions of the radiative flux. Little change from radiative equilibrium F
is obtained for these models.
It is interesting to note the very sensitive dependence of the surface tempera-
tures on the depth dependence of the convective flux. A low surface temperature as
obtained for one of the models should be apparent in the line spectrum which should
possibly look generally like that of a metallic line star.
From mixing length theory we expect a rather abrupt onset of convection for
T e ~ 8000 a. When using scaled Bilderberg models for convective stars we find a color
change of A(B - V) = 0.07 (B~hm-Vitense 1971). A similar value was also obtained by
Matsushlma and Travis (1973) with their nonlocal theory of convection if they use
76
,,, .... ~ i ..... ~ ...... f, +, , , i ~ ~ " 7
,.o,~// ? .oo~// ? '~W/ ~ "°° ¢I I /,7--°? ..~,y-t,:oo~ it~oo t k oo.
, , , , , , ,4: : , t , , , , . , ,r!,:, ~ - , , , , . , , , . T . . , ~; , : 7
.<. ,&,oo~;;;~oooo ¢ A,oo-- ,X/,,o~t I x / , "" / .,[.__.J/,,oo, -Ti-"J,,oo, ~ ItD,oo,-; iJ/-,,oo, t
, I l l l ,~. I'~, 25, . . . . . , , , , , ,,,1" , f 9 ~,1", I , i l l , , , i l , , , ' l ..7,,o+jf ~,,ooo+;/~ ~ ~-, ,<>o_+'-, '~;t • ' 7 h , , - ,4/.-o • ,,~/~,,oo. t-J~,,°°, t__,Yk,,°°. :-__5/~,,°°. t " -,~--~,, ,. 9 < ~ : ~ ..... Ro:: ~ , ~ G , : - ,', , , :'9:1 O' .02 .04 ,06 .08 0 ,OZ .04 .06 .08 0 .02 .04 .06 .OB 0 .Oa .04 .06 .08 1.0
,,~,[~]
FIGURE 8 :
The line profiles of convective stars with AT e = ~ iooo ° are compared with those of a homogeneous (i.e. AT e = O) convective star and a star in radi- ative equilibrium, The inhomogenous star shows generally somewhat weaker lines especially of Calcium. The figure was taken from B6hm-Vitense 1970.
77
! o m *< t -
0 w !
0 K
16
/ °
_~~....__../ (o) • 6-
" I O ~ ! . 0 5 I O 4 , , - - P - " I . I I I I I i
5x lO "4 I0 " z 0.1 0 .67 3,5 21.1
-~ ,-~'
- ----"~-x~¢,,-- -
4 ,0" ,0~\ 10 (b) 2 I I I II I\ I m
3 ~lSJlO - 4 tO "2 0.1 0 .67 3.5 21.,
I.C ~ C _o ~
3.6 3.7 ~.e 3.9 log ),
FIGURE 9 :
In part (a) we give the temperature stratlfications for several models, in part (b) the corresponding depth dependences of the radiative flux and in part (c) the corresponding continuum energy distributions. The ooo in part (a) refer to the radiative equilibrium model for the stable layer and mixing length model for the convective layers. The corresponding depth dependence of the radiative flux is shown by the displaced curve (b) (with sparate scale). The long dashes refer to the scaled Bilderberg model. The other curves refer to trial models that were computed in order to see whether we could also ob- tain a reduction in the energy emitted Just longward of the Balmerjump with a smaller decrease of the radiative flux in layers ~ > I. This does not seem to be possible.
78
= | ( which however,leads to difficulties with the observed solar center to limb
variation). This abrupt color change should he observed as a gap in the observed B - V.
The presence of a gap at 0.2 < B - V < 0.3 for field stars was noticed by Mendoza
(1956). He also noticed that this gap is not present for Pleiades stars. Figure ]0
shows that this gap is indicated more or less pronounced in different clusters (B~hm-
Vitense and Can terna 1974), though its position changes slightly for different clusters.
We suspect that rotation my have influence on the onset of convection. If this inter-
pretation of the gap is correct, then field stars with B - V < 0.22 should be in ra-
diative equilibrium, those with B - V > 0.29 should be influenced by strong convec-
tion, leading to the color change.
Figure 11 shows the result of observations for field stars by Oke (1964), for
Hyades stars by eke and Conti (1965), and by Baschek and Oke (1965) for A m stars.
In the same Figure we compare these scans - corrected for the change of absolute
calibration (Oke and Schild 1970 and Hayes 1975) and for line absorption - with con-
tinuum energy distributions for radiative equilibrium models and for scaled Bilder-
berg models.
The right hand side of Figure 11 shows the result for the Hyades stars. Except
for the small deviations around 4000 ~ the scans show good agreement with radiative
equilibrium models (solid lines) for B - V < 0.2 and agreement with the scaled Bilder-
berg models (dashed curves)for B - V > 0.3 as we expected from the study of the colors.
The left hand side shows the results for the field stars, which display the gap
very clearly at B - V = 0.22. Unexpectedly we see the influence of convection already
for ~ Arl with B - V = 0.14.
Figure 12 shows the results for A stars. The cooler ones clearly fit convective m
models and not radiative ones,
We have made additional scanner observations of main sequence H~ades stars with
different rotational velocities (B~hm-Vitense and Johnson 1977). In Figure 13 we see
the results. Weather and instrumental problems reduced the accuracy of our Hyades
observations, but we can notice some interesting results: our bluest star is p Tau
with B - V = 0.24) i.e., at a B - V where the field stars show the gap. Unfortunately
we have only rather poor measurements for that star so our conclusions are somewhat
shaky but it seems this star shows effects of convective energy transport.
For 57 Tau we have plotted both Oke's and our 7all and spring measurements in
order to give an impression of the uncertainties in the observation which Conti and
eke estimate for their measurements to be of the order of om.02. For the Hyades our
uncertainties may be larger. We think that also for 57 Tau th~ ccnvective energy dis-
tribution fits better than the radiative one, though it is not quite conclusive.
For the field stars our scanner results are seen on the left hand side of Figure 13.
For stars with B - V ~ 0.3, all these stars clearly show the decrease in the UV as
given by scaled Bilderberg models in indicating very efficient convective energy
7 9
"°"o,° . t .... . 1
, , . , , . . . . . . . O if, , ., ' F ." .. • "..:" |2 FIELD . "" _ .
~.. , . . . . . . . ,.n 01-."41,:, ..J,M. LW.~.." ." [.,:.' m
| ~ i "1 f ~' . " " .|.
I- ": '" , ; o , L ~ . , , . ~ " : ' " " . " FIELD STARS J.M. ( 0 ) 1 l (b]" 02~ I I I IJ O ~ . | l ' I • l ,
• 0 0 .2 0 . 4 0.6 "" 0 0 .2 0 .4 0.6 8-V 8-V
I i -- x I I I .... J 0 . 8 1 , , ~' - , io I F I E L D . ' ~ -0.II" COMA ...C.
I '" :lit° el ';"II ~. . s.R " , ~ "." .
1.41____.I_-*,4.~ I " I I I 0 2 1 1 ... • , . , ( d 0 0,1 0 .2 0 .3 0 .4 " 0 0 .2 0.4 0,6
b -y B -V
"°i I , ,,,,,,,,,, , , , , . . . . . o J / , , , - - - } t = O~ HYADES . ~p,j,~ ? 0 P R A E S E P E =.:.j.",.~
':":": " "~!1 ' " "
o, , . : , . : . ' , f . , ' = o . ,~ , . , , , (e o , ," " , ('
0.2," 0 0.2 0 .4 0.6 " 0 0 .2 0 .4 0 .6 B-V B-V
= ".." .'.~ " % , 1 =, o~ .:, ".- . , , , ; . ' . . . : . ~ J ~ o., I -.-,: .-:. o°~r , " . , , - . , o~ . . . . ",, , •
' 0 0 .2 0 .4 0 ,6 0 0 ,2 0 .4 0 ,6 B - V B - V
FIGURE 10:
The two color diagrams for field stars and different star clusters. This
figure is taken from B6hm-Vitense and Canterna (1975) and is based on mea- surements by (a): Johnson and Morgan 1953; (b): Johnson et al. 1966; (c): Str~mgren and Perry 1965; (d): Johnson and Knuckles 1955; (e): Johnson |952;
(f): Johnson et al. 1962; (g): Johnson and Mitchell 1958; (h): Mitchell 1960. The gap for O.22 < B - V < 0.29 is very pronounced for field stars. It
is present in most of the clusters, though not visible for the Pleiades.
80
3.6
3.8
4.0
(.3 4 .2 +
~ 4.4 0
4.6 O4 I
4.8
5.0
5.2' 3 .6
- "e -
~ ' - [ ~ 8 7 0 0 q . h= A I , O . O I ~e( "r ~%= "~ . ~-..'~-~kq. ~ , ~-~ ~ )r "%~ ~ . K Teu. 7 7 . ~ O o o ~ i ' ~ , ° . . . . , ~ . - - : ~_ ~ . ~AT, 0,4 - ~ - . ~ , , o . ' ~ ~ ~ ' ~ ~8 ,ooo - ~ . ~ . _ - , ~ . ~ . . . . _
[ ~ 4 |QU t ~)~ ~-~_-~ -.~A,, 76 /1%°0, ~ ~A75.o ,6
_ "x. .XPsc, 7 o ..o~o ~ ~.o,0_
o. s . , . ~ ~ , .E ( o - s , , , . . . 7 , o o ° . . ~ l ' r ~ . . . ~Oo- FO, 0.33 ~ ~=-ru , v.~%_
" ' - AVERAGE" 7 6 Tou, 115 I I 1 I~ I I LINE 1 F 0 , 0 . 3 2 I
3.7 3.8 3.9 4.(J3.6 3.7 3.8 3.9 4.0 log k
FIGURE 11:
Shows scanner observations ( .... ) by Oke (1964) and by Baschek and Oke (1965) corrected for line blanketing and corrected for the new calibration by Oke and Schild (1970) and Hayes and Latham (1975). The points (-) and • de- monstrate the difference obtained for the continuum with different measured line blanketing corrections. Also shown are the computed continuum energy distributions for radiative equilibrium models ( ) and for scaled Bilderberg models ( .... ). For the Hyades stars, shown in the right hand column, radiative equilibrium models can represent the observed distribution rather well if B - V < 0.22. For larger values of B - V a reduction of the flux for
~ 4000 ~ is ~pparent indicating a flat temperature gradient in the layers T ~ i. For the field stars shown on the left hand side the violet flux is reduced already for 8Ari with B - V = O.14 and I Ps¢ with B - V = O.19. The energy distributions can be represented quite well with scaled Bilderberg models, i.e. models with an unexpectedly large convective energy transport in layers with ~ ~ I.
The number given beside the star name gives the rotational velocity vsin i, the number beside the spectral type gives B - V.
81
4,0
4"11 4.2
4.3
4.4
° 0 ÷ 4
~ 4.6 _o ~. 4.7 N
I 4 . 8
~e ~dO0, 60 Leo
900 15 " 0.20
m , , , . , . o
f.. 7~0 63, Tou 0.30
4,9 ~ -OC--" -- -- 1 _-- -. 5.0 ~q .
rUMo 5.1 r . 0.35
5 2 / I ! 3.6 3.7
log k
| I" 3.8 3.9
FIGURE 12 :
Shows the A m star scanner observations by Baschek and Oke (1965), cor-
rected for line blanketing and the new calibration, (notation as in Figure 11). The bluest star, 60 Leo, can in the average be well represented by a
radiative equilibrium model. (A discontinuity might be suggested at a wave- length, where a discontinuity in the OI continuous ~ occurs). In 15 Vul with
B - V = o.20 some convective energy transport may be present. For 63 Tau and T UMa the flux reduction in the violet is even stronger than predicted by the scaled Bilderberg model, however, the line corrections may be some-
what uncertain. For A m stars convection appears to become important for about the same
B - V as for normal stars.
82
transport in the top layers of the convection zone. For the field stars in the gap
(0.22 < B - V < 0°29) we can almost match the observed energy distribution with ra-
diative equilibrium models except for the sharp downturn just longward of the Balmer-
jump. It seems they try to have convection like Hyades stars but do not quite make it.
I do not understand this difference between the Hyades and the field stars.
In the last Figure 14 we have plotted the T , derived for the different stars e
by these comparisons of scans and computed energy distributions as a function of their
B - V given in the literature. Also given are the spectral types and the VrSin i. The
filled symbols indicate stars matching convective models, the open ones radiative
equilibrium energy distributions. Stars for which the decision could not be clearly
made are given in brackets. They are mostly the field stars in the gap. If we leave
out these uncertain ones, then we see two sequences, one for radiative and one for
convective energy distributions. The stars with high VrSin i occur exclusively on the
convective branch. I would interpret Figure 14 as telling us that generally convection
will become efficient for B - V > 0.22, however fast rotation will cause an earlier
onset of efficient convection leading to a reddening of the star by A(B - V) = 0.07
as given by the scaled Bilderberg models.
Could the decrease of the flux longward of 3647 ~ be a direct result of rapid ro-
tation without involving convection? Collins's results (1965) show that such an effect
can only be expected if the star rotates close to the Roche Limit and if at the same
time we look almost equator on, i.e.~ sin i ~ ]. For A stars this should lead to
v sin i ~ 350 km/sec, which is much larger than the observed values. r
Also indicated in Figure ]4 are the values for the A m stars. After correcting
B - V for the additional line blanketing - Baschek determined A(B - V) ~ 0.05-0.07 -
they fall on the same two sequences defined by the normal stars. There does not seem to
be any difference with respectto convectiono I might mention that we c~zmot reproduce
Baschek's and Oke's scanner measurements for any of the A m stars which we measured,
namely ]5 Vul, T UMa, 60 Leo, even though we do reproduce the energy distributions of
normal stars~ except for some minor discrepancies for 45 Tau. We are inclined to con-
clude that the A stars are all variable on time scales of the order of decades, a m
timeseale that reminds one of the solar cycle. We are presently checking on the
variability. This result is only preliminary.
III. SUMMARY
We have pointed out that stellar evolution computations presentl F need a good con-
vection theory rather than give us relevant information. The measured micro-and macro-
turbulent velocities may tell us something about convection, but we do not really know
when. Temperature inhomogeneities are hard to measure. The continuum energy distribution
in the UV for stars with B - V ~ 0.30 clearly shows the effect of a reduced temperature
83
5.6
5.;
4.8
o ~
L~ 4.4
l
4.0
, , , , ~ ' , i , , , ~ , , I , - ~__~a3oo~ ~...~-.~. ~o.o
~ooo. 5"~ CAq, ,s0 ~ "
;;'._;o~%~oo -%-~,<25 ~ o ; ~"',.. ,.u.~,.5 ~ ~ • /'.~ - . ~ , u - " " - - ' l = ~ F 2 0 . 5 6 " ~ - ' ~ " ~ " ~ A , ¢ , - , : ;n -'~OO.W'x%&.~STTou 100
.~mC_ .e,O, ' " ~00_ "~ ,~ ~'~ . . . . . . © " - " ~ A9 -FO" 0 2 8
/~ ~ _ _ ~ . r t u y g , D . " " " ~'~,~..=G'0. ~ e , A7 0,2 ~, _ . ~ H D 2 4 3 5 7 , 5 0 . • ~ - ~ % . F 4 , O . 3 8 ~ ~'~ln."n"n"n"n"n"~L • ' ~ ~ m "~' '-~61~uUr -x=~,FI 0 3 4
• *" ~ * ' - - ~ [ 0 ~ -~FO 0 2 6
,, ro-~oOc - ' _~-- '~=-~¢,~ "~FO, 0 , 2 6 ~; ,~- " 2o "~ ~'f '50 . - ¢ - - ¢ .00¢
/ . . . . r _ FO, 0 . 5 0 ~ . . . . . . ~6000~4, F2 0 .37 .." ..~oo~--?~'-.~ .. oo~~71ooc , ,
i l i I i i i ¢ ~ I i I ~ ' ' , , , ~ ' ~ , 3.6 3.7 3.8 3.9 4 .0 " 3 6 3 7 3,8 3,9 4 .0 '3.6 3,7 3,8 3,9 4 , 0
~og X
FIGURE 13:
Shows the scanner observations by B~hm-Vitense and P. Johnson (1977). For 57 Tau we compare 2 sets of measurements of B6hm-Vitense and Johnson with those of Baschek and Oke to give an impression of the uncertainties. ~Indieates stars for which we have only 1 or 2 usable scans. Otherwise nota- tion as in Figure 11.
The scans confirm our previous conclusions. The stars in the "gap" i.e. with 0.22 ~ B- V ~ 0.29 show evidence,of convection, the field stars as well as the Hyades stars. HD 27429, a rapidly rotating star, shows an especially large reduction of the blue and violet flux.
84
9000
8500
Te 8000
7500
7 0 0 0
0
OAI,118 I
• HYADES, CONV. 13 HYADES, RAD. 0 FIELD, RAD+ • FIELD, CONV.
I !
• A5, 76 B Ari
r + A7' 1600 + ( c } ~7,77 a.eAs,+-+P,,+)~
A6, 9 0 0 _ -A7, 70 +C
A7.5, 52D + ( r ) mmA8, t30
AT, 74 r'l(OA5 ' 175) (FO, 5 0 1 1 A 8 , 215
A9 ,92 rt ( (0 FO))
((FO,5OO})m A9, I00
lI FO, 60 FO
(FO0) FI,SC
FO, I I S I I ~ F 2 F 0 , 8 5 +/ c + I I F 2
• F 2 F 2 , 9 0 0
I I I O, I 0.2 0 3 0.4
B - V
150
FIGURE 14:
Gives the relation between T e and B - V as determined from the compari- son of computed and observed energy distributions. Filled symbols indicate matches with scaled Bilderberg models, i.e. convective models, open symbols
matches with radiative equilibrium models. Dubious cases are given in brackets, they always appear twice, once matched with a convective model and once with
a purely radiative model. The spectral types and the rotational velocities VrSin i are given for each star. Leaving out the dubious cases we find that
for B - V ~ 0.30 all stars are convective. For smaller B - V we find two branches: One for radiative equilibrium stars and one for convective ones.
The high VrSin i occur all at the convective branch, indicating that rapid rotation appears to enforce convective energy transport in the late A stars
rather than to inhibit it as was previously suggested (B6hm-Vitense and Canter-
na 1975).
This conclusion rests on the interpretation of the somewhat uncertain energy distributions of p Tau, 69 Tau and 57 Tau.
+r refers to radiative equilibirium A m stars, +c to convective A m stars. After correcting B - V for the additional line blanketing they fall on the same two branches as the non A m stars.
85
gradient in layers ~ ~ 2/3 indicating an unexpectedly large convective energy flux in
these layers. Stars with B - V < 0°2 and VrSin i < |00 km/sec mostly appear to be in
radiative equilibrium. Stars with 0.10 < B - V < 0.30 may be convective, they are al-
ways convective if they have large v sin i. r
The point B - V = 0.30 also marks the extension of the red boundary line of the
instability strip to the main sequence. We take this as an additional evidence that
this boundary line actually marks the onset of convection in the HR diagram. With local
mixing length theory I/H = ] leads to agreement between the theoretical and observed
boundary lines for convection, neither larger nor smaller values for I/H will do.
The author's research described in this review was made possible by an NSF grant,
which is gratefully acknowledged. I am also grateful for a U.S. senior scientist award
from the "Alexander yon Humboldt Stiftung".
REFERENCES:
Abt, H. and Hunter, J.: 1962, Ap. J. 136, 381 Allen, L.H., Greenstein, J.L.: 1960, Ap. J. Suppl. ~, 139 Andersen, P.: 1973, P.A.S.P. 85, 666 Andersen, C.M., Stoeckley, R. and Kraft,R.P.: 1966, Ap. J. 143, 299 Baschek, B. and Oke, J.B.: 1965, Ap. J. 141, 1404 Baschek, B. and Reimers, D.: 1969, Astron. and Astrophys. ~, 240 B6hm, K.H.: 1958, Zs. f. Ap. 46, 245 B6hm-Vitense, E.: 1970a, Astr~n. and Astrophys, 8, 283 B6hm-Vitense, E.: 197Ob, Astron. and Astrophys. 8, 209 B6hm-Vitense, E. and Szkody, P.: 1974, Ap. J~ 19~, 607 B6hm-Vitense, E. and Canterna, R.: 1975, Ap. J, 194, 629 B6hm-Vitense, E. and Nelson, G.: 1976, Ap. J. in press B6hm-Vitense, E. and Johnson, P.: 1977, in preparation Canterna, R.: 1976, private communication Chaffee, R.H.: 1970, Astron. and Astrophys. 4, 291 Chandrasekhar. ~. :1961>,Hydrodynamic and Hydromagnetic Stability (Oxford,
Clarendon Press) p. 135 Collins, G.W.: 1965, Ap. J. 142, 265 Dickens, R.J., Kraft, R.P.~ and Krzeminski, W.: 1968, A . J. 73, 6 Frisch, H. and U.: 1975, Physique des Mouvements dans les Atmospheres
Stellaires (Centre National de la Recherche Scientifique, Paris 1976)
Garz, T. and Kock, M.: 1969, Astron. and Astrophys. ~, 274 Hayes, D.S. and Lantham, D.W.: 1975, Ap. J. 197, 593 Hoyle, F. and Schwarzschild, M.: 1955, Ap. J. Suppl. 13 Huan9, S. and Struve, O.: 1952, Ap. J..116, 410 Iben, I.: 1963, Ap. J. 138, 452 Johnson, H.L.: 1952, Ap. J. 11_~6, 640 Johnson, H.L. and Morgan,W.W.: 1953, Ap. J. 117, 313 Johnson, H.L. and Knuckles, C.F.: 1955, Ap. J. 122, 209 Johnson, H.L. and Mitchell, R.I.: 1958, Ap. J. 128, 31 Johnson, H.L., Mitchell, R.I. and Iriarte, B.: 1962, Ap. J. 136, 75 Johnson, H.L., Mitchell, R.I., Iriarte, B.;and Wisniewski, W.: 1966, Comm.
Lunar and Planet. Lab. No. 63 Kippenhahn, R.: 1972, In Stellar Chromospheres, Proceedings of ~ASA Collo-
quium.
86
Slettebak, A.: 1955, Ap. J. 121, 653 Str~mgren, B. and Perry, C.: 1965, unpublished report, Institute of Advanced
Study, Princeton, N.J. Traving, G.: 1975, Physique des Mouvements dans les Atmospheres Stellaires
(Centre National de la Recherche Scientifique, Paris 1976) Wallerstein, G.: 1962, Ap. J. Suppl. ~, 407 Wallerstein, G. and Helfer, H.L.: 1966 Ap. J. 71, 350 Wilson, 0.C.: 1976, Ap. J. 205, 823 Wright, K.O.: 1955, Transactions of the IAU, IX, 739 Zirin, H .: 1975, Reprint, Hale Observatories (Carnegie Institution of
Washington, California Institute of Technology, BBSO No. O150)
Kraft, R.P.: 1965, Ap. J. 142, 681 Kraft, R.P.: 1967, Ap. J. 148, 129 Lucy, L.B.: 1976, Ap. J. 206, 499 Mannery, E.J., Wallerstein, G. and Welch, G.A.: 1968, Ap. J. 73, 548 Matsushima, S. and Travis, L.D.: 1973, Ap. J. 181, 387 Mendoza, E.E.: 1956, Ap. J. 123, 54 Mitchell, R.I.: 1960, Ap. J. 132, 68 Oke, J.B.: 1964, Ap. J. 140, 689 Oke, J.B. and Conti, P.S. : 1966, Ap. J. 143, 134 Oke, J.B. and Schild, R.E.: 1970, Ap. J. 161, 1015 Paradijs, J. van: 1973, Astron. and Astrophys. 23, 369 Reimers, D.: 1976,Physique des Mouvements dans les Atmospheres Stellaires.
(Centre National de la Recherche Scientifique, Paris 1976) Rosendhal, J.: 1910, Ap. J. 16__~0, 627 Sandage, A. and Tammann, G.A.: 1971, Ap. J. 167, 293 Schwarzschild, M.: 1975, Ap. J. 195, 137 Sedlmayr, E.: 1975, Physique des Mouvements dans les Atmospheres Stellaires.
(Centre National de la Recherche Scientifique, Paris 1976)
DYNAMICAL INSTABILITIES IN STARS
P.LEDOUX
Institut d'Astrophysique
Unlversit~ de Liege
SUMMARY
The linear dynamical instability at the origin of convec-
tion in stars is reviewed and shown to depend essentially on the sign
of
A = ] d__~p ! dp p dr rlp dr
which is the usual argument of convection criteria. The case of two
or more superadiabatic regions separated by subadiabatic ones might
well deserve more detailed attention.
Once this instability is partially removed by the setting
in of convection its effects must be balanced by dis ~pation terms
if a stationary state is to result. This yields the value of a Ray-
lelgh number.
If energy generation is included in the non-c0nservative
terms, possibilities are somewhat enriched including a case of dyna-
mical instability in presence of A<0 (usually stable) but very small
in absolute value.
I. THE GENERAL PROBLEM
In the context of this conference we are not interested in
dynamical instability towards purely radial mQdes since convection
cannot manifest itself through these modes. We are thus left with the
problem of the response of the star to non radial perturbatius which
88
we shall assume very small to allow a linear treatment.
Of course, non linear effects may be of very great interest
and importance and are at least partially included in some of the ap-
proaches to steady convection as, for instance, in the mixing length
theory or various numerical attempts usually with other slm~lifylng
hypothesis and simple geometry (of. Spiegel ]971, 1972 Nordlund, 1976).
One of the most recent and most direct attaeksin general stellar cir-
cumstances is due to Deupree (1974, 1975 a-b, 1976). As far as I am
aware it has not revealed new instabilities such as may occur for
instance in metastable situations. It has not either restricted the
domain of significant dynamical instabilities but it has yielded
interesting information on the development of these instabilities
such for instance as a strong asymmetry between upward and downward
motions.
The study of non radial stellar oscillations does not go
far back. If we exclude Lord Kelvin's discussion of the homogeneous
incompressible sphere (Thomson, ]863) and some more or less timid
references by Moulton (|909) and Shapley (1914), the first significant
paper is that of Pekeris (|938) in which he solved the problem of the
non radial perturbation of the homogeneous compressible s~he~e.
Pekeris used the usual separation in time and spherical
coordinates of the Euclidian perturbations f~(pt,0', T',~') and of
the radial component of the displacement ~r
f'(r,O,~,t) - f'(r)pm(cosS)eim~e i~t -£~m~£
He showed thatp for each value of the degree ~ of the sphe~cal har-
monic~ apart from a positive spectrum with an accumulation point at
infinity corresponding to the pressure modes (or p modes, a~), there
89
existed a negative spectrum corresponding to gravity modes (or g
modes, a~) with an accumulation point at zero, all these modes being
(2~+I) degenerate with respect to m. An illustration of the distribu-
tion of the ~2 as function of ~ and the order of the modes can be
found for instance in Ledoux (1974).
Of course~ in this case, all the g modes (o~<0) are unstable
but I don't think that the connection with convection was pointed out.
Note that Pekerls choice of dependent variable (~= dlv ~) led him to
miss the so called fundamental mode (or f mode or Kelvin mode) which,
in this case, is exactly the same as the unique mode (for a given £7
of the incompressible sphere (~=O).
The next important paper is that of CowllnE (1941) in which
he tackled the case of the general polytrope of index n(p = Kpn+l/n).
In this case, the general problem is of the fourth order as it does
not split into ~wo second order differential equations which can he
solved successively as for the homogeneous model. However Cowling
noted that, except for the lowest modes and lowest values of £, the
perturbation of the gravitational field can 5e neglected without
serious effects. With this approximation, the problem reduces again
to the second order and, as Cowling showed, can be assimilated to a
St=rm-Liouville problem for large enough u 2 (high p modes) or small
enough I c21 (high g modest, the f mode for each ~ falling la between
the corresponding p and g spectra.
Furthermore if the generalized ratio of specific heats F 1
(or ~ in a pure gas) satisfies the inequalities
n+l r1> n (I)
all the o 2 are positive (stable g modes : g+) while, if 8
90
r < n+l i n (2)
they are all negative leading to instability (unstable g modes : g ).
But the criterion for convective instability in terms of
A = l dp l ap (3) 0 dr FlP dr
can be written very generally, even in the relativistic case (Thorne,
1966, Kovetz, 1967, Islam, 1970)
A >0 (4)
and becomes in a polytrope, with the usual notation,
A = 1O dEd--~0 (n -n_+i)111 >0 ( 5 )
or since (dO/dE) is negative
n+l (6) rl < n
2 which is identical to condition (2). Thus all the Og are negative
(dynamical instability towards non radial perturbations) provided the
criterion for convective instability (4) or (5) be satisfied every-
where in the star.
The negative g spectrum in the homogeneous model may be
interpreted in the same way since in that case
A I dp >0 Flp dr
gl
But apparently, it was only slowly (Ledoux, 1949), at least
in astrophysics, that the connection between dynamically unstable g
modes and the sign of A became to be recognized and that convection
became to be considered as the end effect of this instability.
In the general case, it is not difficult to have A appear
explicitly in the equations which can be written
~P _ O' ~r dp -- - -- + .... d i v ~r (7)
p p p dr
0~i ~ rip div ~r = ~2~r- grad (@' + ) + ~ A P .....
= r3 -I ! grad o ( ~ - i div ~)' - i_~a di~) i~ p o P
p' + rip ( °' + A~r) ~ ~ p(~- !div ~)' o is p
(s)
(9)
F 2 p iaCp p
A~' = 4~Gp' (11)
where
I kPikA - + -- ~ iVk e =£N E P
represents the total heat liberated per second by nuclear reactions
and viscosity.
One may note that A is related to the Brunt-V~is~l~ frequency
N by
N 2 ,, - gA
02
and to
S = r2 - i I d1~ I dT (12) r 2 p dr T dr
by
a = 4 - 3____/~ s + ! d£ (13) 8 ~ dr
if 8 is the ratio of the gas pressure ~o the total pressure.
The adiabatic approximation (right hand members neglected
in (8), (9), (I0)) is sufficient to discuss dynamical stability and
it should enable us to understand the correlation noted above between
the sign of A and the stability of the g modes.
If we neglect ~ we can for i~stance write a second order
differential equation for ~- ~r/r :
where
9 d~ 6 i do I d
dr 2 ( ) dr
÷ ¢ { 020 _ t ( t + 1) o 2 A8 + 6
rip r 2 0 2 r 2
d dr (PI~)
+ 3 i do I d 0r 2 r ~ ~r'r + ( ) dr (Plp)) ( ) r~p d-7 < ) } - o (14)
_0r2 )
For high p modes,o 2 large, the equation simplifies very
much and shows the acoustic character of these modes. However, they
are without interest in the present context since, in realistic stel-
93
far conditions, all the o 2 are always positive. But in the same way P
if one considers high g modes, I ~21 small so that terms proportional
to 02 can be neglected, the only term left over which contains 02
is the second one in the coefficient of ~ in (14), i.e.
~(~+l) A_~
r2 ~2
and the problem is essentially of the Sturm-Liouvllle type with a
parameter %~ 1/o 2 . It is well known in that case that if A keeps
the same sign throughout, the elgenvalue (l=I/=2)~fll be of the oppo-
site sign. In other words~if A is positive (convective instability),
at least the small u 2 will be negative. But it has been accepted g
generally that all the 02 will have this same sign, because it is g
difficult to see how the sign of 02 could change in going from small
to larger values if the sign of its coefficient is constant. Anyway
fairly recently Grisvard, Souffrin and Zerner (1972) using the second
order system
dv =(~( ~ + 1) Pr2 i p2tr l d--{ c 2 - r l ~ ~ w = aw (15)
dw (s2+ Ag) P d--r = ~ v = by (16)
equivalent to (14) managed to prove without any assumption as to the
order of magnitude of ~2 that the latter are all real positive if A
is everywhere negative. Thus a necessary condition for dynamical in-
stability is that A be positive at least in some part of the star.
In that case, the authors succeeded in establishing an upper limit for
the modulus of any negative c 2 g
94
I o~t < Max (Ag) (17)
where the maximum is taken on the region where A>O. They showed also
that 1o21 increases with £, i.e. when the horizontal wave length g
decreases.
However if the investigation of Grisvard et al is free of
any asymptotic restriction, it still treats the perturbation of the
gravitational potential 4' as negligible. Would the previous results
hold if 4' is not neglected, especially for the lowest modes and har-
monics? Lebovltz (1965 a-h, 1966) tackled this general problem on the
basis of the variational principle established earlier by Chandrasek-
her (1964)~ which expresses o 2 as the extremum of integrals exten-
ded to the whole configuration. This enabled him to show first that if
A<0 everywhere (convective stability) all the o 2 are positive which
is thus a sufficient condition for dynamical stability towards non
radial perturbations. It is also a necessary condition as he showed
later, since the existence of a region however small with A>O entails
negative eigenvalues. This implies also that one can find solution~
of the differential equations with appreciable amplitudes in that re-
gion only.
In this respect~the work of Ledoux and Smeyers (1966) was
more or less complementary since they pointed out indeed that when A
changes sign in O~r~R the asymptotic form (02 small, g modes) of equa-
tion (14) has s turning point in A = 0 and that the g spectrum splits
then into two : one of positive eigenvalues (a~ 20) corresponding to
modes oscillating in space with appreciable amplitudes in the convec-
tively stable region (A<O) and decaying exponentially in the unstable
region (A>O) while the other spectrum of negative eigenvalues (o~ <0)
95
corresponds to modes oscillating in the convectlvel F unstable
region (A>O) and decaying in the stable one.
Using equations (15) and (16), Scuflaire (1974, cf. also
Osakl 1975) showed that these properties of the unstable modes sub-
sist even for the first few modes~ i.e. they oscillate only in the
unstable region. However~ it may happen that stable modes oscillate
in a slightly superadlabatic region, as for instance in a condensed
model where the stable g modes characteristic of the central stable
core may continue to oscillate in an external convection zone.
2. MULTIPLE UNSTABLE ZONES
The unstable modes are of greatest interest here and an
interesting problem arises as to their behaviour if there are two
or more regions with A>O separated by stable zones (A<O). Tassoul
and Tassoul (1968) in a discussion of asymptotic g modes suggested
that there should be as many distinct unstable g spectra as there
are unstable regions.
However, this is not obvious since even in the simplest
case of two turning points (for instance two convectlvely unstable
regions separated by a stable one) the usual analysis of a single
turning point in terms of Bessel functions cannot be simply repea-
ted (Langer, 1959) at each of the two turning points. A solution
should rather be sought in terms of Weber functions allowing to cross
the two turning points at once.A considerable amount of literature
exists on the subject and a very recent paper by Olver (1975) opens
the way to straightforward applications. They may bring to our
attention, at least in special cases~ unstable solutions whlch~In
the above case, may have large and comparable amplitudes in the two
unstable regions with a relatively minor reduction of this amplitude
96
across the stable region especially if the latter is fairly narrow
with a density gradient only slightly subadlabatlc. Such modes could
be particularly efficient in mixing the whole star. This conjecture
is somewhat supported by some simplified or numerical investigations.
For instance, Goosens and Smeyers (1974) have found more or less
"accidental resonances" between some of the stable g modes of two
stable regions separated by an unstable one (Just the opposite of
the case considered above) giving rise to a stable mode with large
amplitudes in both stable regions decreasing only moderately in bet-
ween in the unstable region.
Other examples have been treated by students in Liege
which lead to slmilar conclusions. Consider for instance, the case
of a heterogeneous incompressible model composed of superposed layers
of different densities presenting two unstable discontinuities,
(Pin - Pext )<0~ separated by a stable one. The behavlour of the
elgenfunctlons associated with the two negative eigenvalues can
depend drastically on the closeness of these eigenvalues. In general~
i.e. as long as those two o 2 are not very close~ each of the elgen-
functions has a sln~le maximum at the unstable interface with which
it is associated. Kowever when the parameters of the discontinuities
are varied~ one of these solutions may acquire a secondary strong ma-
ximum at the other discontinuity, the minimum between the two remai-
ning apprecla~le ~r1~n t~n~s elgenvalue becomes very close to that cor-
responding to the other discontinuity.
3. THE EFFECTS OF THE DYNAMICAL MOTIONS AND THE REDUCTION OF THE
SUPERADIABATIC GRADIENT
Many of the examples where dynamically unstable modes have
been found are artlfleiel because the superadiabaticity (A) has been
fixed a priori at a much larger value than is ever llkely to occur
97
in stars. In such cases, the time-scales of tke growing motions
which are proportional to (A) -I/2 are rather short. Of course in the
end, these violent motions lead to the establishment of a convective
zone through which A is reduced to a very small value Just sufficient
to allow the residual energy no longer transferred by radiation to
be carried by convective currents. This implies important readjust-
ments in the internal structure of the star including transfer of
mass to deeper layers and various feed-back effects which may in-
crease considerably the extent of the convection zone with respect
to the initial superadlabatlc region. For instance, in going from
an homogeneous compressible sphere to a polytrope n - 3/2(FI=5/3)~
the energy released, if the mass and the radius are those of the sun,
is of the order of 0.i GM~ /R G = 5.1047 ergs in a short time of the
order of one hour. Of course this is an extreme case, but even if the
energy release is reduced by a factor I0 I0 and the duration increa-
sed by a few orders of magnitude, it would still remain a fairly
spectacular phenomenon which was considered at one time (Biermann~
1939; Schatzman, 1946) significant for the interDretatlon of novae.
In any case~ I suppose that there are no serious doubts
that the readjustments contemplated above would lead in the end to
the same model as the one that could be built a priori using the
usual method of having a convective adiabatic zone initiated at the
point where the radiative gradient becomes exactly equal to the adia-
batic one (A ~ 0).
In fact things are a little more subtle as the reduction
of the superadlabatlcity must proceed only so far that the subsisting
excess provides the necessary buoyancy force to balance the energy
dissipated by viscosity and conduction (radiative or otherwise). Of
course, when A and 1o21 are still large, the effects of these dissi-
98
pation terms could Be evaluated hy a pertuT~atlon metkod (similar
to that used for vibrational stability when ~2>O) yleldlug a damping
coefficient ~' correcting the adiabatic time dependence By a factor
-olt e
But one knows how cumbersome (cf. Ledoux 1974), the ex-
pression of ~' is. yo illustrate the situatlo~, it seems Better to
revert to a simple case as, for instance, the plane layer with cons-
tant coefficients. Let ~ and n represent respectively the thermome-
tric conductivity a=d the kinematic coefficient of viscosity. Assu-
ming a time depe~d~nce e st, one finds for the g modes, if A is posi-
tive and larger than K,~ or <+n
s = gA) z2- 2 (lg) k 2
where k~ and k z (k 2 = k~ + k ) are respectively the horizontal and
the vertical wave number. In this case, dlssipatio~ simply hinders
a little convection hut does not affect it very much.
On the other hand, when convection is established, the
effects of the dissipation forces are of the same order as those of
the residual buoyancy and the above formula is no longer significant.
In that case and if one includes the rate of energy Ke~eratlon c
(significant in the deep interior), one gets an equation equivalent
to that of Defouw (1970) with
~)g E L T = - _ = T ' L0 O
where v represents the sensitivity of ~ to T,~ =(dlog~)0/dlo~ T.
After separating a secular root s 3 = -n k 2, the dispersion relation
for g modes, gives solutions
99
1 ~ (~ - I) ) s = - y~ (~+ ~)k 2 -CT
P
2
+ i --~ (19) -'{~ ((~ - n)k2 CpT (v- i) 12 k zk~l gA} I/2
which for vanishing E reduces to the ordinary Raylelgh solution of
the B~nard problem
i + I s -- - y (n + ~)k 2 -{~ (~ 2 l/2
~) 2k4 kH + ~ gA} (20)
which yields back (18) if (k2/k2)gA>>~2k 4 and n2k 4
On the other hand, if we approach the marginal case
2 2 (st0), then (kH/k)gA must take the appropriate value to make the
square root exactly equal to the first term, i.e.
2
kH gA ~ nk 4 nk2 C~ (v-l) k 2
If e is negligible~this condition becomes
~Ad 4 = k6d 4
~n 2 k H
where the depth d of the layer has been introduced. If k z =~/d and
a ~ kHd , one gets finally
~Ad 4 = ( 2+ a2)3
<~ 2 a
= R
which is the usual value of the Rayleigh number. The energy genera-
tion could reduce somewhat the value of R. In this marginal case,
100
the tlme-scale- !/s Becomes infinite but a circulation sets in with
a finite turn-over time.
Thus convection~although originating essentially in a dy-
namical instability (conservative terms) with a short time scale,initi
ares fast motions to modify the medium itself in such a way as to
reduce gA fIBal~y to the same order as the dissipative (non conser-
vative) forces.
As Defouw (1970) pointed out, the energy generation term
has a destabilizing influence (cf.19) and if
CeT (v-l)> (n+~)k 2 P
it contributes directly to the instaDillty. If A>O, this effect
simply relnforc~ the Buoyancy.
On the other hand if A<O, and such that
2
k 2 p
the effect would correspond to a case of vibrational instability (or
overstability) of stable g modes. But if
2 kH gA<~ (< - -V 2 c T (v-l)
P
it seems that a growing non-oscillatory motion would arise which
might lead to some kind of convection, although A<O.
101
REFERENCES
BIERMANN, L. 1939, Z. Astrophys., 18, 344
CHANDRASEKHAR, S. 1964, Astrophys. J., !39, 664
COWLING, T.G. 1941, Mo~. Not. Roy. Astr. Soe., I01, 367
DEFOUW, R.J. 1970, Astrophys. J., !60, 659
DEUPREE, R.G. 1974, Astrophys.J., 194, 393
DEUPREE, R.G. 1975a, Astrophys.J., 198, 419
DEUPREE, R.G. 19755~ Astrophys.J., 201, 183
DEUPREE, R.G. 1976, Astrophys.J., 205, 286
GOOSENS, M. and SMEYERS, P. 1974, Astrophys. Space Sel., 26, 137
GRISVARD, P., SOUFFRIN, P. and ZERNER, M. 1972, Astron. and Astro- phys., 17, 309
ISLAM, J.N. 1970, Mon. Not. Roy. Astr. Sot°, 150, 237
KOVETZ, A. 1967~ Z. Astrophys., 66, 446
LANGER, R.E. 1959, Tra~s. Amer. Math. Soe., 90, 113
LEBOVITZ, N.R. 1965a, Astrophys. J., 142, 229
LEBOVITZ, N.R. 1965~, Astrophys. J., 142, 1257
LEBOVlTE, N.R, 1966, Astrophys. J., 146, 946
LEDOUX, P. 1949, Contribution ~ l'~tude de la s~ructure interne des
~toiles et de leur stabilitY, M~m. Soc. Roy. Scl. Liege
Coll. in -8 °, 4°s~r. T IX, Ch. III, sect. 4, 5, 6
LEDOUX, P. et SMEYERS, P. 1966, Compt. Rend. Acad. Sci. Paris, S~r.B., 262, 841
LEDOUX, P. 1974, in P.Ledoux et al (ed.) Stellar Instability and
Evolution, IAU Symposium n=59, Part Vl, p. 135
MOULTON, F.R. 1909, Astrophys. J., 29, 257 0
NORDLUND,A. 1976, Astron. and Astrophys., 50, 23
OLVER, F.W.J. 1975, Phil. Trans. Roy. Soc. London A, 278, 137
OSAKI, Y. 1975, Publ. Astron. Soc. Japan, 27, 237
PEKERIS, C.L. 1938, Astrophys. J., 88, 189
102
SCHATZMAN, E. 1946, Ann. Astrophys., 9, 199
SCUFLAIRE, R. 1974, Astro~. and Astrophys., 34, 449
SHAPLEY, H. 1914, Astrophys. J., 40, 448
SPIEGEL, E.A. 1971, Ann. Rev. Astron. Astrophys. ~, 323
SPIEGEL, E.A. 1972, Aun. Rev. Astron. Astrophys., IO, 261
TASSOUL M. a~d TASSOUL J.L. 1968, Ann. Astrophys., 31, 251
THOMSON, W. 1863, Phil. Traus. Roy. Soc. London, 153, 612
THORNE, K.S. 1966, AsCrophys. J., 144, 201
OBSERVATIONS BEARING ON CONVECTION
K.H. BbHM
University of Washington~ Seattle, WA., U.S.A.
SUMMARY
I. Solar observations contain a considerable amount of information on the hydro-
dynamics of stellar convection. We emphasize and discuss especially
(a) the existence of two very different cell sizes,
(b) the unexpectedly high cross-correlation between vertical velocities and tempera-
ture fluctuations in the granulation,
(c) the fast "downdrafts" in intergranular regions,
(d) the existence of cells much larger than the scale height,
(e) thes~mnge behavior of the temperature fluctuations in the supergranulation, and
(f) the importance of convective overshoot.
2. The Li-Be problem and its possible relevance as an indicator of convective
overshoot is briefly summarized.
3. Convection may have a stronger influence on the observable properties of He-
rich ("non-DA") white dwarfs than of most other stars. We discuss especially
(a) the persistence of outer convection zones through a very wide range of effective
temperatures,
(b) the occurence of convection in high layers of the atmosphere,
(c) the relatively high efficiency of convection in white dwarf atmospheres, and
(d) the relevance of convection to the cooling problem.
! • INTRODUCTION
The main ~£iculty of the present topic is due to the fact that almost all
spectroscopic and color observations of cooler stars are somewhat related to the
convection problem but that there are so few observations which seem to be really
crucial in this context. Almost all of the really decisive observations seem to refer
to the sun. Though in the stellar case the conclusions concerning convection can be
extremely interesting, the way to reach them is often indirect and some doubt is
usually possible.
Consequently we fee~ chat the topics of our discussion are quite obvious as long as
we look at evidence from the Sun, but that our selection of topics in the stellar
cases will probably be somewhat subjective.
104
I shall first report on the solar evidence. After that I shall go on to discuss
some observational evidence for convective mixing processes in stars.
Finallywe shall consider a class of stellar atmospheres whose average temperature
structure is strong, changed by the presence of convection in rather high atmospheric
layers.
II. OBSERVATIONS OF SOLAR CONVECTION
A. GENERAL REMARKS
As we all know there are at least two groups of phenomena which are thought
to be direct indications of the presence of convective motion (including overshoot)
namely granulation and supergranulatlon. In the case of granulation it is more or
less generally accepted that we see the direct effects of convective motion, in the
case of supergranulation some slight doubts may be possible. Nevertheless most
astronomers believe that supergranulation is the manifestitation of the penetration
of large cell convection.
In addition to these direct convective effects there are indirect effects which give
us some information about the structure of the convection zone. There are firstly the
observed eigenmodes of the so-called five-minute oscillation. I refer to the work
of Ando and Osaki (1975), Deubner (1975), the review given last year by. J.P. Zahn
as well as the very interesting comment by Mclntyre (1975). Secondly, there are
observed mixing effects in the sun which may or may not be due to convective mixing.
I refer specifically to the very low abundance of Li 7 and the apparent absence of Li 6.
I shall first discuss the direct evidence for convection from the observations of
granulations and supergranulation.
One of the most interesting observed features of solar convection is that there are
two scales of motion present near the surface of the convection zone, namely the
granular scale of about 2000 km (maximum of the A I power spectrum) and the super-
granular scale of about 32000 km. Both numbers correspond only to the relatively
flat peaks of broad power spectra. This statement is of course not new. However,
it is surprising how few theoretical attempts to understand this fact have been made.
However, there are some notable exceptions including especially the paper by Simon
and weiss (1968).
Let me now try to summarize briefly our knowledge of granulation and then of the
supergranulation.
B. GRANULATION
How do we describe the observational information on granulation ? There
are essentially two possibilities. Either we use a description based on the autocor-
relation function and power spectrum or we look at single granules and, maybe, derive
from them the properties of an average (or typical) granule.
105
For some time it seemed that a description using correlation functions and power
spectra would be the only useful method. However, more recently studies of properties
of individual granules have found great interest. The reason for this is of course that
a description is needed which permits a separate study of the granules on the one
hand side and the intergranular lanes on the other side.
In studying granulation we should like (ideally) to obtain a three-dimensional picture
of the velocity and the temperature fluctuations. This requires observations of very
high spatial resolution, observations in a number of different lines (which are formed
at different depths) and center-to-limb observations. One of the fundamental diffi-
culties is, as we all know, the separation of the granular and the oscillatory velocity
fields.
One way to do this is to identify the region in the k-~ plane which corresponds to
granular motion. A simpler method is to assume that all horizontal scales smaller
than'b 4000 km correspond to granulation whereas all motions with a larger horizontal
scale are due to oscillations (cf. Mattig, Mehltretter and Nesis ]969, Beckers
and Canfield 1976).
In addition, one finds that the observed velocities first decrease with increasing
height and then increase again (cf. Canfield and Mehltretter 1973, Mattig and Nesis
1974). This fact can also be used for a separation of the two velocity fields.
The component which decreases outward is usually identified with convection and
convective overshoot whereas the increasing (or constant) component corresponds
to the oscillations.
Even today the problem of correcting the observations for the effects of the
contrast transmission function of the telescope and of the atmospheric seeing seems
to be a very difficult one. This is true for the determination of the intensity
and the corresponding temperature fluctuations but even more so for the velocity
determination. Consequently we find a rather large scatter of the observational
results which is certainly to a considerable part due to seeing differences. Another
difficulty is due to the fact that different authors use different spatial domains
and use lines which are formed at different depths in the atmosphere. Consequently
it does not make much sense to take averages of measurement by different authors.
Rather, we shall take the detailed high resolution observations by Canfield and
Mehltretter (1973) as a typical example of modern results. These authors find a definite
decrease of the r.m.s, velocity outward before it starts to rise again. (This
result has been confirmed by Mattig and Nesis 1974, Mattig and Schlebbe ]974,
Musman 1974, and others).
The largest r.m.s, velocity is found for the line formed at the relatively
largest depth. The relatively faint Fel line 5178 shows a formation depth of about
40 Pan above the zero level of HSRA. Canfield and Mehltretter find a r.m.s, velocity
of .54 km/s for this line if corrected only for instrumental effects and of .73 km/s
if a reasonable correction for seeing effects is applied. By extrapolation the
authors find a r.m.s, velocity of .8km/s at the height of continuum formation.
106
It seems that these velocities are very roughly compatible with predictions of the
simple mixing length theory.However, this type of theory (without convective overshoot)
predicts a much steeper variation of the r.m.s, velocity than is observed.(See ~ig.l.)
Any theory which includes overshooting (even linear mode calculations, el. B~hm 1963 a)
can reproduce the observational results in a qualitative way.
0 .8
.~ 0.6 E
> 0 . 4
0 . 2
~l I i "" I ' I ' I I - -MIXING LENGTH WITHOUT
_'~ O V E R S H O O T
~ Canfield, ~\ X,~ Mehltretter ~\ ( 1 9 7 3 ~
\ \ " . ,~ ,~o . . ~ - \ \
\ "- \ , ~ ~" -. ~X"3130 km
1 5 6 0 k i n " I , I " ,L I I
0 2 0 O 4 0 O h (kin)
6 0 0
F I G U R E l
Comparison of the observed (vertical) velocity stratification (corrected for finite telescope aperture and for finite slit width hut not for atmospheric seeing~ Canfield and Mehltretter |973) with some simple theoretical results. The filled circles represent the observations by Canfieldand Mehltretter (1973), the (long) broken line is the corresponding interpolation curve. The solid line shows the results for the standard (local) mixing length theory with I = H. The vertical velocity distribution for two linear modes with horizontal wavelength calculated for a detailed model of the solar convection zone (B~hm 1963)is given by the (short) broken curves~ Note that the linear modes contain an arbitrary amplitude factor.
107
.-2 ~.oo c
,4 t,=
0 . 7 5 n~ LU
0 a. 0 . 5 0 LU >
0 . 2 5 ..J W n,-
%
' I ' I ' I ' I ' -
-- ~\\ / , I X5171
I '~
/ / \ \ " --_rx5,64 -1 ", \ \ t xs,za /
, I i I ,- "" 1" J I ~
2 0 4 0 6 0 8 0 I 0 0 k x 10 - 4 km -I
FIGURE 2
Comparison of the velocity power spectra for the lines Fel ~ 5171 and Fel /% 5164, 5178, 5180 and the power spectrum of the intensity fluctuations in the continuum. The diagram is based on the data given by Canfield and Mehltretter (]973). The maxima of the contribution functions for the lines 5178, 5164 and 5180A occur at a height between - 20 km ands70 km in the Harward-Smlthsonian Reference Atmosphere, the maximum for the line ~ 5171 lies at h% + 470 km i~ the HSRA. Note that the velocity power spectrum for the lines formed near the depth of formation of the continuum (~ 5178, A 5164, ~ 5180) looks very similar to the intensity fluctuation power spectrum for the continuum.
108
The continuum r.m.s, intensity fluctuations are~ 8.2% with a reasonable seeing
correction (2% T-fluctuation) which (if naively interpreted) is lower than the
mixing length predictions (~20%).
However, Canfield and Mehltretter believe that the higher value of ~ I ~ 12%
derived earlier by Mehltretter (19711 from his observations may also be correct.
The spatial power spectra of the intensity fluctuations show (after correction
for seeing effects) a rather flat maximum at a horizontal wave number between
3 and 5 x |0 - 3km -I (~ between 2000 and 1250 km).
Itshould be noted that a number of investigators find the peak of the power
spectrum at considerably longer wavelength (cf. Mattig and Nesis 1974). We assume
that this is a consequence of seeing effects.
Power spectra ~f the velocity fluctuations for lines which are formed relatively
deep in the photosphere seem to approach a power spectrum similar in shape to
the A I power spectrum (Canfield and Mehltretter 19731 indicating that
T-fluctuations and velocity field are really related below T 5000 ,~ ,4.(Fig.2).
Earlier, Frazier (1968) had found a comparably high correlation between continuum
intensity and the velocity in the Sill 6371 llne which is formed very deep. (The
peak of the contribution function is at • ~a ,65,)
It is really astonishing how high the v - ~ I correlation is. One has to
remember two things to appreciate this fact. Firstly, the velocity and the
continuum intensity which are beeing cross-correlated do not refer to the same
layers. Secondly, there is a possibility that for turbulent convection even at
one given point the cross-correlation could be considerably lower than 1
(cf. Spiegel 1966 a).
It is also interesting to note that there seems to be roughly a 30 second phase lag
in the sense that the maximum granular continuum follows the velocity (Frazier 1968,
Edmonds and Webb 1972, Musman 1974, Beckers and Canfield 19761.
We shall now proceed to the description of individual granules which, as mentioned
earlier, turns out to be useful and very interesting from a hydrodynamic point
of view. A very fundamental study of this type has been carried out by Beckers and
Morrison (!9701. They observed a large number of granules at ~ = .84, .70, .60
and determined the velocity field of an average granule from these observations.
By making use of the different positions on the solar disk (different~)these
authors derive the vertical as well as the horizontal velocity field for the average
granulum. Since not all granula have the same size and shape we can not expect
to get a really quantitative picture of the hydrodynamics of a granulum by this
procedure. Since in the reduction process the granules are positioned such that their
centers fall on the same point the upward velocities at the center of the granule are
enhanced in the averaging process whereas horizontal and downward velocities (occu-
ring in the outer parts of the granulum) are reduced. Nevertheless the "average"
granulum shows very clearly the basic structure of the velocity field in a granulum.
(Fig. 3).
109
| i I ,
Z ~ k m I 0 ~
200 m / ~
I I I I iooo looounl
J
FIGURE 3
The vertical cross-section of the velocity field of an "average" granule. (See text for a brief description of the averaging procedure). From Beckers and Morrison (1970). By permission of Reidel Publi. Co., Dordrecht.
Of course, we have to keep in mind that all modes of smaller scale are smoothed out by
atmospheric effects and by the averaging process.
The importance of such investigations lies in the detection of an ordered horizontal
outflow from the granulum. This is, of course, not too surprising, but it is certainly
an observational fact which is not emphasized in the mixing length theory.
The investigation of individual granules has lead to the discovery of another interes-
ting phenomenon namely the "exploding granules". (Carlier, Chauveau, Gugan and
RDsch 1968, Musman 1972, Beckers and Canfield 1975).
The phenomenon is shown (in a rather schematic way) in Fig. 4. Musman (|972) has
interpreted this sequence of events as being due to the interaction Of a thermal
with the stable layers above the convection zone. Ha has carried out a laboratory
experiment in order to confirm this point of view.
110
4 MINUTES B MINUTES LATER LATER
APPARENT EXPANSION VELOC ITY
I.Skm/s
o 0 SMALL UNUSUALLY
BRIGHT GRANULES
HAS EXPANDED; SHOWS
SLIGHT DARKENING AT
CENTER
FURTHER EXPANSION
WITH DARK REGION AT
CENTER
GRANULE (RING) FRAGMENTS AND FADES
FIGURE 4
Schematic representation of the development of an "exploding" granule (after Musman 1972, see text). Note that the bright granular region is drawn dark here.
Another important group of investigation is concerned with phenomena occuring in the
intergranular regions. It turns out that observers find a number of unexpected pheno-
mena and that these observations change our idea about solar convection considerably.
According to Deubner (]975) observations show that very large downward velocities can
be observed fairly often in the dark intergranular lanes. He quotes that observations
which have been made under excellent conditions show often downward velocities (uncor-
rected for seeing effects) of 2 km/s. He argues that these observations lead to correc-
ted downward velocities of about 4 km/s or possibly to even somewhat higher values.
These results indicate a very large asymmetry between upward and downward motion and
should be of great relevance to an understanding of the hydrodynamics of solar convec-
tion. It is not surprising that this effect is not visible in the results of Beckers
and Morrison who take average over many granules of different sizes.
111
It should be emphasized that it is not clear how large a fraction of the intergranular
lanes really show these great downward velocities. Deubner discusses these problems
in connection with the occurence of magnetic flux ropes in the photospheric network
as observed by Stenflo (1973) but he definitely considers the possibility that the
downward velocity is very high in all intergranular regions.
B. SUPERGRANLILATION
We all know that the supergranular velocity field pattern is strongly correla-
ted with the chromospheric network. The average diameter of supergranulation cells is
about 32000 km. Cells with diameters in the whole range from about l0000 km to 60000 km
are present. The typical horizontal (outflow) velocities are 0.3 - 0.4 km/s (Simon and
Leighton 1964, Deubner 1971). Vertical downward motions specifically in the magnetic
regions at the borders of the cell are in the range 0.] - 0.2 km/s (cf. Simon and
Leigthon 1964, Frazier 1970, Musman and Rust 1970, Musman 1971, Deubner 1971, Worden
1975). Supergranular motions have been detected also deep in the photosphere as
indicated by Deubner's (1971) measurement of CI 5380 line (X-v 7.7 eV ).
It is generally agreed that the main downdrafts occur in the rather localized
magnetic regions at the cell boundary.
Upward motions at the center of supergranulum of the order of 50 m/s may be present
but have not yet been confirmed. (Worden and Simon 1976).
In order to judge whether supergranulation is really a convective motion a study of
the corresponding temperature fluctuations is of great importance. The results of
such studies are somewhat confusing though a clarification may now be in sight.
Somewhat surprisingly one finds normally a slight temperature increase at the cell
boundaries (~ 2.5°K) whereas according to the usual picture of a simple convective
cell a temperature decrease would be expected (Beckers 1968, Frazier ]970). It is
usually assumed that these very small temperature increases are due to the increase
of the magnetic field and are not directly related to the convective flow (Liu 1974).
One might hope to find the expected velocity -- ~ T correlation more easily in deeper
layers. Recently Worden (1975) has studied the supergranulation structure at 1.64
which represents the deepest observable layer in the solar photosphere. Using the
HSRA (Gingerlch et al. 1971) we find that at this wavelength we look down to about
T qu 1.8 where the temperature is ~ 7000°K. Worden finds an 0.7~intensity increase
near the cell boundaries. This has to be compared to an 0.4% intensity increase in the
higher photospheric layers. These results can be translatedinto a temperature diffe-
rence ofabout 50 ° in the deep photospheric layers. This rather small difference indicates
that the supergranulation (if it is convection) contributes only very little to the
total convective energy flux in the visible layers. Nevertheless, the fact that such
large convection cells do existis very important from a theoretical point of view.
I do not think that this conclusion is very much influenced if supergranulation is an
overshoot phenomenon or if the supergranulation elemen~ are counter cells (of. Spiegel
1966 b).
112
Some of the i~ediate theoretical implications of the existence of convection cells
of supergranular size have been discussed by Simon and Weiss(1968 a, 1968 b).
They point out among other things that we should expect that the depth of cell is
not very different from about one fourth of i~s diameter. This leads to a cell
depth of about ]0000km which is clearly much larger than a scale height. Simon and
Weiss are to some extent guided by the results known for polytropic atmospheres in
which the scale height is of the same order of magnitude as the distance to the
surface of the convection zone. The possibility that cells of supergranular size will
transport a large fraction of the convective energy flux in deep layers is of great
importance. Simon and Weiss also predict giant cells which are comparable in size to
the total thickness of the solar convection zone and which in the meantime have also
been detected observationally.
We close this chapter by listing briefly some of the interesting and hydrodynamically
relevant properties of solar convection in table I.
TABLE I
SOME INTERESTING PROPERTIES
OF SOLAR CONVECTION
| . GENERAL
A. Existence of two completely different cell sizes (2000 km and 32000 km)
2. GRANULATION
A. Very high correlation between ~J and A T (more than 80%)
B. Flat maximum of ATpower spectra between ~ = 1200 and ~ = 2000 km
C. Very fast down-drafts (~u 4 km/s) in intergranular regions
D. "Cell" structure
E. Convective overshoot
3. SUPERGRANULATION
A. Cell sizes much larger than scale height
B. Concentration of magnetic fields at cell boundaries
C. Positive ~ T in high layers, negative ~ T in deep photospheric layers
of cell boundaries, ~ T unexpectedly small.
113
I I I CONVECTIVE MIXING AND THE Li-Be PROBLEM
When we discuss convection in stars it is quite clear that the observational
evidence concerning convection has to be rather indirect. Our conclusions are usually
based on a mixture of observations and theoretical predictions (which often use a very
crude form of theory), Sometimes we even forget that theoretical assumptions are used
and claim that the result is purely observational though it is not.
Problems of convective mixing definltely belong to this category. In many cases
we certainly see effects of mixing but we can not be absolutely certain that this
mixing is really due to convection. Even if it is convective mixing it can have
happened rather recently or during an earlier phase of evolution. Probably~ the
best known example of such situation is the lithium-beryllium problem. As is well
known, the solar atmosphere contains no or almost no Li 6 and in contradistinction to
other old material in the solar system (chondrites) very little Li 7. In order to
explain this it is usually assumed that surface material has been mixed into layers
of 2.5 x 106K where the lithium is destroyed by (p, ~ ) reactions. A progressive
decrease of the lithium content from G 2 V to K O V stars (Wallerstein, Herhig and Conti
1965, Zappala 1972) has been attributed to the effects of pre-main sequence convection.
(Bodenheimer |965, |966). In addition amain sequence depletion time scale of about
1.5. x 109 years has been found (Herbig 1965, Zappala 1972). It is not yet completely
clear whether very slow convective overshoot (cf. B~hm 1963, 3966, Weymann and Sears
1965, Spiegel |968, Strau~, Blake and Schramm 1976) in combination with the relevant
nuclear rates can produce this time scale. However, Strauss,Blake and Schramm (1976)
argue very strongly that convective overshoot can explain the observed lithium abun-
dances. They emplasize the fact that the enersetically possible convective overshoot
theoretically decreases with decreasing mass of the star and that this is just the
feature which is required in order to explain the observations. If this result should
be confirmed the investigations of the Li problem would give us an excellent opportunity
to study convective overshoot below outer stella r convection zones. It seems that the
lithium problem is more useful in the study of stellar convection than many other mixing
problems since the time scale is known well and since it refers to main sequence stars
which are better understood than the advanced phases of stellar evolution in which most
other mixing processes seem to occur. The fact that it is probably related to convective
overshoot makes it especially interesting.
114
IV INFLUENCE OF CONVECTION ON ATMOSPHERE STRUCTURE AND COOLING TIMES IN COOL
WHITE DWARFS.
After these brief remarks about the Li problem I finally would llke to talk
about a group of convection problems which are also closely related to the above ques-
tions and in which I have been strongly interested in recent years. This is the field
of convection in white dwarf stars. I personally feel that there may be more drastic
effects of convection on observable properties in the ease of white dwarfs than in most
other stars for the following reasons (of. Bohm ]968, 1970, Van Horn 1970, Wiekramasinghe
and Strittmatter 1970, B~hm and Cassinelli 197], Wagner 1972, Grenfell ]974, Fontaine
et al. 1974, Fontaine and Van Horn 1976, Muchmore and B~hm 1976).
]. Atmospheric convection persists over a wider effective temperature range
in white dwarfs than in any other stars. This is especially true for the roughly 30 %
of white dwarfs which are called non-DA's and which have very He-rlch atmospheres. In
these objects outer convection zones are present in the effective temperature range from
about 25OO0°K down the lowest effective temperatures.
2. Convective instability sets in very high in cool. white dwarf atmospheres.
In a number of models this occurs at optical depths higher than O.O1. Consequently the
atmospheric structure is strongly influenced by convection. This is especially true for
white dwarfs with) say, Tel f < 8000 ° in which the high atmospheric density leads to
relatively high effectivity of convection and consequently to a temperature stratifica-
tion which differs considerably from a radiative equilibrium stratification (cf. Wagner
1972, Grenfell }972). In extreme cases (non-DAs with 3000°K < Tel f ~ 5000 °K) the convec-
tive flux reaches values of about 60 % of the total flux at ~ ~ ].0 (B~hm, Carson)
Fontaine and Van Hors |976). So, we should be able to detect the presence of convection
without difficulty by studying the line spectra Of cool non-DA white dwarfs like
Van Maanen 2 and Ross 640. (However, it should Be emphasized convection in cool white
dwarfs has to be studied through its influence on the mean stratification of the atmos-
phere. The velocities in these high density atmospheres are much too small to he detec-
ted directly.~
3. The existence of two different types of white dwarfs atmospheres, one consis-
ting of almost pure helium (non-DA), the other one of almost pure hydrogen (DA) has lead
to the suggestion that convective mixing processes in combination with gravitational
separation and/or accretion may be important in determining the chemical composition of
white dwarf atmospheres (cf. Strittmatter and Wickramasinghe }97}, Shipman 1972, Baglin
and Vauclalr ]973). However, the subject is controversial and simple convective
115
mixing is insufficient to explain the non-DAs (Koester 1976). Nevertheless, some
interesting convective mixing effects are to be expected since it is clear that
white dwarfs have a very ~hin envelope of hydrogen or helium surrounding the main
body of the star which consists of C and O (el. Weidemann |975).
6.0- I--
_.o
5.5-
!
0 -5 -10
I I
. . . oooo °
,,, ! ! . . . . . . . .
/ . _ - - BOO0 °
. . - - - - . . . - I o o o =
. . . . - 6500 °
5 . 0
6.5-
-20
FIGURE 5
Temperature stratification in the outer layers of cool He-rich (non-DA) white dwarfs. The diagram is based on calculations by Muchmore and B~hm (1976). We have plotted the logarithm of the temperature as a function of the local degeneracy parameter = = - ~ /kT (with ~ = chemical potential). Tel f is the parameter. The solid parts of the curves correspond to convection zones, the broken lines to the radiatlve-conductive zones below. The diagram shows how the temperature in cool He white dwarfs approaches its asymptotic core value T c and how large a fraction of the temperature rise towards T c occurs in the (geometrically thin) convection zone. This gives some indication of the importance of convection for the determination of the T e and Tel f relation (see text).
116
4. In the case of white dwarfs the evolutionary time scale is strongly
influenced by the time it takes for energy flux to go through the outer non-
degenerate and partially degenerate layers. When convection breaks throngh this
"insulating" layer the cooling becomes considerably faster. Consequently, the
cooling time and the luminosity function of white dwarfs is influenced by the
presence of convection (cf. Bbhm 1968, Ostr~ker 1971, Lamb and Van Horn 1975).
Eventually we shall he able to derive information about convection from all these
different effects. Some indication of the importance of convection for the deter-
mination of the relation between core temperature and Tel f is given in Fig. 5.
Finally it should be emphasized that the comparison of theoretical predictions of
white dwarf convection with observations involves one complication which
is not present in most other stars. The calculation of the equation of state, the
adiabatic gradient and the specific heat is made rather difficult by the presence
of partial degeneracy, pressure ionization and electrostatic interactions between the
particles. (In non-DAs of Teff < 3800 ~ we face some fascinating problems because
drastic complications of the equation of state, including partial degeneracy, occur
already within the atmosphere, see B~hm, Carson, Fontaine and Van Horn ]g76).
However, we hope that the problems related to the equation of state will be overcome in
the foreseeable future.
ACKNOWLEDGEMENT S
This work has been in part supported by NSF grant AST 74 - 24343 A 0].
A part of this paper was written while the author held the Gauss Professor-
ship of the Gbttinger Akademie der Wissensehaften. I am grateful to the
Akademieand to Professor H.H. Voigt and the other astronomers of the Gbttingen
University Observatory for their kind hospitality.
t17
REFERENCES
ALTROCK, R.C, and MUSMAN, S. 2976, Ap J. 203, 533
ANDO, H. and 0SAKI, Y. ]975, P.A.S.J. 27, 58]
BAGLIN, A. and VAUCLAIR, G. 1973, Astron. Astrophys. 27, 307
BECKERS, J.M. 1968, Solar Phys. 5, ]08
BECKERS, J.M. and CANFIELD, R.C. 1976, in Physique des Mouvements dans les Atmospheres Stellaires (ed. R. CAYREL and M. STEINBERG) Editions du C.N.R.S., p. 207
BECKERS, J.M. and MOKRISON, R.A. 2970, Solar Phys. 14, 280
BODENHEIMER, P. 2965, Ap J. 142, 451
BODENHEIMER, P. 1966, Ap J. 24_~4, 103
B~HM, K.H. 1963, Ap J. 138, 297.
B~HM, K.H. 2966, Z.f. Naturforschung 2l_~a, ]107
BOHM, K.H. 1968, Astrophys. Space Sci. 2, 375
BOHM, K.H. 2970, Ap J. 162, 919
B~HM, K.H. and CASSINELLI, J.C. 197], Astron. Astrophys. 12, 21
BSHM, K.H., CARSON, T.R., FONTAINE, G. and Van HORN, H.M., 1976, in preparation
CANFIELD, R.C. and MEHLTRETTER, J.p. 1973, Solar Phys. 33, 33
CARLIER, A., CHAUVEAU, F., HUGON, H. and R~SCH, J. ]968, Comptes Rendus 266, 199
DEUBNER, F.L. 2971, Solar Phys. 17, 6
DEUBNER, F.L. I975, Astron. Astrophys. 44, 371
DEUBNER, F,L, 2976, Astron. Astrophys. 47, 475
EDMONDS, F.N. and WEBB, C.J. 2972, Solar Phys. 25, 44
FONTAINE, G. and Van HORN, H.M. 1976, Ap J. Suppl. 3]_, 467
FONTAINE, G., Van HORN, H.M., BOHM, K.H. and GRENFELL, T.C. ]974, Ap J. 193. 205
FRAZIER, E.N. 1968, Ap J. 152, 557
FRAZIER, E.N. 1970, Solar Phys. 24, 89
GINGERICH, O., NOYES, R.W., KALKOFEN, W. and CIINY, Y. 1971, Solar Phys. 28, 347
GRENFELL, T.C. 1972, Astron. Astrophys. 20, 293
GRENFELL, '£.C. 2974, Astron. Astrophys. 3]_ , 303
RERBIG, GoH. ]965, Ap J. ]4], 588
KOESTER, D. 2976, Konvektive Durchmischung und Accretion be] weissen Zwergen Habilitationsschrift Kiel
LAMB, D.Q. and Van HORN, H.M. 2975, Ap J. 200, 306
LIU, B.Y. 1974, Ap J. 189, 359
MATTIG, W., MEHLTRETTER, J.P. and NESIS, A. 1969, Solar Phys. 10, 254
MATTIG, W. and NESIS, A. 1974, Solar Phys. 36, 3
MATTIG, W. and SCHLEBBE, H. 1974, Solar Phys. 34, 299
McINTYRE, M.E. 1976 in Physique des Mouvements dans les Atmosphares Stellaires (ed. R. CAYREL and M. STEINBERG) Editions du C.N.R.S., p, 262
MEHLTRETTER, J.P. 2971, Solar Phys. 16s 253
118
MUCHMORE, D.0. and BDHM, K.H. 1977, Astron. Astrophys. 54, 499
MUSMAN, S. 1971, I.A.U Symposium n ° . 43, p. 289
MUSMAN, S. 1972, Solar Phys. 26, 290
MUSMAN, S. 1974, Solar Phys. 36, 313
MUSMAN, S. and RUST, D.M. 1970, Solar Phys. 13, 261
OSTRIKER,J.P. 1971 ~nn. Rev. Astron. Astrophys. 9
SHIPMAN, H.L. 1972, Ap J. | 7 7 , 723
SIMON, G.W. and LEIGHTON, R.B. 1964, Ap J. 140, |]20
SIMON, G.W. and WEISS, N.O. 1968 a, Z. Astrophysik 69, 435
SIMON, G.M. and WEISS, N.0. 1968b, in Structure and Development of Solar Active Regions (ed. K.O. KIEPENHEUER) D. Reidel, Dordrecht-Holland, P.]08
SPIEGEL, E.A. 1966 a, in Stellar Evolution (ed. R.F. STEIN and A.G.W. CAMERON) New-York: Plenum Press, p. 143
SPIEGEL, E.A. ]966 b~ I.A.U. Transactions ]2 B, 539
SPIEGEL, E.A. 1968, Highlights in Astronomy (ed. L. PEREK) Dordrecht: Reidel Publ. Co.
STENFLO, J.O. 1973~ Solar Phys. 32, 41
STRAUSS, J.M., BLAKE, J.B. and SCHRAMM, D.N. 1976, Ap J. 204, 481
STRITTMATTER, P.A. and WICKRAMASINGHE, D.T. 197], M.N.R.A.S. 152, 47
VAN HORN, H.M. 1970, Ap J. (letters) ]60, L 53
WALLERSTEIN, G.W., HERBIG, G.H. and CONTI, P.S. ]965, Ap. J. 141, 610
WEGNER, G. 1972, Ap J. 172, 451
WEIDEMANN, V. 1975, in Problems in Stellar Atmospheres and Envelopes, (ed. B. BASCHEK, W.H. KEGEL and G. TRAVING) Heidelberg - New-York: Springer Verlag, p. 173
WEYMANN, R. and Sears, R.L. ]965, Ap J. ]42, ]74
WICKRAMASINGHE, D.T. and STRITTMATTER, P.A. 1970, M.N.R.A.S. ]50, 435
WORDEN, S.P. 1975, Solar Phys. 45, 52]
WORDEN, P.S. and SIMON, G.W. 1976, Solar Phys. (in press) abstract: Solar Phys. 45, 547 (1975)
ZAPPALA, R.R. 1972, Ap J. 172, 57
EVOLUTION PATTERN OF THE EXPLODING GRANULES
O. Namba and R. van Rijsbergen
Astronomical Institute, Utrecht
SUMMARY
The evolution pattern of the so-called exploding granules has been studied on
the basis of a time sequence from Princeton Stratoscope pictures of the solar granu-
lation. Some preliminary results are presented (section3). A new interpretation of
the phenomenon is suggested (section 4).
1. INTRODUCTION
During the morphological study of the solar granulation (Namba and Diemel,1969)
we were puzzled by some large granules which have a round darkening in their centre
often with several dark canals radiating from it toward the boundary. In 1967 we began
a study of this remarkable class of granules, which are now referred to as "exploding
granules" though the term is somewhat misleading. The exploding granule phenomenon was
discovered by Roseh and his co-workers (Carlier et al., 1968) in moving pictures ob-
tained at the Pic du Midi Observatory, an example of which was shown during the 1967
I.A.U. Meeting at Prague, Musman (1972) has proposed the first theoretical model of
the phenomenon on the basis of his laboratory experiment and observations obtained at
the Sacramento Peak Observatory. In this paper we present some observational features
of exploding granules, which may be of some importance for the theory of non-stationary
convection.
2. OBSERVATIONAL MATERIAL
The observational material is a time sequence of high-definition photographs of
the granulation obtained on the Stratoscope flight of August 17, 1959. The duplicate
negatives on 35-mm film were kindly lent us by the Princeton University Observatory.
The photographs were taken in the green-yellow region with an exposure time of 0.0015
sec at a rate of a frame every 0.929 see; the highest spatial resolution attained is
0.112" or 271 fun on the Sun.
The time sequence lasts about ]6min (Frame Nos. 2145 - 3200) and refers to a
quiet region near the disk centre; some of the frames are illustrated in Bahng and
Schwarzschild (196;), where the observational data are also given, The search for ex-
ploding granules and the study of their evolution have been done on positive enlarge-
120
a
2 1 5 3
Ore06 s
2413
4m07 s
i b 2 1 8 8
0 m 3 8 s
C
2 2 5 9
l m 4 4 s
g 2 5 5 0
6m14 s
h
2 7 0 6
8 m 3 9 s
J
|
d 2311
2m:
2 8 0 3
10 m 0 9 s
e "~- > 1 0 m 5 7 s
2 3 5 3
3 m l l s 0 5 10" I . . . . I , , , , I
Fig. I. Evolution of a typical exploding granule (centre) - a time sequence from Stratoscope granulation photographs obtained on August I?, 1959. Indicated are frane numbers, relative times, the lifetime, and the distance scale (I" = 725 km). Note also remarkable stability of surrounding granules. (Small specks and dots in some pictures are blemishes.)
121
mants of about 50 selected frames (the image factor: l"= 4 mm for all and 1 ''= 6 mm
for portions including the granules treated here). The work was troublesome, because
the focussing of the image was widely variable and high-resolution pictures are not
uniformly distributed in the sequence. Furthermore, besides small shifts of the granu-
lation field in the frame, there is a drift of the solar image that divided the time
sequence into two parts, one lasting about 10 min and the other 6 min, with some spa-
tial overlapping (cf. Fig.2 below).
In this study seven exploding granules were examined. Fig. l shows a series of pic-
tures for a typical example. Although the picture quality is somewhat variable, the
series illustrates nicely its evolution (cf. Carlier et al.,196g and Musman, 1972).
On the positive prints we measured the area of granules as a whole and expressed them
in terms of the average diameter (D). For the pictures shown in Fig. l microphotometry
has been carried out with a Joyce Isodensitracer.
3. RESULTS
Although individual exploding granules behave rather differently, their evolution
pattern may be summarized schematically as follows. A granule appears as a bright spot,
which grows quickly. As the size increases, a vague shade forms over the central part
of the bright granule; it soon becomes a round dark hole - see Fig. l. As the central
darkening develops a few dark canals radiate from it toward the boundary. This may be
regarded as the primary splitting of the granule, usually into two or three smaller
granules. Around the time where the granule reaches its maximum extension the split
granules break further into several parts (the secondary splitting?), provided the gra-
nule is large. So, one can count from several to more than ten granules, split from a
single granule. The evolution after the fragmentation depends upon the granule. One of
the split granules may become a new exploding granule while others fade away. Carlier
et al. (1968) showed an example that developed three "generations" of exploding gra-
nule from a single entity. We should mention that the splitting is a very common pheno-
menon of granules: even for small granules (D < I") it is rather difficult to find such
granules that do not split.
From the study of seven exploding granules the following characteristic features
have been derived.
(a) Measured diameters are plotted in Fig.2 as a function of time. The maximum size
reached during the evolution was from 3.3" (2200 km) to 5.4" (4000 km) in diameter. The
expansion rate AD/At is more or less linear with a speed of ].3 to 3.3km/s. Hence, the
radial expansion rate AR/At ranges from 0.7 to 1.7 km/s, in ~greement with the earlier
estimates of 1.5 to 2 km/s (Carlier et ai.,1968) and of 1.8 km/s (Musman, ]972). These
values are only a fraction of the sound speed in the photosphere (~8 km/s), but they
exceed the horizontal flow velocity of 0.34 km/s (maximum) at the top of granules,
measured by Beckers and Morrison (1970).
Furthermore,there is a tendency that the expansion speed is proportional to the ma-
122
6'
5
t n,,
LLJ
F,3
I I I I I 2200 2400 2600 2800 3000 3200
TIME (Frame N o) ,,
Fig. 2. Growth of exploding granules: average diameter (I" = 725 km) vs, time (a fr~ne every 0.929 eec), The granulej illustrated in Fig. 1, is the third from the top.
x 1.0
E
",~ .s i .4
,2
I ! . . . . . I I I I I I I
Dmax I
1 i i i i i i i i i O. 0 -8 -6 -4 -2 0 +2 +4 .6 *8 +I0
TIME (minutes)
Fig. 3. Relative growth of the exploding granules, derived from Fig. 2. The ~verage rate of growth &In D/At ~ O.045/min.
I23
ximumdiameter reached. In Fig.3, where the mean diameter relative to the maximum dia-
meter is plotted versus time, all the growing branches cluster around a line (not drawn)
with a slope
A in D O.045/min. (I)
At
(b) The above relation suggests that in the sub-photospheric layer the upper part of
the granules may look like huge cones, as sketched in Fig.4.
(c) The central darkening occurs only when the granule is roundish, uniformly bright
and exceeds a certain critical diameter D c ~2.3" (1600 km). It does not show up in
large but elongated granules with a width smaller than this value.
The isophotometry of the pictures shown in Fig.] yielded interesting data about the
development of the central darkening for this particular granule. In the interval from
picture b to picture e, the central darkness increases linearly with time from 13 to
25% of the surrounding brightness and the diameter at half the central depression grows
from 700 to II00 km at a linear rate of 3.0 km/s while the granule expands at a rate of
3.2 km/s in the same period.
(d) The lifetime for the individual granules has been found to be much longer than
the"correlation"lifetime of 8.6 min measured by Bahng and Schwarzschild (1961). The
time sequence allowed us to determine only a lower limit of 10 - 11 min. But Fig.3
suggests'a lifetime of, say, half an hour. In Fig. I also the remarkable stability of
normal granules surrounding the exploding granule is apparent: many persist through
the |0-min time. This is the case also for small granules in the range of 0.8" ~ D
< ].7" (Namba and P. Provoost, ]973, unpublished).
(e) In the time sequence we counted about 20 certain and ]0 suspected exploding gra-
nules with the characteristic features (central darkening with or without radial canals)
over an area of roughly 70" x 90" at any given moment; they cover 2 to 3% of the total
area. In addition, thanks to the time sequence, we found about 10 granules, which be-
came exploding ones, and nearly twice as many already "exploded" granules.
The spatial distribution of exploding granules does not seem to be random, but
we lack further observational material to investigate whether there is any correlation
between the locations of exploding granules and the supergranulatlon pattern. Allen and
Musmen (1973) found no such correlation in their observations.
4. A POSSIBLE INTERPRETATION OF THE EXPLODING GRANULE PHENOMENON
Musman (]972) interpreted the phenomenon as follows. When a rising granule pene-
trates into the overlying stable region, the internal g=anular motions and the conser-
vation of angular momentum act to change the form of the granule into a vortex ring,
which is stretched out horizontally and breaks. This"smoke rlng model, however, meets
difficulties in explaining some observed facts, for example, why some granules are of
the exploding type while man~ others are not, and why a part of the broken ring may
124
a photospheric
level-h . . . . - - . . . ~
b = 8 0 0 km
C "iO Fig. 4. A possible interpretation of the exploding granule phenomenon -
development of the central darkening (vertical cross section). Cf. Fig. I. a: as the granule rises the top layer is cooled instantly; the cold-matter flows outj b: since the granule grows fast and large (D >~ 1800 kin) the cold matt'er is left behind the expanding granule boundaryj and the central darkening begins to form. It develops parallel to the growth of the granule~ c: when the mass of the cold matter in the centre becomes too heavy,-~t sinks, breaking through the granule. The development is accompanied with splitting of the granule. Borne of the split granules may grow further while others fade away.
125
become a new exploding granule.
The evolution pattern reported here suggests an alternative aualitative interpre-
tation which is illustrated in Pig. 4. As soon as a hot granule reaches the photospheric
level its top is cooled. At first the cooled matter flows horizontally (with a velocity
of ~ 0.5 km/s) and then downward to join the surrounding intergranular region (Fig.4a).
The upper layer of the steadily growing granule is removed continuously in this way.
This process goes smoothly as long as the granule size does not reach the critical dia-
meter mentioned above.
However, when the granule expands with a speed faster than the horizontal flow velo-
city, the cold matter can never reach the granule boundary and is left behind, and con-
sequently the central darkening begins to form (Fig.4b). By this time the granule dia-
meter may have reached the critical value D c. The central darkening develops parallel
to the growth of the granule (paragraph 3c above).
The mass of the cold matter accumulated in the centre of the granule is supported
by the buoyancy and the upward motion in the granule, until its total weight becomes
too large. Then the balance is lost and the cold mass sinks, breaking through the gra-
nule (Fig.4c).
The development of the central darkening may be accompanied with the fragmentation
of the granule. The onset of the downward streaming at the centre might induce the
secondary splitting.
Our interpretation predicts a downward motion in the central darkening after some
moment. It is of particular interest to determine the Doppler velocity in the central
darkening and its change during the evolution of exploding granules.
ACKNOWLEDGEMENTS
Our thanks are due to the late Dr R.E. Danielson of the Princeton University Ob-
servatory, who kindly supplied us with the valuable Stratoscope material. The Project
Stratoscope I was sponsored by NSF, ONR, and NASA, U.S.A. We thank Mr J.L.A. van
Hensbergen for his assistance in photographic work. We acknowledge Drs A.G. Hearn,
G.D. Nelson and C. Zwaan for stimulating discussions.
REFERENCES
Allen, M.S. and Musman, S., 1973, Solar Phys., 32, 3]]
Bahng, J. and Schwarzschild, M., 1961, Astrophys. J,, 134, 3]2
Beckers, J.M. and Morrison, R.A., 1970, Solar Phys., ]4, 280
Carlier, A., Chauveau, F., Hugon, M. and Rosch, J., 1968, C.R.Acad.Se.Paris,226,199
Musman, S., 1972~ Solar Physo, 26, 290
Namba, O. and Diemel, W.E., ]969, Solar Phys., !, 167
GRANULATION OBSERVATIONS
A. Nesis
Fraunhofer-lnstitut
F r e i b u r g i~Br., FaG
The a n a l y s i s o f h i g h r e s o l u t i o n s p e c t r o g r a m s t h a t were o b t a i n e d in
Capr i w i t h t h e Coud6 r e f r a c t o r gave t h e f o l l o w i n g r e s u l t s r a g a r d i n g
the g r a n u l a r f i e l d s ,
a) The rms v e l o c i t y of t he g r a n u l a t i o n was 300 m / s e c . Th i s v a l u e r e -
p r e s e n t s t h e raw d a t a and s h o u l d be s u b s t a n t i a l l y i n c r e a s e d to a c c o u n t
f o r i n s t r u m e n t a l and s e e i n g e f f e c t s (W. M a t t i g e t a l . , 1969) . Note
t h a t t h e o s c i l l a t o r y component of t he t o t a l v e l o c i t y f i e l d has been
e l i m i n a t e d by a n u m e r i c a l f i l t e r °
b) In the photosphere the intensity and velocity fluctuations show a
similar structure. The size distributions of the observed Doppler
shifts and intensity variations have been studied by means of power
spectrum analysis (Fig.l, ~. Mattig and A. Nesis, 197~)o These spectra
are not corrected for finite instrumental resolution or for atmospheric
seeing.
The question that we put to the theorists is which of the current
models of convection is able to interpret these observations?
Should one resort to the anelastic approximation for convection to
study our data, or must one work with the thermals picture, t o limit
oneself to just two examples?
Since we can arrive at an understanding of the granulation only by
careful and systematic observations, I believe that a practically
orientated dialogue between theory and observations is indispensable.
References
Mattig, W., Mehltre%ter, J.P., and Nesis, A.: 1969, Solar Phys. I__0, 25~
Mattig, W., and Nesis, A.: 1974, Solar Phys. 36, 3
127
Io 9 P{K}
3-
LM -\ \ \
k , , r - " - ~
13."8 3."8 1:9 1"2 0~85 0."75 i i i I I ' i
23 46 k 69 92 115 x lO'"km "~
Fig. J. The power s p e c t r u m of t h e i n t e n s i t y I ( x ) and o f t h e v e l o c i t y
v ( x )
LoK.: c o n t i n u u m l e v e l s ~ L, Mo: Line c e n t e r
SOME ASPECTS OF CONVECTION IN METEOROLOGY
R. S. Lindzen
Center for Earth and Planetary Physics
Harvard University
Cambridge, Massachusetts 02138
ABSTRACT
Various aspects of convection in meteorology which may have some relevance for astro-
physics are discussed. In particular the role of convection in determining the gross
thermal structure of the atmosphere, the treatment of convective turbulence in the boun-
dary layer, and the larger scale organization of convection are dealt with. ~
I. INTRODUCTION
As noted by Prof. Biermann during this conference, a number of seminal approaches
to convection in astrophysics such as convective adjustment and mixing-length theory
originated in meteorology. The purpose of this paper is to briefly review the current
status of such notions in meteorology as well as to report some relatively recent approa-
ches to meteorological convection which may prove useful in astrophysics.
Since most of what I discussed at the conference has appeared in the meteorological
literature 9 1 will tend to use this written version of my lecture as a selective anno-
tated directory to this literature rather than a complete version of the lecture. No-
thing remotely approximating a complete review of convection in the meteorological li-
terature will be attempted.
In section 2, I will describe the observed thermal structure of earth's atmosphere
and explain why simple radiative models with convective adjustment prove inadequate --
qualitatively and quantitatively. I will also outline our current understanding of
the observed structure -- though this understanding is by no means complete.
In section 3, I will introduce a current phenomenological approach to penetrative
convection in the atmosphere which may prove a more consistent alternative to convective
adjustment in astrophysics. In both sections 2 and 3~allusions will be made to the
fact tha~in the atmosphere, convection often occurs in relatively narrow plumes, and
that such convection is generally associated with local static stability. The point
which may be relevant to astrophysics is that convection is not necessarily related
to local temperature gradients in a simple way.
129
Section 4 deals with a particular observed feature of atmospheric convection :
namely, when broad regions (for example the maritime tropics equatorwards of 20-30 =)
are uniformly unstable (or conditionally unstable) convection does not occur in a uni-
formly distributed manner. Rather, the convecting system itself appears to be unstable
to so-called mesoscale systems (physically akin to internal gravity waves) which in turn
organize the convection on scales much larger than the scale of the convective elements.
The mechanism for this organization appears to be different when the convection is deep,
extending almost from the ground to the tropopause (cumulonimbus convection), and when
convection occurs within the middle levels of the troposphere; both are discussed in
section 4. The point, however, is that in the earth's atmosphere convection rarely if
ever manifests itself in the appearance of a pattern characterized by the scale of the
convective elements alone; larger scales of organization and motion almost invariably
appear. This may offer some insight into such phenomena as supergranules on the sun.
2. THE LARGE SCALE THERMAL STRUCTURE OF THE EARTH'S ATMOSPHERE
In Figure ! a somewhat smoothed picture of the height and latitude structure of
longitudinally averaged temperature is presented (only heights up to |6 km are shown).
In Figure 2 several height profiles of temperature appropriate to various latitudes are
shown. From Figure 2 we may note a certain similarity of profiles from various latitudes.
In most cases temperature decreases with height above the ground with a gradient of
about 6°/km until some level (known as the tropopause) where the temperature gradient ~T
goes to zero or becomes positive. The relatively uniform lapse rate (- ~-~ ) of the
lower atmosphere as well as the observation that radiative equilibrium profiles for the
lawer atmosphere are statically unstable led early on to the use of a convective adjust-
ment model to explain the thermal structure of the lower atmosphere (Gold, 1909; Emden,
1913; Goody, 1949). Several important difficulties have, however, arisen in connection
with such models (some were recognized almost immediately) :
i) Convective adjustment would lead to the adiabatic lapse rate in the lower atmos-
phere. This is 9.g°/km not 6=/km. It is sometimes suggested that the atmosphere adjusts
instead to the moist saturated adiabatic lapse rate. While this lapse rate is on the
order of 6°/km near the ground, by 4km the atmosphere, because of its low temperature,
saturates with very small amounts of water vapor and the saturated lapse rate differs
little from the dry value. Thus, the use of the saturated lapse rate is no solution to
this problem -- even ignoring the fact that the atmosphere is rarely saturated. It is
interesting to note that the application of convective adjustment to the earth's atmos-
phere leads to errors on the order of 30=K or 10% in absolute temperature.
ii) Convective adjustment in no way can account for the abrupt change in tropopause
height near 30 = latitude: equatorwards it is near |6 km, polewards it is near 12 km.
1 3 0
iii) Radiative-convective equilibrium does not account for the fact that in the
nelghbourhood of the tropopause, a temperature minimum exists at the equator.
iv) Finally and revealingly, close scrutiny of Figures ] and 2 indicates that the
lapse rate of the tropopause is not really uniform,especially at high latitudes.
MB - : |k in . . . . . . . . . . . ; 6
~ 0 0 ~ . . . . . . . L - - k . . . r - - - - o - ~ . . . . . . . . . . . - - - , ~ - ' ~ ~ *-~-45 . . . . . 4 - 1 2
-~5_b--~. . ,> '~ I . - L ~ ~ " - ' ' : ~ ' ~
. . . . ~.~ ,-~. ~ - - - . " - ~ 7 ~ . : - - - - - C " ---------_ ~ - 4 ~ ~ ~ 8
500 4
7 0 0 . - - 2 5 " - - ' " . . . . ;0 . . . .
~O~N ~ 0 o 3 0 ° 0 o S~ME.
( 0 ) H E'~I~jTEH'~R E JANUARY HEMISPHERE
100
20O
3 0 0
5 0 0
7 0 O M B
1 0 0 0 9 0 ° N
( b }
6 0 ° 3 0 ° 0 = 3 0 o 6 0 == SUMMER WINTER
HEMISPHERE HEMISPHERE J U L Y
12
8
4
0 9 0 °
Figure ]. Zonally (longitudinally) averaged temperature as a function of height
and latitude. Contours are lines of constant temperature (°C). After Palmen and
Newton (1969).
131
4C . . . . . . . .
~ 5 C - - - -
E
b.,,I C~ :::) 20 l - - i I-- .-I
1 0
180 5 ~
I i 45t /~1~..,,,,P I JANUARY AND
60£~.. ~/~./~/i/. 2" ~ SPRING/FALL,
,~/j -~---~ . . . . . . . . . ~- T
I / o |
\ h I
i -
"
200 220 2 4 0 260 2 8 0 TEMPERATURE (OK)
Figure 2. Zonally averaged temperature as a function of height for various lati-
tudes. After U.S. Standard Atmosphere Supplements (1966).
Our current understanding of the atmosphere's structure suggests no uniform expla-
nation for the whole globe. Recent work (Schneider and Lindzen, 1976; Schneider, 1976)
shows that within a certain neighbourhood of the equator (extending to about 30 ° lati-
tude) the atmosphere cannot sustain significant horizontal temperature gradients (in
many respects this region is similar to a spherically symmetric atmosphere where rota-
tion is not of great importance). Large scale dynamic effects in this region serve
primarily to homogenize (horizontally) the temperature in this region, and as a result
the vertical temperature structure of this region is indeed describable in terms of
radiative-convective equilibrium. However, because the convection occurs in relatively
narrow cumulonimbus towers, it leads to finite stability rather than neutral lapse rates.
How this occurs is outlined in Appendix ]. From about 30-70 ° latitude, horizontal tem-
perature gradients are significant and rotation is of basic importance. It is generally
believed that convection in this region is due to baroclinic eddies whose energy is
drawn from horizontal temperature gradients. These eddies tend to carry heat upwards,
132
and the rate at which these eddies stabilize the atmosphere is much greater than the
rate at which radiation acts to destabilize the atmosphere~so the question of convec-
tive adjustment does not arise. The stability achieved in this region is primarily
r~lated to the north-south temperature difference, and at the moment there does not
appear to be any basic reason why temperature lapse rates at middle latitudes should
be the same as they are in the tropics. A discussion of how baroelinio eddies act to
establish the lapse rate in middle latitudes may be found in Stone (1972, 1973). The
relevance of this process to astrophysics is not at all clear. Finally, the arctic-
antarctic ice and snow cover lead to high surface albedos and radiation tends to stabi-
lize rather than destabilize the atmosphere. This, in turn, tends to suppress baro-
clinic eddies. A comprehensive discussion of terrestrial atmospheric stability based
on numerical simulation may be found in Reid (1976).
3. PENETRATIVE CONVECTION AND MIXED LAYERS
One may reasonably ask, at this stage, whether convection in the earth's atmosphere
ever leads to a neutral lapse rate. The answer is almost certainly yes, but it is not
clear that even in these instances, convective adjustment is the correct approach.
We shall, in this section, look at one of the more extensively studied examples of
convective mixing: namely the convective mixing of the air near the ground where the
convection is forced by solar heating of the surface. A substantial number of pheno-
menological theories exist for this process and there is still a measure of controversy
surrounding them. I will sketch one typical example of such theories due to Tennekes
(1973). The geometry of the situation is shown in Figure 3 where profiles of both po-
tential temperature and convective heat flux are presented. At the bottom of the mixed
layer there is a thin superadiabatic layer dominated by mechanical turbulence. The
nature of this layer is ignored except insofar as it delivers a heat flux (O-W)o to the
interior; this heating forces the convective mixing which proceeds over a finite layer
of thickness, h, topped by an inversion layer with temperature jump, A. The region dO
above this jump is stably stratified with~-~z= ~. As heating continues, h increases
with time -- whence the name "penetrative convection". The picture thus far is reaso-
nably well observed over land in middle latitudes. As the mixed layer rises into a
warmer environment, the cooling of the entrained warmer air must give rise to a nega-
tive flux (~)i beneath the inversion. This is mathematically expressed as follows :
dh (3. I) (Ow)i = ~ d--f •
An equation may be written for the time evolution of A, on noting that the penetration of
the mixed layer into the stable interior tends to increase A, while the heating of the
mixed layer tends to decrease A :
dA _ dh ~O d-{- Y ~ " - ('~t') b . l . (3.2)
133
we ignore radiative processes, Ob.l. satisfies a simple budget :
__ Cp0( )b.]. = - 3--~ ( Cp 08w),
orjintegratlng over the mixed layer, we find
(3.3)
(~w) o - (e-~) i 38 (TC)b. i . : h
(3.4)
Substitution of (3,4) into (3.2) yields
dA dh (E~;)o (@w)i dt ? dt h + h
• (3.5)
and (3.]) together with (3.5) are generally taken as the basic equations for the sys-
tem. (~)o is given, and (3.1) and (3.5) then form 2 equations in 3 unknowns : A, h
and (ew) i . Clearly another relation is needed (and it is at this point that the bulk
of the controversy is engendered). Tennekes (1974) first considers the turbulent
energy budget near the inversion:
# - q2w) ÷ * , (36) o o
where q is the magnitude of turbulent velocity fluctuations and ~ is a dissipation
rate which is empirically found to be negligible near the inversion. (T o is a mean
temperature.) Thus (3.6) suggests that the kinetic energy generated by buoyancy is
consumed in bringing heat down through the inversion. Since buoyancy tends
to generate vertical velocity, and buoyancy acts th~Dughout the mixed layer,
(-~ q2w) ought to scale as follows:
3 ~z (2 q2w) ~ - 0 (-~), (3.7)
where o W
is the vertical velocity variance, and
T 3 -(~)i ~ o ~ (3.8)
g h
In addition since o~ is generated by (~)o
G 2
O,.v * gh 0
we have
and ow 3 ,x, gh ( ~ ) o (3 .9 ) T
0
134
Combining (3.8) and (3.9) we have
- (~)w)i = k (SW) o : (3.10)
k is a constant which is empirically found to be about 0.2. Equation (3,10) closes the
system described by (3.]) and (3.5). The resulting equations have been used (with
moderate success) to describe a variety of convective boundary layers, For the diur-
nal boundary layer, surface heating during the day causes h and Qb.|. to increase;
the heat thus deposited is carried away by radiation during the night when (~)o is
zero. This incidentally explains how there can be a turbulent heat flux into the atmos-
phere in the mean even though the mean stability may be positive.
d8
d--fiY Z Z
t+dt
~e 8W
Figure 3. The vertical distributions of potential temperature and turbulent heat
flux in and above a convective boundary layerl after Tennekes (]973),
To be sure, the concept of a diurnal boundary layer is hardly applicable in astro-
physics. However, the above approach has also successfully accounted for the semi-
permanent mixed layer of the tropical maritime atmosphere (Sarachik, 1974). In that dh particular case there exists a between-cloud subsidence which causes ~ in equ, (3.])
dh and (3.5) to be replaced by (~ - w) and an equilibrium solution exists wherein
135
dh d-~ = O. More germain to astrophysics would be the inclusion of radiation in the above
picture. Equ. (3.]), (3.3), (3.6) and (3.7) would all need modification since radia-
tion would not only alter the gross budgets but would also act to dissipate buoyancy.
1% is also conceivable that convection, if it were to occur in plumes~would not lead
to an adiabatic lapse (well mixed potential temperature) (see Appeudix ]). This might
affect the validity of (3.8) and (3.9) since the mean stability would inhibit buoyant
acceleration. The above, of course, all remains to be done, but it might conceivably
form a more satisfactory alternative to convective adjustment. The possibility of con-
vection leading to inversion "discontinuities" etc. might have significant implications
as well.
4. MESOSCALE ORGANIZATION OF COI'~ECTION
We turn now to a last and somewhat different aspect of atmospheric convection.
Even when rather broad regions of the atmosphere are relatively uniformly unstable
(or more typically conditionally unstable with respect to moist processes), convec-
tion (in the form of cumulus clouds) rarely if ever occurs in a uniformly distribu-
ted manner. Instead, the convection is almost always organized into systems whose
scale is typically I-2 orders of magnitude larger than the scale of the cumulus
clouds themselves. The larger scale (100-4001~) is referred to in meteorology as
the mesoscale. Cloud clusters and squall systems are examples of mesoscale systems.
Mesoscale organization appears to be an intrinsic feature of atmospheric convection.
For certain types of atmospheric convection the relation to mesoscale organization
seems reasonably clear. In these cases moisture is concentrated near the surface
(in the first 2 kilometers of the atmosphere typically) and virtually the entire
depth of the troposphere is conditionally unstable. Such situations tend to be
characterized by intense cumulonimbus convection. The rainfall in such situations
tends to satisfy a simple moisture budget where the rainfall (and hence latent heat
release) is proportional to the convergence of moisture (plus evaporation where this
is relevant). Moreover~ since the moisture tends to he confined to Z < Z T (where Z T
is typically 2 km), the convergence of moisture tends to be proportional to the ver-
tical velocity at Z T. Finally the latent heat release is significant for the larger
scale motions. In the presence of an internal wave perturbation (which produces
convergence) one can imagine an interaction of the sort indicated below:
Latent Heating
J \ Surface convergence ~-" -- Internal waves
136
If the internal waves produced by latent heating produce more surface convergence
(in the proper phase) than is needed to maintain the wave, the system will be un-
stable. This mechanism is referred to as wave CISK (conditional instability of the
second kind), and is described in greater detail in Lindzen (1974). CISK is used
to describe any collective instability of cumulonimbus convection and larger scale
motions. The concept was introduced by Charney and Eliassen (1964) in connection
with hurricane generation. The mathematical problem in the present instance con-
sists simply in the solution of the equation for thermally forced internal gravity
waves which takes approximately the following form:
d2w+ ~2w = Q(z) (4.1)
dz 2
where are proportional to elk(x-ct).'" w is the vertical velocity, Q is all fields
proportional to heating, x is a harizontal coordinate, k is a horizontal wavenumber,
and c is a horizontal phase speed which may be complex (for unstable solutions).
For our purposes N2
~2 ~ C 2 (4.2)
where N is the Brunt-Vaisala frequency. Now it is an easy matter to write the solu-
tion for w (satisfying suitable boundary conditions) as functional of Q(z):
w ffi F c [Q] ( 4 . 3 )
where w depends on c (and z) as well as Q, But Q is proportional to W(ZT), and (4.3)
becomes
w(Z) = F c [q(Z')w(ZT)] (4.4)
where q(Z) is a specified function. At Z = Z T (4.4) becomes
w(Z T) = F c [q(Z')W(ZT)] (4.5)
which proves to be possible only for certain values of c--one of which is typically
associated with the greatest degree of instability. Current calculations indicate
that the imaginary part of c is much smaller than the real part and that for common
terrestrial situations Re(c) ~ 15m/s. Since solutions are of the form e Ik(x-ct),
growth rates are equml to k x Im(c) and one might infer that maximum growth rates
are achieved as k ÷ ~(and as the frequency k Rec ÷ ~ also). This, however, is in-
consistent with the fundamental premise of C!SK: namely, that convection is organ-
ized by large scale convergence. Clearly such organizatiun cannot be achieved on
time scales shorter than characteristic development times for the clouds. For ex-
ample in the tropics cumulonimbus clouds have a characterlstic time scale of about
i hour, which suggests a maximum frequency, ~, of about (I hour) -I. Now
1 ~ k x 15 mls. ~ kc ~ 3600 see
137
1 Hence k
3600 x 15 m 2~
and horizontal wavelength ~-~ 2~ x 3600 x 15 m ~ 339~. (4.6;
In fact, both this wavelength, and the predicted phase speed are characteristic of
tropical mesoscale disturbances, implying that the maximum frequency suggested above
is, in fact, what is realized. A similar approach has been used by Raymond (1975)
to account for the structure and evolution of intense co=vective storms in the mid-
western United States.
The relevance of wave-CISK for astrophysics is questlo=able since there appears to
be no astrophysical counterpart to rainfall. However, it is also observed in the
earth's atmosphere that cumulus convection which is restmicted to relatively shallow
layers within the middle of the troposphere and which is associated with little
(and sometimes no) rainfall is also organized into mesoscale patterns. Latent heat
does not appear at first sight, to play a major role in forcing these mesoscale sys-
tems. In a recent paper, Lindzen and Tung (1976) have shown that the near neutral
Stability created by mid-level cumulus activity helps trap internal gravity waves in
the stable region below the clouds~ creating a duct wherein wave modes may exist
without significant forcing. The phase speeds of these ducted modes (determined pri-
marily by the thickness of the stable region below the clouds) are in good agreement
with observations. Furthermore, observed periods appear to satisfy the relation
2w (4.7) T~ave rcloud
JUSt as in the case of wave CISK disturbances. Given a duct phase speed, c, and a
characteristic cloud time scale Tclo~d, the mesoscale wavelength is again
wavelength ~ 2~ c Tclou d (4.8)
The means for interaction between the waves and the cloud field are not entirely
dear in this case. However, the period given by (4.7) is still the shortest period
on which any interaction could take place. Moreover, the well known degeneracy of
such features of convection as its plan form suggests that the organization of con-
vection might be responsive to relatively weak perturbations. Similary, the waves,
being dueted, call for only small forcing.
At this point it is worth noting that the earth's atmosphere can sustain a class
of free oscillations (Lamb waves) which do not require explicit ducting. These
waves are, essentlally, horizontally propagating acoustic waves %~th c ~ 319 m/s.
By the above arguments we ought to expect organization of convection with wavelengths
given by (4.8) based on the speed of sound and Tclou d. There is no clear cut ohser-
vatlonal evidence available for this suggestion. However, the wavelengths obtained
are on the order of several thousand kilometers, and on the earth, regions on this
scale with relatively uniform conditional instability are rare. The situation ap-
pears somewhat more congenial on the sun where a convective layer exists over the
entire star. Identifying the convective elements with granules for which T ~ 5 min-
138
utes and taking c ~ I0 km/sec one obtains from e~. (4.8) that the dominant wave-
length ought to be 40,000 km. Whether it is purely an accident that this is also
the scale of supergranules remains to be seen. Less arguably, the above discussion
demonstrates rather clearly that the appearance of structures of a given horizontal
scale need not imply vertical scales of the same order. Similarly, terrestrial ex-
perience suggests that convectio~ rarely involves merely a single horizontal scale.
ACKNOWLEDGEMENTS
The author wishes to thank E, Spiegel for encouraging the preparation of this
manuscript 9 and the National Science Foundation for its support under Grant ATM-75-
20156.
REFERENCES
Arakawa, A. and W.H. Schubert, 1974 : Interaction of a cumulus cloud ensemble with the large scale environment. J. Atmos, Sci., 3_[], 674.
Charney) J. and A. Eliassen, ]964: on the growth of the hurricane depression. J. Atmos. Sci., 2_[I, 68
Emden, R., 1913: Uber Strahlungsgleichgewicht und atmosph~rische Strahlung. Sitz. d. Bayerische Akad. d. Wiss., Math. Phys. KlasRe, p. 55.
Gold, E., 1909: The isothermal layer of the atmosphere and atmospheric radiation. Proc. Roy. Soc. A, 82, 43.
Goody, R.M., 1949 : The thermal equilibrium at the tropopause and the temperature of the lower stratosphere~ Proc. Rcy. Soc. A, ]97, 487.
Held, I.M., 1976 : The Tropospheric Lapse Rate and Climate Sensitivity, Ph.D. The- sis, Princeton University, 2]7 pp.
Herman, G., and R.M. Goody, ]976: formation and persistence of summertime arctic stratus clouds. J. Atmos. Sci., 33, ]537-1553.
Lindzen, RoS., ]974: Wave-CISK in the tropics. J. Atmos. Sci., 3~], ]56.
Lindzen, R.S., and K.-K. Tung, 1976: Banded convective activity and ducted gravity waves. Mon. Wea. Rev., 104, in press.
Palmen, E. and C. W. Newton, ]969: Atmospheric Circulation Systems, Academic Press, New-York, 603 pp.
Raymond, D. J., ]975: A model for predicting the movement of continuously propaga- ting convective storms, J~ Atmos. Sci.~ 32, 1308.
Sarachik, E.S., ]974: the tropical mixed layer and cumulus parameterization. J. Atmos. Sci,, 31 , 2225.
139
Schneider, E.K., 1976: Axially sy~netrie steady state models of the basic state for instability and climate studies. Part II: Nonlinear calculations. J. Atmos. Sci., 3_33, in press.
Schneider E.K., and R.S. Lindzen, ]976: Axially symmetric steady state models of the basic state for instability and climate studies. Part. I; linear calculations. J. Atmos. Sci., 33, in press.
Stone, P.H., ]972: A simplified radiative-dynamical model for the state stability of rotating atmospheres. J. Atmos. Sci., 29, 405.
Stone, P.H., ]973: The effect of large scale eddies on climatic change. J. Atmos. Sci., 3__00, 521.
Tennekes, H., ]973: A model of the dynamics of the inversion above a convective boundary layer. J. Atmos. Sci., 30, 558.
U.S. Standard atmosphere supplements, ]966: available Superintendant of Documents U.S. Government Printing office, Washington, D.C. 20402.
APPENDIX ]. HEAT TRANSFER BY THIN PLUMES
The following discussion is based on work by Arakawa and Schubert (]974) concerning
cumulonimbus clouds. The present discussion, however, ignores moisture (both for sim-
plicity and because of its irrelevance to astrophysical problems). We shall consider
convection which occurs in plumes which occupy a small fraction of the total horizon-
tal area and which despite their small area contribute significantly to the mean ver-
tical mass flow. By "mean" we shall always refer to an average over an area large
compared to the cross-sectional area of plumes, but small compared to any large scale
flow. Our aim will be to parameterize the effect of plumes on this large scale flow.
Means will be indicated by overbars. The approach will be analogous to the use of
Reynold's averaging where the eddies will be convective plumes.
We will first partition the mean vertical mass flux into plume and environmental
(non plume) contributions: %
p-'w = Mp + M, (A.])
where p = density, Mp = plume mass flux and M = environmental mass flux.
poses the following quasi-Boussinesq continuity equation will suffice:
V o (pl) + ~ (~-6) : 0
For our pur-
(A.2)
(V • ~) will here refer to horizontal divergence of q.
consider an ensemble of plumes where
It will also prove useful to
Mp = EMi0 (A.3) i
140
Each plume may either be entraining mass from its environment in which case
,~Mi ~oi, ~Mi 8oi Ei = i--~z + P =~t )' -~z + ~ > 0 (A.4a)
or detraining into the environment in which case
,~Mi "-~)~°i'' ~Mi i D i = - t--~ + --~ +--~ < O; (A.4h)
o. is the fractional area occupied by the i th 1
p ied by a l l plumes i s
plume and the fractional area occu-
= Z o.. (A.5) p .l i
Mp satisfies the following budget :
~Mp Bop E - D = BT + P ' - - ~ , (A.6)
where
The static energy:
= E Ei entraining
plumes
= S Di
detraining plumes
s = c T + gZ (A.7) P
is conserved during adiabatic processes. The budget for s in the environment is given
by
op)psJ " ~ (M s) + QR (A.8) ~--- [(] - ~ = - V ~ (PVs) - Es + S DiSDi - --Bz , ~t d,p,
where l refers to a sum over detraining plumes and SDi is the static energy of
the d.p, i th detraining plume; QR represents radiative heating in the environment.
Using equ. (A.]), (A.2) and (A. 6) we may easily transform (A,8) to the following:
+d~p. Di (SDi - ~) + Mp ~- + Qr . (A.9)
We will now assume the following to be adequate approximations:
v • pv ~ V (~-v) , (A.!Oa)
v ' pvs ~V • (p v s) . (A.]0b)
141
Also, for P
and
<< 1, it is readily shown that (] - ~p) ~ 1,
s,
~r ~
Equ. (A.9) then becomes
27 - 37 27 p -~ + 0 v • 7 ~ + p~z = Mp ~ + dip. Di(SDi - ~) + Qr. (A. II)
Let us finally assume each plume detrains at precisely that level where its static
energy equals that of the environment (i.e., where it looses buoyancy: this is con-
sistent with the known instability of decelerating jets). Then
Z Di(Ssi - ~) = 0 d.p.
and (A. II) becomes
~-~ + p ~ ~ . V ~ + pw ~ z = Mp ~ + ~. (A.12)
We see from (A.]2) that the primary effect of convective plumes on the environment 37
is to introduce a heating term Mp ~z " This term is easily interpreted: a portion
of pw (i.e., Mp) which rises in plumes does not give rise to adiabatic cooling in
the environment - and appears, therefore, as a heating term. If we ignore p-w, Mp must
be compensated by equal subsidence which does, in fact, lead to eompressional heating
in the environment. A crucial point which may be made from (A.12) is that if convec-
tive plumes are to supply heat which is then carried off by radiation, ~z must be
positive!
NUMERICAL METHODS IN CONVECTION THEORY
N. O. Weiss
Department of Applied Mathematics and Theoretical Physics
University of Cambridge
SUMMARY
Two and three-dimenslonal computations have enlarged our understanding of non-
linear convection, particularly in Boussinesq fluids. However, we cannot
adequately predict the relationship between convective heat transport and the super-
adiabatic temperature gradient. Nor is there any indication of a preferred length
scale, other than the depth of the convecting layer, in a compressible fluid.
I. INTRODUCTION
The standard procedure for calculating the structure of stellar convection
zones is to use mixing length theory, calibrated to fit the sun and neighbouring
stars on the main sequence. Mixing length theory is based on plausible physical
assumptions and seems to provide qualitatively acceptable results but, as Dr. Gough
has emphasized, it lacks any firm theoretical basis. The principal need in astro-
physical convection is for a soundly based theory that can confirm, or replace , the
procedure now adopted. In particular, we would like to establish the functional
relationship between the convective heat transport and the superadiabatic temperature
gradient, and to determine the preferred scale of convective motion. Spiegel's
(1971b, 1972) excellent review of astrophysical convection contains a thorough dis-
cussion of the basic fluid dynamical problem and includes a full list of references,
which has been brought up to date by Gough (1976). go I shall limit myself to
describing recent progress towards understanding nonlinear convection by solving
model problems numerically on a computer.
Direct observation of solar convection reveals cellular patteraa. Hot gas
rises, cold gas sinks and the lifetime of an individual cell is of the same order as
the time taken for fluid to turn over in it. The photospheric granulation has a
horizontal scale similar to the local density or pressure scale height, and comparable
with the thickness of the strongly superadiabatic layer at the top of the convective
zone. Supergranulea have diameters about 15 times larger; their relationship to
features with strong magnetic fields implies that they correspond to more deep-
seated convection. There are also suggestions of motion on a scale comparable to
the depth of the convection zone, while speckle photometry indlcates that there may be
large scale convective cells in the outer layers of red giants.
Observations provide few constraints on the relationship between heat flux and
temperature gradient. Nor can the parameters appropriate for astrophysical convection
be modelled in laboratory experiments. Hence we must attempt to solve the governing
equations which, since they are nonlinear, have to be tackled on a computer. The
143
full problem is still too difficult. So it is necessary to make various simplifying
geometrical and fluid dynamical approximations. We hope that a better description
of astrophysical convection will eventually emerge from the results of a sequence
of idealized numerical experiments.
2. THE IDEALIZED PROBLEM
Let us consider convection in a horizontal layer, heated uniformly from below
and confined between the planes z = O, d, where the z-axis points vertically
upwards. In the absence of motion the superadiabatic temperature gradient
~ ]
where T is the temperature and the adiabatic gradient
Cp Here g is the gravitational acceleration, Cp the specific heat at constant pressure
and = the coefficient of thermal'expansion (for a perfect gas = = l/T). In the
Boussinesq approximation we assume that the layer depth d is much smaller than the
temperature scale height Cp/g~ and that the Mach number U/e s <<I (where U is a
typical velocity and c s is the velocity of sound): then the velocity ~ and density
satisfy the equations
where %, T O are constant, and the configuration is described by two dimensionless
parameters, the Rayleigh number
R =
and the Prandtl number ~ = v/X , where ~, ~ are the thermal and viscous diffusivl-
ties. The heat flux can be expressed in terms of the dimensionless number
where the total heat flux is Cp~ F. For an infinite layer N is a function of R
and ~ only.
In most laboratory experiments the convecting fluid is confined between rigid
boundaries at which ~ vanishes, and the resulting flow is dominated by viscous
boundary layers. These boundary conditions are inappropriate for stars and it
usual to assume that the tangential stress and normal velocity vanish at the
surfaces z = O, d, which are held at fixed temperatures T O + ~d, T O . These "free"
boundary conditions are dynamically fairly passive and mathematically convenient.
Nevertheless, any technique or theory should be capable of describing experimental
results correctly before it is applied to astrophysical convection.
A bewildering array of power laws has been put forward for the function
N(R, ~). For R >> 1 an asymptotic upper bound with N ~ R ½ has been established
(Howard 1963, Busse 1969). At high Prandtl numbers NNR I/3 for free boundaries
but the radiative conductivity is high in stars and the Prandtl number is therefore
small~ For ~ << I we might expect that the energy flux should not depend explicitly
144
on the viscosity W , so that N = N(S), whore S = ~R. In the sun, ~m 10 -9 but
S is typically of order 1012 . Arguments can be found for suggesting asymptotic
power laws of the form N ~ S r with r = 2, ½, 1/3, 1/4, 1/5 (Spiegel 1971 a,b;
Gough and Weiss 1976; Jones et al. 1976; Gough et al. 1975) but it is not obvious
which, if any, of these exponents is correct.
3. BOUSSINESQ CONVECTION
In the Boussinesq approximation the pressure can be eliminated by taking the
curl of the equation of motion. The time-dependent equations then become
and ~' : V^(~^~ - ~T^ 9 * ~V~
together with V.H = O, where the vorticlty ~ : V^~ and e, is a unit vector in the
z-direction. In two dimensions, with motion confined to the xz plane and indepen-
dent of the y co-ordlnate, the vnrticity has only a y-component and the velocity
can be expressed in terms of a stream function ~ such that
where @~ is a unit vector in the y-direction.
The vorticity equation then reduces to
~ _ ~ ~ ~ V~ .
3.1 Rigid boundaries
Convection sets in at the critical Rayleigh number R c and two-dimensional
solutions for R ~ I000 R have been available for some time (e.g. Fromm 1965, c
Schneck and Veronis 1967, Plows 1968). Busso (1967) first showed that two-dimenslonal
solutions at infinite Prandtl number may be unstable to three-dimensional pertur-
bations and the development of rolls into three-dimensional and time-dependent regimes
has been studied experimentally (e.g. Busse and Whitehead 1974) for fluids with high
Prandtl numbers. The stability of two-dimensional rolls was systematically inves-
tigated by Clever and Busse (1974): for Prandtl numbers of order unity, the rolls
develop a wavelike oscillatory instability when R ~ 3.5 R c. The most thoroughly
investigated case is convection in air ( ~ = 0.7) for which Veltishchev and Zelnin
(1975) and Lipps (1976) have computed three-dlmensional solutions with R 4 15 R c.
At low Rayleigh numbers Lipps' numerical experiments show the development of rolls
whose preferred width differs from that which maximizes the heat transport. As R
is increased, the oscillatory instability appears and solutions become time-
dependent. For R ~ 15 Rc, motion is three-dimensional and aperiodic. However the
change from two to three dimensional convection does not greatly affect the time-
averaged Nusselt number. These numerical results are all supported by experiments
(Willis and Deardorff 1967, 1970; Krishnamurti 1970a,b, 1973; Brown 1973).
Unfortunately, apart from the experiments by Rossby (1969), few results are available
for low Prandtl number convection.
145
3.2 Free boundaries
Convection in two-dimensional rolls has been studied in numerical experiments
with R 6 IO00 Rc (Fromm 1965, Veronis 1966, Moore and Weiss 1973). For high Prandtl
number ( V >> R ~) there are steady solutions with N~R I/s, which ere apparently
stable (Straus 1972). For V << i, the Nusselt number depends only on R and for R>>]
N ~ R 0"36 (Moore and Weiss 1973). In these laminar solutions the vorticity ~ is
nearly constant on the streamlines. The nonlinear term in the vorticity equation
remains small even when the Reynolds number is large: the rolls behave like flywheels
and are slowly accelerated until, after they have turned over many times, the buoyancy
torque is eventually balanced by friction. However, the oscillatory instability sets
in near the critical Raylelgh number for v << I (Busse 1972) and the rolls should
break down into three-dimensional cells.
In the two-dlmensional solutions, rising and falling plumes are exactly symmetrical
but this syrmmetry is no longer present in, say, a hexagonal cell where fluid can rise
in a central column and sink around the perimeter of the cell. It was conjectured that
this geometrical change might affect the physics so that N could depend on S for
<< i. So we investigated axisymmetric convection in a cylindrical cell (Jones et al.
1976). This idealized model is mathematically two-dlmensional but geometrically three-
dimensionalj though the cells cannot be packed together to fill a plane. Referred to
%ylindrical polar co-ordinates (r,~,z) the velocity is given by a Stokes stream
function ~ (r,z) such that
where e~ is a unit vector in the ~-dlrectlon, and the vortlcity
= r~l _e e ,
where
~'-~ - r ~ '7
We found, however, that the convective flux was similar to that for two-dimensional
rolls. For high Prandtl numbers N~R I/3 again, while for f<< i N ~R 0"4 approximately.
So the Nusselt number is still independent of • and approaches closer to the upper
bound. The form of the solutions is displayed in Fig. i, which shows streamlines,
isotherms and profiles of the modified vorticity~ for a steady solution with
= O.O1, R = IOO R . Vortex tubes are stretched as they move away from the axis with c
the fluid and~l is nearly constant along streamlines. So flywheel solutions exist in
a cylinder and would also, presumably, appear in hexagons.
Are these solutions stable? Jones and Moore (1977) have recently shown that the
axis~mmetric flow is Unstable to non-axisymmetric perturbations. Thus a cylindrical
cell can fragment into sectors, like unstable vortex rings in laboratory experiments
(Widnall and Sullivan 1973, Widnall, 1975). As the Reynolds number increases, three-
dimensional convection cells should therefore become unstable and split up. Such e
phenomenon is observed in the sun: large granules explode and break up into smaller
146
TZ
F (a)
F
(b)
Figure i. Axisymmetric convection in a cylindrical cell. Results for R = i00 Rc, p = 0.O1. (a) Isotherms and streamlines: equally spaced contours of T(left) and ~ . (b) Profiles of the modified vorticlty~, which is nearly constant along the stream- lines.
147
cells (Musman 1972). In a tesselated convection pattern there may also be collective
instabilities which allow vortex rings to reconnect, so that cells are swallowed up
and disappear.
3.3 The modal approximation
An alternative approach to three-dimensional convection has been to adopt a
truncated modal expansion. For example, the vertical velocity w can be expanded in
eigenfunctions of the two-dimensional Laplacian operator:
,
The single mode expansion, normalized so that ~= O, f2 = i, 7 ~ 2C (where the bars
denote horizontal averages) has been studied in great detail (Gough et el. 1975;
Toomre et ai. 1977) and numerical solutions have been obtained for Rayleigh numbers
up to 1025. In this approximation the plan form of a convection cell is prescribed
by the linear eigenfunction, and enters the equations through the parameter C. For
two-dimensional rolls C = 0 and the equations reduce to the mean field approximation;
for cylinders C = 0.18 and for hexagons C = 0.41.
With rigid boundaries the results for a single mode agree quite well with experi-
ments; with free boundaries N~R I/3 when ~ >> I but N N (S in S) }/5 for R -| << ~ << |.
The imposed plan form generates a large nonlinear term in the vorticity equation. If
the flow is constrained only to be laminar and steady then it can adjust its plan
form to make ~^(~^~) very small and the effective dissipation can therefore be
reduced.
The modal expansion and the flywheel solutions are two extremes. We might expect
that instabilities would limit the lifetimes of three dimensional convection cells,
so that they are comparable with the turnover time and laminar flywheel solutions
cannot be attained. Then N should depend on S, though it is not clear what power law
would hold. This problem will not be resolved until the results of fully three-
dimensional computations have become available. Meanwhile, the power law derived
from mixing length theory (NNS ½ for S >> i) remains as good as any other.
4. COMPRESSIBLE CONVECTION
The Boussinesq approximation is manifestly inadequate for stellar atmospheres
that extend over many scale heights (the density increases by a factor of 106 in the
solar convection zone). It i8 commonly supposed that the dimensions of convection
cells should be of the same order as the local density or pressure scale height.
This assumption fits the photospheric granulation and some physical arguments can be
adduced to support it (Schwarzschild 1961; Weiss 1976). In mixing length theory
(which is essentially Boussinesq) the mixing length is generally set equal to some
multiple of the local pressure scale height. It would be comforting to have some
theoretical justification for this choice of length scale.
Linear theory gives no help: in a polytropic atmosphere convection sets in with
a horizontal scale that is comparable with the layer depth, even for the complete
148
atmosphere where the scale height shrinks to zero at the upper boundary (Spiegel
1965; Gough et al. 1976; Graham and Moore 1977). At supercritical Rayleigh numbers
modes with smaller horizontal scales have higher growth rates, at least when dis-
sipation is ignored (see Spiegel 1972). B~hm (1967) discussed the growth rates of
linear modes in a model of the solar convection zone, neglecting turbulent viscosity,
and found that the growth rate increased monotonically with the horizontal wavenumber.
Vandakurov (1975a,b) has included the effects of an eddy viscosity and found a
maximum growth rate for cells with a horizontal scale intermediate between those of
granules and supergranules. In these gravest modes there is no reversal
of the velocity, though the amplitude is strongly peaked near the surface. A
preliminary study of the marginal stability problem (Bbq~m 1975) indicates that there
may be internal nodes but their interpretation is obscure. (The reversal in the
temperature perturbation reported by Vickers (1971) is apparently due to
numerical error.) Smaller length scales seem to be produced not by the density
variation but by the strongly superadiabatic gradient, coupled with ionization, near
the top of the convective zone.
In nonlinear studies sound waves can be filtered out by using the anelastie
approximation (Gough 1969) which is valid provided the Mach number remains small.
This has been applied, using the modal approximations to study (inefficient) con-
vection in A-type stars (Latour et al. 1976; Toomre et al. 1976). However, no
careful study of the transition from Bousslnesq to compressible convection has yet
been carried out.
Dr. Graham will describe his numerical experiments on fully compressible non-
linear convection in two and three dimensions. For steady convection in two-
dimensional rolls the eye of an eddy is no longer at the centre of the cell but is
displaced downwards and towards the sinking plume (Graham 1975). This asymmetry is
observed in solar granules, which show a broad column of hot gas, rising at their
centres, surrounded by narrower, more rapidly sinking ring of cold material (Kirk
and Livingston 1968; Deubner 1976). Graham finds no evidence for small scale motion;
convective cells extend across the entire layer, even when the density varies by a
factor of 50. In studying compressible convection it is most straightforward to
assume that the dynamic viscosity ?g is uniform. Then the viscous term dominates
the equation of motion near the upper boundary, where the density is small (Gough
et al. 1976). If the aim is to represent turbulent dissipation by an eddy viscosity,
then the kinematic viscosity ~ can be obtained from a model of the convection zone.
For the sun~ ~ is roughly constant (Cocke 1967, B~hm 1975). However, Graham finds
that the cell size is not altered by setting ~ constant across the convecting layer.
So fare the only suggestion of small scale motion has come from some nonlinear
calculations by Deupree (1976), whose resolution is too coarse for the results to
be credible. Unless further computations on compressible convection reveal some
new pattern of behaviour, we shall have to suppose that the observed scales of con-
vection in granules and supergranules are caused by boundary layers near the surface
149
of the sun, rather than by the changing density scale height. If so, the mixing
length cannot be locally determined and mixing length theory is, at best, reliable
only near the surfaces of main sequence stars.
REFERENCES
B~hm, K.-H., 1967. Aerodynamic phenpmena in stellar atmospheres (IAU Symp. No. 28) ed. R. N. Thomas, p. 366j Academic Press, London.
g~hm, K.-H., 1975. Physique des mouyements dans les atmosph_~r@s st ellaires , ed. R. Cayrel and M' Steinberg, p. 57, CNRS, Paris.
Brown, W., 1973. J. Fluid Mech. 60, 539.
Busse, F. H., 1967. J. Math. Phzs. 46, 140.
Busse, F, H., 1969. J. Fluid Mech. 37, 457.
Busse, F. H., 1972. J. Fluid Mech. 52, 97.
Busse, F. H. and Whitehead, J. A., 1974. J. Fluid Mech. 66, 67.
Clever, R. M. and Busse, F. H., 1974. J. Fluid Mech. 65, 625.
Cocke, W. J., 1967. A strophys. J. 150, 1041.
Deubner, F. L., 1976. Astr. Ast[ophys. 47, 475.
Frorm, J. E., 1965. Phys. Fluid s 8, 1757.
Gough, D. 0., 1969. J. Atmos. Sci. 26, 448.
Gough, D. O., 1976. Trans. IAU 16A Part 2, 169.
Gough, D. O., Moore, D. R., Spiegel, E. A. and Weiss, N. 0., 1976. Astrophys. J. 206, 536.
Gongh, D. O., Spiegel, E. A. and Toomre, J., 1975. J. Fluid Mech. 68, 695.
Gough, D. O. and Weiss, N. 0., 1976. Men. Not. R. Astr. Soc. ll~76l, 589.
Graham, E., 1975. J. Fluid Mech. 7qO, 689.
Graham, E. and Moore, D. R. 1977. In preparation.
Howard, L. N., 1963. J. Fluid Mech. 17, 405.
Jones, C. A., Moore, D. R. and Weiss, N. O., 1976. J. Fluid Mech. 73, 353.
Jones, C. A. and Moore, D. R., 1977. In preparation.
Kirk, J. G. and Livingston, W., 1968. Solar Phys. 3, 510.
Krishnamurtl, R., 1970a. J. Fluid Mech. 42, 295.
Krishnamurti, R., 19705. J. Fluid Mech. 42, 309.
Krishnamurti, R., 1973. J. Fluid Mech. 60, 285.
Latour, J., Spiegel, E. A., Toomre, J. and Zahn, J.-P., 1976. As troph~s. J. 207, 233.
150
Lipps, F. B., 1976. J. Fluid Mech. 75, 113.
Moore, D. R. and Weiss, N. 0., 1973. J. Fluid Mech. 58, 289.
Musman, S., 1972. Solar Phys. 26, 290.
Plows, W., 1968. Phys. Fluids II, 1593.
Rosshy, H. T., 1969. J. Fluid Mech. 36, 309.
Schneck, P. and Veronis, G., 1967. Phys. Fluids iO, 927.
Schwarzschild, M., 1961. Astrophys. ~.t 134, I.
Spiegel, E. A., 1965. Astroph~s. J. 141, 1068.
Spiegel, E. A., 1971a. Comm. Astrophys. Space Phys. 3, 53.
Spiegel, E. A., 1971b. Ann. Rev. Astr. Astrophxs. ~, 323.
Spiegel, E. A., 1972. Ann. Rev. Astr. Astrophys. iO, 261.
Straus, J., 1972. J. Fluid Mech. 56, 353.
Toomre, J., Gough, D. O. and Spiegel, E. A., 1977. J. Fluid Mech. 79, i.
Toomre, J., Zahn, J.-P., Latour, J. and Spiegel, E. A., 1976, Astrophys. J. 207, 545.
Vandakurov, Yu. V., 1975a. Solar Phys. 40, 3.
Vandakurov, Yu. V., 1975b. Solar Phy.~. 45, 501.
Veltishohev, N. F. and ~elnin, A. A., 1975. J. Fluid Mech. 68, 353.
Veronis, G., 1966. J. Fluid Mech. 26, 49.
Vickers, G. T., 1971. Astrophy s. J. 163, 363.
Weiss, N. 0., 1976. Basic mechanisms of solar activity (IAU Symp. No. 71), ed. V. Bumba and J. Kleczek, p.ZZ9, Reidel, Dordrecht.
Widnall, S., 1975. Ann. Rev. Fluid Mech. ~, 141.
Widnall, S. and Sullivan, J., 1973. Proc. ROy. So~. A 332, 335.
Willis~ G. E. and Deardorff, J. W., 1967. phys. Fluids IO, 931.
Willis, G. E. and Deardorff~ J. W., 1970. J. Fluid Mech. 44, 661.
COMPRESSIBLE CONVECTION
Eric Graham
Department of Applied Mathematics and Theoretical Physics
University of Cambridge, England*
I . INTRODUCTION
Stellar convection zones often extend over several pressure scale heights and
convective velocities can be comparable to the local sound speed. Neither laboratory
convection experiments~or analytic solution of the non-linear equations are feasible
in such regimes. In order to gain insight into the details of s te l lar convection we
are obliged to use numerical simulations. At the present time, even this approach
cannot be applied to the parameter range typical of s te l lar interiors~ however sol-
utions can be obtained which extend over many scale heights and have non-negligible
Mach numbers. Under these conditions i t is necessary to employ the fu l l compressible
equations rather than the anelastic approximation (Gough [1] ) or the Boussinesq app-
roximation (Spiegel & Veronis [~ ).
2. THE PROBLEM
Rather than attempting to model a complete star, we w i l l employ a simplified
geometry. In this way we can fac i l i ta te the numerical calculation, while avoiding
the complexities of treating the transition between the convective zone and the opt-
ica l ly thin region. As a standard problem, we consider a gas confined in a rectangular box with slipp-
ery walls. The upper and lower faces are maintained at fixed but different temperat-
ures, T u and T I . The side walls are thermally insulating, A constant gravitational
f ie ld is imposed which has suff ic ient magnitude to produce a signif icant density var-
iation with height. The equation governing the problem are
and
Present address: National Center for Atmospheric Research, High Alt i tude Observatory,
P.O. Box 3000, Boulder, C'olorado 80303, USA.
t52
where ~ j = ~ ( ~ H- ~)X; -- ~- ~Xs" ,
S is the specific entropy,~} is the coefficient of viscosity, K is the thermal conduct-
i v i t y and al l the other symbols have their usual meaning.
Even i f we prescribe the equation of state, the functional forms of the conduct-
i v i t y and the viscosity and the aspect ratios of the box, we s t i l l have f ive degrees
of freedom in setting up the problem (see Graham E~). In addition we can choose our
i n i t i a l velocity, density and temperature distribution.
Numerical solutions for the two-dimensional problem have been presented by Graham
[3]. The most important parameters are found to be the Rayleigh number, R, the Prandtl
number,Or , and the layer depth parameter, Z, given by
= o p t / K , .
I t is sometimes useful to have solutions of the linear equations for the onset of
convection. These equations have been treated by Spiegel[4], Gough et al[5] and by
Graham and Moore[61. A relative Rayleigh number,~ , can be defined by scaling R by
the cr i t ical Rayleigh number of the linear problem.
3. NUMERICAL METHODS
A variety of numerical methods have been used for compressible convection. The
f i r s t solutions were obtained for two-dimensional motions using a modified Lax-Wendroff
f in i te difference scheme. This method is described by Graham [31. The scheme has
been generalised to three-dimensional flows. An alternative to the f in i te difference
method is the pseudo-spectral or collocation method. This has been successfully used for compressible convection equatiomby Graham (unpublished). Each dependent variable
is approximated by a truncated Chebyshev series in two or three space dimensions. The
series are substituted into the dif ferential equations. The time derivatives of the
coefficients of the series are determined by requiringthat the differential equations
be satisfied at selected collocation points. Because Chebyshev transforms can be cal-
culated using fast Fourier techniques, the method is economical. Both the Chebyshev
scheme and the Lax-Wendroff scheme suffer from numerical s tab i l i ty problems for low
Prandti numbers and large values of the Rayleigh number. Solutions have been obtained
withX =100,c=0.I and Z=IO. Current work is directed at developing an alternating
direction implici t f in i te difference scheme.
153
4. THE RESULTS
Because of the computational labour involved in obtaining three-dimensional sol-
utions, most of the calculations are restricted to two dimensional flows. The calcul- ations reported by Graham [3] relate to a perfect gas law and constant K and n~
A number of general results were found. 1. Two-dimensional solutions evolve to steady state flows. The time taken to reach a
steady state increases with increasing horizontal box dimension and decreasing or. This suggests that for more extreme configurations, there may be no steady solution.
2 '
Uu PP er/u lower
_ ~ t a n t V
constant
I I0 I00
Udownwa rd/uu pwa rd
, , J
1 I0 I00
Figure la Figure lb
0
0. :
Uupper/Ulower
const
c o n s t a n t "r I
1 10
Udownward/Uupward
constant
constant -~ " -
0 ,,,, ,, , I
0.I I
O-
10
Figure Ic Figure ld
154
2. There is an asymmetry between upward and downward velocit ies, downward velocities
usually being larger. Horizontal velocities are similar at the upper and lower surfaces,
with the lower velocity often being s l ight ly larger. This is a surprising result, par-
t icu lar ly for large values of Z, because continuity arguments have been proposed to su-
ggest that convective velocities are larger in low density regions.
3. When the horizontal box dimension is large enough to permit several convective ro l ls ,
the horizontal wavelength differs signi f icant ly from that which would maximise heat
f lux.
4. Convective cells extend over several pressure scale heights in the vertical direction.
No cases were found where the flow breaks up into several rol ls in the vert ical.
Further calculations have been performed with a constant kinematic v iscosi ty , ] / ,
rather than a constant dynamic v iscosi ty ,~. I t had been conjectured that the increase
of l / near the surface reduced the upper horizontal velocity. Figure la shows the ratio
of upper to lower velocity as a function of ~ with Z=10. We see that i t is only for
small values of ~ that the upper velocity is enhanced. Figure lb shows the correspond-
ing behaviour of the ratio of downward to upward velocit ies. The variations with Pran-
dtl number is shown in figures lc and Id. The general conclusion is that the solutions
are insensitive to the form of the viscosity law in the cases of large R and small G ,
which is the regime found in ste l lar convection zones.
1 \(
I I
j7
_ ::):
i
; J ,/
J /
Figure 2
155
Relatively few three-dimensional calculations have been performed. I f the horizontal
box size is comparable to the vertical size, two-dimensional flow patterns are found.
As the horizontal size is increased, the flow pattern becomes time dependent even for
modest values of ~ . Figure 2 shows a velocity f ie ld for Z=I and~ lO . In this per-
spective picture, a rectangular bite has been removed from one corner of the box to
reveal the interior. The arrows represent velocity components parallel to the faces.
The arrows are distributed at random with a probability proportional to the density.
The cut away portion shows that the f lu id has significant vertical vor t ic i ty . Such
regions are observed to be short lived, being dissipated and then reforming in a new
position.
5. CONCLUSIONS
Numerical simulat ion of compressible convection provides a way of obtaining a
detai led picture of s t e l l a r convection. At the present time, solut ions are s t i l l far
from the parameter range found in s t e l l a r i n te r io rs . However the solut ions are well
removed from the Boussinesq l i m i t of laboratory convection experiments. I t is to be
hoped that future developments in the form of more e f f i c i e n t algorithms for computat-
ional f l u i d dynamics, turbulence theories for handling the f ine scale features of the
flow and increases in the available computing resources wi l l al l help in attempts to
construct more real ist ic models of ste l lar convection zones.
REFERENCES
1 Gough, D.O., The anelastic approximation for thermal convection, J. Atmospheric
Sciences 26 (1969) pp. 448-456 2 Spiegel, E.A. & Veronis, G., On the Boussinesq approximation for a compressible
f luid, Astroph~s. J. 131 (1960) pp.442-447 3 Graham, E., Numerical simulation of two-dimensional compressible convection, J .
Fluid Mech. 70 (1975) pp. 689-703 4 Spiegel, E.A., Convective instabi l i ty in a compressible atmosphere I, As trophys.
J. 141 (1965) pp. 1068-1090 5 Gough, D.O., Moore, D.R., Spiegel, E.A. and Weiss, N.O. Convective instabi l i ty in
a compressible atmosphere I I , As__trophys. J. 206 (1976) pp. 536-542 6 Graham, E. & Moore, D.R., The onset of compressible convection, To appear
CONVECTION IN ROTATING STARS
F.H. BUSSE University of California, Los Angeles
SUMMARY
It is shown that many features of convection in rotating spheres and spherical
shells can be understood on the basis of plane layer models. The phenomenon of
differential rotation generated by convection is emphasized. The potential applications
and limitations of analytical and numerical models for problems of astrophysical
interest are briefly discussed.
I INTRODUCTION
In thinking about the effects of rotation in stars a variety of thoughts comes
to mind. Some are more negative: Rotation is a breaker of symmetry. It spoils our
notion of a star as an ideal spherically symmetric body which is in static equilibrium
except for the convection zones. Even the latter can be regarded as spherically symme-
tric with respect to their gross properties in the absence of rotation. Although the
deviations from spherical symmetry are small in most rotating stars, the effects of
rotation are only partially known and continue to irritate the theoretician involved
in computations of stellar evolution.
On the other hand stellar rotation is an exciting subject because of the variety
of interesting phenomena associated with it. The generation of magnetic fields is in
general connected with rotation. The shape of surfaces of equal potential in a
rotating star may become unstable in phases of contraction. Rotation can cause
meridional circulations and mixing processes and, in addition, there is a variety of
phenomena connected with differential rotation.
For the theoretical fluid dynamicist rotation brings to mind still other thoughts.
Whenever the Coriolis force becomes dominant the dynamics of fluids are profoundly
altered. The intuition developed from experience with hydrodynamics in non-rotating
systems is no longer valid. Intuitive concepts like mixing length theory appear to be
even less applicable in the case of low Rossby number convection, i.e. when the vor-
ticity of motion relative to the rotating system is small compared to the rotation
rate. On the other hand, theories developed for small amplitude convection appear to have
a much larger range of validity than in a non-rotating system. The two-dimensionality
enforced by a dominating Coriolis force tends to suppress instabilities and restricts
the degree of freedom for turbulent motion.
In the following theoretical and experimental results for the small Rossby number
N
o)
O)
a) Convection columns aligned with the axis of rotation.
b) Convection rolls in a
layer rotating about a
vertical axis.
FIGURE
]
158
case will be presented. Among the nonlinear phenomena caused by convection in rotating
systems we shall emphasize the generation of differential rotation. In discussing the
application to rotating stars we shall restrict our attention to the Sun and Jupiter.
The detailed surface observations available in both cases offer the best hope for
eventual quantitative tests of theoretical concepts.
2 BASIC EFFECTS OF ROTATION ON CONVECTION
The dynamics of nearly stationary motions in a rotating system are
governed by the Proudman-Taylor theorem which states that a small amplitude stationary
velocity field of an inviscld incompressible fluid must be independent of the
coordinate in the direction of the axis of rotation. It is of interest for astrophy-
sical applications that the theorem holds for barotropic fluids as well if the velocity
vector ~ is replaced by the momentum vector p~: By taking the curl of the equation
of motion
2 ~ x pv = - Vp- pV~
and using the equation of continuity
V- p~ = 0
the relationship
2~" V~ = 0 (I)
is obtained. In the following we shall restrict our attention, however, to the
case of incompressible fluids, or, more exactly, Boussinesq fluids for which the
temperature dependence of the density is taken into account in the gravity term
only.
We start the discussion of convection in rotating systems by considering two
simple cases as shown in Figure 1. In case (a) the vectors of gravity and rotation
are at a right angle and convection solutions satisfying the Proudman-Taylor theorem
are possible. The Coriolis force is entirely balanced by the pressure gradient in
that case and the critical value of the Rayleigh number for the onset of convection
becomes the same as in a nonrotating system. Since the Coriolis force always increases
the critical Rayleigh number unless it is balanced by the pressure, the solution
corresponding to convection rolls aligned with the axis of rotation is physically
preferred. It can be easily realized in the laboratory by heating a cylindrical
rotating annulus from the outside and cooling it from the inside and using the
centrifugal force as gravity (Busse and Carrigan, 1974).
While the stabilizing effect of the Coriolis force vanishes in case (a) it
reaches its maximum in case (b) when the vectors of gravity and rotation are parallel.
This is realized when a fluid layer heated from below is rotating about a vertical
axis. Release of potential energy by convection requiresa vertical component of
I
I I .~
..4 .
...
i -
s
(o)
g---
~
Convection columns in an annulus with
inclined top and bottom boundaries.
Fig
ure
2
: Convection layer inclined with
respect to axis of rotation.
f
/
(C)
Convection layer with
changing light.
160
motion which cannot occur without violating the Proudman-Taylor theorem. In order to
overcome the constraint of the Proudman-Taylor theorem viscous friction must become
sufficiently strong, thus playing a destabilizing role in this case. The non-dimen-
sional number describing the ratio between viscous friction and Coriolis force is
the Ekman number
E = ~ / ~ d 2 (2)
where d is the thickness of the layer and ~ is the kinematic viscosity. Since E is
very small in most applications, the horizontal scale of convection must become much
smaller than the vertical in order to increase friction. More detailed analysis (we
refer to Chandrasekhar's (|961) book) shows that the horizontal scale decreases like
E I/3 and the Rayleigh number for onset of convection increases like E -4/3 for small
E. Besides the Ekman number and the Rayleigh number, which is a measure of the buoyan-
cy, the Prandtl number is the third dimensionless parameter of the problem. It des-
cribes the ratio between thermal and viscous time scales of convection. For Prandtl
numbers less than a value of about I, oscillatory convection offers an alternate way
to overcome the constraint of the Proudman-Taylor theorem wihtout changing, however,
the power laws in the dependences on E.
3 EFFECTS OF INCLINED BOUNDARIES
The two extreme cases (a) and (b) of Figure I obviously correspond to equatorial
and polar regions, respectively, of rotating spherical fluid shells heated from within
and subjected to spherically symmetric gravity. There are, however, some important
deviations because of the finite dimensions of the spherical shells. To discuss these
effects let us consider the influence of inclined boundaries in (a) and (b). If top
and bottom boundaries are added in case (a) convective motions satisfying the Proud-
man-Taylor theorem are still possible as long as the boundaries are parallel and vis-
cous friction is negligible. Boundaries inclined with respect to each other, however,
require a dependence of the velocity field on the coordinate in the direction of the
axis of rotation, which we shall cal z-coordinate. A typical example is shown in
Figure 2 (a). The deviation from the Proudman-Taylor condition is accomplished by a
combination of time dependence and viscous friction in this case: Convection still
has the form of columns aligned with the z-axls, but the columns are travelling like
Rossby waves in the prograde or retrograde azimuthal direction depending on whether
the distance between top and bottom boundaries decreases or increases with distance
from the axis. In addition the azimuthal wave number ~ becomes large in order to
increase frictional effects. In the limit of small values of E we find
R = ( ~ ~ )4/3 , ~ = ( ; )I/3 , -I
(3)
/-
0 Figure 3:
Buoyancy force A and inhibition C
caused by inclined boundaries in
a sphere as a function of distance
S from the axis.
Figure 4:
Sketch of motion at the onset of
convection in a rotating sphere.
162
rotation rate, and n is the tangent of half of the angle between the inclined
boundaries. A detailed theory and an experimental study of the stabilizing effect
of inclined boundaries can be found in the papers by Busse (1970b) and Busse and
Carrigan (1974). In section 4 we show how the theory can be applied more or less
directly to the case of convection in a sphere.
Since convection at the mid-latitudes of a rotating spherical shell corresponds
to intermediate angles between gravity and the rotation vector, it may be anticipated
that it shows properties intermediate to those of the extreme cases (a) and (b) of
F~gure I. Indeed, it is easily shown (Chandrasekhar, 1961) that both the Rayleigh
number and the wave number at the onset of convection are governed by the expressions
derived for case (a) if the rotation rate ~ is replaced by its vertical component
cos ~ . Convection occurs in the form of rolls aligned with the horizontal component
of ~ , as indicated in Figure 2(b). Accordingly, the component of the Coriolis force
proportional to ~ sin ~ is balanced by the pressure and drops out of the dynamical
considerations.
When applying the theory of plane parallel convection layers to spherical shells
the strong dynamical coherence of the fluid along any line parallel to the axis of
rotation must be kept in mind. For this reason convection in a spherical shell
exhibits the effects of non-parallel boundaries even though the distance between the
boundaries is constant. Since the tangential surfaces to the spherical boundaries
of the shell are not parallel at the points intersected by the same line parallel to
the z-axis, the dynamics of convection exhibit the same effects as in the case of the
convective layer shown in Figure 2(c). The variation of "height" with distance from
the axis of rotation induces a wave propagation property of the convective motions
similar to that of the convection columns in Figure 2(a). Because of the particular
phase relationship between buoyancy force and motion the phase propagation velocity is
opposite that of Rossby waves, at least for Prandtl numbers of the order I/3 and
larger (Busse and Cuong, 1976).
4 CONVECTION IN ROTATING SPHERES AND SPHERICAL SHELLS
The problem of convection in a self-gravitating rotating fluid sphere has been
traditionally considered for the case of homogeneous internal heating. Both gravity
vector and temperature gradient vary linearly with distance r from the center in this
case. Roberts (1968) gave a detailed mathematical analysis of the problem. The physically
realized mode was determined by Busse (1970b).
An approximate solution of the problem can be obtained without any numerical
analysis by applying the concept of convection in a rotatin~ annulus, as shown in
Figure 2(a). Because of the coherence in the z-direction enforced by rotation and the
small length scale of the convection columns in the perpendicular direction,
convection in any cylindrical section of the sphere behaves as in the corresponding
163
Figure 5: Laboratory simulation of convection in a rapidly rotating sphere. The motions are made visible by small flaky particles which align with the shear.
164
annulus problem. In Figure 3 the stabilizing effect C of the Coriolis force owing
to the inclination of the boundary has been plotted as a function of the distance
s from the axis together with the buoyancy force A~ which is given by the product of the s-
components of gravity and temperature gradient, since the z-component of the buoyancy
force has little effect on the convection motion. The minimum of C/A at a distance s
of about half the radius indicates the cylindrical surface where the onset of
convection will occur as the critical value of the temperature gradient is reached.
Figure 4 gives a qualitative sketch of the solution of the problem.
The fact that only the component of gravity perpendicular to the axis of
rotation enters the dynamics in a first approximation is the basis for the laboratory
simulation of the convection process (Busse and Carrigan, 1976). By using centrifugal
force in place Of gravity and by cooling the sphere from the inside and heating it
from the outside the convection flow described above can be realized in a laboratory
experiment. The onset of convection occurs in the form of regularly spaced columns,
as shown in Figure 4. When the buoyancy force increases beyond the critical value,
the region of convection is extended until the entire sphere is filled by convection
columns. While amplitude fluctuations and the difference in the speed of propagation
cause deviations from the regular picture at low amplitudes the perfect alignment of
the columns persists, as shown in Figure 5.
The analysis of the spherical case applies directly to the equatorial region of
spherical shells outside the cylindrical surface touching the inner boundary at the
equator. In all cases the Rayleigh number for the onset of convection is lower in that
region than in the other parts of the fluid shell. Inside the cylindrical surface the
onset of convection can be described approximately by applying locally the theory of an
inclined convection layer if the effects discussed in connection with Figure32(b) and
2(c) are taken into account. Of particular interest is the prograde propagation of
convection modes everywhere except at the poles. An asymptotic analysis for different
radius ratios and for varying Prandtl number P is given by Busse and Cuong (]976).
Figure 6 shows the local Rayleigh number for onset of convection as a function of the
distance from the axis in a typical case. The corresponding wave number and frequency
of convection are also shown. The asymptotic results agree reasonably well with the
earlier numerical results obtained by Gilman (1975) at finite values of E in the case
P= I and for a radius ratio ~/~ = 0.8, which is appropriate for the solar convection
zone.
Figure 7 illustrates the most important feature of convection in a rapidly
rotating spherical shell: The change in the character of convection across the
cylindrical interface s = r i. While the vorticity of the motions is nearly z- independent
for s > r i the z- component of vorticity changes sign between lower and upper parts
of the convection cell for s < r.. This change in the symmetry of convection has z
important effects on the nature of the differential rotation generated by convection
and on the heat transport.
165
t00
RE-Z/3
50
Figure 6:
R a
\
\ ,
/
a i # / ,'~
,, . . - ; : q ~t ~
1 ; J
/
" ' ' " I I , i t I
.5 S
R
1 I I I
2
S ,,,"?'./wE-
Rayleigh number R for the onset of convection in a spherical shell with radius ratio r4/r~ = 0.6 as a function of distance S from t~e ~xis. Wave number a and frequency e of convection columns are shown by dashed lines.
I66
j/
I
/
Figure 7: Sketch of convection modes in a spherical shell.
iO s
RO
TATI
ON
AL
REG
IMES
D
UE
TO
CO
NVE
CTI
ON
IN
R
OTA
TIN
G
SP
HE
RIC
AL
SH
ELL
SO
LID
R
OT
AT
ION
R
~
T.8
T z
/3
I r,."
10 4
bJ
El
Z
I (3
Ld
.d
10 3
>
- S
OL
ID
RO
TA
TIO
N
R ~
0
,84
T z
/3
Pra
nd
tl
Num
ber
P=
1 S
tre
ss
fre
e
bo
un
da
rie
s
I0 z I0
z
IO 3
I0 4
I0 s
TA
YL
OR
N
UM
BE
R
T -"
" Figure 8:
Regimes of differential rotation from Gilman
(1976a).
IO s
168
5 NONLINEAR ASPECTS
The phenomenon of solar differential rotation has stimulated much of the
recent effort to understand convection in rotating spherical shells. It was first
shown by Busse (1970a) that convection in a spherical shell can generate a differen-
tial rotation of the same form as that observed on the Sun. While Busse used an
analytical perturbation method in the thin shell limit, Durney (1970) independently
developed a mean field approach for the solution of the problem from which he
obtained--after using the wave propagation property demonstrated by the analytical
theory--essentially the same results. The exciting aspect of the observed solar
phenomenon as well as of the theoretical results is that a prograde differential
rotation occurs at the equator. This eontradicts the earlier notion of angular mixing
by convection which would have led to a deceleration of the equatorial region.
That the hypothesis of angular momentum mixing by convection is incorrect
can easily be demonstrated in the case of convection in a cylindrical annulus
discussed earlier. Since the Coriolis force can be entirely balanced by the pressure
in this case, the influence of rotation disappears from the full nonlinear equation
for two-dimentional convection rolls. Differential rotation cannot be a part of the
solution since the basic equations are identical to those in a nonrotating region in
this case and since a preferred azimuthal direction cannot be distinguished. Generation
of differential rotation obviously depends on secondary features such as the curvature
of the boundaries, and cannot be predicted by simple physical arguments.
How complicated the phenomenon of differential rotation in a convecting
spherical shell can become at higher Rayleigh and Taylor numbers is evident from
the numerical computations of Gilman (]972, 1976a,b). Because both the Reynolds stresses
of the fluctuating convection velocity field and the meridional circulations caused
by the inhomogeneity of convection contribute to the generation of differential rotation,
small changes in the parameters of the problem may change the form of differential
rotation dramatically. Figure 8 from Gilman (1976a) shows how the equatorial maximum of
angular velocity changes into a relative minimum as the Rayleigh number is increased.
The influence of boundary conditions also appears to be important. The almost exclusively
used stress-free boundaries actually represent a singular case in the thin shell limit
(Busse, ]973) since an equilibration between Reynolds stresses and viscous stresses can
take place only in the latitudinal direction.
In order to investigate the generation of differential rotation in a conceptually
simple ease, the problem of convection in a rotating cylindrical annulus has recently
been studied both experimentally and theoretically. Since the measurements are still in
progress we restrict our attention to the qualitative picture, as shown in Figure 9. No
differential rotation is generated in the case of straight top and bottom boundaries
S
/
/
---~--~--Figure 9:
Differential~
rotation generated by convection
~-in
a rotating cylindrical annulus.
170
of the annulus. The experimental observations show an increase of the gradient of
angular momentum for convex boundaries and a decrease for concave boundaries, in
agreement with theoretical predictions.
Meridional circulation and latitudinal variation of the convective heat
transport are other important nonlinear properties of convection in spherical shells.
Both phenomena are closely linked since the variation of the mean temperature caused
by an inhomogeneous heat transport is the most important cause of meridional circulation.
The lack of observational evidence for either phenomenon on the solar surface has been
a source of controversy in the interpretation of theoretical models. We shall return
to this point in the next section.
6 APPLICATIONS TO THE SUN AND JUPITER
It is fortunate for the theory of convection in rotating stars that there
exist two quite different celestial bodies for which detailed surface observations are
available. In the case of the Sun the influence of rotation is relatively small: The
Rossby number is large compared to unity at least for the velocity field in the
upper part of the convection zone. Jupiter represents the opposite case of a rapidly
rotating system characterized by a small Rossby number. Although about half of the
energy emitted from the surface of Jupiter is received from the Sun, the convective
heat transport required for the other half is the dominating source of motions in the
Jovian interior. In this respect Jupiter does indeed represent a low Rossby number
example of a rotating convecting star.
The application of theoretical models which are valid at best for systems of
laboratory scales to systems of stellar dimensions faces obvious difficulties. It is
con~mon practice to take into account the effects of turbulence owing to motions of
smaller scale than those considered in the form of an eddy viscosity ~e which
replaces molecular viscosity in the equations of motion. The main justification for
this procedure is that it appears to work well in many cases.
If ~ is chosen sufficiently large that the Rayleigh number and Taylor number e
4E -2 are not too large the differential rotation observed on the Sun resembles that
predicted by the theoretical models fairly well. There is also evidence for the large-
scale convection cells, often called giant cells, girdling the equator like a cartridge
belt (Howard and Yoshimura, 1976). Figure I0 shows a laboratory simulation. The radius
ratio in the laboratory experiment is closer to unity than in the solar case and the
number of cells is correspondingly larger. Otherwise the cells show a surprising
resemblance to those observed on the Sun by Walter and Gilliam (1976). Because the
latter authors show magnetic regions a direct physical interpretation of the phenomeno-
logical resemblance is difficult, especially since the simultaneous occur~neeOf magne-
tic features which are syrmetric or antisymmetric with respect to the solar equator is
not well understood.
171
Figure 10: Laboratory simulation of convection in a rotating spherical fluid shell with inner radius r i = 4.45 cm and r = 4.77 Cm.
o
172
The measurement of the Coriolis deflection of the horizontal motion in
supergranules by Kubicela (2973) appears to be the only direct determination of the
effect of rotation on solar convection. Kubicela interprets the observed deflection
of the velocity as the Coriolis acceleration multiplied by the lifetime of a super-
granule. Using a lifetime of 20 h he finds reasonable agreement with the measurements.
Since the supergranular velocity field is defined as the mean over a field of highly
fluctuating granular motions, the eddy viscosity concept can be used as an alternative
possibility of interpretation. Using the linear solution for a convection cell in a
rotating layer with stress-free boundaries (Chandrasekhar, 196]) we find the expression
2e tgy
~ 2d2 e
for the angle 7 of deflection, where d is the depth of the supergranular layer.
For simplicity we have assumed that the horizontal wavelength of the cells is large
in comparison with d. Using ~ ]09 cm and ~ = 2.6 " 20 -6 sec -2 we derive from the
observed angle y ~ 20 ° an eddy viscosity of the order 2 • j0]2cm2sec -2 , which is in
reasonable agreement with values derived from other more heuristic considerations.
For the larger scale of giant cells a slightly higher value of ~e appears to be
appropriate yielding an Ekman number of approximately ]0 -2, which is of the same order
as the value used by Gilman (]976b) in his numerical simulation of the solar convec-
tion zone.
It should be mentioned that earlier theories of the solar differential rota-
tion by Kippenhahn (2963) and others used the concept of an anisotropic eddy viscosi-
ty proposed by Biermann (1958). This concept often mimics the anisotroplc dynamical
influence of large-scale eddies. If the deviations from rigid rotation are described
in terms of an anisotropic viscosity it would seem reasonable in view of the more
detailed theory described above to use a latitude-longitude anlsotropy rather than
a horizontal-vertical anisotropy as proposed by Biermann.
Raylelgh numbers for stellar convection zones are based on the superadiaba-
tic part of the temperature gradient, which amounts in general to only a small frac-
tion of the total temperature gradient. A small change in surface temperature causes
a disproportionately large change in the Rayleigh number and an even larger in the
convective heat transport. The convection zone reacts like a highgain amplifier to
any change of the temperature at the surface and it is not surprising that no subcri-
tical large-scale variations of the solar surface temperature are observed. Since the
temperature determines the energy emission, the convective heat flux must adjust itself
to a uniform value. Ingersoll (]976) has emphasized this point in the case of Jupiter,
where the convection heat transport adjusts itself in such a way that large-scale va-
riations of the surface temperature vanish.
For this reason the heat flux variations and associated meridional circulations
N
Ob
serv
ed
C
om
pu
ted
,, -
' ,
~ _
l\
~~
C~
:~
'
~ \ \
\ \ \
~ \\\
~\
\ ~\ \
\ ~ \
\ 1_
1 ~r~
I
1\
\ k i,
\\ \
\\\
\ \~
\\\:
\\\
\\\x
t\\'
\l
\'t\~
l\~lk
"
l \
II
I1
\
..... I,
i ..
..
.
i .
..
..
..
..
..
..
..
Figure ii:
Comparison between theoretical predictions and observations
of bands on Jupiter (from BUSSe, 1976).
174
of low Rayleigh number models do not have much meaning for high Rayleigh number
stellar convection zones. Even in laboratory experiments it is apparent that the
inhibiting influence of rotation on the convective heat transport reverses itself
with increasing Rayleigh number. Rossby's (1969) measurements even show a slight
increase in heat transport owing to rotation at high Rayleigh numbers. The
generation of differential rotation, on the other hand, depends on the alignment
effect rather than the inhibition effect of rotation. It seems intuitively reasonable
that the former effect, which does not have direct energetic consequences, persists
at high Rayleigh numbers, while the latter effect is diminished by nonlinear processes.
Because of its low Rossby number, convection in the planet Jupiter may be
more accessible than solar convection to theoretical analysis. A simple model has
recently been proposed (Busse, |976). It is generally believed that a transition
from molecular to metallic hydrogen occurs at a radius of about 5/7 of Jupiter's
outer radius and that the interface inhibits penetration by convection. Accordingly
we are faced with the problem of convection in a rotating shell as sketched in Figure 7,
which was actually drawn to apply to Jupiter. The fact that a relatively sharp transition
from the low latitude band structure to the polar region of random eddy motion is
observed on Jupiter at about 45 ° latitude appears to be the strongest argument for a
dynamical influence of rotation along the lines outlined in this paper. To obtain a
more detailed comparison as shown by Figure 11 the concept of an eddy viscosity must
be invoked again. The value of ~ required for a fivefold layer of convection columns e
is in good agreement, however, with the eddy viscosity deduced from convection models
for the heat transport. More elaborate models are clearly possible and Jupiter may
well become the testing ground for future theories of convection in rotating stars.
175
REFERENCES
BIERMANN, L. 1958 IAU Syrup. N ° 6, 248
BUSSE, F.H. 1970a Astrophys. J. 159, 629-639
BUSSE, F.R. 1970b J. Fluid Mech. 444, 442-460
BUSSE, F.H. ]973 Astron Astrophys. 27, 27-37
BUSSE, F.H. 1976 Icarus, in press
BUSSE,F.H. and CARRIGAN, C.R. 1974 J. Fluid Mech. 62, 579-592
BUSSE F.H. and CARRIGAN, C.R. 2976 Science 192, 81-83
BUSSE, F.H. and CUONG, P.G. 1976 Geephys. Fluid Dy., in press
CHANDRASEKHAR, S. 1961 Hydrodynamic and Hydromagnetic Stability Oxford Clarendon Press
DURNEY, B.R. ]970 Astrophys.J. 26]_, ]II5-]127
GILMAN, P.A. ]972 Solar Phys. 27, 3-26
GILMAN, P.A. 1975 J. Atmos. Sci. 32, 1332-2352
GILMAN, P.A. 1976a Proc. IAU Symp. 71, in press
GILMAN, P.A. 1976b J. Fluid Mech., submitted
HOWARD, R. and YOSHIMURA, H. 1976 Prec. IAU Syrup. N ° 71, in press
INGERSOLL, A.O. 1976 Icarus, in press
KIPPENHAHN, R. 2963 Astrophys.J. 137, 664
KUBICELA, A. ]973 P roe. 1st European Astr~ Mt~. Solar Activity and Related Interpla-
netary an d Terrestrial Phenomena, J. Xanthakis, ed. Springer
ROBERTS, P.B. |968 Ph~l. Trans. Roy. Soc0 London A 263, 93-I]7
ROSSBY, H.T. ]969 J. Fluid Mech. 36~ 309-335
WALTER, W.T. and GILLIAM, L.B. 1976 Solar Phys., in press
MAGNETIC FIELDS AND CONVECTION
N. O. Weiss
Department of Applied Mathematics and Theoretical Physics
University of Cambridge
SUMMARY
In a highly conducting plasma convection is hindered by the imposition of a
magnetic field. Convection may set in as direct or overstable modes and behaviour
near the onset of instability depends on the ratio of the magnetic to the thermal
diffusivity. Vigorous convection produces local flux concentrations with magnetic
fields that may be much greater than the equipartition value. The interaction
between magnetic fields and convection can be observed in detail on the sun and is
essential to any stellar dynamo.
I. INTRODU6TION
Magnetic fields - whether primeval or maintained by dynamo action - are
ubiquitous. Any rotating, convecting star seems able to generate a magnetic field,
though the interaction between convection, rotation and magnetic fields bristles with
problems for the theorist. We can usefully distinguish between the problem of
maintaining large scale fields by dynamo action, which will be discussed by Dr
Childress, and that of the interaction between small scale convection and an imposed
magnetic field. I shall assume that any convective timescale is short compared with
the lifetime of large scale magnetic fields and I shall not concern myself with their
origin.
The scale of ordinary laboratory experiments is too small for them to model
hydromagnetic behaviour in astrophysical plasmas. However, the sun provides a
marvellous laboratory where such phenomena can be observed. Sunspots are dark
because normal convection is suppressed by the strong magnetic fields; on a smaller
scale, it is now possible to resolve features a few hundred kilometres across and
to follow the interaction between weak fields and granular convection.
This increase in resolution has revealed more magnetic structures and strongsr
magnetic fields than had been expected.
The theoretical description of a convecting system is particularly rich when
stabilizing and destabilizing effects compete in it (Spiegel 1972). Dr Huppert has
reviewed thermohaline convection; the nonlinear Lorentz force makes magnetic con-
vection yet more complicated. I shall first summarize the results of linear theory
and then discuss various nonlinear problems: is motion steady or oscillatory? are
there subcritical instabilities? how is energy transport affected by the field?
what limits flux concentration between convection cells and how strong are the fields
177
produced? Not all these questions are yet answered but nonlinear magnetic con-
vection is gradually being understood. Finally, I shall try to relate this theory
tosolar magnetic fields and to some aspects of the dynamo problem.
2. LINEAR THEORY
In the absence of a magnetic field a stratified gas is stable to adiabatic
perturbations if Schwarzschild's criterion is locally satisfied. The imposition
of a unlform magnetic field inhibits the onset of oDnvection: a plane, perfectly
conducting layer is eonvectively stable if
(Gough and Tayler 1966), where B O is the vertical component of the magnetic field,
T is the temperature, p the pressure, ~ the ratio of specific heats, /x the
permeability and the adiabatic gradient (dlnT/dlnP)ad = (~-i)/~ for a perfect gas.
Strong magnetic fields can therefore hinder the onset of convection in a star,
though the difference between the adiabatic and the radiative gradient is usually
large enough for instability to occur before the latter is attained (Moss and
Tayler 1969, 1970; Tayler 1971).
When the conductivity ~ is finite, plasma can move across the lines of force
and the stabilizing effect of the magnetic field is relaxed. What happens depends
on the relative values of the magne~c diffusivlty ? = ~#~)" and of the thermal and viscous diffusivities ~ and ~ . In typical stellar conditions, ~ q ~ ~.
The onset of instability in a Boussinesq fluid has been studied in detail (Thompson
1951; Chandrasekhar 1952, 1961; Danielson 1961; Weiss 1964a; Gibson 1966). For a
plane layer of depth d the stabilizing effect of a uniform magnetic field is
measured by the dimensionless Chandrasekhar number
_ ~ a ~
which is the square of a Hartmann number and can be regarded as a "magnetic Rayleigh
number" (Spiegel 1972). A configuration is defined by Q, by the RayleiKh number
R = g~d4/~9 , where ~ is the coefficient of thermal expansion and ~ the super-
adiabatic temperature gradient, and by the Prandtl number ~= ~/~ and the
magnetic Schmidt (or Frandtl) number
=
If, for simplicity, we adopt "free" boundary conditions (Chandrasekhar 1961,
Gibson 1966) then the linear modes have the form
, , , , = Wc ) e ,
where W(z) = WosinrCz (0 4 z ~ d) and
178
with W and a constant, referred to cartesian co-ordinates with the z-axis vertical. o
If ~ q (~)T) linear instability sets in as in ordinary Rayleigh-B6nard
convection. The growth rate s is real and instability sets in as a direct mode~
corresponding to an exchange of stabilities, when R = R Ce) . (Semantics are
succinctly sunnnarized by Spiegel, 1972.) For large Q, R (e) is a minimum when the
dimensionless horizontal wavenumber
so convection first appears in vertically elongated cells at R = R (e~ ~ ~2Q. c
Standing hydromagnetic waves in an unstratified fluid produce oscillations
which are damped by ohmic and viscous dissipation. When ~ >~ these oscillations
may be destabilized by the thermal stratification (Cowling 1976a), so that con-
vection sets in as overstable oscillations when R = R t~ For sufficiently large
Q, overstability first occurs in elongated cells, when
When ~< ~ , therefore, R (°) ~ R (~ and instability first appears as overstable e c
oscillations. At R = R (~ there are two complex conjugate growth rates but as the c
Rayleigh number is raised IIm (~[ decreases until for some R = R ~O the growth
rates are purely real. Thus convective instability sets in with direct modes at
R=RC° AsQ+ ~ , for ~<<~<' (~?,>~) ~ ~CZ~Q, =(~)~ and the minimum value of R ~+) is ~++) ~ (~/=) =~ ~ (Danielson 1961; Weiss
1974a); thus R +~ <~ RIO << R+~ For R +O ~ R < R +~ there are two distinct c c c
positive real growth rates. One of these changes sign when R = R(e) but this
exchange of stabilities has no physical significance.
So far we have considered only free boundary conditions. Analogous results
hold also for other boundary conditions (Chandrasekhar 1961, Gibson 1966). In
particular, the effect of superposing a stable layer on top of the unstable region
has been investigated by Musman (1967) and Savage (1969). The treatment has also
been extended to include some effects of compressibility (Kato 1966; Syrovatsky and
Zhugzhda 1967, 1968; Saito and Kato 1968). If the Alfv6n speed is small compared
with the sound speed, slow magnetosonic oscillations become overstable; if the
Alfv~n speed is large, the fast magnetosonic mode can be destabilized (Cowling
1976b).
3. NONLINEAR cONVECTION
In a Boussinesq fluid the magnetic field satisfies the induction equation
~B
~t
179
while the vorticity _&) and the temperature T are governed by the equations
~ . g - - - - ~ - - - -
"~T
and
~.~ ~ 0 , V_B = O.
Here ~ = ~-J ~zA~ - is the electric current, ~ is the velocity and ~ the density.
For two-dimensional convection, with ~ and B confined to the xz-plane and independent
of y, B can be described by a flux function (the y-component of the vector
potential) A such that
and
while the vorticity equation reduces to
The most convenient boundary condition on the field is obtained by setting B x = 0
at z = O, d. This is somewhat artificial but corresponds to the free boundary
conditions adopted for linear theory.
Near the exchange of stabilities (R = R (e)) finite amplitude solutions can be
constructed using modified perturbation theory. Veronis (1959) observed in a
footnote that subcritical instabilities were possible when ~ . Busse (1975) has
considered a two-dimensional model in which the magnetic field affects the
amplitude, but not the form, of the motion. For R near Rc, the critical Rayleigh
number in the absence of a magnetic field, he combined a perturbation expansion for
the velocity with a computed solution for the distorted magnetic field. He showed
that when ~ >> ~ stationary convection is possible with R c ~ R ~ R :~ In
these solutions the magnetic Reynolds number
U~
(where Q is a typical velocity) is large and the magnetic field is confined to
narrow regions so that its overall stabilizing effect is correspondingly reduced; an
analogous argument applies to the thermohaline problem that Dr Huppert has described.
Since subcritical convection appears when ~4 T and R~)~< R ~) this technique
cannot rigorously establish whether steady finite amplitude solutions are possible
before the onset of overstability. However, Busse's results do suggest that such
subcritical instabilities may occur and Proctor (private communication) has developed
a simplified model of magnetic convection which shows steady motion when R is close
t o R . e
180
Perturbation methods are reliable only while the P~clet number (U d/w) is low
and the convective energy transport relatively small. The efficiency of convection
is measured by the Nusselt number N = (F -~ad)/~, where F is the total thermo-
metric flux and ~ad the adiabatic temperature gradient. The effect of a magnetic
field on the Nusselt number was investigated by van der Borght, Murphy and Spiegel
(1972)~ using the mean field approximation (Spiegel 1971). They considered only
steady convection which, in this approximation, is independent of both ~ and ~ .
For a fixed Rayleigh number, N decreased monotonically and smoothly with increasing
Q until the exchange of stabilities was reached~ thereafter convection was completely
suppressed. Van der Borght (1974) has also attempted to describe the time dependent
problem.
Fully nonlinear two-dimensional computations show a different range of
5ehaviour (Weiss 1975). For free boundary conditions with ~= I, ~ ~ i there is
a general tendency to generate nonlinear oscillations. When ~ = 5, convection first
appears as overstable oscillations in accordance with linear theory. If Q is then
decreased, while R is kept fixed, these oscillations are stabilized at some small
but finite amplitude and convection remains comparatively inefficient. As Q is
further reduced~ R ~) becomes less than R and linear theory predicts direct,
exponentially growing modes. These appear in the numerical solutions but eventually
develop into periodic nonlinear oscillations. The Lorentz force is quadratic in
and linear theory underestimates the restoring force. Hence nonlinear magnetic
convection differs from thermohaline convection, where the stabilizing force remains
linear. If Q is decreased yet further the oscillations develop into irregular
aperiodic motion and, eventually~ into steady convection. The tlme-averaged Nusselt
number rises monotonically as Q decreases but there are no noticeable discontinuities,
nor could any hysteresis be detected,
Dr. Galloway will describe his numerical study of axisymmetrlc convection in a
magnetic field. The results are qualitatively similar, though he found some
hysteresis~ indicating different solutions when Q was decreased from the critical
value and subsequently increased. So far, however, numerical experiments have
provided no evidence of any Jump in the Nusselt number or of any metastable conducting
state associated with suberitlcal convection. Computations on thermohaline convection
(~uppert and Moore 1977) demonstrate that such phenomena can occur. Busse's finite
amplitude results and Proctor's simple model both suggest that, with suitably
chosen parameters~ metastable magnetic configurations should exist. Further com-
putations, with a wider range of diffusivities, are needed to establish w~ether
subcritical convection can be found. It is obviously important to determine what
parameter ranges allow metastable states and whether linear stability theory has any
relevance to convection in a strong magnetic field in a star.
4. FLUX CONCENTRATION
In the limit when Q is su~dclently small the magnetic field is weak and the
Lorentz force has no dynamical effect. The velocity R can then, in principle,
181
be derived from some theory of ordinary Rayleigh-B~nard convection. If ~ is then
fed into the induction equation the kinematically distorted magnetic field can be
calculated. This has been done for various plausible velocity fields (Parker
1963; Clark 1965, 1966; Weiss 1966; Clark and Johnson 1967; Busse 1975). When the
magnetic Reynolds number is high, magnetic flux is rapidly swept to the edges of
convection cells to form ropes. Within a ceil, the field is wound up until the
lines of force eventually reconnect and magnetic flux is expelled. If R >> i the m
pattern of motion must persist for many turnover times before the expulsion process
is completed. However, ropes are formed between the cells by the time they have
turned over once. Within these ropes the field strength has an approximately
Gaussian profile.and the peak field
B* ~ R ½ B in two dimensions mo
R B in three dimensions~ mo
where B is the average initial field (or the field in the absence of convection). o
Astrophysical length scales are large and ~l is big enough for enormous
magnetic fields to be produced locally if concentrations were purely kinematic.
Eventually, the J^B force in the flux rope must become powerful enough to halt the
concentration: amplification of the field is then dynamically limited. But it is
not immediately obvious what limiting field strength can be produced. Partly on
dimensional grounds, it has been popularly supposed that the local field strength
cannot exceed the equipartition field Be, where
~ : , U ~ = ~
The principal argument for this limit depends on considering pressure fluctuations
associated with convection but in a Boussinesq fluid the pressure can be eliminated
from the equations and the equipartition limit should therefore be irrelevant.
Busse (1975) showed that for small amplitude two-dimensional convection
B*~ (~)i/4. Hence B*/B e could be made arbitrarily large by a suitable choice of
. The full two-dlmensional problem has been investigated numerically for con-
vection driven by imposed horizontal temperature gradients (Peckover and Weiss
1977) and by heating from below (Weiss 1975), and Dr, Galloway has computed solu-
tions for axisymmetric convection. The maximum value of the peak field B* can be
estimated by a simple argument. The two-dlmensional results show that kinematic
amplification is halted when ohmic dissipation in the flux rope becomes comparable
with viscous dissipation throughout the convection cell. It follows that the
maximum field B* ~.~/4,-- a result confirmed by the computatio~ In three dim- max
ensions a similar argument yields a maximum field B*max~'~'(galloway etal. 1977).
By choosing ~" sufficiently large, B* can be made much greater than B and solutions e
have been obtained with B*/Bem 5 (though the particular value has no significance).
Once the magnetic field becomes dynamically significant, vorticity is
generated in the flux ropes, where there is a local balance between the magnetic
and viscous terms in the vorticlty equation. The buoyancy force generates vorticity
182
with one sense, while the Lorentz force generates vorticity with the opposite sense
and viscosity maintains a balance. The resultant vorticlty distribution corresponds
to a velocity field with a reduction in the transverse flow that concentrates the
flux. A simple physical description confirms that two-dimensional amplification
is halted as ohmic dissipation reduces the overall flow. In three dimensions motion
can be excluded from the flux rope slightly earlier (Galloway, private communication).
As B is further increased, the flux ropes grow broader and develop a different o
structure. The field within a rope is more nearly uniform, dropping abruptly near
its boundary. The ensuing current sheath produces a Lorentz force which prevents
the motion from entering the flux tube. Ultimately the layer separates into con-
veering cells, from which magnetic flux has been expelled, and stagnant flux ropes
in the interstices between them.
5. SOLAR MAGNETIC FIELDS
Cowling (1953, 1976a), Sweet (1971) and Mullan (1974b) have reviewed magnetic
fields in the sun. The discussion of flux concentration relates most directly to
intense, small scale magnetic fields in the photosphere (Weiss 1977). Over the
last eight years ground-based observers have succeeded in resolving magnetic
structures with a scale smaller than that of the granulation and fields of up to
about 1500 G (Schr~ter 1971; Harvey 1971, 1977; Dunn and Zirker 1973; Mehltretter
1974; Stenflo 1976). These features are formed between granules and have lifetin~s
similar to those of individual granules. The fields are much larger than the local
equipartition field (Be~5OO G) and the magnetic pressure alone is almost sufficient
to balance the external gas pressure. Such high fields can only be contained by that
gas pressure (Parker 1976a). A full theory of convective transport in strong
magnetic fields is needed to explain the formation of these flux ropes but a crude
extrapolation from ~he Boussinesq results indicat~that the field can be amplified
to reach the strengths observed (Galloway et al. 1977).
On a larger scale, magnetic flux is swept aside by supergranules and concen-
trated at their boundaries to form a network in which most of the small scale
features are located. Irregular small scale fields have recently been detected
within the network (Harvey 1977) but the flux involved is relatively slight. As
more flux is brought together the magnetic field inter~res with convection so that
the gas is cooled. Dark pores or sunspots then appear between the supergranules.
The magnetic flux that emerges through a sunspot is presumably assembled into a
rope deep in the convective zone, though supergranules certainly play a part in the
formation of a spot. Conversely, though small flux ropes can be shifted to fit the
pattern of supergranular convection, large sunspots are anchored deeper down and
long-lived, stable convection cells may form around or near them (Harvey and Harvey
1973; Livingston and Orrall 1974; Meyer et al. 1974).
In the umbra of a sunspot the magnetic field is nearly vertical and strong
enough to suppress convective instability, Following a suggestion by Biermann
183
(1941), Cc~ling (1953, 1976 a,b) argued that motion across the magnetic field is
inhlbitedj and that convection is limited to predominantly vertical, oscillatory
motion, in slender elongated cells. Theoretical models of sunspots (Chitre 1963;
D~inzer 1965; Chltre and Shaviv 1967; Yun 1970) show that radiation alone cannot
supply the energy emitted from the umbra and microturbulsnt velocities (Backers
1976) provide some observational evidence for convection. (Umbral dots are too
sporadic to be an essential feature of the transport process.) Various attempts
have been made both to relate linear stability theory to umbral and penumbral
structure in sunspots (Danielson 1961, 1965, 1966; Weiss 1964b, 1969; Musman 1967;
Saito and Kato 1968; Danielson and Savage 1968; Savage 1969; Mullah and Yun 1973;
Moore 1973) and also to study overstability in an isolated magnetic flux tube
(Parker 1974b, B. Roberts 1976, Defouw 1977).
Parker (1974a,h, 1975a, 1976a,b) has recently emphasized the importance of
mechanical energy transported by transverse hydromagnetic waves, which may escape
either upwards or downwards from the umbra. He suggests that thermal energy which
would otherwise have reached the photosphere is carried away by these waves, which
are so efficiently coupled to sub photospherlc convection (cf. Mullah 1974a) that
they refrigerate the sunspot, cooling by Alfv~n waves requires extreme efficiency
and this mechanism has been criticized by Cowling (1976b). Moreover, the corona
absorbs only a comparatively small amount of energy and excess X-ray emission is
associated with active regions, not specifically with sunspots. So magnetic
inhibition of convection still provides the most obvious explanation for the cooling
of pores and spots.
Unlike the umbra, the penumbra of a sunspot is essentially inhomogeneous, and
the radial filaments are correlated with convective motion (Backers and Schr~ter
1969, Schr~ter 1971). According to linear theory convection in rolls lying in the
plane of B is affected only by the vertical component of the field. The inclination
of the magnetic field increases across the penumbra until it becomes almost hori-
zontal at the edge of the spot. Danielson (1961) and Pikel'ner (1961) therefore
suggested that the filamentary structure is caused by convection in horizontal rolls
and this axplanation is qualitatively convincing. Linear theory indicates that the
penumbra may be convectively unstable, to direct rather than to overstable modes
(Danielson 1961; Musman 1967; Saito and Kato 1968; Savage 1969). Nonlinear results
imply that convective transport would then be significant, though motion might
still be periodic (Weiss 1975). These theoretical models are obviously oversimplified.
In particular, the boundary conditions are too stringent: one might, for instance,
expect that vigorously convecting plasma from below the penumbra would be able to
penetrate through the shallow magnetically dominated region (Meyer etal. 1977). A
more complete theory should also explain the Evershed outflow as a consequence of
convection (Galloway 1975).
184
6. CONVECTION AND DYNAMOTHEORY
There is some observational evidence that other stars with outer convection
zones have magnetic cycles llke the sun (Wilson 1976) and flux concentration is
inevitable in any stellar dynamo that is driven by convection, Turbulent motion
tends to remove magnetic fields from a convective zone: flux tubes may emerge from
the surface and be carried off by a stellar wind or they may be expelled downwards
into the radiative zone. Systematic differences in the velocity~ caused for example
by the radial density gradient, may pump flux preferentially in one direction (Moore
and Proctor 1977). A more important topological effect was pointed out by
Drobyshevsky and Yuferev (1974). In three dimensional convection with upward motion
at cell centres, the sinking fluid forms a continuous network, while regions ef
rising fluid are separated from each other. Since a flux tube can wind continuously
through downward moving gas there is a tendency to pump flux downwards and to concen-
trate the field at the base of a convecting layer. Topological pumping competes with
magnetic buoyancy (Parker 1955j 1975b; Gilman 1970; Unno and Rihes 1976). If the
field in the flux rope has the equipartitien value then the rope floats upward
relative to the ambient gas at about the Alfv~n speed, which is equal to the downward
convective velocity. Se the net motion of the flux tube cannot readily be estimated,
though it seems unlikely that flux can remain within the star unless it is surrounded
by sinking gas. A proper description of the inhomogeneous magnetic field must he included in any
realistic dynamo model. The theory of turbulent dynamomhas often been reviewed (eg.
Parker 1970; P. H. Roberts 1971; Vainshtein and Zel'dovich 1972; Gubbins 1974; Mestel
and Weiss 1974; Moffatt 1977). Without systematic helicity, homogeneous turbulence
is unlikely to maintain a field (Moffatt 1977), and helicity is caused by rotation.
The Coriolis force, like the Lorentz force, tends to inhibit convection but these
costraints may be relaxed if both are simultaneously ~resent (Malkus 1959;
Ohandrasekhar 1961; Eltayeb and Roberts 1970; Eltayeb 1972~ 1975; van der Borght and
Murphy 1973; Roberts and Stewartson 1974, 1975). Attempts to solve the full hydro-
magnetic dynamo problem will be discussed by Dr. Childress.
REFERENCES
Backers, J. M., 1976. Astrophys. J. 203, 739.
Backers, J. M. and Schr~ter, E. H., 1969. Solar Phys. i_O0, 384.
Biermann, L., 1941. yiertel~ahrscPr. Astr. Ges. 76, 194.
van der Borght, R., 1974. Men. Not. R. Astr. Soc. 166, 191.
van der Borght, R., and Murphy, J. 0., 1973. Austr. J. Phys. 2~6, 617.
van der Borght, R., Murphy, J. O. and Spiegel, E. A., 1972. Austr. J. Phys. 2~5, 703.
Busse~ F. H., 1975. J. Fluid Mech. 7~i, 193.
Chandrasekhar, S., 1952. Phil. Mag. (7) 4~3, 501.
185
Chandrasekhar, S., 1961. Hydrodynamic and hydromagnetic stability. Oxford.
Chitre, S. M., 1963. Mon. Not. R. Astr. Soc. 126, 431.
Chitre, S. M. and Shavlv, G., 1967. Solar Phys. ~, 150.
Clark, A., 1965. Phys. FIo ~, 644.
Clark, A., 1966. Phys. FI. ~, 485.
Clark, A. and Johnson, A. C.+ 1967. Solar Phys. ~, 433.
Cowling, T. G., 1953. The sun, ed. G. R. Kulper, p. 532. University of Chicago Press.
Cowling, T. G., 1976a. Magnetohydrodynamics. Hilger, Bristol.
Cowling, T. G., 1976b. Mon, Not. R. Astr. Soc. 177, 409.
Danielson, R. E., 1961. Astrophys. J. 134, 289.
Danielson, R. E., 1965. Solar and stellgr magnetic fields (IAU Symp. No. 22), ed. R. LUst, p. 314. North-Holland, Amsterdam.
Danielson, R. E., 1966. Atti de l c0+nve~no sulle macchie solari, ed. G. Righini, p. 120, Barbara, Florence.
Danielson, R. E. and Savage, B. D., 1968. Structure and development of solar active re~ions (IAU Symp. No. 35), ed. K. 0. Kiepenheuer, p. 112. Reidel, Dordrecht.
Defouw, R. J., 1977. Astrophys. J. (in press).
Deinzer, W., 1965 Astrophys. J. 141, 5~8,
Drobyshevsky, E. M. and Yuferev, V. S., 1974. J. Fluid Mech. 65, 33.
Dunn, R. B. and Zirker, J. B., 1973. Solar Phys. 33, 281.
Eltayeb, I. A., 1972. Proe. Roy. Soe. A 32__~6, 229.
Eltayeb, I. A.~ 1975. J. Fluid Mech. 71, 161.
Eltayeb, I. A. and Roberts, P. H., 1970. Astrophys. J. 162, 699.
Galloway, D. J., 1975. Solar Phys. 44, 409.
Galloway, D. J., Proctor, M. R. E. and Weiss, N. 0., 1977. Submitted to Nature.
Gibson, R. D., 1966. Prec. Camb. Phil. Soe. 6_22, 287.
Gilman, P. A., 1970. Astrophys. J. !62, 1019.
Gough, D. O. and Tayler, R. J., 1966. Mon. Not. R. Astr. Soc. 133, 85.
Gubbins, D., 1974. Key. Geophys. Space Phys. 12, 137.
Harvey, J., 1971. Publ. Astr. Soc. Pacific 83, 539.
Harvey, J., 1977. Highllghts 0f astronomy, ed. E. MUller, Reidel, Dsrdrecht.
Harvey, K. and Harvey, J°, 1973. So!at Phys. 28, 61.
Clarendon Press,
186
Huppert, H. and Moore, D. R., 1977. J. Fluid Mech. 78, 821.
Kato, S., 1966. P ubl. As tr. Soc. Japan, 18, 201.
Livingston, W. and Orrall, F. Q., 1974. Solar Phys. 39, 301.
Malkus, W. V. R., 1959. ~ . 130, 259.
Mehltretler, J. P°, 1974. Solar Phys. 38, 43.
Mestel~ L. and Weiss, N. 0., 1974. Masnetohydrodynamics. Swiss Soe. Astr. Astrophys., Geneva.
Meyer, F., Schmidt, H. U. and Weiss, N. 0., 1977. Mon. Not.R. Astr. Soc. (in press).
Meyer, F., Schmidt, H. U., Weiss, N. O. and Wilson, P. R., 1974. Mon. Not. R. Astr. Soc. 169, 35.
Moffatt, Ho K., 1977. Masnetic field seneration in electrically, conduetin$ fluids. Cambridge University Press.
Moore~ D. R. and Proctor, M. R. E., 1977. In preparation.
Moore, R. L., 1973. Solar Phy s . 30, 403.
Moss, D. L. and Tayler, R. J., 1969. Mon. Not. R. Astr. Soc. 145, 217.
Moss, D. L. and Tayler, R. J., 1970. Mon. Not. R. Astr. Soc. 147, 133.
Mullah, D. J., 1974a. Astrophys. J. 187, 621.
Mullan, D. J., 1974b. J~ Franklin Inst. 298, 341.
Mullah, D. J. and Yun, H. S., 1973. Solar Phys. 30, 83.
Musman, So, 1967. Astro~hys. J. 149, 201.
Parker, E. N., 1955. Astrop y .h s J. --121, 491.
Parker, E. N., 1963. Astrophys. J. 138, 552.
Parker, E. N., 1970. Ann. Rev. Astr. Astro~h~s. 8, i.
Parker, E. N., 1974a. Solar PhTs. 36, 249.
Parker, E. N., 1974b. Solar Phys. 37, 127.
Parker, E. N., 1975a. Solar Phys. 40, 275.
Parker, Eo N., 1975b. Astrophys. J. 198, 205,
Parker, E. N., 1976a. ~ . 204, 259.
Basic mechanisms of solar activity (IAU Symp. No. 71), ed. Parker, E. N., 1976b. V. Bumba and J. Kleczek, Reidel, Dordrecht.
Peckover, R. S. and Weiss, N. 0., 1977. In preparation.
Pikel'ner, S. B., 1961. Osnovy kosmieheskoy elektrodinamiki.
Roberts, Bo, 1976. ~ . 204, 268.
Nauka, Moscow.
Roberts, P. H., 1971. Mathematical prob!ems in the geophysical sciences, ed. W, H. Reid, p. 129, Am. Math. Soc., Providence.
187
Roberts, P. H. and Stewartson, K., 1974. Phil. Trans. A 277, 287.
Roberts, P. H. and Stewartson, Ko, 1975. J. Fluid Mech. 68, 447.
Saito, M. and Kato, S., 1968. Solar Phys. ~, 531.
Savage, B. D., 1969. Astroph~s. J. 156, 707.
Schr~ter, E. H., 1971. Solar magnetic fields (IAU Symp. No. 43) ed. R. Howard, p. 167, Reidel, Dordrecht.
Spiegel, E. A., 1971. Ann. Rev. Astr. Astrophys. 9, 323.
Spiegel, E. A., 1972. Ann. Rev. A str. Astroph~s. I__OO, 261.
Stenflo, J. O., 1976. Basic mechanisms of solar activity (IAU Symp No. 71), ed. V. Bumba and J. Kleczek, p. 69. Reidel, Dordrecht.
Sweet, P. A., 1971. Solar magnetic fields (IAU Symp. No. 43), ed. R. Howard, p. 457, Reidel, Dordrecht.
Syrovatsky, S. I. and Zhugzhda, Y. D., 1967. Astr. Zh. 44, 1180 (Soy. Astr. II, 945, 1968).
Syrovatsky, S. I0 and Zhugzhda, Y. D., 1968. Structure and development of solar active re$ions (IAU Symp. No. 35), ed. K. 0. Kiepenheuer, p. 127. Reidel, Dordrecht.
Tayler, R. J., 1971. 9" J" R. Astr. Soc. 12, 352.
Thompson, W. B., 1951. Phil. Ma~. (7) 4-2, 1417.
Unno, W. and Ribes, E., 1976. Astrophys. J. 208, 222.
Vainshtein, S. I. and Zel'dovich, Y. B., 1972. Usp. Fiz. Nauk 106, 431 (Sov. Phys. Usp. 15, 159).
Veronis, G., 1959. J. Fluid Mech. 5, 401.
Weiss, N. 0., 1964a. Phil. Trans. A 256, 99.
Weiss, N. 0., 1964b. Mono Not. R. Astr. Soc. 128, 225.
Weiss, N. 0o, 1966. Proc. Roy. Soc. A 293, 310.
Weiss, N. O., 1969. Plasma instabilities in astrophysics, ed. D. G. D. E° Tidman, p. 153. Gordon and Breach, New York.
Wentzel and
Weiss, N. O., 1975. Adv. Chem. Phys. 32, IO1.
Weiss, N. O., 1977. Highlights of astr0nom[, ed. E. MUller, Reidel, Dordrecht.
Wilson, O. C., 1976. Basic mechanisms of solar activity (IAU Symp. No. 71), ed. V. Bumba and Jo Kleczek?Reidel, Dordrecht.
Yun, H. S., 1970. Astrophys. J. 162, 975.
AXISYMMETRIC CONVECTION WITH A MA~ETIC FIELD
D. J. Galloway Astronomy Centre
University of Sussex Brighton, BNI 9QH
England
Summar~
The n o n - l i n e a r B o u s s i n e s q e q u a t i o n s d e s c r i b i n g a x t s y m m e t r i c c o n v e c t i o n i n a c y l i n d e r
w i t h an i n i t i a l l y u n i f o r m m a g n e t i c f i e l d have been i n t e g r a t e d f o r w a r d i n t i m e
n u m e r i c a l l y . When t h e f i e l d i s weak a s t r o n g c e n t r a l f l u x r o p e i s formed a t t h e
axis. In this case the maximum field strength can be limited either klnematlcally
or by dynamical effects, and the equipartitlon prediction B 2 ~ 4~pu ~ is eg~ily max
exceeded. If the field is strong oscillations can occur and hysteresis is posslble
as the field is increased and decreased.
i. Introduction
The interaction between convection and a magnetic, fleld determines many features
observed in the solar photosphere. Sunspots and smaller scale magnetic fleld
elements are symptoms of the ability of convection to concentrate a weak average
field into strong fluxropes. Oscillatory phenomena such as running penumbral waves
can occur in the presence of a strong field. To study such effects it is necessary
to solve non-linear problems, and clearly to do So in three dimensions if at all
possible. The recent work of Jones, Moore and Weiss (1976) on axlsymmetric con-
vection is easily extended to include the presence of a ma~etlc field with average
strength B . This problem is geometrically three-dlmenslonal but depends mathe- o
matlcally on only two variables, thereby rendering Itself tractable to nnmsrlcal
computation. The normal equations of Bousslnssq convection are modified to include
the effect of the Lorentz force in the vortlclty equation, and in addition the
electromagnetlc induction equation is solved to update the magnetic fleld as the
system is integrated forward in time.
S. The Problem
To solve the equations it is convenient to set up stream functions for the
velocity and magnetic fields thus:-
u= Va(o, , o) = (- r ~z o, r ar ,
B= V^(O, ~r' O) = (- r ~z O, r az "
We use cylindrical polar coordinates (r, 8, z). The geometry of the problem is
shown in figure 1.
189
Bo- In~t~a ll~.,, =,q~co,',~.
F=O
_
ff"e:rn p. T : T,,+AT
Temp.T--To
ease r'= a.
Fig. 1
Geometry for the axisymmetrie problem, showing basic cylinder on left and
axis-edge cross-section on right.
The equations V.~ = 0 and V.~ = 0 are automatically satisfied by Zhe above
and B flelds. Those remaining can be non-dimenslonalized and put in the following
form:
~-~ = -- p3 R -- - r - ~ r + ( V . ( r V ( r 2 ~ ) ) , ( 2 . 1 )
~_TT = -V. (Tu) + 1 V. (VT) Bt -- (pR)½
( 2 . 2 7
~t p3(pR)~ *
(2.3)
w h e r e ~ = (VA--u)o = - r ~ r2 - r ~ r ~z 2
J = (V^S--)e = - - r ~r ~z
There are five dimensionless parameters specifying each solution to the problem;
these are
2 2 __ 3 B d
R = g~wdz o K K~) ' Q = 4W~O~ ' p = ~ ' P 3 = ~ , a n d
a , t h e r a t i o o f c e l l w i d t h tO c e l l h e i g h t . The n o n - d i m e n s i o n a l i z a t i o n h a s b e e n t
c o n d u c t e d w i t h l e n g t h s c a l e d ( l a y e r d e p t h ) , t i m e s c a l e ( d / g o , AT) ~, f i e l d s t r e n g t h
scale B and temperature scale AT. The density and coefficient of volume expansion o
of the fluid are Q and ~ ;~ ,<, and i'] are its viscous, thermal and magnetic
190
diffusivities respectively, and g is the acceleration due to gravity.
The following boundary conditions are used:
T = T O + AT, B r = O, @ = O, ~= 0 (z = O)
T = T , B = 0, @ = 0, ~= 0 (z = 1) o r
~T - - = = ~r O, X const,, ~ = O, ~ = 0 (r = a)
~T ~--~= O, × = O, ~ = O, ~ = 0 ( r = O)
The conditions on the fluid are those commonly known as stress-free. The con-
straint X = const, at r = a fixes the total flux in tho cylinder and circumvents
Cowllng's theorem, so that steady solutions are possible.
The above equations and boundary conditions have been solved by finite-difference
methods similar to those described in Moore~ Peckover and Weiss (1973). The
equations were integrated forward in time until the solutions converged to a steady
state or a repeating oscillation. In many cases one solution was started from
another, and in this way the effects of continuously varying one parameter could be
investigated.
3. Discussion of Results
To correspond most closely with highly conducting astrophysical plasmas the
program WaS run with values of </n ranging from 10 to 50. For fixed and moderately
non-linear values of the Rayleigh number the followlng types of solution are found
as Q is increased.
i) For very weak fields the convection is unaffected and concentrates all the flux
kinematically into a central rope. The structure of this rope is fixed by the
balance between diffusion and ndvection in the induction equation. The maxi-
mum field strength B m is higher than the input field B ° by a factor of the order
of the magnetic Reynolds number, and the profile of the rope is Gaussian. Such
solutions have been described by Weiss (1966) and Clark and Johnson (1967).
ii) As Q is increased a regime ensues where the imposed field remains compressed in
a fluxrope hut can exert a dynamical influence on the convective flow. Within
the rope motion is minimal: at its edge, typically a few mesh points from the
axis, there is a shear layer and the velocity reaches a value comparable with
that in the absence of the field. The dynamics are dominated by a balance
between the total thermal and magnetic torques; dissipation can be ohmic or
viscous, and the maximum field can be Successfully predicted by s power law of
the form
191
Bo ~ The numerical experiments yield average ~ = ~= 0.63 and ~ = 0.77; the power-law
behaviour extends over typically two orders of magnitude. The greater ability of a
three-dimensional geometry to concentrate flux means it is far easier to chart this
regime in the axlsymmetric case than for two-dimenslonal rolls. It is also possible
to advance physical arguments based on a boundary-layer structure (Galloway, 1976),
and predict a law similar to (3.1). The exact values of ~,8 and 7depend on whether
viscous or ohmic dissipation is dominant, and the formulae involve weakly varying
logarithms, but agreement with the numerical experiments is generally very good.
An example of one of these dynamically limited solutions is shown in figure 3,
which is the case Q = i00, R = 20Rc, p = 1, P3 = I0, and a = 4/3. (Here R c =
(27/4)~4). The fluxrope is almost stagnant; st its edge there is a current
sheet which generates a large localized amount of negative vortlcity. Within the
fluxrope horizontal temperature gradients cause a very weak countercell to develop;
this has an advective effect on the field and causes the fluxrope to develop a
maximum some distance away from the axis. The run of the fluxrope profile as Q
increases is shown in figure 2.
Q = 1 q = 5 Q = 20 Q = I00 Q = 1000
Fig. 2. Fluxrope profiles for R = 20R , p = I, </n = I0, a = 4/3 e
i i i ) The c e n t r a l f l u x r o p e b r o a d e n s a s Q i s f u r t h e r i n c r e a s e d a n d e v e n t u a l l y i t
o c c u p i e s a b o u t a h a l f o f t h e r a d i u s o f t h e c e l l . At t h i s s t a g e t h e r o p e b e g i n s t o
o s c i l l a t e w h i l s t t h e o u t s i d e c i r c u l a t i o n r e m a i n s s t e a d y , a n d t h e r e i s a c o r r e s -
p o n d i n g v a r i a t i o n i n t h e h e a t t r a n s p o r t , T h i s f o r m o f s o l u t i o n h a s a n a t u r a l
e x p l a n a t i o n . T h e i n i t i a l l y i m p o s e d f i e l d B ° i s w e a k e n o u g h t o a l l o w s t e a d y c o n -
v e c t i o n ~ w h i c h c o n c e n t r a t e s t h e f l u x i n t o a r o p e o f s t r e n g t h B a n d r a d i u s ~ a . F l u x m
conservation suggests Bm % 4]30, and this means that, considered in isolation, the
central rope is overstable to linear theory. The frequency of the computed solution
agrees moderately well with such a linear prediction. When the senses of the two
192
a) magnetic lines of force
f
b) streamlines
c) isotherms
I///~ "///.////~////~/ d) vortlcity ~ (axis on left)
193
circulations are opposite, upward moving plumes are adjacent and the heat transport
is a maximum, When the senses are the same, the cold downdraught of the countereell
is next to the hot updraught of the main flow and lateral diffusion reduces the heat
transport to a minimum.
iv) Finally Q is so strong that only finite-amplltude oscillations are posslble.
These are confined mainly to the outer half of the radius, so that the solutions
are quite different to the elgenfunctions of linear theory. Periods are typically
10% - 20% faster than the linear values, presumably because the essentially quad-
ratic Lorentz t e rm in (R.l) is badly underestimated in the linear approximation.
The nature of the solutions depends on the five dimensionless parameters defined
earlier. However there are also occasians when the system adopts a configuration
dependent on the initial conditions, so that hysteresis occurs. This effect is
encountered when the field is fairly strong. A solution with given (Q,R,p,~/~,a)
can then be steady if it is part of a branch with Q inereasin~and oscillatory if
part of a branch with Q decreasing - the system remembers what it was doing for
earlier values of Q. This effect can be quantified by using the Nusselt number N,
averaged in time if necessary, as a measure of the amplitude. A graph showing the
variation of N with Q as the latter is increased and decreased is shown in figure
4. For this example, Q = 10,300 marks the onset of overstability and Q = 2535 the
transition from steady to oscillatory modes according to linear theory. The slight
increase in N as Q increases in the lower branch at Q = 4000 appears real.
5
3
!
N
"\
i 0 | 0 0 i O 0 0 LO)O00
Fig. 4. Variation of N against log Q for R = 20Rc~p = I, </q = i0, a = 4/3.
...... oscillatory solution -.-.-.- mixed steady-oscillatory solution --steady solution.
194
It is interesting to compare these hysteresis effects with the results of
Huppert (1978) on double-diffusive convection, also described elsewhere in these
proceedings, Broadly slmilar results are obtained but the magnetic results are
more regular and do not show the sudden Jumps in N found in the salt case. Further-
more no subcritieal instabilities have yet been found in the present study.
Applications of this work to the production of intense solar magnetic fields
are discussed in Galloway, Proctor and Weiss (1976). The fluxrope solutions give
fields limited either by the magnetic Reynolds number or by formula (3.1); in a
Boussinesq fluid the equipartition argument B 2 % 4~Wpu 2 is quite irrelevant since max
the pressure can adopt arbitrarily high values. The numerical results give fields
up to six times greater than this prediction. We conclude that in the solar photo-
sphere the maximum field strength is limited by the gas pressure.
Acknowledgements
I am very grateful to Dr D. R. Moore, who wrote the program on which this work
was based, and to Dr N. O. Weiss for many helpful discussions. I thank the Science
Research Council and Trinity College Cambridge for financial assistance.
References
Clark, A. and Johnson, A.C., 1967. Sol.Phys.2, 433
Galloway, D.J., Proctor, M.R.E., and Weiss, N.O. 1976. Nature, to be published
Galloway, D.J., 1976. Ph.D. Thesis, University of Cambridge
Huppert, H.E., 1976. Nature, 263, 20
Jones, C.A., Moore, D.R. , and Welss, N.O., 1976. J.Fluid Mech. 73, S53
Moore, D.R., Peckover, R.S., and Weiss, N.O., 1973. Computer Phys,Comm. 6, 198
Weiss, N.O., 1966. P roc0. Roy. Soc. A , 293, 310
CONVECTIVE DYNAMOS
S t e p h e n C h i l d r e s s Courant Institute of Mathematical Sciences
New York University New York, N. Y. 10012
i. INTRODUCTION
Convective dynamo theory can be regarded as combining two kinds of
physical problems, each involving an electrlcally conducting fluid
medium, but differing in the role of the magnetic field and in the
physical processes described. On the one hand, if the fluid is taken
to be permeated by a prescribed magnetic field B, under suitable
conditions, involving a sufficiently strong flux of heat for examplej
convective motion of the fluid will ensue. On the other hand, kine-
mGtic dynamo theory insures that a sufficiently compllcated fluid
motion u can sustain or excite a magnetic field. In a convective
dynamo the origin of the magnetic field is internal and we must regard
the applied and excited fields as one and the same (Figure i). In the
present paper we shall outline some of the current work on such sys-
tems. The research has been motivated primarily by the search for
tractable models of planetary and solar magnetism, and the focus in
this paper will be on models of the geodynamo. For simplicity we
restrict attention to Boussinesq fluids and emphasize asymptotic solv-
able problems rather than a realistic description of the Earth's core.
We shall, however, require that the dynamo be essentially convective,
in that no auxiliary driving forces are needed. (The convective
process could of course involve any advected, diffusing substance
which changes the weight of a fluid element.)
u
Figure i.
~ rotatlon?
kinematic induction
magnetic convection
heat~flux
J
The Convective D~namo Cycle.
The simplest physical system admitting a convective dynamo cycle
is not obvious (to this author), although in the case of the geodynamo
it would appear that large-scale rotation of the fluid is sufficient
196
if not essential. In the models discussed below it is precisely the
combination of bouyancy and Corlolls forces which create the neces-
sary flow structure, so in this respect at least they may be relevant
to the processes at work in the Earth's core.
2. BOUNDS AND ESTIMATES
One way to define dynamo action is simply to require that the mean
magnetic field of the system, obtained by appropriate integrals over
space and time, b e positive. With this definition we can, in a sense,
"prove" dynamo action by showing that the convective system with B - 0
is unstable to magnetic fields; that isj small "seed" fields are
always ampllfled. This property can be tested rather easily since the
two parts of the system depicted in Figure 1 decouple when the magne-
tic field is weak. Now it is an essential feature of our problem that
the mean magnetic energy of the ultlmate state(s) of the dynamo is
an internal property of the system~ although we may assume that
such a mean energy may be defined and that it will depend upon the
various parameters~ the geometry, etc. This being the case, it is of
interest to determine, without studying the evolution of the system in
detail, an u pr~oPi upper bound on mean magnetic energy.
This intriguing question was apparently first studied only recent-
ly by Kennett (1974) in the case of B~nard convection between free,
perfectly conducting plane isothermal boundaries rotating about a
vertical axis (cf. Section 5 below). We will use the following nota-
tion: ~ - magnetic permeabillty~ p = density, ~ = kinematic viscos-
ity, K - thermal dlffuslvity, n = magnetic dlffusivity, a - coeffi-
cient of thermal expansion, all of the above being taken to be
constant~ P = ~/~ = Prandtl number, P = ~/K, R = Rayleigh number 4
~ased on a temperature gradient 7) = uyEL /K~ , M = Hartmann number
- BL/(~p~) 1/2, Ta = Taylor number = 4~2L4/~2~ where G ~s the
angular speed of the system. If E B denotes the time and volume mean
of B 2 Kennettls result may be written i
_3/2.2,^~2p2 E B ~ 4~ .01~ ~, B 0 - (~pV)I/2/L . (i)
This estimate is obtained by equating the mean dissipation to the
mean work done by the gravitational forces, and involves extensions of
the familiar power integrals of the B~nard problem.
Although as a general rule analysis of this kind rather severely
overestimates energies, (i) is interesting as an indication of the
influence of the various material properties. The parameter
197
P is evidently significant in determining the magnetic energy
realized by convection at a given Raylelgh number, Incidentally, P
has a value of about 106 in the earth's core, but may be as small as
10 -5 in stars because of radiative cooling, so that ideally we would
like a dynamo model to retain Pn as an arbitrary parameter. The fact
that the bound (i) diverges as n ÷ 0 probably ref12cts the infinite
amplification that can be achieved in a perfect conductor by the
twisting and stretching of field lines. But note that the bound also
diverges llke V -I/2 in the limit of small viscosity. While such a
divergence might be expected for convection between isothermal bound-
aries in the limit of zero Prandtl number, it seems unlikely for
Systems driven by a fixed rate of heating; in this case E B should be
bounded independently of the viscosity.
Such a result is in fact implied by the interesting thermodynamic
arguments of Malkus (1973), and Hew~tt, McKenzle, and Weiss (1975).
Following these authors we consider a spherlcal region of (current)
conducting fluid surrounded by a rigid non-conductor. Let the bound-
ary r = L be held at a fixed temperature T O and the interior be
heated uniformly at the rate qo" We seek a bound for the magnetic
energy in terms of the material constants, L, and q0" Let . E B , qj ,
qv • and W now denote the time averages of global B 2, Joule dlsslpa-
tlon, viscous dissipation, and bouyancy work, respectively, all
normalized by the volume V of the sphere. Assuming internal energy
is bounded in time the first law requires
w = qj + qv " qj Z o, qv S 0 . (2)
On the other hand, a well-known property of currents in a homogeneous
spherical conductor is (see e.g. Backus 1958)
qj ~ n~2EB/~L2 (3)
Then f r o m (2) and (3)
E B ~ ~L2Wln w2 (4)
To estimate W we use the temperature equation in the Bousslnesq limit,
8T 8-q.+ ~-vr - K72~ - qo/Oep . (5)
Le t , wi th ~-~ = u z r g o / L ,
198
L
W = (4~spg0/LV) I r3 w( r ) dr (6)
0
so that w is the time average of the spherical mean of u T. If 8 r
denotes t he same operation on T, (5) yields upon integration the flux
balance
3 3 dO 4q0 /30c p r w - Kr ~ = r (7)
o
We may measure 8 in K. Integrating (7) from r = 0 to r - L and
using 8 ~ 0 we have
L
r3w dr ~ Lbq0/15pe p + ~L3T0 (8)
0
Combining (4), (6) and (8) there results
nEB/~L2q 0 ~ (8/5~2)(I + 15K0CpT0/L2q 0) (9)
Here 8 ffi ~g0L/Cp is the ratio of L to the temperature scale height
and is necessarily a small number in the Bousslnesq approximation.
(The inclusion of the dissipation terms on the right of (5) changes
(9) by terms O(8~.) But the left-hand side of (9) should be independent
of the origin of the temperature scale and, indeed, it can be shown
that for uniform heat addition (9) holds with T o = O. Qe therefore have
nEB/~L2q0 ~ 8/5~ 2 (i0)
This provides us with a useful (and small) measure of the efficiency
of a convective dynamo. Other estimates of this kind are contained in
the references cited above.
If we introduce the Raylelgh number
Rq = ~g0qoLb/ocp~2~ ,
then (I0) may be rewritten
E B < R B~/5~2p 2 (ll) -- q o
and thus has the form of (I) reduced by a factor 9/(20 RI/2), the
bound now being independent of the viscosity.
In the above we have dropped viscous dissipation ~ecause of the
199
inequality (3), but if this term is now retained one obtains
L4qv/P n 2 + ! q/Sp 411.)
in place of (ii). If, relative to a rotating frame, the no-sllp
condition is satisfied on the boundary, qv can be bounded from
by a multiple of p~L-2Eu , in which case the terms on below the left
of the inequality would be comparable provided the ratio of kinetic
to magnetic energy is roughly P /P. It is known (Childress 1969a)
that L2E / 2 must exceed a fixed positive bound for a dynamo effect u
to be possible in a given domain of fluid. Consequently in the
El/2 E I/2 plane a convective dynamo must lie within the first B - u
quadrant of an ellipse, and to the right of a vertical line determined
by the dynamo condition. In reality, of course, the radius of the
ellipse should be altered to express the existence of a critlcal
Rayleigh number, and it would be of interest to extend (Ii*) to
account for this shift, perhaps by applying the method used by
Kennett (1974).
Since the above arguments completely ignore the dynamical process
by which the dynamo effect is realized, 411) tells us little about the
behavior of any given system. Suppose, however, that some dynamics
allows the bound (11) to be obtained, and take 1 ~ P~ > 1. (Through-
out we use the symbol ~ as follows: a ~ b if a = O(b) and b = O(a).)
Then 411) implies M 2 ~ R , which is reminiscent of the relation q
R e ~ M 2 (M >> i) obtained for the or~ti~al Rayleigh number for con-
vection between isothermal planes dominated by the magnetic field
(Chandrasekhar 1961). That is, the "optimal state" of the convective
dynamo is close to marginal in the context of linear stability theory.
If the process by which the optimal state were reached involved rapid
rotation (Ta >> i), the analogous stability results of Eltayeb and
Roberts 41970) and Eltayeb (1972) show that if P > .67659, then
Rc(M,Ta) is minimized when M 2 ~ Ta I/2, R c ~ Ta I/2, which is again
compatible with (II) when P > 1. (And note that M4/Ta is also
independent of viscosity.) The Eltayeh-Roberts ordering may be loose-
ly interpreted to imply that magnetic energy with Hartmann number T I/2 a
would be acquired by a rapidly rotating body once the Raylelgh number
was raised to a value ~ T 1/2 But again the argument assumes the a necessary dynamo action by the convection. In particular t~ere is no
implied critlcal rotation rate.
If both fluld inertia and viscous stresses can be neglected (as
seems to be the case in the Earth's core outside Ekman layers~ cf.
Roberts and Soward 1972), the dimensionless parameters of the heated
200
convective system may be reduced to a "Raylelgh number"
2 1/2 = Rq/P Ta (12)
together with P~, by the choice of ~/L, (2~pn) I/2, and q0L2/Pep~ as
units of speed, magnetic field strength, and temperature respectively.
The dimensionless equations are then
÷ ÷ _~g;l Vp + ~ - 1 ~ + Bx(V×B) = ~z, V-~ = Or (13)
, ,
B_~T + ~-VT - p-Iv2T = i . (15)
For given P,
ible with boundary conditions, and from these determine the one
with maximum E B (now dimensionless). By (11) this value cannot
exceed R/5~ 2. Such an operating state, where the mean magnetic energy
is as large as possible for a given heating rate, may be taken as
"optimal", since it is presumably stable locally and can only lead
to smaller energy under a finite perturbation. Of course it is not
clear that the system admits uny nontrlvial solutions (5 # 0).
Note that if such a solution existed for some R, and if it were
known that rotation was essential for a dynamo effect, then it would
be necessary that solutions terminate for sufficiently small Gnd
sufflciently large values of R.
The existing theory of convective dynamos has concentrated on
cases whlchare probably far from optimal in the above sense. Viscous
effects are freely admitted, parameters and geometry chosen to a11ow
the convective modes to be determined by considerations at marginal
stability, and various devices are used to simplify the analysis of
electromagnetic induction. In the following sections we study various
aspects of these highly idealized models, but return to some of the
questions raised above at the end of the paper.
we can seek solutions of (13)-(15) which are compat-
201
3. KINEMATIC INDUCTION
This aspect of the problem has a large recent literature (see e.g.
the reviews of Roberts 1971, Weiss 1971, and Gubbins 1974). One of
the more direct evaluations of the regenerative effect is possible if
the fields are taken to be periodic in space and time. (The spatially-
periodic case was treated by Childress 1967, 1969b, 19701 the theory
in its most general setting was developed by G. O. Roberts 1969, 1970,
1972.) This situation arises naturally in planar or almost-planar
models involving simple boundary conditions. The analysls is facili-
tated (and can be made explicit) if the two dimensionless numbers
r = ~ /tlk~ r k = U/~k, (16)
where U, k , and ~ are the speed, wavenumber, and frequency charac-
teristic of the velocity field, satisfy
r k = o(1), r - o(1). (17)
That is, the magnetic Reynolds number of a fluid eddy must be small,
and the time scale of the motion should be of the order of the decay
time of a magnetic field structure of the same size. With (17) it
becomes rather easy to demonstrate self-excltatlon of a magnetic field
which is slowly-varying relative to the scales k, ~. (I¢ is unlikely
that the first of (17) is satisfied in the Earth's core, but the basic
inductive mechanism, which goes back to the pioneering paper of
Parker (1955), can in fact he deduced without such a restriction
(G. O. Roberts 1970).)
We return to the dimensional induction equations, which are
V-~= 0
(is)
(19)
Consider the solenoidal velocity field
~(a) = U(O, sin o, siu(~+~)), a = kx + ~t, (20)
and suppose t h a t ~ has t he d e c o m p o s i t i o n
= ~ + g, g = o(i), ~ slowly-varylng. (21)
202
Uslng (20) and (21) in (18), (19), one sees that the part ~ will
approximately satisfy
- f-Vu ,
so that fl k d~
g = (nk 2 ~+ ~u) n2k4+~ 2
The slowly-varylng component will then satisfy
-- - = V x ( u x g ) = ~ t
422)
( 2 ~ )
V"~ = 0, (24)
where the overbar denotes the o-average and A is a constant pseudo-
tensor. For (20) the only non-zero component of A is
All = . (qk3U2(sln ¢)/(~2k4+~2)) ~ (25)
Thls additional contribution to mean electromotive force is usually
referred to as the "a-effect". The most general Q-effect, involving
arbitrary symmetric A, can be created by suitably combining linearly
independent modes of the form (20). It is easily seen that, by exam-
ining the case of diagonal A, that (24) can be made to admit exponen-
tlally-growing spatially periodic solutions (and note that (17)
insures that they will be slowly-varying).
Let us look more closely at the underlying inductive mechanism
when ¢ - ~/2. From (22) it is clear that the source of small-scale ÷
magnetic structure is proportional to the x-derivative of u, l.e. to
the sheur of the flow. Now trigonometric spatial modes of the diffus-
ion equation decay without change of shape, but there is a phase shift
between the solution and moving sources. Combining this shift with
that introduced by differentiation, we see that ~ is proportional to
~(O + $)where ~k2/~ - tan ~. As O varies, ~ and ~ rotate in the yz
plane while maintaining thls phase difference, so the induced current,
obtained as a cross product, is independent of o and proportional to
sln ~.
For a given mode 420) the corresponding entry in A is maximized
when ¢ = ~ TM W/2 s l.e. the motion is both quasl-steady and Beltraml
(vortlcity and velocity everywhere parallel). Thls maximizes
the mean heZ~u~ty (Moffatt 1968), defined as the volume average of
u. Vxu, for a given mean kinetic energy. Note that the mean hellclty
is opposite in slgn to ~ for these elementary Beltrami modes.
To summarizer time-lndependent velocity modes having the property
203
that the velocity is orthogonal to the wavenumber vector and the
two orthogonal components are 90 ° out of phase, provides a basic
element of a particularly efficient kinematic dynamo process, charac-
terized by a constant mean heliclty. A variety of other, less effici-
ent dynamo mechanisms (involving an A which either vanishes or has
rank 1~ see e.g. case IV in G. O. Roberts 1972) can he studied by a
refinement of these procedures~ but in the present context it is
rather a slightly different point of view which is needed, since the
relevant convective modes cannot be regarded as exclusively small-
scale. We accordingly consider next the dynamo mechanisms which are
compatible with the dynamics of convection.
4. DYNAMICS
The efficient kinematic dynamos considered above are very special
in that the hellclty of the flow may be averaged over space and time
to obtain a non-zero pseudo-scalar H. It appears to be difficult,
however, to find physical systems which will exhibit this property.
For example, owing to dissipative processes we expect a rotating
sphere of heated fluid to settle down so that H can be defined Indepen-
dently of the initial conditions. Now restart the system but with
initial conditions T(-~,0), -~(-~,0). If the magnetic field is zero
the Bousslnesq equations are invarlant u~der this reflection (recall
g - g0r), so the system will evolve toward a mean hellclty -H = H,
implying H = 0. To argue this in a different way, the mean state of
the system should depend only upon the mean heating rate and various
material parameters (all scalars) and the pseudo-vector ~, from which
it is impossible to construct a pseudo-scalar. If the system is +
endowed with a magnetic field having mean dipole moment m (a vector) ~
H could be expressed as an odd function of m.~, but the record of
magnetic reversals suggests that for the Earth there is no preferred
polarity and therefore m = O!
Ifj nevertheless, rotation is to be regarded as essential to the
convective dynamo, its action must be not to create mean hellclty,
But rather to "polarize" heliclty in space (or time) in such a way
that the resulting pattern of induced currents can be self-exclted.
This self-excltatlon is difficult to visualize and compute when the
length and time scales are unique (as in the system (13)-(15)) since
the induced currents resulting from the polarlzatlon are bound up
closely with the "eddy" currents which dissipate the field. For the
purpose of analysis of the effect it is therefore fortunate that
sufficiently rapid rotation of the fluid introduces two spatial scales
204
into the marginal stability problem, at least for sufficiently weak
magnetic fields (Chandrasekhar 1961), so that large-scale induction
and small-scale dissipation can be clearly distinguished.
To take a concrete example of a zero mean hellcity dynamo, con-
sider the solenoidal fleld
u = (- sin kx cos az, sin kx cos az, cos kx sin az). (26)
If k >> a the induction problem can be solved approximately as in
Section 3, with similar results except that now there is an additional 1 factor ~ sin 2z in A. But note that (26) can also be written
g . sln(k +az) ÷ sln(k -a )+ , ( 2 7 )
t h a t i s , a s a sum o f two O ( l ) r a p i d l y - v a r y i n g f i e l d s , e a c h h a v i n g c o m -
p o n e n t s in p~ase, so that each fails as an "efficient" dynamo of the
kind considered above. An analogous calculation can be carried out
for the standing wave
= (0, sin kx cos at, cos kx sin at), a << nk 2 (28)
to obtain helicity varying as sin 2at, and (28) can be expressed as
a sum of two progressive waves, moving in opposite directions with
phase speed a/k.
More generally, let
= ~ s i n ~ + ~ ' s i n ~ ' ~ - + ~ t , ( 2 9 )
and aseume Lk - k'I << k, I= - ='l <<~k 2" One them finds, using the
n o t a t i o n of Section 3,
A - ~k2 ,In~-a') (~,x~)o~ (30) n2k4 + 2
The operation k + k' can be thought of as a reflectlon across a
plane normal to ~ - ~', and if = = =' the dispersion relation for
the modes must be invarlant under this reflection; in addition, from
(30) it is clear that the two corresponding amplitudes must not be
parallel. The special case ~ = ~', ~' = -~, which could arise in
a conservative system, gives the tlme-periodlc induction.
These properties pertain to infinite fields having the proper
structure. In oan#u~ned rotating fluids hellclty can also be polarl-
205
ized as a result of "Ekman pumping" into a quasi-geostrophic flow. If
the latter has wavenumber k, the secondary flow set up by the Ekman
layer is of magnitude ~ Ta-i/dkL, L being the length scale for the
container, and the resulting helicity may or may not he comparable to
that introduced by other processes, depending upon the magnitudes of
k and Ta. Under certain conditions it can be demonstrated that the
Ekman layers are essential to a convective dynamo effect (see Roberts
and Soward 1972), but in certain idealized models (Section 5) they
definitely are not.
The polarity of the resulting hellcity is fixed by the direction
of rotation and is easily c~puted. At a point on the boundary with
outer normal ~, the nearby heliclty has the sign of -~-~. For the
rapidly rotating Benard layer (case I below), the polarity is the same
as that introduced globally by the convection mode for free boundar-
ies, but the distribution is different and (as Just noted) it is
smaller, by a factor Ta -I/12.
We turn now to the situation in rotating convection. We have in
mind, of course, convection in a heated sphere or spherical annulus,
but to get a qualitative picture it is helpful to consider several
planar "approximations" to parts of a spherical annulus, which we
indicate in Figure 2.
Figure 2. Planar "approximations". The heavy lines indicate isother- mal boundaries to be represented by tangent planes. In IV the included angle is made small. The polarization of helicity is indi- cated for rotation in the direction shown at the top. In a homogene- ous sphere~region ~ can be regarded as extending from top to bottom.
206
We consider these regions in turn. Letting g and ~ be parallel and
oonstant~ and choosing units of length, time, and speed to be L, L2/~
and K/L, we find the dimensionless linearized equations to be
8~ + Vp + Ta I/2 ÷ ÷ ffi T' ~--[ i~u - v2~ - R l g , v.~ = o (31)
!
@T' - V2T = 0 (32) ~ T i - ÷ ~ - I g
where ~g and ~ are unlt vectors, T' is the temperature perturba-
tion, and we take Ta >> i.
Case I. This classical problem is treated by Chandrasekhar
(1961). In the limit Ta ÷ ~, provided P exceeds about 0.68, convec-
tion ensues as a small-scale pattern. The critical wavenumber vectors
(L is now the layer thickness) are almost perpendicular to ~G, the
motion is quasl-geostrophic, and under reflection across the plane of
the layer the vertical velocity component changes sign. There is
accordingly polarization of the helicity along the axis of rotation,
in a manner similar to that obtained for the motion (26). The criti-
cal parameters are
R = 3(~2/2)2/3Ta 2/3, k ~ (2Ta/2)l / 6, (33) c e
Using the term "roll" to denote the convection field corresponding to
a given wavenumber vector ~ in the plane of the layer, and setting
~G = (0,0,1), a single roll has, in the case of free boundaries, the
form
~ sin ~z cos • ~3 + (~2Tal/2/k4) ÷ ÷ cos
E a c h s u c h r o l l c o n t r i b u t e s a n e n t r y i n t h e u p p e r l e f t 2 x 2 s u b m a t r i x
o f A, w h i c h i s a n e g a t i v e m u l t i p l e o f s i n 2 ~ z . The c o r r e s p o n d i n g
h e l i c i t y h a s t h e p o l a r i t y s h o w n i n F i g u r e 2 . F o r r i g i d b o u n d a r i e s t h e
results are similar over the interior, the secondary Ekman flow being
smaller by a factor k Ta -1/4 ~ Ta -1/12 according to (33). c
From the point of view of dynamo action, the essential feature of
this ease is the high degeneracy of the geostrophic flow, allowing
rolls of arbitrary direction. The fact that A can be made to have
rank 2 implies a relatively efficient dynamo process based exclusively
on small scale motions (see Section 5). This realizes in a simple
planar geometry the so-called ,, 2,, kinematic dynamo (Roberts 1971).
On the other hand region I can hardly be regarded as typical of the
sphere as a whole, especially sinoep strictly speaking, the geostro-
207
phic contours have this degeneracy only at the poles!
Case II. Here gravity and i~ = (0,0,1) are orthogonal, and the
choice of geostrophic velocity ~ = Y×$(x~y)~ 3 reduces the problem
to classical B~nard convection without rotation. There is no dynamo
effect from these rolls since particle paths lie in planes. (Indeed
from (18) it follows that B 3 decays, and the resulting two-dimenslon-
al non-dynamo can be regarded as a special instance of Cowling's
theorem, of. Roberts 1967.)
However~ these special solutions completely neglect the presence
of "sidewalls" which might represent the effect of the sloping
spherical boundary. As Busse (1970) has emphasized, the sidewall
constraint drastically upsets the geostrophic balance and the near-
equatorial region is best approached through case IV below.
Case Ill. We let ~g = (0,0,-i), ~ = (sin ~,O,cos ~). For
steady convection the Rayle~gh number is g~ven by
R = [k 6 + Ta(klSln ~ + k3cos ~)2]/-(k 12+. 2K2 )
where ~ is now an arbitrary vector. To make this expression an even
function of k 3 and therefore obtain modes of the kind needed to
satisfy conditions at the plane boundaries, it is seen that k I must
vanish, in which case the problem reduces to case I above but with Ta
replaced by Ta cos2~. The effect of the obliqueness is therefore to
reduce the critical Rayleigh number somewhat, and to restrict the
locally horizontal wavenumber vector to be nezrly perpendicular to
the plane of ~ and ;. Thus the 2 dynamo of Case I is reduced to an
incomplete or near incomplete s-effect, strongly biased toward induc-
ing i 2 current from i 2 fleld. In kinematic dynamo theory this induc-
tion is nevertheless essential to the success of the "~" dynamo
(Roberts 1971)~ as orlglnally envisaged by Parker (1955). In the
a~ mechanism the a-effect is supplemented by large-scale shear~ which
here replenishes ~2 field. Thus case III, which might be taken to the
be typical of a large fraction of spherlcal annulus~ suggests a
natural mechanism for obtaining "one-half" of the dynamo effect from
convection.
Case IV. Here one seeks to represent the effect of the sloping
sidewalls. A class of such models was studied by Busse (1970) and
subsequently used in a convective cycle (Busse 1975p see Section 6
below). The model has the advantage of describing rather closely, in
a simple geometry~ the essential physics of the convective instability
in a rapidly rotating, heated homogeneous sphere (Busse 1970).
The sidewalls upset geostrophy through Ekman pumping as well
208
as by their inclination to the axis of rotation~ but the latter
effect can be made to dominate. In this case Busse's results can be
obtained rather easily by inverting a device of the oceanographer and
replacing a slow decrease of the depth by a slow increase of ~. Let
= (-l,0,0) ~ - (0,0,i) and replace Ta I/2 by Tal/2(l" + Xx) in g , 3
(31). L e t t i n g
ffi V x ~(x,y,t)~ 3 , p - -(I +Xx)Ta I12 ~ + p'
and eliminating pt by cross-dlfferentlation we obtain
2 2 (-~ - V2)V2, = -~y (XTal/2* + RT') •
~T' V~T' ~-~ 2 ~2 ~2 _ = = +
P ~t ~y • V2 ~x 2 ~y2 "
If we look for modes proportional to exp(i~t + lax + iby)
( P i e + c 2 ) ( i ~ c 2 + c 4 + i a A T a 1 / 2 ) = Ra 2 ,
(35)
( 3 6 )
we have
( 3 7 )
where c 2 ffi a 2 + b 2. Thus
ffi - ( a A T a l / 2 ) / ( p + 1 ) c 2 • a2R ffi c 6 + (P/(P+I))2a2ATa/c 2
a n d t h e c o n d i t i o n s f o r m a r g i n a l s t a b i l i t y b e c o m e
= 2 - 1 1 6 ( ~ P l ( t + p ) ) l l 3 T a 116 , b = O, R = 3 ( X P I ( l + p ) ) 4 1 3 ( T a 1 2 ) 2 1 ~ ac c c
(38)
If X is regarded as small• the perturbation is on the strict geostro-
phic equilibrium of case If, but it is a singular perturbation with
strong roll selection. If A is regarded as % i, the ordering (38) is
in accord with that of cases I and IIl~ although the mode in the
present case has some features of a Rossby wave. In Busse's formula-
tion A is a typical sidewall slope and should be taken as positive,
so the phase speed given by (38) is "eastward". (If one regards the
observed "westward drift" of the non-dlpole geomagnetic field as a
phase speed of a flew field associated with the ~-effect, this result
is disappointing. However• as Busse notes the g~oup velocity ~s west-
ward, and this raises the question of whether sufficiently super-
critical convection in this system might not take the form of inter-
mittent wave packets.)
As we have noted the model can be interpreted as an appropriate
local section of a sphere, parametrized by the latitude of the inter-
209
section with the boundary andjif this is done~reasonable agreement is
obtained with the stability analysis of a rapidly rotating heated
sphere (Roberts 1968, Busse 1970). Roberts found that convection
begins on a cylindrical annulus oriented parallel to the axis of rota-
tlon~ with radius about half that of the sphere, and consists of
vertical rolls of dimension % Ta -I/6 along the aximuth, the radial
thickness of ~he convection zone being % Ta -I/9 both relative to the
radius of the sphere. It would appear that in the planar model the
radial structure is lost, or rather reflected only in the vanishing
of b c in (38). Certainly the spherical case should have a local
approximatlon with radial structure, which could be incorporated into
a dynamo model. Soward's remarks at this meeting concerning his
current calculations of the localized D-effect point toward such a
possibility, which would in effect open the way to an asymptotic
analysis of the spherical dynamo.
5. B~NARD-TYPE MODELS
We c o n s i d e r i n t h i s s e c t i o n t w o e x a m p l e s o f a e o n v e c t l v e d y n a m o
cycle based upon a classical B~nard layer. Busse (1973) considers
B~nard convection without rotation. It is assumed that only one set
of rolls is present, so hellclty is created by adding a shear flow
alone the axis of the rolls. Such a flow is somewhat artificial for
a B~nard layer, but it can in principle be driven by a modification
of the mean temperature profile. Moreover, such distortions might
well arise in a different geometry through convective heat transport
obllque to the direction of gravity. To effect a scale separation it
is assumed that the (spatial) mean magnetic field is dominated by a
component orthogonal to the roll axis and slowly-varylng along it.
In this model the mean hellclty vanishes, since the unidirectional
shear flow passes down rolls of alternating sign. The kinematic
dynamo effect is therefore not the first order D-effect of Section 3,
but rather a hlgher-order mechanism involving the spatial derivatives
of the mean magnetic field. (In the terminology of Roberts 1971 the
dynamo is of "B~" type.) Busse uses a numerical method to study the
equilibration of the system and the partitioning of internal energy.
While his approach, being essentially quasi-steady, does not deal with
the dynamics of equilibration, it does reveal an interesting balance,
similar to that suggested by (ii*), which is perhaps typical of near-
critical convective dynamos. Namely, if R - R > 0 is sufficiently c
+ bE B = constant, where a and be are posi- small, one finds that aE u *
tive constants, but that E must exceed a critical value E in order U U
210
to have a dynamo effect. The conclusion is that for E > E* the u u field energy will increase as the kinetic energy falls, while for
E < E the kinetic energy will rise as the field decays. Ultimately u u the magnetic field will be sustained at the maximum energy compatible
with the above dynamic constraint as well as stationary dynamo action.
The second model, put forward by Childress and Soward (1973) and
worked out by Soward (1974) for equilibrium at minimum field energy,
is based upon the rotating B~nard layer considered as case I in Sec-
tion 4. This model utilizes the rotation of the fluld to effect the
scale separation, in a manner which permits the dynamics of equilibra-
tion t o be followed in detail. We take P~ ~ i.
The "weak-field" case studied by Soward (1974) assumes that the
Hartmann number of the induced field is ~ I. This a pr~or~ hypothesis
on field intensity is then Justified by exhibiting consistent, appar-
ently stablejoperating states. The rolls have the form (34) with ~ an
arbitrary vector in the plane of the layer. The length L is here the
thickness of the layer, so that for regeneration of the field we must
have (of. (25) with ~ = 0)
R 2 ~ k ~ Ta I/6 , m c
where Rm = UL/~ is a magnetic Reynolds number based on roll ampli-
tude U. Thus Rm % Ta 1/12 and the small-scale field satisfies
g ~ Ta-I/12f. This ordering generates a series solution in powers
of Ta -1/12 .
The mean field equations are easily obtained for a discrete or
continuous distribution of rolls, and in the former case, with
= [Bl(Z,t),B2(z,t),O), take t h e form
~B i ~2B i + 2~A ~ [sin 2Wz Mij(t)B j] ...... 0 (39)
~z 2
where, in terms of the A in (24),
M = [ -A21A11 -A22 IA12 (40)
Here the unit of time is L2/n. Note that, since we have in mind that
roll structure is to be determined by auxiliary equations of evolution,
the discreteness or continuity of the roll pattern is determined by
the initial conditions. In (39) and (40) the normalization is such
that All + A22 = 2 so that A(t) i s a parameter proportional to
the kinetic energy of the flow.
For the weak-field solutions, it turns out that near marginal
211
instability A is fixed by the quantity (R - Rc)/R c. Thus the dynamics
of the model reduces to the study of how the magnetic field detez-
mines the partitioning of a fixed constant kinetic energy among the
various rolls. Soward was able to show that the roll structure indeed
evolves on the same time scale as the field. If q(t,~) denotes the
kinetic energy in a given r011, where ~ = ~/Ta I/6, the equation for
q takes the form
d ft. 2 ~ Q(K,K ) q ( t , K ) q ( t , K ) d t
1 + - [~(t,~) - ~ ~ ~(t,K') q(t,K')] q(t,K) = 0 , (41)
where Q(~,~') and B(t,K) are given explicitly, the former being skew
in its arguments. The magnetic field is contained in the quantity B,
which takes the form
= f(~) + g(~) . ZjKIKj ~(Bisj7 , (42) i i
~(f) = [ ( W2c°s2Wz - K2eln2~z)f dz i
0
Finally, the matrix A in (40) is obtained from the q's by
(t,~7 (43) Aij = ~ K 2 q K
where we may set K = K . c
The weak-field model is then given by (39)-(437 with R as para-
meter. Soward examinee 2 and 3-roll solutions of this system, as well
as an interesting continuous-roll solution, and finds a tendency for
the kinetic energy to localize itself at any one time in rolls near
a single direction, but the direction itself changes with time. In
fact, as the number of admitted discrete rolls is increased, the solu-
tions tend increasingly to resemble a single rotating roll, a property
then explicitly exhibited in the continuous roll example, where energy
is dispersed about an orientation which rotates with uniform speed.
Physically, the magnetic field, at each instant, favors rolls with a
certain orientation (determined by the term ~ in (41)). From (39) one
sees that the ~-effect then feeds energy into the component of the
field pGral~el to the preferred roll axis. If the field were indepen-
dent of z, the analysis of Eltayeb (19727 would apply and it could be
concluded that rolls with axis o~thogonu~ to the field are most
unstable. Thus both field and roll axis rotate, as the ~-effect keeps
212
up with the destabilizing effect of the field. This argument would
imply that rotation is opposite to the large-scale rotation of the
layer (of. M in (39) when A is diagonal), a property that was always
obtained in Soward's calculations.
The continuous-roll solution consists of at least two branches
when E B is plotted as a function of A, with subcritical bifurcation
occurring from E B = 0 , A = 1.5974, and it seems likely from the 2
and 3-roll calculations that some of these are stable on the time
scale of the model. (Soward establishes dynamic stability on the
relevant short time scale.) On the other hand, it is by no means
obvious that these solutions are stable to finite increases of
initial field energy, and indeed the Eltayab-Roberts ordering men-
tioned in Section 2 would suggest that they are not.
Preliminary attempts by the author to test the stability of the
dynamo by startin E it with magnetlc energy correspondin 8 to a Hartmann
number ~ Ta 1/12 (the Intermediate-field regime) have in fact uncovered
several kinds of instability, and recent unpublished calculatlons of
Yves Fautrell in the strong-field regime Ta I/6 < O(M) < Ta I/4 also
indicate instability under certain conditions. It is not definitely
known at this time whether or not there are regimes other than that of
weak field where local stability is obtained. It is possible that
stability is retained only at the "very strong field" level M ~ Ta I14,"
but there the multiple-scale procedure is ineffective since k c % i.
At the intermediate level One finds, first, that dynamic stability
on the fast time scale, shorter than L2/n by a factor Ta -I/12, is
upset. Examination of some two-roll solutions show that this
Instability represents collapse onto a single roll (without the dis-
persion of energy about a preferred direction which characterized the
weak-field ¢ontlnuous-roll solution), Single-roll solutions are
dynamically stable on the fast time scale at the intermediate level.
Single-roll solutions are found to be unstable, however, on the time
scala L2/~ ! To sea how this happens we write out the equation for q
at the Intermediate-fleld level:
Ta-i/6 d__qq = dt q[Cl + c2e(t) + c3 ~-(~'~)z)] ' (44)
where the cts are positive constants. Here GTa -I/6 is an amplitude of
the perturbation of the mean temperature, satisfylng
d._00 + c4 e = - c5 A (45) dt
where again the c's are positive constants. In particular, at the
213
fntermediate level A must he retained as a function of time even at
fixed Rayleigh number, Let us assume tha~ the system (39) p (44), (45)
is started "most stably" by adjusting the roll angle and 8 to make the
right-hand side of (44) take on its maximum value, and that this value
is zero. Assuming that this configuration (stable on short time scales)
is maintained, we may neglect the left-hand side of (44), solve for Q,
and use this expression in (45) to obtain an expression for A in terms
of B, which expression can then be used in (39) to obtain an equation
for ~.(This procedure also fixes the roll angle as a function of B.)
Numerical studies clearly show that in general this system allows solu-
tions which diverge in a finite time, owing to the quadratic dependence
of A upon B, and quite apart from the dynamo or non-dynamo property of
a single roll. In effect it appears that the field has destabilized
the convection to the extent that divergent behavior of the field is
caused by the rapid increase of the kinetic energy and heat flux, rather
than by dynamo action,
One reason for these difficulties may lie in the degeneracy allowing
multiple-roll solutions. Indeed, if a single roll direction could be
fixed by other considerations (as in Cases III and IV in Section 4),
the component of the field which is amplified by dynamo action is ortho- +
gonal to K, and so does not enter into (44). So long as more than one +
roll direction is permitted, however, the quadratic growth of A with B
would probably have the instability of the rotating roll.
It can also be asked whether the instability is not an indirect result
of the isothermal boundary condition, which more realistically should be
replaced by a condition of constant mean heat ~£ux. Some tentative work
at the intermediate level did not indicate a stabilizing effect, but the
question remains undecided at the higher field levels.
6. THE ANNULUS MODEL
As an outgrowth of his quasi-planar analysis of convection in case IV
above, Busse (|975) has considered a corresponding convective dynamo model.
His approach, as in the stability analysis, is to consider a simultaneous
expansion for large Ta and small %D, where I is a typical boundary slope
and D >> | is the width to height ratio to the annulus. The geometry is
summarized in Figure 3.
214
T = T 2
a ~ 0,112)
I T ~ T I < T 2
1 (DI2,0,0) ',4- g
Figure 3. Busse's annulus model; lengths are in units of the mean height. The almost horizontal boundaries are given by 2z = ~ e x p ( - ~ x ) .
The top boundaries are treated as rigid interfaces between conductor
and non-conductor, and on the vertical ones the temperature perturba-
tion as well as the x-component of velocity vanish.
The latter conditions are particularly important, since they
enforce a fixed roll structure on the convection, effectively remov-
ing the degeneracy of case I. Indeed, from the analysis of Section 4,
we see that a 0 and R e again have the asymptotic expressions (38),
but now b = w/D~ so the most unstable velocity mode has the form c
= ATal/2(sln(Wx/D + w/2) sin acY, a c
(46)
(W/acD) eos(wx/D + ~/2) cos acY, 0).
Of course, once R exceeds R a band of wavenumbers with continuously c
varying b will grow~ but rather than introduce a Fourier transform we
follow Busse and take (46), which can be expressed as a sum of two
rolls with wavenumber vectors a ± T/D, as typical of the convec- c
tire mode.
The hellcity corresponding to (46) vanishes Identlcally, so the
kinematic dynamo action rests on the combined effects of Ekman pumping
and sidewall slope. For suitable parameter values the former effect
can be made to predominate, and an a-effect is achieved, but one which
is strongly biased, the induced current associated with a y-component
of the mean field being (ae/bc)2 H 1/e 2 times that associated with a
215
comparable x-component. The steady-state mean field is thus found to
satisfy an equation obtainable from (39) by replacing sin 2wz by z,
and M by
I °I 0 l/e
where
(47)
m = [(A2a5Ta3/4)/8~ (w~ + a4p2/p23(w/D). c cq
(48)
These expressions can be derived using the familiar results for the
by the Ekman layer, together with (25). Since ~c/a~%p-l- flow induced
we see from (47) and (48) that the quantity
r = p2A2acTa3141D(I+P~) (49)
must exceed a positive number of order unity if we are to have a
dynamo effect.
A second condition is imposed by the two-scale expansion. The
small-scale magnetic field can be estimated from (22) as follows:
The dimensionless velocity amplitude is a A Ta 1/2, and since the c
x-component of the field predominates (see below), the relevant shear
is I/D times this amplitude. Thus
g/f % ac A Tal/2D-I 2 p2 2 4 -i/2 (mc + n /pac) << 1
is a condition on the expansion. Combining (50) with the condition on
r and using (38) we have
Tal/4zD >> 1 , (50)
and this inequality is easily met by the assumed ordering. On the
basis of his solution of the kinematic dynamo problem Busse concludes
t h a t if
acbc/P~Tall4 >> 1 (51)
(the inequality followlng equation (5.3) of Busse 1975), the expan-
sion is consistent. This adds a much stronger condition, which can
only be met, with a c given by (38) and b c ~ l/D, by making P smGlZ.
Since it is important to retain P~ as a large parameter in a geodynamo
model, it will be of interest to know if (51) can be relaxed while
216
maintaining a consistent expansion, or if (50) by itself might be
sufficient.
The equilibration of the system is studied using an equilibrium
calculation as for the non-rotatlng layer model of Section 5, with
similar results: For slightly supercrltlcal convection the magnetic
and kinetic energies are linearly related and the system equilibrates
as the dynamo effect becomes stationary.
The model has a number of advantages over those of B~nard type,
the foremost being that it is constructed to represent the convection
within a region of a homogeneous rotating sphere. Roll structure is
independently fixed by the boundary conditions, rather than evolving
in response to the mean magnetic field. The a-effect is of a new
type, induced by the response of the domain to a Rossby-llke wave.
As Busse notes, the rather stringent conditions on the parameters
can probably be considerably relaxed without affecting the qualitative
features of the model. Moreover, the behavior of the system as a weak
~2-type dynamo is probably of secondary importance compared to the
insight it gives into the possible origin of the ~-effect in a sphere.
In this connection it should be mentioned that the effect of the
boundary (absent in the realization of the case IV considered in Sec-
tion 4) enters into u as O(A2), and thUS is a reflection of boundary
our~Gture. This boundary contribution is independent of viscosity and
can be made to predominate over that due to Ekman pumping, although
Busse does not investigate the full dynamo cycle in that case.
On the other hand, certain features of the solution, imposed by
its asymptotic form, should be noted. First, the s-effect is such
that B 1 ~ E-2B2 , and since B 1 here represents the "meridlonal"
component of the field, the dynamo is characterized by a small
"toroidal" component. As Busse notes, the implication is, if one
accepts the model when ~ ~ I, that the two components are comparable,
but it is disturbing that this state is approached through an
asymptotic ordering that is usually regarded as improbable in the
Earth's core. A second point concerns the possibility of subcritical
instabilities. We have seen in the case of the rotating B~nard model
that the locally stable weak-fleld case may not be stable to finite-
amplitude perturbations in the magnetic field, and the question arises
as to whether or not a similar state of affairs prevails here. If one
examines the stability in case IV with an applied u~form magnetic
field of the form B(~ + E2J), it can be seen that (37) is replaced by
4 M2b 2 (IP~ + c 2) (iec 2 + c + + lalTa I/2) = Ra 2 (52)
217
where M is the Hartmann number based on B. From (52) it is easily
seen that convection at Raylelgh numbers ~ ATe I/2 can be realized
provided that a ~ b % 1 and that M 2 ~ lTa I/2. This is fully
analogous to the Eltayeb-Roberts ordering mentioned earlier, and we
suggest that there may be similar implications for the present model
at higher field energies.
7. MODAL EXPANSIONS
Numerical calculations utilizing truncated expansions in funda-
mental modes have played a prominent role in the kinematic dynamo
theory (we mention in particular the pioneering paper of Bullard and
Gellman (1954) and the recent study of Gubhins (1973)) as well as in
the simulation of thermal convection (GouEh , Spiegel, and Toomre 1975).
St is natural to consider the application of these methods to the
convective dynamo.
One immediate difficulty is the choice of appropriate "fundamental
modes", capable of representing the system at a rather low level of
truncation. The asymptotic models of the kind discussed above, which
have something of a "modal" character near the critical Eaylelgh
number, can be helpful here. The practical problem is, of course,
that if the asymptotic solution were to represent a globally stable
state, its finlte-amplitude modal counterpart offers a modest and
perhaps unnecessary extension. On the other hand, the value of the
modal approach lles in 8~muZut~o~ of the dynamo, and there the struc-
ture of the asymptotic solutions may be misleading. To take a speci-
fic example, in rapidly rotating non-magnetlc B~nard convection the
roll structure is given by (34). As we have seen, however, the
appropriate horizontal scale of the convection may increase dramati-
cally once a magnetic field is developed and M 2 ~ Ta I/2. Generally
the horlzontal scales of the modes are prescribed at the outset and
it is not obvious, u pP~or~,what value should be used.
In a rotating B~nard layer, the fundamental modes for velocity or
magnetic field will generally consist of a "pololdal" part
= Pz(Z,t)Vf(x,y) - F(z,t)V2f(x,y)~ 3
a n d a "toroldal" part
= C(z,t)~ 3 × Vg(x,y)
218
where f and g are functions chosen to represent the horizontal struc-
ture. The fields are built up from a finite number of such terms,
each corresponding to a choice of a,b in the equations q2f + a2f ffi 0,
V2g + b2g = O.
This approach has been applied by Baker (1973) to a B~nard-type
convective dynamo. Baker focuses on a "2-mode" closure (the number of
distinct F and G) and specifically on square convection cells generat-
ed by the choice: f = cos(ax), cos(ay). If the system does not
rotate, the model can be further reduced to "1-1/2 modes" by express-
ing the magnetic field in terms of one poloidal and one toroldal
component, and the velocity field in terms of two poloidal modes. One
then obtains six equations, second order in z and first order in t,
for the undetermined functions. The full 2-mode system includes
additional toroidal parts of the velocity field and takes account of
the influence of the Coriolls force, so it would appear to be the
simplest modal realization of a rotating dynamo.
In the l-I/2-mode closure dynamo action was found to occur over
a range of parameter values and for various boundary conditions.
In Figure 4 we show the energies developed in one of the oscillatory
dynamos. In this example the mean magnetic energy is about 2.5 times
the mean kinetic energy, and the tendency for the peaks to be out of
phase is consistent with Busse's quasl-equilibrlum analysis at margi-
10 4
W Z W
0
t o t a l
0 TIME
Figure 4. ~n oscillatory dynamo with 1 1/2 modes, for perfectly conductin~ rigid walls, R = 105 , P = 0.i, P = I, a - 3.I (from Baker 1973)).
219
nal stability (Section 5). The effect of rotation was not studied in
the same detail, but in some preliminary computations with 2-mode
closure the rotation was found to enhance the dynamo effect. Perhaps
most interesting is the fact that the calculations suggest the exis-
tence of a convective dynamo effect in a B~nard layer without rota-
tion, for square cells whose horizontal dimension i s comparable to
the thickness of the layer. Unfortunately, Baker notes a rather poor
convergence in going from 1-1/2 to 2 modes, so it must be regarded as
possible that the dynamo effect is illusory at this low level of
truncation.
Probably the simplest model of B~nard convection involves single
modes for velocity, perturbed temperature, and mean temperature, and
a further projection of the vertical structure onto a mode of the form
exp(Imwz). The resultlng system of three first-order ordinary differ-
ential equations in time is known as the Lorenz-Howard-Malkus or "ABC"
convection model (Lorenz 1963, Malkus 1972). Recently Kennett (1976)
has extended thle system to encompass magnetohydrodynamic convection,
by the addlt~on of terms representing the pololdal and toroldal field
components. The resulting "ABCDE" model can be thought of as a pro-
Jection of the vertlcal structure of Baker's 1-1/2 mode system, and
is simpler by one equation because of the absence of one pololdal mode
in the veloclty. Indeed it is probably the mlnlmal modal system for
a non-rotatlng convective dynamo. An interesting aspect of the form-
ulatlon is that it should allow systematic study of periodic and
aperiodic behavior; the latter is known to occur in the ABC model
(Lorenz 1963) as well as in other third-order systems (Baker, Moore,
and Spiegel 1971). The "CBE" part of the system, moreover, bears a
certain resemblance to the shunted dlsk-dynamo model studied by
Robblns (1975).
Kennett shows that the system admits equilibrium solutions with
non-zero magnetic field provided that
R > Cl +
where the o's are constants determined by the form of the horizontal
modes. For a range of parameter values these equilibria are unstable,
however, and by applying the method of averaging it is shown that
there exist in that case, in the limit of large time, nearby linearly
stable periodic solutions with non-zero magnetic field.
Because of their relative simplicity these systems are very useful
and deserve further study. It would be interesting to know, for
examplej what insight could be gained concerning the role of rotation
220
in the dynamo process, through the addition of a toroldal velocity
mode. Also, it is to be hoped that as our understanding of the pro-
cess deepens, a mode structure can be devised which converges rapidly
with the truncation level.
8. TOWARDS SIMULATION OF THE GEODYNAMO
We have not dealt in this paper with the well known solution of
the kinematic dynamo problem discovered by Braginski~ (see the review
of this work in Roberts 1971), since this approach was not exploited v
in the convective dynamos discussed above. The Braginskii dynamo has
the advantage of making no special assumptions regardin E the distri-
bution of spatial scales. Rather, a simplification is achieved by
requiring the magnetic Reynolds number of the velocity eddies to be
large. This enforces a certain symmetry on the fields (near axial
symmetry in a spherical core), which are then dominated by their
symmetric toroidal parts.
Recently Braginskii has initiated a study of the corresponding
spherical convective dynamo (Braginskii 1975). In this first paper
the fluctuating component is assumed given, so the problem reduces to
equations for the symmetric components of the fields. The questions
raised by the multi-scale convective dynamos, concerning the origin
of an s-effect and mean Lorentz force from the small-scale convection,
are thereby avoided, and the dynamic balance for the symmetric fields
can be studied at energies believed realistic for the geodynamo.
Braginskii proposes a solution in which the meridlonal magnetic field
within the core is predominantly parallel to the rotation axis. The
field is matched with its mantle counterpart through a magnetic boun-
dary layer at the core-mantle interface. Since the dynamo is of
"~" type, the azimuthal velocity which provides the "~-effect" must
be determined from the dynamics of the symmetric fields. As Roberts
and Stewartson (1974) have emphasized, this is a crucial step in the
construction once M 2 ~ Ta I~2. Braginski~'s model, which determines
the azimuthal flow by a process involving electromagnetic coupling of
core and mantle, thus confronts a problem not faced in the idealized
layer systems. (For a different approach to this question, devised
for u2-dynamics, see Malkus and Proctor 1975.) It is probably fair
to say that the convective origin of the s-effect is only one-half,
and perhaps the easier one-half, of the dynamical problem, and we
await with considerable interest the further development of this
approach to the spherical convective dynamo.
221
We conclude with a few general observations. For the sake of
argument we adopt a conservative attitude, as will be clear from the
following list of postulates for the geodynamo:
(I) The field is maintained by heating at a uniform rate in a
region of size L, fixed within the core relative to the rota-
tion axis.
(2) Within thls region, core motions are irregular (in particular
poloidal and toroldal components are comparable) and can be
characterized by a speed U and length scale L.
(3) Within this region the magnetic field is also irregular, with
field strength B > 0 and length scale L.
(4) Within this region the Coriolis, Lorentz, and buoyancy forces
acting on a fluid element are comparable.
(5) The system varies on a time scale of magnetic diffusion.
Given that heating is uniform and the system is Boussinesq, these
hypotheses are close to the "worst possible" if the aim is a pertur-
bational analysis. Indeed from (2) and the existence of a dynamo
effect it follows that R m - UL/~ ~ I, so that kinematic dynamo
problem is without a small parameter. Balancing the Corlolis and
Lorentz force we then have M 2 ~ Ta I/2, an ordering already encounter-
ed in Section 2. With (5) the units are fixed and, if it is addi-
tionally postulated that viscous and inertial forces are negligible,
the system reverts to the dimensionless form (13)-(15) (for example).
It is plausible that (with the possible exception of Ekman layer
effects and core-mantle coupling) the resulting equations contain the
relevant physics and the important matter is the ordering of terms.
In each perturbational model one or more of the above postulates is
relaxed.
A crucial question is the appropriate magnetic Reynolds number of
velocity eddies. Estimates range from 1 to 104 (Gubblns 1974). How-
ever, in view of the uncertainty over the possible size and location
of a convecting region in the Earth's core (cf. Busse 1975) a value
in the range i0-i00 is not unreasonable. This would tend to favor
Braginskii's ordering of the kinematic dynamo, but there is a
possible alternative, namely that the symmetry of the field with
respect to the rotation axis is a result of the location of the con-
vective region and the nature of the dynamo effect within it. In that
case (2) and (3) might reflect irregular motion with moderate concen-
trations of magnetic flux (Weiss 1966).
Regarding the induction problem, it is tempting to add a postulate
to our llst, namely that the dynamo is of "u~" type, even though if
Rm % 1 the ~ and e-effects are difficult to separate. We suggest that
222
the u-effect could be realized as in Busse's annulus model, or as in
case I~I of Sectlon 4. Busse's model is especially attractive, since
it also suggests how the corresponding w-effect could be developed.
Suppose we alter the direction of gravity to reflect the inclination
which occurs over most of the cylindrical annulus of rolls in the
marginally convective heated sphere. The convective heat transport
is then oblique to gravity, so the mean temperature field is distorted,
in such a way that the ~-effect arises from the "thermal wind". One
can Check that if the distortion is of the order of the equilibrium
mean temperature profile, the magnetic Reynolds number of the thermal
wind is indeed ~ i provided M and R are ordered as above. These q
estimates are likely to be modified somewhat if the convective zone is
only a small fraction of the electrically conducting region.
The geometry of the convecting zone relevant to the e-effect may be
significantly altered if, as Kennedy and Higgins (1973) suggest,
convection in the Earth's core occurs only near the inner core. In
that case the appropriate annulus model may involve a depth which
~norease8 with distance from the rotation axis, implying an a-effect
from ~estwurd-movlng waves.
Equations (13)-(15) indicate that Pn is a significant parameter in
our problem, a point that has been emphasized by Roberts and
Stewartson (1974) in their study of dissipative M.A.C. waves arising
in rotating magnetoconvectlon (cf. Roberts and Soward (1972)). It is
not clear whether ultimately the most profltable course will be to
take P~ ~ i, or rather to use the singular limit process P~ ~ (pre-
sumably leading to localized convective heat transport and a reorder-
ing of the variables) as intrinsic to the geodynamo.
One aspect of the problem which would appear to deserve further
study is the possibility of obtalnlnE more refined estimates of solu-
tions along the lines of the calculation of Kennett (1974), perhaps
wlth a view to maximizing magnetic energy in a system driven by
internal heating. It is likely that the Eeodynamo operates in a
state which is "optimal" in the realized mean magnetic energy (cf.
Section 2), and once the nature of this state is determined we can
expect, on the basis of the considerable advances made over the last
decade~ that it will then not be too difficult to secure a dynamical
model of the process.
ACKNOWLEDGEMENTS
The author is indebted to E. A. Spiegel for conversations, and to
L. Baker for Figure 4. This work was completed with the help of a
generous grant from the Guggenheim Foundation.
223
REFERENCES
Backus, G. (1958) Ann. Phys. 4, 372
Baker, L. (1972) Thesis, Astronomy Dept., Columbla University
Baker, N. H., Moore, D° W. and Spiegel, E. A. (1971). Quart. Jour. Mech. Appl. Hath. 2_~4, 391
Braginski~, S. I. (1975) Geomag. and Aaron. 15, 149
Bullard, E. C. and Gellman, H. (1954) Phil. Trans. Roy. Soc. Lend. Set. A 247, 213
Busse, F. H. (1970) Jour. Fluld Mech. 44, 441
Chandrasekhar, S. (1961) Hydrodynamic and Hydromagnetic Stabillty, Oxford University Press
Childrass, S. (1967) Courant Inst. Report AFSOR-67-0124
Childress, S. (1969a) Theorle magn~tohydrodyuamique de l'effet dynamo, Lecture Notes, M~canique theorique, Facult~s des Sciences, Paris
Childress, S. (1969b) in The Appllcatlon of Moder n Physics to the Earth and Planetar 7 Sciences , edited by S. K. Runcorn, Wiley, London
Childress, S. (1970) J. Math. Phys. 11, 3063
Eltayeh, I. A. and Roberts, P. H. (1970) Astrophys. Jour. 162, 699
Eltayeb, I. A. (1972) Prec. Roy. Soc. Lend. Ser. A. 326, 229
Geugh, D., Spiegel, E. A., and Toomre, J. (1975) Jour. Fluid Mech~ 68, 695
Gubbins, D. (1973) Phil. Trans. Roy. Soc. Lend. Set. A 274, 493
Gubbins, D. (1974) Rev° Ceophys° Space Phys. 12, 137
Hewitt, J. M., McKenzie, D. P. and Weiss, N. O. (1975) Jour. Fluid Mech. 68, 721
Kennedy, G. C. and Higglns, G. H. (1973) J. Geophys. Res. 78, 900
Kennett, R. G. (1974) in Notes on the Summer Study Program in Geo- physical Fluld Dynamics, Woods Hole, 94
Kennett, R. G. (1976) Stud. in AppI. Math. 55, 65
Lorenz, E. N. (1963) J. Atmos. Sci. 20, 130
Malkus, W. V. R. (1972) M~m. Soc. Roy. Sci. Liege, 6th series, 4, 125
Malkus, W. V. R. (1973) Geophys. Fluid Dyn. 4, 267
Malkus, W. V. R. and Proctor, H. R. E. (1975) Jour. Fluld Mech. 67, 417
Moffatt, H. K. (1968) Jour. Fluid Mech. 35, 117
224
Roberts, G. O. (1970)
Roberts, O. 0. (1972)
Roberts, P. H. (1967) Elsevier
Roberts, P° H. (1968)
Parker, E. N. (1955) Astrophys. Jour. 122~ 293
Robbins, K. (1975) Theslsp Dept. of Math., M.I.T.
Robberts, G. O. (1969) in The Application of Modern Physics to the Earth and Planetary Interiors, edited by S. K. Runcorn, Wiley, London, 603
Phil. Trans. Roy. Soc. Lond. Set. A 266, 535
Phil. Trans. Roy. Soc. Lond. Set. A 271, 411
Introduction to Magnetohydrodynamics, American
Phil. Trans. Roy. Soc. Lond. Set. A 263, 93
RoBerts, P. H. (1971) in Mathematical Problems in the Geophysical Sciences, Vol. 2, edited by W. H. Reid, A.M.S., Providence, 129
Roberts, P. H. and Soward, A. M. (1972) Ann. Rev. Fluid Mech. ~, 117
Roberts, P. H., and Stewartson, K. (1974). Phil. Trans. Roy. So¢. Lond. Set. A 277, 35
Weiss, N. O. (1966) Prec. R. So¢. Lend. 293, 310
Weiss, N. 0. (1971) Q. Jour. Roy. Astr. Soc. 12, 432
PENETRATIVE CONVECTION IN STARS
Jean-Paul ZAHN
Ohservatoire de Nice - FRANCE
I. Introduction
Penetrative convection occurs in a fluid whenever a conyectively unstable region
is bounded by a stable domain. This situation is encountered in many stars, and it is
also a very com~non circumstance on Earth: in the oceans and in the atmosphere. One
would therefore expect that the astrophysicists may largely benefit from the experience
accumulated on this subject by the geophysicists.
However, this is only partly the case. In the ocean, salinity plays a very
important role and especially so at the interface between a stable and an unstable
(mixed) region. In the atmosphere, the behavior of the convective planetary boundary
layer is dominated by the 24 hour thermal cycle, so that a steady state is never achie-
ved ~ as it is in a star (at least in one that is not pulsating). Furthermore, the ratio
between viscosity and conductivity, as measured by the Prandtl number, is of order unity
for water and air, but it drops to 10 -~ and less in a star. Finally, the effec~of stra-
tification are much stronger in stars where convective regions often span several density
scale heights.
For all these reasons, the astrophysicists have developed methods of their own
to describe stellar convection, even though some are widely inspired by those used by
the geophysicists. The same is true for convective penetration, whose study cannot he
separated from that of convection itself. The purpose of this review will he to recall
those methods, and to verify if they are suited to describe the penetration of convective
motions into stable surroundings.
~I. Phenomenological approaches
In those approaches, one hypothesizes a flow which is plausible in that it does
not seemingly contradict the laws of fluid dynamics and that it conserves heat and
kinetic energy. One then calculates the gross parameters that characterize this flow:
convective flux, mean temperature gradient. The most commonly used of such procedures
are based on the concept of mixing length, and have already been discussed in this
colloquium by D.O. Gough.
226
I. Non-local mixing-length treatment@
All mixing-length procedures applied to stellar convection are in fact based
on the two differential equations describing:
i) the variation with height z of the density excess 60 between a convective element
and the surrounding medium, in which the densities are respectively p* and P
~zz(~p ) d__e* - d_@_p ( ] ) dz dz ,
ii) the variation of the kinetic energy of that convective element
d l (2) d--z (5 P v2) = - 6P g ,
where g is the gravity.
The standard prescription (Vitense ]953) is to replace these equationsby
dO l ~ (3) ~P :k~ ~ ~ ~ ,
] v 2 ~Sp ~, (4) ~- = - C g P 2 ,
being the mixing length and C an efficiency factor which allows for the production
of turbulent energy. In this treatment, both the density excess and the convective
velocity are functions of local quantities only (the mixing length and the density
gradients); by construction the convective motions cannot penetrate into the stable
adjacent regions.
That constraint may however be relaxed by treating the original differential
equations in a less crude way. This was done by Shaviv and Salpeter (1973), Maeder (1975a)
and Cogan (1975), to be specifically applied to the overshooting from a convective stellar
core. The differential equations are integrated over one mixing length (or up to the
point where the velocity vanishes, whichever happens first):
d0 1 (" 1
1 v 2 6 p d z ( 6 ) y =-c g T ' • z-z.~£ l 1
227
(To formally recover certain results of the standard scheme, Maeder identifies the
integration distance with half the mixing length). The density stratification dp/dz
of the ambient medium is adjusted until the constancy of the total energy flux
(convective plus radiative) is realized.
This non-local mixing-length treatment permits the description of many ~eatures
of penetrative convection in the laboratory or in the Earth~atmosphere. A convective
element ceases to be buoyant at some distance from the unstable region, where also the
convective flux vanishes; from there on its momentum carries it still further into the
stable region, and since it is cooler than the surrounding medium, the convective flux
is of opposite sign. In a stellar core, the P~clet number is very high and thus the con-
vection is extremely efficient; it follows that the whole domain where the motions occur
is kept nearly adiabatic.
The main weakness of this approach, as one may expect, is that all quantitative
predictions depend on the assumption made for the mixing length. Another parameter plays
here also Some role, and it too can only be guessed: it serves to measure the fraction
of space filled by the convective elements. In the bulk of the unstable domain this
parameter is probably close to unity, but in the overshooting region, it drops to one
half and possibly much less, because it is unlikely that many downwards moving elements
are present there.
In a generalization of the mixing-length procedure proposed by Spiegel (]963),
the number of convective elements is not fixed a priori, but is governed by an equation
of conservation similar to the radiative transfer equation. Travis and Matsushima (]973)
have applied this non-local theory to the solar atmosphere, and they obtain an apprecia-
ble overshooting into the photosphere. In order to match the solar limb-darkening obser-
vations, they must choose a ratio of mixing length to pressure scale height of 0.35 or
less. Unfortunately, this value is too small to yield the correct solar radius, within
the assumptions that can be made for the chemical composition. Travis and Matsushima
suggest that this discrepancy be removed by allowing the above mentioned ratio, between
mixing length and scale height~ to vary with depth.
228
2. Other procedures
A different approach has been used by the meteorologists to model cloud
dynamics (Stommel 1947). It is based on the concept of thermals, and has since been
applied to a variety of other problems; it was Moore (1967) who brought it to the
attention of the astronomical community. A thermal is an organized cell which, llke the
eddy of the mixing-length treatment, exchanges heat and momentum with the surrounding
medium, but has also the property of gaining or loosing matter through entrainment or
turbulent surface erosion.
The only serious attempt to apply this concept to an astrophysical case was
made by Ulrich (]970 a, b), who used it to build a model of the solar atmosphere. He
had to overcome such difficulties as the absence of any ground level (from where the
thermals start on Earth), fragmentation (since the thermals are bound to move over
several scale heights) and radiative exchanges (the P~clet number becomes rather small
above a certain level). His model displays substantial overshooting well into the
photosphere, but one may wonder whether this is not due mainly to a simplifying assum-
tion he made for the correlation between the velocity of a thermal and its temperature
excess. Another consequence of this is that there is no sign change of the convective
flux in the stable region.
A similar treatment has been proposed recently by Nord!und (]976), in which the
medium is organized in two streams of rising and falling fluid. Those behave like the
thermals in the sense that they too exchange matter, heat and momentum, but here there
is no ambient medium. Dimensional arguments are invoked to write down the equations
governing the exchanges between the two streams. Solar models constructed with this
procedure are characterized by an appreciable penetration up to an optical depth of
T = 0.]; the quantitative predictior~of course depend on the choice of the dimen-
sionless parameters that occur in the equations.
229
III~ Direct approache s
In the past ten years a new approach has been explored thanks to the fast
computers with large memory storage that are now available: one can start directly
from the fluid dynamics equations, instead of replacing them by simpler ones that are
more tractable. Of course, it is not feasible yet to treat the most general problem:
as we will see, the solutions obtained to date all suffer from some kind of restriction.
But at least they help to build up an intuition which has been lacking so far. We
shall consider here only the nonlinear investigations; the main interest of the linear
studies has been to determine the critical conditions (Gribov and Gurevich ]957,
Stix |970, Whitehead 197]), but they cannot be used to predict the extent of penetration,
which is strongly related to the amplitude of the solution.
]. Bo_____ussinesq convection
The prototype of penetrative convection in the laboratory is the ice-water
experiment suggested by Malkus (]960) and performed among ethers by Townsend (1964) and
Myrup e~t ~. (|970). Water has the peculiar property of presenting a density maximum
at 4°C, so that a tank of water whose bottom is kept at 0°C will he conveetively
unstable up to the level of maximum density, and stable above. Veronis (1963) gave the
criterion for the onset of the instahility, which is of the finite amplitude type. There-
after Musman (]968) made the first quantitative predictions for the extent of penetra-
tion, using the so-called mean-field approximation (Herring 1963). The next improvemer
came from Moore and Weiss (]973), who solved the two-dimensional problem without furthe~
simplification.
A slightly different experiment is that of a fluid heated in its bulk by Joule
effect, in which the parabolic temperature profile creates two superposed domains of
respectively unstable and stable stratifications (Tritton and Zarraga 1967). This
experiment has been modelled by Strauss (|976)~ again with a two-dimensional code; his
results are similar to those of Moore and ~eiss (1973).
These two-dimensional studies are fairly ~ueeessful in predicting, at moderate
Rayleigh numbers, the mean temperature profile and thus the extent of penetration. But
it is doubtful that they can be extrapolated to the parameter range which is of astro-
physical interest (high Rayleigh numbers and low Prandtl numbers). Moreover, these
two-dimensional studies are unable to describe the time-dependent temperature fluctuations
which are observed at the boundary of the well-mixed region. These seem to be excited
randomly, and are essentially three-dimensional in their nature. The astrophysical
importance of these oscillations must not be underestimated: in the Sun, they would
occur just at the base of the photosphere and would generate gravity waves.
230
Another suggestion that the two-dimenslonal studies may be somewhat misleading
comes from the results obtained hy Latour ~t ~. (1977). They analyze the penetration
of convective motions from an unstable slab into the stable adjacent regions. The
solutions are expanded into orthogonal modes in the horizontal, and a finite differences
scheme is used in the vertical. In the special case of a single mode with a two-dimen-
sional planform, this procedure reduces to the mean-field approximation of Herring used
by Musman (]968). But one can also choose a three-dimensional planform representing, for
instance, prismatic cells of hexagonal base. The comparison of solutions derived with
the two types of planforms reveals that penetration is much stronger when the conveetlve
motions are allowed to be three-dimenslonal (Figure ]). In the simplest three-dimen-
sional case, where only a single planform is retained, the solutions are asyr~netrieal:
the overshooting occurs mainly on the side to which the centerline flow is directed in
the hexagonal cells. The mean temperature profile becomes symmetrical again when one
superposes two patterns of hexagonal cells with opposite centerline velocities;
remarkably enough, the total kinetic energy of the flow does not vary as one switches
from the one-mode solution to this two-mode solution. And the total extent of penetra-
tion too remains unchanged, if it is defined as the sum of the penetration depths at
either side of the unstable layer.
2. Convection in a stratified medium
In the laboratory (or Boussinesq) case, the extent of penetration is related to
the only natural length that characterizes the problem, namely the thickness of the
unstable layer. But what should one expect in a stratified medium, such as the solar
convection zone, where the unstable domain spans several density or pressure scale-
heights?
This question has not been answered yet. Toomre ~t ~. (]976) have studied the
penetration from the deeper convection zone of an A-type star; this zone is due to the
second ionization of helium, and it measures about one pressure scale height. Using the
technique mentioned above of truncated modal expansion, and retaining only one single
three-dimensional mode, they find that the motions penetrate up to one scale height
into the stable region below. More recently, they have established that the convective
motions penetrate also above, as far as to build a link between the deeper convection
zone and the upper one, which is caused by the ionization of hydrogen. But the situation
considered is admittedly not one of severe stratification, and these results cannot
be extrapolated to the Sun, for instance, Moreover, the solutions obtained so far are
all stationary, missing thereby the time-dependent character of penetrative convection
which may be of primordial importance.
Another difficulty with these drastically truncated modal calculations is that
they depend on the choice made for the horizontal wavelength of their single planform.
Fortunately, the results are not too sensitive to this parameter, which is felt mainly
in the horizontal heat exchanges; it does not play the dominant role of the mixing
length in the phenomenological approaches.
231
I
a
i I "~I # l
| I
,,f,, 'ii
-"t!
t - -
o i"'%
l ! 1 1 ! 1
V
b
: ' l . \ / - I / ! I l ~'.
. . . . . . . . '~ I / ! I t " - ~ - -
\ 1 ] \ II
, J ~f
c / ' ' k ~ W ; % 1
i*x e l
. . . . . . 4 . , I \ i l l ' , . . . . . . o . . . . : ' * / \ ~I
; I
0 ~ • " ° " "
i *
o 1 z
Figure I . Modal solutions for penetrative Boussinesq convection.
The unstable layer, which extends in depth from z = O to z = ], is imbedded in an infinite stable domain from which only a fraction of thickness ~z ~ 2 on each side is shown here. The same Raylelgh number R ~ |0 s characterizes the stability and the instability of the three superposed layers (it corresponds to about thousand times critical). The amplitudesof the vertical velocity, W, and of the temperature fluctuations,G, are displayed as functions of z. Figure la shows a single two-dimensional mode (which may be visualized as a horizontal roll), figure Ib a single three-dlmensional mode of hexagonal horizontal planform, and figure |c two non-interacting three-dimensional modes of that same geometry. In all cases, the value of the horizontal wavenumber is 2, and the Prandtl number is I. Notice that the overshooting into the stable surroundings is much more pronounced with the three- dimensional motions.
232
The only way to avoid any extra assumption would of course be to directly
integrate the basic equations in three-dimensional space. This has been done hy
Graham (]975), whose latest results are presented in this colloquium. But even the
most powerful computers which are presently available set a rather low limit on the
number of grldpoints that can be used. This in turn fixes the highest Rayleigh or
Reynolds numbers that can be reached: typically one hundred times critical. There is
thus still a very long road to go before meeting the numbers characterizing a stellar
convection zone, but in the meanwhile these numerical experiments are very useful as
a workbench to test the various approximations that have been proposed.
IV. Observational tests
It is relatively easy to confront theoretical predictions of Boussinesq
penetrative convection with laboratory experiments. But, as we were already reminded
by K.H. Bbhm, the comparison of astrophysical models with stellar or solar obser-
vations is more delicate, for the physical parameters that can he determined often
depend on other factors than just the properties of convection.
For the stars, one is forced to rely on the few gross parameters which can
be observed. In principle the classical tests for probing the internal structure
of a star may be used to determine the extent of the regions which are in nearly
adiabatic stratification, at least once their location is roughly known. These tests
can complement each other: the apsidal motion test (see Sehw arzschild ]958) is more
sensitive to the overall mass concentration in a star, whereas the pulsational period
of a variable star (see Ledoux and Walraven ]958) depends more on the stratification
of its envelope. There is even a s]ight hope to interpret the properties of the dynamical
tide in a close binary system, which are closely related to the size of the quasi-
adiabatic core of the two components (Zahn ]977).
But the most promising tests are probably those which sense the inhomogeneitles
in chemical composition. Prather and Demarque (1974) and Maeder (]975b, ]976) have
included some amount of overshooting in their calculations of evolutionary stellar
models. They find that the evolutionary tracks, lifetimes and cluster isochrones all
are appreciably modified by an increase of the convective core. Prather and Demarque
obtain the best fit between their theoretical isoehrones and the cluster diagram of M 67
for a penetration depth of about ]0% of the pressure scale height; Maeder's value is
slightly less and he uses it to calibrate his non-local mlxing-length procedure.
The thickness of a convective envelope (together with its penetrative extension)
may be inferred from the abundance of elements which undergo nuclear destruction at
moderate remperatures, such as llthium~ beryllium and boron. In the case of the Sun,
additional information can be gathered from the composition of the solar wind (Boehsler
and Geiss ]973). But when interpreting such observations, one must keep in mind that
other instabilities than convection may also lead to a thorough mixing of the stellar
material.
233
It looks at first sight as if the Sun should be the ideal object on which to
check the theories of penetrative convection. The solar atmosphere becomes eonvectively
unstable below optical depth T = 2, which means that the overshooting motions should
occur in the photosphere and thus be visible. The difficulty however is to distinguish
in the observations of Doppler-shifted lines what is due to waves or oscillations, and
what is due to genuine penetrative convection. The accuracy of correlation measurements
between velocities and temperature fluctuations is still not sufficient to permit the
separation of both types of motions (for a recent and complete review on such measure-
ments, see Beekers and Canfield ]976). And one encounters the same problem when it
comes to the interpretation of the non-thermal energy flux: the convective (enthalpy)
flux is blended with the flux of kinetic energy, which is carried by both convection
and waves. But the solar observations are rapidly progressing toward better precision
and spatial resolution, and one may hope that these questions will he settled in the
not too distant future.
284
Biblio~"~aphy
Beckers,J.M., CsLufield,R.C. 1976, Physique des Mouvements dans les Atmos- pheres Stellaires, R.Cayrel and M.Steinberg eds., CNRS, p.207
Bochsler,P., GeisstJ. 1973, Solar Phys. 32, 3
Cogan,B.C. 1975, Astrophys. J. 201, 637
Graham,E. 1975, J. Fluid Mech. 70, 689
Gribov,V.N., Gurevich,L.E. 1957, Soviet Phys. JETP 4, 720
Herring,J.R. 1983, J. Atmos. Sci. 20, 325
Latour,J., Toomre,J., Zahn,J.P. 1977 (in preparation)
Ledoux,P., Walraven,Th. 1958, Hsndhuch der Physik, t.51, p.353 (Springer)
Maeder, A. 1975a, Astron. & Astrophys. 40, 303
Maeder, A. 1975b, As,ton. & Astrophys. 43, 61
Maeder, A. 1976, As,men. & Astrophys, 47, 389
Malkus,W.V.Ro 1960, Aerodyn. phenomena in stellar atmosph., p.346 (Thomas edit)
Moore,D.W. 1967, Aerodyn. phenomena in stellar atmosph., p.a05 (Thomas edit.)
Moore,D.R., Weiss,N.O. 1973, J. Fluid Meqh. 61, 553
Musman,S. 1968, J0 Fluid Mec h, 31, 343
Nordlund,A. 1976, Astron. & Astrophys. 50, 23
Prather,M.J. Demarque,P. 197~, Astmophys. J. 193, 109
Sehwarzschild,M. 1958, Structure and evolution of stars, p.iA6 (Dover)
Shaviv,G., Salpeter,E.E. 1973, Astrophys. J. 184, 191
Spiegel,E.A. 1963, Astrophys. J. 138, 216
Stix,M. 1970, Tellus 22, 517
Stommel,H. 1947, J. Meteorol. 4., 91
8t~auss,J.M. 1976, Astrophys. J. 209, 179
Toomre,J., Zahn,J.P., Latour,J., Spiegel,E.A. 1976~ Astrop~s. J. 207, 545
Townsend,A.A. 1964, Quart. J. Ray. Meteorol. SOs. 90, 248
Tmavls,L°D~, Matsushima,S. 1973, Astrophys. J. 138, 216
Tritton,D.J., Zarraga,M.N. 1967, J. Flui d Mech. 30, 21
Ulrleh,R.K. 1970a, Astrophys. & Space Sci. 7, 71
Ulrlsh,R.K. 1970b, Astrophys. & Space Soi, 7, 183
Veronis,G. 1963, Ast~oph~s° J. 137, 641
Vitense,E. 1953, Z. fur Astroph. 3~2~ 135
9~itehead,J.A., Chen,M. 1970, J. Fluid Mech. 40, 549
9~itehead,J.A. 1971, Geophys. Fluid Dynamics 2, 289
Zahn,J.P. 1977, AstTon% & Astrophys. 57, 383
THE BOUNDARIES OF A CONVECTIVE ZONE
A. MAEDER
Geneva Observatory
It is worth noting that various definitions for the boundaries of a convective
Zone may be considered. Their importance for stellar evolution is very unequal.
I. A level r N is defined at the place where the Nusselt number N = I, the Nusselt
number being the ratio fo the total heat transfer in the turbulent state to that
in absence of turbulence (Spiegel, 1966). Thus, r N is the level reached when the
Contribution of convection to the energy transport changes of sign. If there is a
negligible transport by sound waves, the usual equation of energy transport in
stellar structure may be written :
T G Mr T I V t a d
M 4 = r ~ P N r
where N may be determined by an iterative process in a non-local form of the mixing-
length theory. For example, at the edge of a convective core, there are usually
2 levels rNi and rN2, the first one marks the transition from the convective zone to
the overshooting zone (convective motions with N < I), while the second one marks
the transition from the overshooting zone to the radiative zone (NE|). A frequent
but unsatisfactory treatment in stellar models is to consider rNl = rN2.
2. The level r T is defined at the place where the mean temperature excess AT of
a fluid element vanishes. Thus, at rT, the forces acting on the elements also vanish
and this level may be called the dynamical edge of the core. For subsonic convec-
tion, the levels r T and r N are evidently equal.
3. Following Shaviv and Salpeter (1973), a level r~ may be defined at the place
where ~ ffi O, where 6 is
(dT / dr )
236
~T/ ~r is the gradient in the surrounding medium, in the non-local formalism adopted
(Maeder and Bouvier, 1976) it is a non-local quantity. It was shown that the tempera-
ture fluctuations of the turbulent medium are able to make 6 ~ 0 at many places in
convective cores. So, this boundary r6 has no true meaning.
4. The level r is defined at the place given by Schwarzschild's criterion, i.e.j e
at the place where e = O, with
(dT / dr)ra d E = -- ]
(dT / dr)ad
Formally, r E and rN] do not coincide, e may be written
(dT / dr)ef f = N l ,
(dT / dr)ad
where (dT/dr)ef f is the fictious gradient, necessary if all the energy was carried by
radiation in the convective zone. In the calculated models, this gradient is slightly
subadiabatic for r + rN]. Thus, r E lies slightly below rN] , but due to the very small
deviations from adiabatieity, these 2 levels are essentially undiseernible at the
edge of a convective core.
5. A kinematical edge r v may be defined at the level, where the velocity of a
mean fluid element becomes zero. This level evidently coincides with the level rN2
defined before. It is this level which determines the extention of the zone of con-
vective mixing.
In a convective core, the significant levels are, in order of increasing dis-
tance from the centre, re < rN! = rT < rv = rN2. This order will be reversed at
the bottom of a convective zone, provided the convection is adiabatic there.
Numerical models show that the distance of overshooting (rN2 - rNi) /
expressed in terms of the mixing length is very insensitive of the various efficiency
parameters of convection. Comparisons with observations of open star clusters show
that an overshooting amounting to about 7 % of pressure scale height is likely to
occur in upper MS stars.
Bibliography
Maeder,A., ]975, Astron. Astrophys. 400, 303
Maeder,A., Bouvier, P., 1976, Astron ,. Astrophys. 50, 309
Shaviv,G., Salpeter, E.E., 1973, Astrophys.J. ]84, 19]
237
CONVECTIVE OVERSHOOTING IN THE SOLAR PHOTOSPHERE;
A MODEL GRANULAR VELOCITY FIELD
Ake Nordlund
NORDITA
Blegdamsvej 17, Copenhagen, Denmark
The solar granulation, with its horizontal temperature fluctuations,
and its associated velocity field, is a consequence of overshooting
convective motions. Theoretical estimates of the magnitude of the
temperature fluctuations and mass fluxes involved were obtained in a
recent paper (Nordlund, 1976, Astronomy & Astrophys. 50, 23). Here, a
simple model of the instantaneous granular velocity field is presented,
and the effects of this velocity field on photospheric spectral lines
are described.
The vertical velocity component is modelled by a simple, parameterized
expression:
pv z (x,y,z) = ~o(~2/2) sin (~x/d) sin (~y/d)e-Z/Zo/(l+e-Z/Zo).
The three parameters specify the amplitude (4o) of the vertical mass
flow, the horizontal size (d) of a model granule element, and a typical
scale (Zo) for the vertical variation of the mass flux.
The horizontal velocities are determfned from the vertical velocity by
the condition of continuity. Since the granular motions are slow and
approximately anelastic, the condition of continuity can be well approxi-
mated by
div (~v) = O.
This simple, quadratic pattern'of alternating vertical velocities, with
the corresponding horizontal velocities determined by the condition of
continuity, represents a crude model of the instantaneous granular
velocity field. Some important conclusions are possible, however, using
this model to fit the non-thermal broadening of photospheric spectral
lines:
The three parameters (~o' Zo" and d) can be used to fit the observed
238
half widths of a set of photospheric spectral lines, at two different
center to limb distances. The values 4o=0.35 kgm-2s -1, Zo=100 km,
d=1500 km produce a reasonably good fit. The velocity along a line of
sight varies along the llne of sight in the model granular velocity field.
When the line of sight velocity varies, the line absorption coefficient
is shifted in and out of the intensity profile of the line, and an
increased absorption results. With the given parameter values, this
effect is negligible for vertical sight lines. However, because of the
increase of typical optical scales for increasing angles of inclination,
the variation of the velocity along a line of sight becomes important
with growing angle of inclination.
Due to this effect, the observed center to limb behavior of the equiv-
alent widths of the spectral lines is also reproduced, without the need
for classical microturbulence. With a classical macro/microturbulence
model, the same behavior could have been achieved only by assuming a
depth-dependent and anisotropic macro- and micro-turbulence.
Turbulent motions on scales smaller than granular are certainly gener-
ated by the larger scale, granular motions. However, the (apparently
anisotropic) center to limb effects on the equivalent widths, which
correspond to classical microturbulent velocities of the order 1 - 2
kms -1, can be explained entirely as a consequence of the granular scale
velocity field.
In conclusion, this study shows that the observations of llne broadening
and line strength are consistent with a situation where the amplitude
of the velocity field is at maximum on granular scales, where the motion
is being driven by convection, and with the amplitudes of smaller scale
motion being progressively smaller and smaller.
THERMOSOLUTAL CONVECTION
Herbert E. Huppert
Department of Applied Mathematics and Theoretical Physics, Silver Street,
Cambridge CB3 9EW England
I. Introduction
The aim of this contribution is to survey a relatively new form of convection,
which is very easy to investigate in the laboratory, plays an important role in
the oceans and many chemical engineering situations and is likely to prove
essential in the understanding of some areas of stellar convection. Thermosolutal
convection (or double-diffusive convection as it is often called) owes its exis-
tence to the presence of two components of different molecular diffusivities which
contribute in an opposing sense to the locally vertical density gradient. The
different sets of components studied have covered a wide range including
a. heat and salt - two components relevant to the oceans and a number of
laboratory experiments;
b. heat and helium - two components relevant to certain stellar situations;
c. salt and sugar or two different solutes - components useful for laboratory
investigations; and
d. heat and angular momentum - components which are likely to be relevant to
some stellar situations. In each case, the most rapidly diffusing component has
been listed first. Thus, while in the paper the terminology of heat and salt
will be used, different components can be envisaged by reference to the above
examples.*
Aside from its many applications, thermosolutal convection has received
considerable attention because it can induce motions very different from those
predicted on the basis of purely thermal convection, that is, convection with
only one component. In particular, diffusion, which is known to have a stabil-
izing influence in thermal convection, acts in a destabilizing manner in thermo-
solutal convection. By the action of diffusion, instabilities can arise and
vigorous motion take place in situations where everywhere throughout the fluid
heavy fluid underlies relatively lighter fluid.
*Ed. Spiegel paraphrases this by the maxim: for salt, think helium. Is this his secret of gourmet cuisine?
240
An example of naturally occurring thermosolutal convection which highlights
its counter-lntuitive nature is afforded by Lake Vanda. Situated in Antarctica,
approximately 5 km long, 11 kmwlde and 65 m deep; Lake Vands has a permanent ice
cover of 3 - 4 m. Just below the ice the water temperature is 4.7°C and t~e
temperature increases with depth, often in a step-like fashion, until at the
bottom the temperature is 24.8°C (Figure l). There is a corresponding increase in
density, from 1.004 Em cm -3 just beneath the ice to a maximum of I.I0 Em cm -3
- - J
52
54~
-------~S-- --B 62--
TEMPERATURE °C
Figure 1. The temperature profile in Lake Vanda as a function of depth indicated in meters (taken from Huppert and Turner, 1972). Note the existence of a layer of uniform temperature (7.6°C) between 14.2 and 37.9 m which has been partially omitted from this figure.
at the bottom, due to the presence of salt. Vigorous convective motions take
place in the upper portions of the lake, maintaining the regions of uniform
properties, which are the hallmark of thermosolutal convection. Any model of the
lake based solely on considerations of temperature, or density, is doomed to
failure. Only by incorporating thermosolutal effects can a successful model be
derived (Huppert and Turner, 1972).
The plan of this survey is as follows. The two fundamental mechanisms of
thermosolutal convection are described physically in §2. These form the foundation
of the quantitative analysis of a suitable Rayleigh - Benard convection problem,
whose linear and nonlinear aspects are discussed in §3. The mechanism by which a
series of layers and interfaces can be maintained, as in Lake Vanda, is considered
in §4. In §5 a few ways in which a series of layers and interfaces can originate
are described. The structural stability of such a series is investigated in §6.
Conclusions are presented in §7.
241
2. The Fundamental Mechanisms
The first of the two fundamental mechanisms of thermosolutal convection occurs
in a fluid for which the temperature and salinity both decrease with depth, while
the (overall) density increases with depth, as indicated in figure 2a. In this
(a)
v
T S 0
?
(:b) T S p
Figure 2. Typical temperature, salinity and density profiles for: (a) the finger situation and (b) the diffusive situation, including a sketch of the motion of a disturbed parcel of fluid.
statically stable situation, the dynamic instability that arises can be examined
by considering a parcel of fluid displaced vertically downward. Initially warmer
and saltier than its surroundings, the parcel comes to thermal equilibriumbefore
its excess salinity can be diffused. It is thus heavier than its surroundings and
continues to descend. The ensuing motion consists of adjacently rising and
failing cells, interchanging their heat, and to a much smaller extent their salt,
much llke a heat exchanger. The kinetic energy of the motion is extracted from the
potential energy stored in the salt field. Experiments indicate that in typical
conditions, the plan form of the cells, called salt-fingers, is squarish with a
horizontal length scale of {(~g/KTV) (d~/dz)} -1/4, where ~ is the coefficient of
thermal expansion, g is the acceleration due to gravity, <T is the coefficient of
thermal diffusivity, v is the kinematic viscosity and ~d~/dz) is the mean
(positive) vertical temperature gradient. This length scale, discussed further
in the next section, represents a balance between dissipative effects acting pre-
ferentially on small scale motions and the increasing inefficiency of diffusing
heat over ever larger horizontal distances.
The second fundamental mechanism occurs in a fluid whose temperature,
salinity and (as before) overall density increases with depth, as indicated in
figure 2b. Displacement of the typical fluid particle vertically downwards now
places it in a warmer, saltier and more dense environment. As before, the thermal
field of the parcel begins to equilibrate with its surroundings more rapidly than
does the salt field. The parcel is then lighter than its surroundings and rises.
But due to the finite value of the thermal diffusion coefficient, the temperature
field of the parcel lags the displacement field and the parcel returns to its
242
original position lighter than it was at the outset. It thus rises through a
distance greater than the original displacement, whereupon the above process
continues and leads to a series of growing oscillations, or overstability, which is
resisted only by the effects of viscosity. This oscillatory form of motion has
been experimentally documented (Shlrtcllffe, 1969) and some of its characteristics
explored by Moore and Spiegel (1966) in an imaginative paper which develops an
analogy between this form of thermosolutal convection and the motion of a flaccid
balloon in a thermally stratified fluid. For sufficiently large temperature
gradients, steady motion can occur because a large temperature field can overcome
the restoring tendency of the salinity field. The criteria at which this first
occurs are discussed in the next section.
3. The RaTleish-B~nard problem
The fundamental mechanisms of the previous section form the basis of all
quantitative calculations. The most straightforward and hence frequently considered
calculation relates to the extension of the classical Rayleigh-B6nard problem:
what is the motion of a fluid confined between two horizontal planes across which
there is a temperature difference AT and a salinity difference AS? A major
motivation behind such studies is the expectation that just as the purely thermal
problem has successfully explained a variety of phenomena, as summarised by
Spiegel (1971), so also will the thermosolutal extension. And indeed this expect-
ation has already been partially fulfilled.
All calculations so far performed have essentially assumed two-dimensional
motion, dependent on one horizontal co-ordinate, x, and the vertical co-ordinate z.
Considering this restriction and non-dimensionalising all lengths with respect to
D, the separation between the planes, time with D2/KT and expressing the v e l o c i t y
q* in terms of a s t r e a ~ u n c t i o n ~ by
q* ~ (KT/D) (Bz~ , - Bx~), (3.1)
the temperature T* by
and the salinity S* by
T* m T O + AT (i - z + T) (3.2)
S* = S + AS (I - z + T) (3.3) o
where T and S are constant reference values, we can write the governing o o
Boussinesq equations of motion as
o-Iv2~t~-o-Ij(~,V2~) = -~3 x T+R s ~xS+V4~, (3.~)
~t T + ~x ~2-J(~,T) = V2T (3.5)
~t S + ~X~ -J(~,S) =TV2S (3.6)
where the Jacobian, J, is defined by
243
J(f,g) = ~xf~z g - ~zf~x g. (3.7)
We have also assumed the linear equation of state
P* = Po (I - aT* + 8S*), (3.8)
where ~ and 8 are taken to be constant, in the expression for the body-force term
in (3.4).
Four non-dimensional parameters appear in (3.4)-(3.6): the Prandtl number
~=V/KT; the ratio of the dlffuslvlties T=KS/K T, where K S is the saline dlffusivity,
which is less than KT; the thermal Rayleigh number ~ = ~gATD3/(KTV); and the
saline Rayleigh number R S = 8gASD3/(KTV).
To these equations must be added a series of boundary conditions. Mainly
because of their mathematical simplicity, the most frequently used conditions are
those obtained by assuming that both horizontal planes are stress free and per-
fectly conducting to both heat and salt. That such an assumption is a reasonable
one for a model which is to apply in the interior region of a star can be fairly
well defended (and often has been). One aspect of the defense incorporates the
belief that the use of other, possibly more realistic, conditions is likely to
lead to only slight quantitative differences. Indeed, Huppert and Matins (1973),
in a series of experiments described below, give an example of this. Mathematically,
free-free boundary conditions, as the above are often loosely called, are expressed
by ~=~2zz $ = T - S = 0 (z = O,I). (3.9)
a) Linear Disturbances
The equations governing infinitesimal motions are obtained by deleting the
nonlinear Jacobian terms of (3.4)-(3.6). The resulting differential system has
constant coefficients and a solution in terms of the lowest normal modes
~(x, z, t) = 9 ° slnwax ) (3.10a)
T(x, z, t) = T O cos~ax I ePtsin~z (3.10b)
S(x, z, t) = S o cos~ax (3.10c)
leads to the dispersion relationship
where
p3+(c+T+l)k2p2+ {(O+T+l)k 4 - ~2c~2k-2(RT-Rs)}p
+oTk6+~2~a2(Rs-T~) = O,
(3.11)
k 2 = ~ 2 ( 1 + a 2 ) . ( 3 . 1 2 )
S i n c e ( 3 . 1 I ) i s a c u b i c w i t h r e a l c o e f f i c i e n t s i t s z e r o s a r e e i t h e r a l l r e a l o r
c o n s i s t o f one r e a l r o o t and two compl ex c o n j u g a t e r o o t s . E x c h a n g e o f s t a b i l i t i e s ,
w h i c h a r i s e s when one o f t h e r o o t s e q u a l s z e r o , i . e . p~ O, o r e q u i v a l e n t l y
~t~0, occurs first for ~= 2 -1/2 and
244
- RS/T + 27~4]4 (3.13)
O v e r s t a b i l l t y , wh£ch a r i s e s w h e n the pa r r of complex-conjugate roo t s c rosses the
imaginary axis, that ~s Pr = O, occurs first for the same wavenumber, ~ ffi 2 -1/2
and ~ ffi (O+T)Rs/(~+I) + 27~4(l+r)(l+ro-l)/4. (3.14)
I0000
R S
5000
~" ~I~ (3.13)
(3.74)
I I | I I -I0000 -5000 ~ 5000 IOO00
-5000
-I0000
Figure 3. The linear stability results for 0 = i0 -I, T = I0 -I. Along (3.13) one of the temporal elgenvalues, p, is identically zero; along (3.14) two of the (complex conjugate) p are pure imaginary; and along C two complex conjugate elgenvalues coalesce on the real axis.
In the RS, ~plane the linear stability boundary is a combination of (3.13)
and (3.14), as depicted in figure 3, which presents a complete smmnary of the
linear results for ~ = 10 -I, T ffi I0 -I.
An investigation of the fastest growing mode evaluated by linear theory has
been presented by Balnes and Gill (1969), although owing to the linear constralnt~
the results are of at most academic interest. ~n agreement with the result
245
originally calculated by Stern (1960), Balnes & Gill find that in the salt finger
region, the unstable portion of the third quadrant in figure 3, the wavelength of
the disturbance of most rapid growth is much smaller than the marginal value,
23/2~, except very close to the marginal stability llne (3.13). This is in accord
with the physical description of §2, which indicates that a thinner mode acts as a
be~ter heat exchanger.
The following experiment is an example to which results based on linear theory
can be profitably applied. A uniform layer of hot, salty water is carefully placed
over a unlform layer of relatively colder, fresher water. The temperature and
salinity distributions across the initially paper-thln horizontal interface evolve
by diffusion, leading to a situation similar to that considered at the beginning
of this section. Equating the central gradient of the diffusing distribution to
the temperature and salinity gradients which appear in the marginal stability
criterion (3.12), Huppert & Martins (1973) calculate that under typical laboratory
conditions, specifically ~, T-IRs >> 27~4/4, salt-fingering should occur if
~S/C~T > T 3/2, (3.15)
where AT, AS are the initial temperature and salinity differences across the inter-
face. The results of a series of experiments, conducted with a variety of pairs of
solutes with different values of T, are in very good agreement with (3.15).
b) Nonlinear Disturbances
Fully nonlinear, but two-dimensional, investigations have been conducted by
Straus (1972) for ~, R S < 0 and by Huppert & Moore (1976) for ~, R S > 0.
The former reduces the complexity of the governing equations by assuming that
T ÷ O, R S ÷ 0 with Rs/T fixed.
In this limit, the inertial terms in the momentum equation and the adveetlon of
the disturbance temperature, but not the disturbance salinity are negligible.
Straus calculates the solutions for a variety of different values of u as
increases from the marginal stability value. + He also tests the linear stability
of these solutions. His principal conclusions are that as ~ increases, both
extremes of the range of stable wavenumbers increase significantly. Beyond a
specific ~, dependent upon Rs/T, the range of stable wavenumbers no longer
includes the wavenumber at marginal stability. The form of motion with the most
stable wavenumber corresponds to the long thin cells of the (somewhat different)
experiments and is close to that wavenumberwhieh leads to a maximum salt flux.
These results are interesting and suggestive, hut the two-dlmensional and small T
assumptions might limit the generality of some of the specific conclusions.
The calculations for ~, R S > 0 of Huppert &Moore aim to follow the form of
solutions as ~ increases for fixed RS, ~ and T. Drawing on the results of a
+The marginal stability point is supercrltical, that is, there is only the con- ductive solution for ~ less than the marglnal value.
246
number of numerical experiments, they put forward the following general con-
clusions. There are two rather different branches of solutions. Along one branch,
which may be initiated either subcritically or supercritically from the linear
oscillatory critical point given by (3.14), the solutlons are oscillatory. In
R~
4 ~
i
3(I0(I
10000
RT
90OO
8000
2"0 4.0 6 0 X 0
.Ms
, t i _ _ 1 21o ~ ' 4!o 6.0
M e
Monotonic
...! |0~0
Figure 4. The stabl~ solutlon branches in a thermalxRayleigh number, maximum Nusselt number at z 0 plane for (a) ~ = I, T = i0 -~, R s = I04 and (b) 0 = I, T = I0-I, R s = I07/2. Where relevant both local maxima are shown and the rapidly oscillating curve indicates that no definite maximum can be assigned to the aperiodic motion in this range. The dots indicate the transitions that can take place between the oscillatory and monotonic branches.
general, as R T increases, a transition point is attained at which the solution
changes from being relatlvely simple to being fundamentally more complicated, yet
247
still periodic. At a yet larger value of ~ another transition takes place, a
transition to aperiodic motion. Finally, beyond a still larger value, stable
aperiodic solutions cease to exist and only solutions which are ultimately steady,
and make up the second branch of solutions, can he found. The two branches for
particular values of o, T and R S are graphed in Figure 4. Steady motion exists
in a thermosolutal fluid because of the tendency of the temperature fleld to cause
an almost isosolutal core to be produced, confining all solute gradients to thin
boundary layers. The temperature, salinity and density fields for a typical steady
solution are presented in figure 5. For given o, T and RS, there is a minimum
T S p
Figure 5. The temperature, salinity and density fields for R T = 10 4 , O = I and T = 10-2.
= 10700,
value of ~ for which steady motion exists and one of the aims of the investigation
by Huppert & Moore is to calculate this minimum. For details of this result and
others the reader is referred to the original paper. The major finding is that
for sufficiently small T, steady convection can occur for values of ~ less than
that obtained from the llnear stability boundary (3.14) (and thus much less than
the value at which linear theory suggests non-oscillatory convection occurs).
The specific results obtained by Huppert & Moore, primarily by numerical
computation, were limited to 14 different values of o, T and ~. Guided by these
calculations, M. R. E. Proctor and independently Huppert & Gough are currently
attempting to obtain analytic expressions for various limiting cases, in
particular, the astrophyslcally relevant situation T ÷ O.
4. LaTers and Interfaces
As the experiment of Huppert and Manins, described in the previous section,
progresses, the fingers and the interface between the two layers extend in length.
Within the interface there is a strong background gradient of density~ and the
interface is hence an ideal site for internal waves, which are generated by dis-
turbances induced by the salt-finger motion. These internal waves cause the
fingers to sway back and forth, like a banner fluttering in the breeze. If the
fingers become too long, this motion causes them to loose their vertical coherence,
248
or break, much like a long strut subjected to an oscillatory transverse load.
Guided by the Navier-Stokes equations of motion rather than the analogies used
above, Stern (1969) argues that for very small T an established field of salt
fingers is limited in length by the requirement that
8FsI ( ~ j < C, (4.1)
where F S is the salt flux through the interface, to be discussed below, d~/dz is
the mean temperature gradient and C is a constant of order unity. Thus for a
fixed salt flux, the length of the salt fingers and the thickness of the inter-
face increase until equality in the constraint (4.1) is reached.
At the two edges of the interface the salt fingers impart an unstable
buoyancy flux on the adjacent layers, which causes the layers to convect.
Developing an analogy with purely thermal convection at high Rayleigh number,
Turner (1967) argues on the basis of dimensional analysis that the relationship
between the saline Nusselt number and the Rayleigh number is of the form
Nu$ ~ F S D/(KSAS) = fF(c~T/~AS, ~,T)R~/3 , (4.2a~b)
where D cancels in (4.2h) and fF is some function of its three argments. Also,
argues Turner, the resultant heat flux, FT, is related to F S by
C~FT/~F S = gF(~AT/BAS, ~, T) . (4.3)
Turner obtained experimentally the explicit form of fF and gF for heat and salt
in water and found that for the range of edT/BAS considered, 2 < ~AT/~AS < I0, fF
is such that as 0~T/~S ~ I the salt flux is approximately 50 times as large as
if the same salinity difference were maintained across a region hounded by two
solid boundaries, and fF decreases slowly with increasing ~AT/~AS. The constancy
of gF indicates that~ independent of o~T/~AS, a constant fraction of the potential
energy released by the salt field is supplied to the temperature field. Linden
(1973) experimentally evaluated fF and gF using a different technique and deter-
mined the same fF but a different, yet still constant, gF" Which result is in
error is still not known.
If a uniform layer of hot, salty water is placed below a uniform layer of
relatively colder, fresher water, heat and salt are transferred upwards through
the thin interface primarily by diffusion, with the resulting unstable buoyancy
flux driving convection in the layers as before. For this case, known as the
diffusive situation, the relationships equivalent tO (4.2) and (4.3) are
Nu T E F T D/(KTAT) = fD(6AS/~AT, O, T)~/3 (4.4a,b)
and
6FS/a~ T = gD(6~S/~T , ~, T) (4.5)
for some functions fD and gD" Using the results of another series of experiments
by Turner (1965) with heat and salt in water , Huppert (1971) suggests that for
249
thi8 partlcular case
Nu T = 3.8 (SAS/cU~T) -2 1~/3 (4.6)
=/l 85 0 85(B S/ T) I < S S/ T < 2 (4.7)
8Fs/~F T ] tO.15 2 < 8AS/~AT. (4.8)
A deductive model of the diffusive interface has not as yet been obtained, though
a number of ad hoc arguments, some of them described by Turner (1974~ lead to
formulae in soma agreement with equation (4,8). A few experiments with two solutes,
rather than heat and salt t have been performed. For both salt-fingering and dif-
fusive cases, all the experiments indicate a constant value of the flux ratios,
(4.3) or (4.5), for a large range of BAS/o~T. The deduction explanation for this
fact is awaited and is one of the major theoretlcal prizes still to be gained. To
be more general, a major advance in the subject would be achieved on building a
mathematical model which predicts the heat and salt fluxes for all values of
and T.
Notwithstanding our current lack of knowledge, the important conceptual
statement that can already be made is that the above mechanisms can be extended to
include a series of convectlng layers, separated by fingering or diffusive inter-
faces, as the situation demands. This is the explanation of the profiles of Lake
Vanda, those obtained under the drifting Arctic Island T3 displayed in figure 6,
and of many other oceanographic examples. The main aim of this review is to
support the suggestion that a process which occurs so readily on earth must also
play a fundamental role in stellar convection.
5. The Buildin~ $f Layers
As suggested in the previous section, a series of convecting layers separated
by thin interfaces can be easily constructed in the laboratory by carefully placlng
one layer on top of another. They arise in natural situations by a large number of
different mechanisms. A few situations have received a fair amount of quantitative
analysis and will be described here.
Consider a fluid with a uniform salinity gradient increasing with depth
subjected to a constant heat Tlux, F H, at its base. Initially a growing overstable
oscillation occurs. Shortly thereafter a convectlng layer develops adjacent to the
bottom because hot fluid rising from the base can penetrate only a finite height
into the stable salinity gradient. As time proceeds the height of this layer, h,
grows according to h = (2Bt)~/Ns, (5.1)
2 dS where B ffi - ~gFH/(Pc) and N S ffi - gS"~ • (5.2)
The relationship (5.1) is a consequence of the conservation of heat and salt and
the experimentally observed fact that the density (but not the temperature or
salinity) is continuous across the top of the layer (Turner, 1968).
This growth does not, however, continue indeflnltely. There is a thermal
250
3 0 0
3 2 0
E
3 3 0 - -
t
340 1
Temperature (degrees Celsius) - - 1.0" , ,,O"
(o) Typicol remperoture Prof/le Sect/on
- - ( b J
Section o.f Profile Recorded ot High Gain
0,01" C
l< - - O . I 'C - 4
150
E --250
C~
f~ I
500
350
Figure 6. The temperature profile under the Arctic Ice Island T-3 (taken from Neal e_.t_tal, 1969).
boundary layer ahead of the advancing front and when a critical Rayleigh number,
Rc, is reached the region above the first layer ceases to grow. This can be
calculated to o c c u r when h = ---(~<~RcB31K)I/41N ~ . (5 .4)
The second layer then grows, the thermal boundary layer ahead of its advancing
front becomes unstable~ and so in time a series of layers is built up. Heat and
salt are transferred across the interfaces~ in the manner of the last section~ and
in the course of time some of the lower interfaces disappear because the density
difference across them tends to zero. A combined theoretical and experimental
investigation of the depths of all the layers and time scales for their formation
is currently being undertaken by Huppert & Linden (1977).
This heating a salinity gradient from below mechanism, but acting in reverse,
that is, cooling a sallnity gradient from above, produces the layers under T3
shown in figure 6.
The above mechanism involes an entirely one-dimensional model. Many natural
phenomena can be expected to be two - or even three-dimensional. As yet such
extensions are only in the early stages of investigation.
251
In a series of qualitative experiments, Turner &Chen (1974) show that even a
relatively small disturbance applied to one side of a thermsolutal fluid, thereby
introducing horizontal inhomogeneities, can have significant effects. For example,
the raising of a small flap at the wall of a vessel containing a thermosolutal
fluid induces, in the salt-flnger situation, a rapidly propagating wave motion
which is accompanied by the initiation of convection over large horizontal distances.
In the diffusive situation, the disturbance propagates horizontally more slowly and
can cause local overturning which leads to the initiation Of salt fingers.
Another example concerns the introduction of a small source of warm, salty
water into a uniform layer of relatively colder, fresher water of exactly the same
density. Fluld which conwnences to fall diffuses its heat to the surroundings,
thereby beco~ng heavier. Nelghbourlng fluld, having been warmed, is relatively
lighter and rises. Large vertical motions, both upwards and downwards, in the
form of plumes result. As the motion proceeds, the density difference between
each pl-me and the surroundings increases. Thus, starting only with fluid of
uniform density, solely by diffusion both heavier and lighter fluid are formed.
If the plumes impinge on horizontal boundaries, they spread out and build a series
of layers and interfaces through the entire vertical extent of the fluid.
A final example is afforded by introducing an insulated sloping boundary. In
a stably stratified fluid, stratified with respect to only one component, such a
sloping boundary induces a thin slow steady upwards motion adjacent to the boundary.
The flow provides a convective density flux equal to the diffusive flux in the
interior and allows the isopycnals, horizontal in the interior, to bend near the
boundary and intersect it at right angles. In a fluid stratified with respect to
two components, the curves of constant T and constant S must intersect the boundary
at right angles and no steady boundary layer flow can accomplish this. Alternatively,
it is not posslble for a single boundary-layer to give rise to convective T and S
fluxes which balance the unequal diffusive T and S fluxes in the interior. Instead,
a series of layers and interfaces from throughout the fluld, as shown in figure 7
to build that characteristic structure of a thermosolutal fluld.
Figure 7. A series of layers and interfaces set up in a laboratory tank by introducing a solid sloping boundary (from Linden and Weber, 1977)o
252
The size of layers and the tlme-scale of their initiation is dependent upon
the angle of the sloping boundary, or more generally their existence is due to its
presence. However, this specific example presents another illustration of the
basic point: initially small horizontal inhomogeneities in a stably stratified
fluid leads to large scale layering with vertical transports considerably in excess
of those calculated on a molecular basis.
6. The Destruction of LeT era
In the experiment of heating a salinity gradient from below, discussed in
section 4, the tendency of the lower layers to merge as the density difference
across them tends to zero was mentioned. Such merging is due to the continually
imposed flux of heat from the bottom. Layers can merge, or be destroyed by more
natural, internally imposed conditions, which will be described in this section.
Consider a three-layer system consisting of two semi-infinite layers of
uniform T and S between which there is a finite layer of intermediate properties.
All layers are assumed to be convecting~ with temperature and salinity fluxes
across the interfaces in accord with (4.2) and (4.3) or (4.4) and (4.5). There
is then a single-valued relationship between the temperature and.salinity in the
intermediate layer for which the flux through the lower interface equals that
through the upper. The conditions under which such equilibrium situations are
stable is partially answered by Huppert (1971). Assuming that merging takes place
without any vertlcal migration of the interfaces~ he shows that only if the
conditions across each interface are in the 'constant regime', gF or gD equals
constant, will the layer system persist. Otherwise one or other of the interfaces
will disappear and two semi-lnflnite layers separated by one interface remain.
The analysis can be extended to any number of intermediate layers to yield the
same result. Thus the prediction is that no stable system of diffusive layers of
hot salty water exist if BAS/~AT < 2. A controlled laboratory experiment to test
this prediction has yet to be performed, although Turner and Chen (1974) and
Linden (1976) have observed merging which they believe to be due to the above
mechanism. Turning to large scale measurements, we can at present report that no
series of layers has been observed under T3~ in the Red-Sea or elsewhere with
BASIaAT < 2.
Experiments by Linden (1976) indicate that another form of instability is
possible, whereby merging occurs by the vertical movement of one interface to
coalesce with its neighbour. A quantitative analysis of this situation has not yet
been performed. It would clearly be interesting to know which instability is
favoured under specified conditions because layer merging will need to be accounted
for in any future quantitative model building.
7. Su~aar~ and Conclusions
This review has attempted to bring out the following salient points. Fluids
253
Stratified with respect to two (or more) components can exhibit motions very
different to slngly-stratlfied fluids. Instabilities can arise even when the
overall density is statically (very) stable by drawing on the potential energy
Stored in one particular component. The typically observed signature of a thermo-
solutal fluid is a series of convecting layers separated by thin interfaces through
which properties are transported by either diffusion or the action of fingers.
This transport is very much larger than one based on consideration of purely
molecular diffusion across a quiescent region.
Large stars have a heated hellum-rich core surrounded by lighter hydrogen.
The Composition gradient in the core/envelope regions is thus of the diffusive
type and it would be expected that convection of this form predominates. Some
attempt has been made to incorporate this process in a semiconvection zone,
although quantitative calculations would benefit from a precise description of
physics in this zone.
The salt-flnger type of instability has been hypothesised to occur in the
outer layers of differentially rotating stars (Goldreich and Schubert~ 1967),
where the two components with different diffusivities are heat and angular
momentum. Turner (1974) appears to be of the opinion, however, that such an
effect would be obliterated by baroclinic instabillties which occur on a much
larger scale. This concluslon should be questioned in view of the large obser-
vational evidence for the existence of salt-fingers in the ocean, also subject to
baroclinlc instability.
We conclude by suggesting that what is achieved so easily in the laboratory
and the oceans might also be attained by the stars~
This survey benefited from a careful reading of a first draft of the manu-
script by Dr N. O. Weiss.
References
Baines, P. G. & Gill, A. E. 1969 On thermohaline convection with linear gradients, J. Fluid Mech. 37, 289-306
Goldrelch, P. & Schubert, G. 1967 Differential rotation in stars, Astrophys. J. 150, 571-587
Huppert, H. E. 1971 On the stability of a series of double-dlffusive layers, Deep-Sea Res. 18, 1005-1021
Huppert, H. E. & Turner, J. S. 1972 Double-diffuslve convection and its implications for the temperature and salinity structure of the ocean and Lake Vanda, J. Phys. Oceano~. 2, 456-461
Huppert, H. E. &Manins, P. C. 1973 Limiting conditions for salt-fingerlng at an interfacej Deep-Sea Res. 20, 315-323
254
Ruppert, R° E. & Moore, D. R. 1976 Nonlinear doubte-diffusive convection, ~. Fluid Mech. 78, 821-855
Huppert, H. E. & Linden, P. F. 197 ? On heating a salinity gradient from below, (work in progress)
Linden, P. F. 1973 On the structure of salt fingersp Deep-Se.a Res. 20, 325-340
Linden, P. F. 1976 The formation and destruction of fine-Structure by double- diffusive processes, Deep-Sea Res. 23, 895-908
Linden, P. F. & Weber, J. E. The formation of layers in a double-dlffusive system with a sloping boundary, J. Fluid Mech. (to appear)
Moore, D. W. a Spiegel, E. A. 1966 A thermally excited nonlinear oscillator, Astrophys. J. 143, 871-887
Neal, V. T., Neshyba, S. & Denner, W. 1969 Thermal stratification in the Arctic Ocean, Science, 166, 373-374
Shlrtcllffe, T. G. L. 1969 An experimental investigation of thermosolutal convection at marginal stability, J. Fluid Mech. 35, 677-688
Spiegel, E. A. 1971 Convection in stars. I. Basic Boussinesq convection, Ann. Rev. Astron. and Astrophys. 9, 323-352
......
Stern, M. E. 1960 The 'salt-fountaln' and thermohallne convection, Tellus, 12, 172-175
Stern, M. E. 1969 Collective instability of salt fingers, J. Fluid Mech. 35, 209-218
Straus, J. M. 1972 Finite amplitude doubly diffusive convection, J. Fluid Mech. 56, 353-374
Turner, J. S. 1965 The coupled turbulent transports of salt and heat across a sharp density interface, Inst. J. Heat Mass Transfer. 8, 759-767
Turner, J. S. 1967 Salt fingers across a density interface, Deep-Sea Res. 14, 599-611
Turner, J. S. 1968 The behaviour of a stable salinity gradient heated from below, J. Fluldldech. 33, 183-2OO
Turner, J. S. 1974 Double-diffusive phenomena, Ann. Rev. of Fluid Mech. 6, 37-56
Turner, J. S. & Chen, C. F. 1974 Two-dimensional effects in double-diffuslve convection, J. Fluid Mech. 63, 577-592
THE URCA CONVECTION
Giora 8haviv
Departement of Physics and Astronomy
Tel-Aviv University
Ramat Aviv, Israel
The possible role that B-decays may play in stellar collapse was first discussed
by Gamow and Schoenberg (1940, 194]). The above authors proposed this mechanism a
mean of extracting quickly the energy content of the star and transporting it out-
side. In this way they hypothesized that stellar collapse may proceed.
The URCA process is composed of B-decay and inverse B-decay scouring inside the
Star. Let (A, Z-i) be a B-decay unstable nucleus in vacuum (i.e. on earth), namely
(A, Z-l) + (A, Z) + e- + U (I)
where (A, Z) is a nucleus with Z protons and A nucleons (protons plus neutrons) and
9 is the emitted antineutrino. The energetics of the process is seen in Fig. I. Note
that AQ must include the rest mass energy of the electron. The transformation of a
neutron into a proton leads to a lower energy configuration. The transition can go
from the ground state of the nucleus (A, Z-l) to the ground state of the nucleus (A, Z)
(arrow ]) or from the ground state to an excited state (arrow 2) if such exists and
if the transition is allowed. Only under high temperatures of the order of about
MeV is the (A, Z-l) nucleus excited and then transitions from excited states are pos-
sible. There is no basic difference between ground state and excited states trans-
itions. The most important properties for convection are: The energy difference
between the two states is shared by the two emitted particles, the electron (inclu-
ding its rest mass) and the neutrino. The electron is usually more energetic than
the surrounding and hence is slowed down (quickly) and deposits its energy in the
surrounding. The neutrino has an extremely small cross-sectlon for interaction with
matter (about 10 -44 2 cm ) and hence escapes from the star. The energy carried by the
neutrino is a net loss to the star. The rate of the decay depends on the density
(we shall return to this question) but is usually of order of minutes and longer.
T AQ
± ( A
,Z-I
)
(A,Z
)
Figure I.
The energetics of the E-decay.
inve
rse
~-de
cay
Ep s
hell
Figure 2.
The density profile in the star
and the URCA shell.
m/M
tot
Pshe
ll
257
The matter in the star is practically fully ionized at the relevant densities
(above 106 gm/am 3) and the electrons are degenerate. The fermi energy, which is the
average energy needed to add an electron at thermal equilibrium, is a monotonic
function of the density (at constant temperature ~$ep I/3 at the relevant densities).
At a density of about 5 x I05 gm/cm 3 the fermi energy is about I/2 MeV, namely it
equals the rest mass energy of the electron. As the density continues to increase
a moment will come at which the degenerate electrons, which are pushed to higher and
higher energy levels will be energetic enough to cause the inverse process, namely :
(A,Z) + e- ÷ (A,Z-I) + v , (2)
Here ~ is the antineutrino which has the same nuclear properties as the neutrino
(cross section ~ I0 -~# emZ)and escapes from the star, in general, so long as the
densities are below nuclear ones. At P=Pshell when the inverse process (2) occurs we
get the URCA shell. The name URCA was given by Gamow and Schoenberg after the famous
Casino in South America where the gambler was bound to lose his money. The URCA shell
is quite narrow, i.e. the process occurs for Ap << Pshell" At densities p < p shell
the dominant process is B-decay while for p > p shell inverse S-decay is dominant.
The location of the URCA shell is given by the condition
2 ef = AQ - meC (3)
where AQ is the energy difference between the two nuclei.
Tsuruta and Cameron (1969) considered the effect of density variations on the rate of
URCA losses. The picture of Gamow and Schoenberg is static. As the star contracts
the location P=Pshell advances outward and gives rise to neutrino (and energy) losses.
In the case of Tsuruta and Cameron the same nucleus can oscillate (by means of general
stellar vibrations) around P=Pshell" When the nucleus (A,Z) moves to a higher density
it absorbs an electron and emits an anti-neutrino, when the product nucleus (A,Z-I)
moves to the low density, it finds that the phase space of the sea of electrons around
it is free and it B-decays releasing the electron and neutrino. The sum of the two
processes can therefore be written schematically as:
(A,Z)-~ (A,Z) + v+ v (4)
Paczynski (J972) considered the effect of convection on the URCA process. The funda-
mental problem and motivation was the difficulty in the theory of neutron star form-
ation. There are some statistical arguments (Gunn and Ostriker 197]) that hint at
the fact that stars in the range 4-10 ~should be the progenitors of pulsars. However,
when models of such stars are calculated it is found that due to the particular
behaviour of stars with nuclear shell burnings, the carbon ignites at high densities
109 - 1010 gm/cm 3 and low temperatures. The ignition of the 2C ÷ Mg reaction under
258
these conditions is dominated by the corrections to reaction rates due to the high
density and gives rise to detonation. Numerical calculations byvarious people have
shown that no remnant is left. On the other hand, if carbon ignition is not achle~ed
with a violent reaction rate, e.g. if a fast and efficient cooling mechanism would be
available~ then the collapse could be delayed to still higher density and the outcome
of the explosion is a neutron star. The fast and concentrated energy production by
the carbon gives rise to convection. Paczynski considered the effect this convection
has on the URCA pairs. The estimates of Paczynski, based on mixing lengh theory
and some properties of stellar convective cores~ yielded the following result
L ~ T 170 (5) c
where L is the neutrino luminosity and T the central temperature. c
This very high temperature sensitivity follows from straightforward application of
the expressions derived by Tsuruta and Cameron (|970) for stellar vibrations to
convective cores. The oscillations considered are very fast compared to the typical
~-decay times and hence the nuclei will be out of equilibrium (and not in equilibrium
as assumed by Tsuruta and Cameron (]970) and Paczynski (1973)). Next, one has to
conserve the number of decaying nuclei. Suppose a convective core inside which an
URCA shell exis~ is given and a nuel~s inverse-8 decays far away from the URCA
shell on the high density side. This nucleus Cannot (practically) B decay before
it crosses the URCA shell to the low density side. Hence the energy loss must be
evaluated first per cycle and not immediately per unit time.
The extreme sensitivity of the neutrino lossesto temperature (much more than the
nuclear reactions energy production) led Paczynski to the conclusion that the UKCA
neutrinos can cool the star sufficiently fast and control earbon burning. Consequently
he assumed stable carbon burning which delayed the collapse of the star and
yielded the desired conditions for the formation of pulsars~aczynski 1973).
A new development came when Bruenn (1973) showed that an accurate calculation shows
that the final outcome of the URCA pair may be heating and not cooling. Consider
first the B-decay. Assume that convection turn-over time scale is short compared
to 8-decay rate (a good assuption). The B-unstable nucleus will therefore be
carried by the convection way past the URCA shell to regions of low density. The
escaping neutrino causes of oourse an energy loss but the emitted electron may
(since the electrons are emitted with a certain spectrum of energies) have energy
well above the average energy of the electrons in the medium (~f at that place). The
fast electron is slowed down and transfers its extra energy to the medium i.e. it
heats the medium.
The same situation occurs when the (A,Z) nucleus is transferred by the convection
to the high density region. Again~ the inverse B-decay process is slow compared to
conveetion velocities and the inverse process may oecur at densities for which ef
259
is much greater than the energy difference between the nuclei. Since the energy diffe-
rence is fixed, so will be the energy of the absorbed electron. Under these conditions
the absorbed electron will come from deep in the sea of electrons. Thus a "holeUis
created below the fermi level. The subsequent thermalization of the distribution, name-
ly the relaxation to a new thermodynamic equilibrium with a smaller number of electrons
will convert some of the energy difference~ef - Eth > 0 (the difference between the
average energy and the energy of the absorbed electron) into thermal energy of the whole
sea of electrons. Said in other words, a high energy electron may jump into the hole
and give its extra energy to the rest. This particular behaviour of 8 decays is well
known to nuclear physicists, namely if you put a G unstable nucleus in a container the
radioactive decay heats the medium in spite of the fact that the neutrino escapes.
The basic argument raised by Bruenn can be demonstrated in the following way. Let
E be the internal energy of the matter in which the species Ni 8 decays. We have from
the first law :
DE DE DE dE = (-~-) dV + (-~--) aT + E (-~) DN.
T'Ni V'Ni i l V,T,N i
= - <E > - pdV v (6)
where the heat lost from the unit mass considered is replaced by <E > the average
energy of the emitted neutrinos. Bruenn assumes the process to occur at constant
volume. This is not the case in stars. The fast pressure equilibrium will give
rise to compression in the case of electron capture (there are fewer particles) and
to expansion in electron emission. In spite of thls neglec?, the qualitative result
is correct. The change in temperature due to the electron capture is therefore given
hy
f ~E DE = L - - { + (
DE ~E + ( - 7 ) ] / ( ~ )
e V,T V,Ni (7)
The sum of the second and third terms on the right hand side
DE 3E ("ST') = (~-Tb"-') = Ef + %c 2 (8)
e V,T e V,S
is exactly AQ. Since we consider only very degenerate matter for which kT <<El, we can
approximate the thermodynamic derivatives with those evaluated at T=O. Hence one gets:
260
2 where m c
e is the rest mass energy of the electron. One finally gets:
~E ~E dT = ( - <e~> + Ae ) / (-~--) a"T" > 0 (9)
V,N i
Where Ae = ef - eth is the difference between the electron fermi energy and the 2
electron capture threshold and eth = -meC " AQ. Cooling will occur only if
<eg> > Ae. The calculations by Bruenn have shown that for T < Tneut (0) the e-capture
will result in heating and vice versa. The effect occurs at p ~ 109 gm cm -3, for T ~ 10 ~ °K, namely higher than the carbon ignition temperatures and hence Bruenn
concluded that the URCA process cannot stabilize the carbon burning and the story
was back the beginning. Figure 3, taken from Regev (1975) demonstrates the basic
result. Close to the URCA shell the electrons have very little extra energy and the
cooling dominates, however, outside a very narrow strip A0 << Pshell heating domina-
tes and if a convective core extends over a sufficiently large density gradient the
total heating may overcome the total cooling and convection may have the opposite
effect: heating instead of cooling.
The pendulum swung in the opposite direction after Couch and Arnett (1974) intro-
duced the idea of a cycle. Consider a given mass element moved up and down by the
convective currents. The energy balance at the high density side is
A E ec = ec - AQ E ee (10)
and at the low density:
aFi = aQ - E -~ (11)
Here ~ = ef + met2 is the chemical potential of the medium. The index ec denotes
that it must be evaluated at the point at which the e-capture takes place and vice
versa, ~- is the chemir.al potential at the place of e-emlssion. Consider now the
full cycle. The mass unit starts at the high density side by absorbing an electron.
It then moves upward where it decays as soon as the density falls. We find that
the downward moving mass unit has one extra electron compared to the upward moving
mass-unit. Consequently there is a net transfer of electrons downward. (The upward
moving electrons are "hidden" in the form of a neutron.) Couch and Arnett add therefore
the two energy equations to get:
ee - ec AE ec + AE = ~ - ~ - (e +av ) (12)
Theynow reason that the difference between the fermi energies of the two locations
must be invested in maintaining the convective flow since the electron must be
brought back to the original place. Note that the electrons move downward and hence
261
u
O~
%
O .
1.6
1,4
1.2
1.0
Heating
Cooling
Heat ing
I i I
2 3 ~; 5 6 T / 1 0 8 "K
Figure 3.
The regions in the O T plane for which heating occurs. The calculation is carried
for the Mg 25 - Na 25 URCA pair with uniform abundance (X = I0-4"I). The broken line 2
indicates the place where AQ =m e c - the URCA shell.
262
release potential energy rather than absorb. Moreover, in hydrostatic equilibrium
the total chemical potential must he constant, hence 9 mp ~ const or d~ mp
where # is the gravitational potential and m the mass of the proton. The gravita- P
tional potential is determined by the C and 0 nuclei and not by the electrons and
much less so by the URCA pair.
The fact that the change in chemical potential of the electrons must be equal to the
change in gravitational potential due to the nuclei gives rise to the high degeneracy
of the electrons. The difference ec _ P is therefore not equal to the difference in
potential energy of the electrons m e A~ but to the difference in potential energy Of
the nuclei - and this in turn is not so relevant (see later).
Couch and Arnett argue that the convective blob (of given mass) contracts upon
e-capture (pressure equilibrium - just the term ignored by Bruenn) and becomes denser
than the surroundinglwhile the downward moving blob emits an electron and becomes
lighter than the surrounding&. Thus the convective flow has to carry denser matter
upward and lighter downward. This difference costs the extra energy that appears
in the term d~ = ec - ~ . A similar argument to the previous one shows that this
is not the case. Consequently, the final result of Bruenn remains valid.
Regev and Shaviv (|975) considered the question of convective stability of the URCA
process. If heating is important,then convection may start before the Schwarzschild
criterion is violated because a small perturbation in the bubble may heat it. It was
found that convection may start earlier than assumed before (according to the Schwarz-
sehild criterion). However, the rise times are quite long and it is impossible to give
a final answer as to what will happen without a detailed stellar evolution calculation.
The analysis was a local one.
Lazareff (]975) considered the details of the convection process with URCA heating
and concluded that no stationary convective core can exist. This conclusion is
correct but for completely different reasons. Lazareff assumed mixing-length theory
in which only rising bubbles exist and considered the detai~ of this motion. He
assumed that during this motion the URCA process releases heat and concluded that if
this process is integrated over the whole convective zone the total entropy will
increase in time and hence no stationary state exists. The error in this kind of
treatment is readily seen in the case of no URCA pair. One finds that the entropy
density continues to increase even in the case of normal stellar convection. The
problem appears because (a) the downward bubbles are not included,
(h) the work done hy the buoyancy force (absorbed by the
bubble but lost by the surrounding) is not accounted for and
(c) the mixing-length theory discusses the perturbed quantities
and not the actual quantities and hence integral theorems are unavailable.
ShaviV and Regev (1976) proceeded in two steps. First the motion of a single bubble
was analyzed and then the global properties of the convective zone were discussed.
26,3
The change in temperature of a blob in surroundings
by (Regev (1975)) :
aT I aT ( r , g ) = v ( r , ~ ) ~ v ( r , ~ ) d--f
ad
containing an URCA pair is given
, T , X ) (13)
C P
when ~ is the place of formation, v the velocity, X the composition at time t.
Quantities with asterisk denote the values inside the blob. C is the specific heat P
at constant pressure. The composition of the blob changes in time according to
dX I ~ ( r , ~ ) • X v(r,E) dr = - X I X I (r,~) + ~2 (X - X I (r,g)) (14)
where X; ~ is the mass-fraction of the first species of the URCA pair and X is the
total mass-fractlon of the pair.
The equations of motion are then integrated in order to find the motion. A typical
result is shown in figure 4. We find that (a) for most cases of rising blobs the URCA
heating "helps" the buoyancy forces and the velocity is increased. (b) downward moving
blobs are somewhat disturbed by the heating (c) a subadiabatic gradient may lead to
unstable upward-~noving blobs (in agreement with the local stability analysis of Regev
and Shaviv (]975)) but prevents blobs moving downward.The most important result is:
(d) the URCA losses have a negligible effect on the motion of the rising bloh.Actually,
as the blob starts to move, its velocity is small and the effect of the URCA on acce-
leration very large, but as soon as the velocity becomes large, the time scale becomes
too short to have any effect on the motion. We find therefore that the URCA losses are
the result of spreading the URCA isotopes uniformly over the whole convective core.
Consider now a convective core as a zone with q~(r) losses and q(r) heating per unit
mass and time. Clearly, close to the URCA shell~q~ is greater than the heating but far
away q(r) l which includes the nuclear and the URCA heating;is the dominant factor.
The two contributions have different spatial behaviour.
The energy equation is :
P d-tdE + PV.~ = pq - pq~ + pqf - V.F (15)
where F is the radiative flux through radius r and qf the rate of heat generated by
dissipation. The equation of motion is given by
d~ Vp - ~ - pV~ (16) 0 dt
where f is the frictional force per unit volume and ~ the gravitational potential.
The integration over the whole convective zone down tb the place where ~=o yieldspafter
264
u
E u
>
0 M
7
6
3 w
5
iIi
×I
! I I I ........ I i
6 7 8 9 40 44 42
r/40 "4 R e
-6
-8
X
O
-10
" -12
Figure 4.
The velocity of a blob in the case of Na 23 URCA pair. Line ] is log v
for an adiabatle blob and line 2 is log v for a blob with the URCA pair.
Line a is the equilibrium abundance of Na 23 in the surrounding medium,
I is the distance of one mixing length,
265
Some manipulation ~the following result :
( ] / 2 pv 2 . . . . . . . . . . (17) -- + E ) = pq - pq~ + L~ - ~t In Lout
where Lin and Lou t are the radiative flux into and out of the convective zone respec-
tively. The bar denotes an integral over the whole convective zone. Consider first the
case of no URCA process, i.e. q = 0 and q is the energy generation due to nuclear reac-
tions. In a steady state, the time derivative must vanish and we find that the net out-
come of convection is to spread the nuclear energy generation over a large volume so
that radiative flux can carry the energy from the boundary. When the URCA pair is pre-
sent and the steady state is preserved, it follows that the total heating by the URCA
process (added to q) must be equal to the total neutrino losses, q . If this balance is
not maintained the convective core will not be stationery. A detailed balance can exist
only if the convective core has a definite extent. Moreover, even if such a steady state
convective core exists, it is• unstable. The analysis of the URCA losses shows that qv
dominates near the URCA shell but the heat gain dominates elsewhere. Thus if the nuclear
reactions increase their energy production and the convective core expands, the URCA
process will increase the heating even more unless the radiative losses increase faster,
which is not the case. We conclude therefore that steady state eonvectioD cannot con-
trol the carbon burning and the problem of the fate of these models and the progenitors
of pulsars remains.
A question of principle remains : how come that a process which conserves material has
as its outcome net heating ? The solution is that the URCA process is out of equilibrium.
The net heating is given by (Regev (1975))
q = A--N° meC2 ([AQ - meC2 _ e~ %2X2 - X2L 2 + [ef + meC2 - AWl]X ] - X]L! ) C]8)
where L l and L 2 are the neutrino loss rates by the e.c. and 6 decay rates per nucleus
respectively. N and A are the Avogadro number and atomic weight respectively. When the o
URCA pair is spread uniformly/q > 0 and we have heating, but at equilibrium X]Xl=%2L 2
and the expression for q becomes
No 2 q = - ~-- mec (X]L l + X2L 2) (19)
and we have cooling only.
We are led finally to the question of the distribution of the URCA pair. Two time-scales
affect the distribution : the convection mixing time rconv = £/Vconv and
TURCA = (%| + 12 ) -]j which is the decay time. Define a new parameter by
= -l
ami x Tconv TURCA = 3,5 x JO 7 (%! + Am)v -! z conv (20)
266
where the convective velocity is given in cm/sec. The limit of complete mixing is
obtained for s . << I while equilibrium is reached for a . >> I. In reality we find mlx mlx
a. = ]. Consequently, at the beginning of the convection the process is close to mix
equilibrium but as time goes on the URCA pair is driven away from equilibrium and the
heating appears. The entropy added into the convective zone is due, as pointed out by
Lazareff (75)) to the non-equilibrium state of the URCA pair.
Acknowledgement : It is a pleasure to thank Mr O. Regev for discussions that made this
analysis and presentation possible.
BIBLIOGRAPHY
I. Bruenn S. W. (1973) Ap J. Let~. 183, L125
2. Couch R. G. and Arnett W. D. (1974) Ap J. 194, 537
3. Gamow G. and Schoenberg M. (1940) Phys. Rev 58, ]I]7
4. Gamow G. and Schoenberg M. (1941) Phys. Rev 59, 539
5. Tsuruta S. and Cameron A. G. W. (1970) Astr & Space Sci. Z, 314
6. Gunn J. E. and Ostriker J. P. (1971) Ap J. ]60, 979
7. Lazareff B. (1975) Ast. & Astrophys. 45, 14]
g. Paezynski B. (]972) Astrophys. Left l_i], 53
(]973) ibid ]5, !47
(1973) Acta Astronomica 23, l
9. Regev O. (September 1975) On the Interaction Between Convection and the URCA
Process M.Sc. thesis Tel Aviv University
]0. Regev O. and Shaviv G. (1975) Ast. & Space Sci 37, [43
I]. Shaviv O. and Regev O. (I976) Ast. & Astrophys. (in press)
PHOTOCONVECTION
E.A. SPIEGEL Astronomy Department Columbia University
New York, New York 10027 U.S.A.
Convection under the influence of dynamically significant radiation fields
occurs routinely in hot stars (Underhill 1949 ab) and probably also in a variety of
other objects near the Eddington limit (Joss, Salpater, and Ostriker 1973), Yet
this topic, which is here called photoconvection, has not been actively investigated
prior to the present decade. Except for limiting cases, the stability condition
does not seem to have been worked out and only some preliminary notions exist about
the highly unstable case. This is somewhat surprising since it has long been sus-
pected that some of the vigorous dynamical activity observed in hot stars (Huang and
Struve 1960, Reimers 1976) is caused by radiative forces (Underhill 1949 ab). In
the hope that this neglect may be compensated for by the application of some of the
techniques described at this meeting, I shall sketch some of the main features of
this topic. Three aspects are considered. First, I list a set of approximate equa-
tions for plane-parallel photoconvection. Then I give a schematin treatment of the
onset of instability. And finally, I shall outline some of the arguments for be-
lieving that photon bubbles occur in the nonlinear regime.
I. EQUATIONS OF PHOTOHYDRODYNAMICS
The interaction of electromagnetic radiation with a plasma is a complicated
subject with a long and controversial history. However, many of the difficulties
are avoided if we consider densities and radiation frequencies that keep the index
of refraction of the medium quite close to unity. In that case, we can describe
the radiation field by transfer theory if we take due notice of the motion of the
material medium. The simplest description arises if we simply take the first two
moments of the transfer equation and supply a constitutive relation for the radia-
tive pressure tensor. For the matter, we shall adopt the model of a perfect gray
gas. Then the matter field is described by the velocity u, the density Q, and the
pressure p, while the radiation field is characterized by the flux~, the energy
density E, and the pressure tensor~.
These variables are expressed in the inertial frame of the system (star), in
which we will generally be working. It will be useful, however, to make use of the
expressions for radiative flux and energy density in the local rest frame of the
matter. These are
268
.÷ ~. 2 (l.la) E ffi E - zu°~/c
- P'u, (l.lb) F - ~- E~ +~÷
where c is the speed of light. These expressions are valid only to order lul/c,
which is the level of accuracy (at best) aimed for here. Nevertheless, in the
equations used below, we shall see some factors of c -2, because the radiation field
is relativistic. In particular, the quantity ~/E qualitatively plays the role of
a velocity for the radiation field and in the surface layers of stars the magnitude
of this velocity may be comparable with c.
In addition to the field variables, we have to specify certain quantities that
measure the effective interactions between the two fields. These interactions we
shall take to be Thomson scattering, absorption, and emission. We shall assume that
the Compton effect can be modeled by a suitable choice of absorption coefficient.
We shall call K the absorption coefficient and ~ the scatterlngcoefflcient (both
per unit mass); G will be constant and K may depend on density and temperature. The
source function (divided by c) is denoted by S and depends only on the matter's
temperature, as indicated below.
The equations describing the conservation of matter and the force balance of
the medlumare
~= -pvJ a t (1.2)
and
(1.3) d+
p~'~-= - W, - gp~-+' c F
where g~ is the acceleration of gravity, ~ is a unit vector, and
d d-[ = ~ + ~.V .
The last term on the right of (1.3) is the usual expression for the radiative force.
Analagous equations exist for the radiative fluid:
DE p(K+~) + +~ ~-[ + v-~ - p~c(S-~) - c u.F (1.4)
and
(1.5) -- -- - (S-E) u. 2 ~t c c
c
For the pressure tensor of the radiation field a standard form is
~-+ i ~++ ~÷ ÷÷ 2 - (uF + Fu)/c - ~+ (1.6) P ~ Z
where I is the idemtensor and T is a viscous tensor. In component form,
F' Bui ÷ (l.6a) TiJ = nL~ + ~ - 2 (V'u)6ij~3
where 6ij is the Kronecker symbol and the viscosity is approximated by
269
8E ( 1 . 6 b ) ~ . . . . . . 3p(10K + 9a)c "
Expression (1.6) arises when the radiation pressure tensor i s approximated
in the matter frame by the usual Eddington approximation plus a viscosity tensor.
For the constitutive relations for the matter we adopt
( 1 . 7 ) p = RpT and
(1.8) S = aT 4
where T is the temperature and R and a are constants. (We shall not specify
here.) The introduction of the temperature calls for another equation, as in
normal convection.
If • is the specific entropy of the matter, we may write
(l.9a) pT ~t = -pKc(S-E),
orj if we use the expression for the entropy of an ideal gas,
aT dp = -p~c(S-E), (1.9b) pCp d-~- dt
where C is the specific heat at constant pressure. P
These governing equations are consistent andmoderately accurate sets of
governing equations. I have said little about the basis of them (but see Simon 1963
or Hsieh and Spiegel 1976) since their physical content is reasonably clear. If
anything, these equations are, for present purposes, too complete. It appears that
there are a number of generally small terms which will hinder calculations and ob-
scure meanings. But many of these terms are unfamilia~and the challenge is to
discover when we can discard which terms. In what follows, I shall make a number
of guesses about this; I hope that these are not too misleading. In fact, much
of the discussion is just aimed at seeing what some of these terms do and in such a
schematic treatment you would not expect to see boundary conditions. I shall hardly
disappoint you. But before I comudt mayhem on the equations, let us modify the ap-
pearance of the last one by combining it with (1.4). We obtain, with the help of
(z.2),
PC dT dp + dE 4 E dP -V.(~ - 4 ÷ ( l . l O ) p d'-f - dt d-[ - ~ ~ d'-T" ~ Zu)
- - - u -F . 3 c
We may note that the left hand side of this equation is
the total (matter plus radiation) specific entropy. pTd6tot/dt where Ato t is
270
II. THE HYDROSTATIC STATE
As background to the problem of photoconvection it is useful to know the solu-
tion of the basic equations which describe the state in which the matter is static.
But note that this solution is not photostatic; the radiation is flowing through mat-
ter llke a fluid through a porous medium.
We consider stationary solutions whose properties are independent of horizon-
tal coordinate. If K# O, equations (1.9), (i.I), and (1.8) indicate that
(2.1) E = aT4;
if K = 0 this relation is not forced and T is an arbitrary function of t h e ver- +
ileal coordinate, z. In either case F is constant and is in the i-direction.
Now (1.2) is identlcally satisfied and (1.3) gives the hydrostatic equation
dz g*P ~ (2.2)
w h e r e
~c-I-O (2.3) g , = g - c F
is the effective gravity. (In the Eddington limit, g , = 0.) The radiative flow
equation (1.5) becomes
dE -3p ~ F, (2.4) d-~ = c
and (1.7) is unmodified. Thus all the governing equations are accounted for and we
have a simple system to solve once K is known. In general the problem is handled
numerically, but some analytically tractable cases exist. Let us look briefly at
the simplest: ~+u = constant.
We may introduce the total pressure
i (2.5) P = p + ~ E,
and combine (2.2) and (2.4). We find that
dP m (2.6) dz -go ,
and, on d i v i d i n g by ( 2 . 4 ) , t h a t
dP (2,7) 3 ~ = (K+~)F
The integral of this equation, after some rearrangement, may be written
g,c
(2.8) p = 3(~+o)F (E-E1) '
271
(2.12)
where
(2.13)
where E 1 is an arbitrary constant. It is often conveD/ent to choose E 1 as the
value of E at the top of the "atmosphere ".
We may now write a simple differential equation for E, or T, and find the
solution
i _ i +. (2.9) -z =~[7-~T Itan [ -yT ltanb -1 )] ,
where E 1 = aTe._ If T 1 - 0, this represents a complete polytropic atmosphere. In
any case, the medium is polytropic for z << 0 and T is proportional to -z
down there. For z >> 0, T - T 1 decays exponentially as we move upward and the
atmosphere extends to infinity for T 1 # 0.
In principle, all the other details could be worked out from this, but
nt~erlcal work is generally needed. However, some things are still simply expres-
sible in terms of the optical depth
(2.10) ~ = [-(K+c)pdz. #
z
In particular,
.ll) E = F(T+TI), (2
where T 1 is a constant of order unity.
Another quantity of interest in the static atmosphere is the temperature
gradient. In the present instance this is most simply expressed in the familiar
nondlmenslonal form
dlnT R dT _ 7 - 1 CD dT V = - d - ~ n P = gB dz ~' g8 dz
~=~.
For the atmosphere with K+o constant we find
(2.14)
where
(2.15)
1 l-s V = m ~ , 4 1-B
g, ~ i I - - +
g
tzt. THZ O+SST OF CONVECTZO~
The action of radiative forces under suitable conditions may promote wave
amplification (Hearn 1972, 1973; Berthomleu, Provost, and Rocca 1976) and posslbly
overstability (e.g., Spiegel 1976). The nature of this overstability seems to place
272
it more in the domain of stellar pulsation theory than convection theory, though the
two may become enmeshed in the nonlinear regime. On the other hand, monotonlc in-
stability, that is exponential growth without oscillation, is more clearly linked to
the development of convection when the time scales are dynamic, and I shall confine
myself here to discussing that topic.
The procedure for deciding whether convective instability arises is straight-
forward in princlple~ especially when we are not trying to study overstabillty or
flnite.amplitude instability. We decompose each dependent variable into a hydro-
static part and small perturbation.~ Here we shall indicate the latter type of
quantity by a prime, except in the case of velocity. We restrict ourselves to the
situation where ~/~t = 0. Then, on linearlzing in the usual way, we find from
(1.10) that
pCpblW = V'F , (3.1)
where
(3.2) 0Cpb I -(pCp dzd-~T-~+--'dS AS dO. = dz 3 ~dz ) "
But we may also proceed in this way on the basis of (l.9b) and in that case we oh-
rain the equation
(3.3)
where
(3.4)
pCpb2W = pKc(S'-E'),
oCph 2 = -(pCp d_.!_ dd_.~.z) dz
Now in a full treatment of the problem it would not matter which of these two
routes is taken since the final answer would be the same. But the stability criteria
that are normally used are obtained with approximations and the two approaches may
differ in that case since they have suggested different approximations to dif-
ferent people. In particular~ people have simply written down criteria for in-
stability with respect to adiabatic disturbances with differing notions of wh&t they
mean by adiabatic. Thus, the commonly encountered criterion results from equating
?.~' to zero. If we do this we find that hlW must vanish at marginal stability.
Since w in that case has small but arbitrary amplitude, we obtain the critical
condition b. = 0, which is the conventional one (Chandrasekhar 1939). On the other
hand, if we set the right hand side of (3.4) equal to zero (e.g., Wentzel 1970,
Spiegel 1976) we obtain b 2 = 0 as the condition for marginal stability. This cri-
terion holds strictly when absorption and Compton*scatterlng are omittedand its use
otherwise is dangerous.
The two criteria represent valid approximations under certain circumstances
and P. Vitello (private communication) has recently investigated what these are.
273
The discussion of this question shows that the conditions under which one or
other neutral stability criterion holds depends on the perturbation being made.
This is a common situation and we expect that the correct instability criterion is
to be found by choosing the most unstable mode.
To see how the problem goes let us begin to do the stability calculation.
From (1.5) we find for marginal linear perturbations that
= p(K~) p(m+o)
with
(3.6) ~ = "~ E'~+ ( ~ + ~) *F /c 2 - ~ ,
Also from (l.9b) we obtain
Cb (3.7) ~' = s' - P 2w.
KC
If we combine these results with (3.1), making use of other equations as needed, we
find an equation of the form
pCp[B2V2w + pKA ~z - 3p21<(K+O)BlW] = 3p2K(<+°)V'(KVT)'" (3.8)
where
(3.9)
and B I, B 2, and A
If n = 0 and C P
(3.10a)
(3.Z0b)
and
(3.10c)
4act 3 K = 3p(K~)
are quantities whose dimensions are temperature over length.
is constant,
81 =bl- 3p2(Kd~) dz 2 3P dz
Fd - ~T, [(r+~) ~]-I_ F Q(K+O)C dz 2 '
B 2 = b 2
Now consider the case in which the right hand side of (3.8) is set equal to
zero. That is, instead of trying to speak of an adiabatic disturbance, let us sim-
ply ask what happens to a perturbation when radiative conductivity is suppressed.
If geometrically small horizontal scales are the most unstable, as they are in
ordinary invlscid, non-conducting convection, we may replace V 2 by -k 2 where k
is the horizontal wave number. For qualitative purposes, we may also omit the term
274
with coefficient A since this is not important in the limiting cases we wish to
consider. Also, for the present argument, I shall set B I = bl, since this discus-
sion is merely schematic. The approximate condition then becomes
(3.11) k2b 2 + 3p2K(K+O)bl = O,
and in terms of the quantities defined in (2.12) and (2.15) we find the instability
criterion
(y-l) [~2 (4-,3~) ] V> ,
-- S[~{S+4(y-I) (1-6) ] + $2 [y62+4(y- l ) (1-6) (4+6) ] (3.12)
where
(3.13) ~2 = 3p2K(<+O)
k 2
This criterion holds approximately in the limit of zero viscosity and with the omis-
sion of radiative conduction terms as indicated. The dimensionless quantity ~2j
which arises in radiative cooling problems (Unno and Spiegel 1966), should be cho-
sen so as to minimize the right hand side of (3.12). The resulting value of ~ is
then inserted to give the local stability criterion. Of course, if we are led to
extreme values of ~ we should worry about the possible violation of physical con-
straints that have been removed in this simplified analysis. (I~nextremis, we could
Just solve the problem properly.)
To make the appropriate choice we observe that for y > 4/3 the right side
of (3.12) increases as ~ decreases. In that case, the instability criterion is
obtained with the largest possible values of ~, hence with modes of large horizon-
tel scale in the length unit [3p2K(K+O) ]-1/2. In stellar interlorsmost scales of
interest satisfy this condition and the conventional criterion would apply. In
transparent regions, however, it may be that geometrical constraints intervene and
large ~ cannot be achieved. In that case, the maxlmumvalues allowed for
should be taken, and here we should note that once ~ exceeds unity there is not a
large difference from the results at very large ~,
In cases where 7 < ~ the situation is reversed and the right hand side of 3'
(3.12) decreases as ~ decreases. The preferred value of ~ is now the smallest
one possible; that is we want the largest allowed value of k. If the particle mean
free path is much less than the photon mean free path we can choose small ~ with-
out worrying about the breakdown of fluid dynamics, But we do have to make sure
that we don't choose a k which is so large that diffusive effects wipe out the in-
stability. In fact this amounts tO finding the preferred mode in the usual way, but
here the choice determines not just diffusive corrections to the critical gradient,
but also the effective adiabatic gradient itself. Unfortunately, there is a compli-
cation that arises in this situation.
275
& The case y <~ will normally occur in ionization zo-~es and therefore has to
be treated with some care. In fact, Underhill (1949b) ~as evaluated the "adiabatic"
temperature gradient with partial ionization and in the presence of an important
radiation field. But that calculation was what I have been calling the conventional
criterion. That is, she has applied the condition of zero total (matter plus radia-
tion) entropy gradient which corresponds to the marginal stability conditions with
=m. Howeverj the possibility exists that finite values of ~ may be more correct
since the zones of partial ~onlzatlon tend to occur in stellar envelopes. We could
then ha~ a somewhat increased convective instability but the modes involved, being
radlatively leaky, might not carry heat effectively. It appears therefore that for
most purposes the standard convection criterion is good. However, it would be more
comfortable to have a detailed treatment of this problem, and I predict that there
soon will be one.
IV, PHOTON BUBBLES
In thinking about ordinary stellar convection we maybe guided by solar ob-
servations, but we have not such direct experience to guide us in photoconvection.
Instead, we may appeal to observations of a laboratory flow that is analogous to
photoconvection. We have already seen that the radiation in this problembehaves
(in the Eddington approximation) like a fluid flowing through a deformable porous
medium. This closely resembles the situation in a fluldlzed bad (Thorns 1973,
Prendergast and Spiegel 1973). Though the analogy is not a perfect one (Spiegel
1976), it can be used to suggest the qualitative nature of nonlinear photoconvection.
And one of the most striking implications of this analogy is that instead of convec-
tive thermals having relatively low densities, we should expect real bubbles
in photoconvectlon. These are filled with radiation and contain virtually no
matter. How this modification of the normal convective process may influence the
heat flux can only be crudely estimated (Thorns 1973), but there are also other
features of convection which are strongly affected. In partlcular~ bubbles feel
the full effect of gravity rather than the reduced gravity of ordinary eonvection~
hence large (that is, sonic) convective speeds may be anticipated.
In this section, I shall sketch an approach to the treatment of photon bubbles
borrowing heavily from the literature on fluidized beds (Jackson 1970, Rowe 1971).
In comparison to fluidlzation, this theory suffers from the disedvantage that we
have not yet seen a photon bubble. However, John Lin at Columbia is looking serious-
ly at the prospects for removing this drawback experimentally.
We wish then to study a photon bubble of radius r ° rising at speed V. We
shall assume that r O << H,~ where H, ~ R T/E~, and that the bubble may be taken to be
quasl-steady when described in its own reference frame. We may nevertheless intro-
duce the dynamical time scale r /V. Let us assume that this time is much shorter o
276
than the thermal time of e region of size r ° and then presume from this that
there is validity in neglecting thermal effects. Then we may tentatively set K = 0
in the basic equations.
Now let E and F be representative values for the ambient radiant energy
density and flux and let p, be a representative ambient matter density. The fol-
lowing dimensionless parameters are of interest:
V EV = V 2 (4.1) c =-f,~ =T,f2 .r,= p,cr o.
gr o
We assume that ~ << 1 and, from the analogy to fluidizatlon, we anticipate that
f2 is of order unity. If E and F may he estimated from their static values
(see (1.11)), we have E/F ~ e/T, for T >> i, where T is the optical depth. Hence
~ ~T. Then, when the bubble is only a few radii below the surface, T, ~ T and
we have that 8 ~ ET,, which is the regime we shall study here. A further restric-
tion to be used in the following analysis is ~ << I, but l shall mention at the
end what may happen at larger depths when 8 becomes of order unity.
If we nondimenslonalize the basic equations and make use of the foregoing
approximations, we obtain a greatly reduced set of equations. In dimensionful form
these are
(4.2) P ~t + u.Vu) - -Vp-gp~ + c F
(4.3) ~-E + V'(p~) = 0 ~t
(p/p~f) + u . V ( p / p "f) - o (4.4) 3-~
( 4 . s ) v.~" = o
(4.63 3 VE - ~c ~"
This description is about as primitive as it can be while still involving the ele-
ments of photohydrodynamice. Let us now seek approximate solutions for a bubble
rising at constant speed V. We presume that the medium is unstable, which is
true if = < 0.2.
Suppose, in first approximation, that the bubble is a spherical hole of
radius r • If the bubble does not greatly disturb the ambient density, we see that o
equations (4.5) and (4.6) are simply the transfer equations for a static medium with
a hole in it. This results because in t h e present approximation the radiation field
adjusts quickly to the state of the medium (~ << i); also the motion is so slow
that the difference between ~ and ~ may be neglected (~ << i)° We have also
assumed that the bubble radius is much less than the local scale height~ hence O
in (4.6) is approximately constant outside the hole. Equations (4.5) and (4..6) may
then he solved separately with D = 0 inside the spherical cavity.
277
Let us introduce a spherical coordinate system with origin at the center of
the hole and with % = 0 at the top of the hole. Far from the hole, the flux is
Fo~, where F ° is a constant given by the static solution and we have the condition
3p~F o ^ ( 4 . 7 ) VE ÷ - - - z a s r ÷ ~ .
c
M o r e o v e r , E and t h e component of ~ n o r m a l t o t h e b u b b l e s u r f a c e a r e c o n t i n u o u s
across the surface. Since (4.6) implies that E is constant inside thehole, we
have the boundary condition
0 1 1 r = r ( 4 . 8 ) E = E ° o
where E is the constant value of E inside the hole. Now (4.5) and (4.6) show o
that E is a harmonic function and, with conditions (4.7) and (4.8), we find
2 r o
oro - c o s e . (4.9) E - E ° - c o r
Since cE/(3p~) is a potentlal for ~, we have
(4.101 ~ ffi V-{Fo o r': [__.r. (~]2]coa e}. o
Alternatively, we can express F in terms o f a Stokes stream function:
1___!__ - 3T Fe i ~ (4.11) Fr = r2sin 8 ~8 ' = r sin 8 ~r '
w h e r e
( 4 . 1 2 ) r ~=_~1 For2[1 + 2(~)3]sln2e
The flux co, isis of ~e original unifo~ part plus a d~ole generated by the hole,
a reset familiar from analogue problems ~, for example, electroata~cs. ~e
radiative flow is shown in the neighborhood of ~e bubble in ~e figure on the
following page. Inside the bubble, the fl~ is 3Fo, if equation (4.6) ~y be wed.
~is last point is a ~llcate one as we have used ~e Eddin~on ~proxi~tion
for the transfer ~eo~. However, this approximation holds if the radiation is iso-
tropic ~d, when pot ° >> i, it probably is. ~e reason is that for T, >> 1 indi-
~dual photons will scatter off the b~ble surface (actually a l~er of ~ickness
(p~)-l) many times before escaping, hence the r~iation field inside ~e b~ble
should be reasonably isotroplc.
~e deformation in the radiation field produces an additional for~ on ~e
~tter. ~e total exte~al force density is 3
-gPz + 0~ F" -g*Pz " c ~pV cos 8
278
/
J
e %.. .'2
Streamlines for the radiative flux around a hole, according to (4.12). The solid circle ehc~s the original hole. ~ The dotted curve indicates the estimated deforma- tion of the hole obtained by setting h = 0 in (4.22) with ~ = 0 and V chosen as in (4.24).
279
(4.13)
v-V(p/p 7) 0, (4,14) ÷ =
(4.1s) v.(~) = o,
where
roF r 3,
and
where g* " g-gR and gR ffi Fo/C" The additional dipole force produces a fluid
circulation which causes the hole to rise and, in general, to deform. Let us study
these effects.
If we work in the bubble's frame and assume a stationary situation we have
the equations
v'Vv ~ -Vh-g,~-V~ ,
I have not changed notation to indicate the coordinate transformation except to
call v = u - V z the new velocity. The correction to ~ due to the motion of
the bubble is of order 6 and is neglected.
First we shall determine V on the assumption that the bubhle remains
spherical. This we do with the approximation p = const, whence
(4.1s) v-~ = 0.
We may therefore take ~ to be the incompressible flow around a spherical ob-
stacle. Such a flow has a vanishing normal component on the bubble boundary and
-V ~ as r -~ ~. Solutlons of this problem are well known and if it approaches
we also set
(4.19)
we find
V x v = O ,
r 3
(4.2O) ~ffi-VV[r cos 811 +~r31 ].
Moreover, because of (4.19) we may rewrite (4.13) as
(4.21) V [h+#+g, zq~2/2 ] = 0;
hence for r ~ ro,
280
(4.22)
h = gro{l-[~ + (l-s) ] cos 8}
- ~V {i+ (1-3 cos 2 @) +~ (1+3 cos 2 @)},
where ~ = g,/g and an arbitrary constant has been chosen so that
On r = r we have o
9 2 (4.23) h = gro[l- cos 8 -~f (l-cos 2 8)].
h(ro,O,O) = O.
For p = const, h = p. Alternatively, the choice p/p7 = const, which satisfies
(4.14), gives h ~ p(7-1)/y. In either case we would like to have h = 0 on r = r o since p = 0 inside the bubble (and E is continuous across the interface). But
(4.23) shows this to be impossible with the present approximate treatment. However,
we do have the freedom to choose f2 to match the pressure boundary conditions as
well as possible. In fluidization theory the procedure used by Davies and Taylor
(1950) for ordinary bubbles is usually adopted. In the present instance this comes
down to setting ~2h/~@2 = 0 at r = r o, 8 = O, whence f2 = 4/g (see also
Batchelor 1967). The argument for this is that h and ~h/~8 are already zero at
r = to, 8 = 0, and we would like to extend the region where h is very small as
far as possible. Let us adopt this choice; (Any other choice of this type would
also give a value of f of order unity. For example we might mlni~zethe inte-
gral of h 2 over the surface r = r .) Thus we have an estimate of the speed of o
rise of the bubble which can also be used to see the magnitude of the distortion of
the spherical hole by the dipole force. For the latter purpose we may simply com-
pute the surface on which h = 0 with
= 2 (4.24) v ~(gro)l/2.
For ~ = 0 this surface is the dotted line indicated in the Figure above. The
distortion of the hole is caused by the need to balance the fluld-dynamical pressure
~2/2 and it represents a problem which is also encountered in the theory of ordi-
nary gas bubbles in liquids (Moore 1959). As long as appreciable speeds occur next
to the bubble this difficulty arises. In an actual fluidization bubble the problem
is resolved by the formation of an indentation at the rear of the bubble. The in-
dentation fills with particles which effectively move with the bubble. This feature
has to be built into the theory in a self-consistent way.
With the present estimates a second problem arises, namely that for r >> r o and @ > 0 we encounter a region of negative h when ~ > 0. This difficulty does
not arise in fluldization theory since that subject is confined to ~ = 0. We
therefore have no experimental guide to the meaning of this result. There are some
speculations that might be offered here but perhaps the message is simply that bub-
bles only occur when ~ is very close to zero.
281
Now it is evident that the foregoing discussion does not really provide an
acceptable theory. It might be different if photon bubbles were an observed phen-
omenon that we were trying to understand qualitatively. But the real question is
whether photon bubbles actually exist and the answer will almost surely have to be
given experimentally. In spite of these worries, I would llke to close this
theoretical discussion of bubbles with one further qualitative remark about what
may happen at very large optical depths.
The total radiant energy density includes the usual energy density plus the
pressure, hence it is ~E. The energy flux divided by this energy density gives
a speed to be associated with the radiant fluid.
F
3 When ~ exceeds some critical value %T , V exceeds v R, and the bubble is
moving faster than the radiative fluid. In that case, the radiation does not ad-
Just quickly to the matter. Rather, we may expect the radiation associated with
the bubble to be swept along with the bubble, much as in the corresponding case
Of fluldlzatlon where one sees a trapped cloud of fluid circulating in and around
the bubble. When this occurs, I expect that photoconvective transport should be-
come very efficient. The optical depth at which this occurs is given approximately
by
T ~ lO 24/9 I~eff] 4/~ • [i--~5 j
V. CONCtUSION
The main questions considered here have to do with the nature of photoconvec-
tlon and the conclusion which is tentatively adQpted is that the two-fluid nature of
the process may make for some qualitative differences from basic Boussinesq convec-
tion. I have tried to sketch how photon bubbles may behave in a~alogy with fluidi-
zation bubbles. The analysis is sufficiently simple that one can easily see what
is going on, but there is one point about the results that I want to emphasize.
The bubble is not simply held open by an excess of radiation pressure inside it.
The radiative force is vital to the process and this is proportional to the flux.
The figure in ¶IV is helpful in seeing how this works: flux converges onto the
bubble from below and diverges upward from the bubble. This forces the fluid
flowing by the bubble to go around it which in turn causes the hole which produced
the flux pattern in the first place. This seems to be a dynamically consistent
situation, whether or not the equations have been completely solved. Whether the
thermodynamics of radiation interacting with matter (and which has not been dis-
cussed at all properly here) can spell the picture, seams difficult to decide, and
that is, to me, the biggest question to be faced at present. But i£ we put doubts
282
aside for now we may imagine some astroDhysicall7 interesting aspects of photon
bubbles.
The generation of large amplitude, complicated velocity fields in hot
stellar atmosphere is one of these. Another is suggested by the rapid separation
of particles of differing properties in bubbling, fluidlzed beds. In this process,
called elutriation, particles of relatively large drag are carried up through the
bed by the bubbles. Similarly we can imagine that particles with large scattering
cross section may be carried swiftly through stellar material By photon bubbles.
Moreover, there are some interesting consequences involved when bubbles collapse
near a stellar surface. The heatlngmay cause hot bursts of radiation (as
J. Pringle has suggested) or radiation of acoustic and shock noise. Also, non-
spherical collapse could squirt matter off the stellar surface at high speed, as a
preliminary computation by J. Theys confirmS.
But these are presently speculative topics and more immediate aimS should
also command attention in this subject.
We need a more complete stability theory, a study of flnlte-amplltude
stability, and some numerical simulation. In this respect, we should be aware of
related work on high density plasmas (Estabrook, Valeo, and Kruer, 1975), though
much of what I have said here leaves out plasma kinetic effects and assumes rela-
tively low density s such as is encountered in stars.
I should like to conclude by acknowledging my indebtedness to the many
people whose remarks have influenced aspects of the presentation and to list Just
a few of them: S. Childress, L.B. Lucy, K.H. Prendergast, and J.C. Theys. I am
~rateful to G. Baran for running his contour routine. And flnslly, I thank the
National Science Foundation for supporting the work reported here under Grant
NSF PHY-7505660.
REFERENCES
Batchelor, G.K. 1967, An Intrqductign t O Fluid Dyqamlcs, Cambridge Univ. Press, p. 475
Berthomleu, J. Provost, A., and Rocca, A. 1976, Astron. and Astrophye., 47, 413 Chandrasekhar, S. 1939, "An Introduction to Stellar Structure " Davies, R.M. and Taylor, G.I. 1950, Prec. Roy. Soc. Lon. A, 200, 375 Estabrcok, K.G., Valeo, E.J. and Knuer, W.L. 1975, Phys. Fluids, 18, 1151 Hearn, A.G. 1972, As tron. and Astrophys., 19, 417 Ream, A.G. 1973, Astren. and Astrophys., 23, 97 Hsleh, S.-H. and Spiegel, E.A. 1976, Ap. J~., 207, 244 Huang, S.-S. and Struve, O. 1960, Stellar Atmospheres, J.L. Greenstein, ed.,
Univ. of Chicago Press, p. 300 Jackson, R. 1971, Fluidization, J.F. Davidson and D. Harrison, eds., Academic Press,
p. 65 Joss, P.C., Salpeter, E.g., and Ostrlber, J.P. 1973, Ap, J., 191, 429 Moore, D.W. 1959, J.F.M., ~, 113 Prendergast, K.H. and Spiegel, E.A. 1973, Co mmentsA p. Space Phys,, ~, 43
283
Reimers, M.D. 1976, Physique des Mouvements dans. les Atmospheres Stellalres, R. Cayrel and M. Steinberg, eds., C.N.R.S., p. 42]
Rowe, P.N. 1976, Fluidization, J.F. Davidson and D. Harrison, eds., Academic Press, p. 121
Simon, R. 1963~ 3. QuaBt ~ Spectr0sc. Radiat. Transfer, ~, 1 Spiegel, E.A. 1976, Phys~iquedes Mouvements dans les Atmospheres .Stellaires,
R. Cayrel and M. Steinberg, eds., C.N.R.S., p. 19 Thorns, V.A. 1973, Report RC59, Dept. of Computer Science, Univ. of Reading Wegener, p.p. and Parlange, J.-Y. 1973, Ann. Rev. Fluid Mech. , ~, 79 Wentzel, D.G. 1970, Ap. J., 160, 373
CONVECTION IN THE HELIUM FLASH
A. J. Wickett
Department of Astrophysics and Nuclear Physics Laboratory,
University of Oxford
Abstract
The evolution of a star through the helium flash depends upon uncertain aspects
of convection theory. Observations place some constraints on the theory of convection
in stellar cores.
Evolution of Stars in the Mass ranse 0.6 M@ to i~ M@
When stars finish burning hydrogen to helium in the core, the burning region
moves outwards to form a shell source. Matter moves inwards through the burning region
into the core which becomes progressively hotter and denser.
As the density of the core increases, the electron gas becomes degenerate. The
hydrogen outside the hydrogen-burning shell forms a diffuse convective envelope of
low density and great spatial extent (radius ~ 5xlO10m). The star is a red giant.
Figure 1 shows the structure just before the helium flash.
The site of the Helium Flash
In the eorejthermal conduction by degenerate electrons is efficient and there
are no heat sources before helium burning starts, so to a first approximation the core
is isothermal. However heat is lost from the densest part of the core by the plasmon
neutrino process
Y plasmon ~ ~ + 9"
A slight temperature inversion therefore appears, which causes helium burning to ignite
some distance (- D.1 M@ ) from the centre. The peak temperature T m " 108K.
285
9.0
8,0
7.0 log p
6.0
(kg m-3
5.0
~1.0
3,0
2.0
1.O
0.0
-1.O
-2.0
-3.0
-4.0
6.0
7.o
log T
6.O
5.O
4.O
m
m
m
m
m
m
m
I .... i i i I ! | ....
- j
, , , I I ...... I I I I I 0.i 0.2 0.3 0.4 0.5 M 0.6 0.7
Me Figure !. Temperature and density profiles for star of 0.75 M@ and zero-age composition X = 0.7, Z = 0.0004 at start of helium flash
286
The rate of the reaction
34He ~ 12 C + Y
(the 3~ reaction) is proportional to about the 40th power of the temperature. Since the
degenerate electron gas can take up much heat without a large pressure increase, a
thermal runaway can occur.
Quasistatic evolution through the Helium Flash
The energy generation rate in the hellnm-burning shell is great enough to require
a convective zone outside it to transport the heat away. The peak temperature rises
a~d the convective zone extends; the rest of the star adjusts its structure adiabat-
ically. When the electron degeneracy is lifted in the burning region~expansion takes
over and the peak temperature falls. Figure 2 shows how the structure of the centre of
the core changes through the flash.
Calculations with an accurate equation of state (Thomas 1970, Demarque and Mengel
1971, Zimmermann 1970, Wickett 1976) show that if, in the convection zone, the temper-
ature gradient is adiabatic or slightly superadiabatic as prescribed by the standard
mixing-length theory and if the convective zone does not reach further towards the
centre than the initial helium ignition zone, the burning timesoale at the peak of the
flash is large compared with the convective timescale. Thus the traditional formulation
of convection theory leads to a quasistatic helium flash.
Convective Uncertainties
Edwards (1975) shows however that the traditional convective stability criterion
3tar < adiabatic ~> unstable
is modified in the presence of a very temperature-dependent energy generation rate.
Even a positive gradient (temperature rising with radial coordinate) is unstable if,
as is the case here, the reaction rats depends sufficiently strongly on the temperature.
It is not clear exactly how this oonclusion affects the evolution; however a calc-
ulation was performed in which the convective zone was extended to the centre and the
temperature gradient was the adiabatic one. The formulation is more fully described in
the author's thesis (1976). At the peak of the flash, the burning time
8,.2
log
T
8.1
8.o
7.9
7.8
7.7
7.6
6.5
Figure 2.
...... I .
....
....
....
....
..
t ......
.....
I i
~odel
.....
/Will
'
I .
..
.
I I
7.0
7.5
8.0
log~
(kg ~3) 8.5
Temperature-density plots for core of star of figure i
during helium flash
r
9.o
288
T T n=~
for constant density and no heat flow at the hottest point was equal to the convective
timescale
i c
T C U " C
U and 1 are the convective speed and mixing length evaluated by the usual theory. o c
The time for thermal runaway is considerably less than T n as the burning rate rises
sharply with temperature. So convection cannot keep up with the rate of increase of
heat production. Thus if this picture of the evolution is correct a thermal runaway
and explosion must occur.
D~namicCalculation
A hydrodynamic code including gravity and nuclear reactions for a spherically
symmetric model was used to follow the explosion.
A spherical detonation wave with peak temperatures ~ 2.5 x 109K and propagation
speed l0 times the sound speed in front and twice that behind passes through the core.
Half of the helium burns to neon (or species of similar mass number) releasing
~ 5 x 1043j. The entire star is disrupted with energy typical of a supernova of type I.
Observational Constraints
We assume that the 108 Galactic globular sluster stars (Allen 1973) have masses
distributed according to Salpeter's (1955) mass function
dN = M. -1"35 d in M.
with a lower limit of 0°l M@. The variation of age at helium flash, T hf" with mass
din T hf - - - -5.31 dlnM,
was calculated using Eggletonls (1971, 1972) code. For age i0 i0 years we are interested
in N, = 0.92 M® and find that Galactic globular cluster stars now undergo 5.5 x l0 "13
helium flashes per star per year or, 5 x lO -5 per year for the whole Galaxy.
289
Since one would expect to see a supernova remnant for l06 years (Woltjer 1972)
and no supernovae remnants or gas of any sort is seen in globular clusters, we con-
clude that the supernova rate is less than l0 -l# per star per year. So there is an
Upper limit of one supernova for 50 helium flashes.
Conclusions
It is not clear how to calculate the temperature gradient or the convective time-
Scale when the energy generation rate is very temperature-dependent. It is however
clear that the traditional prescription does not work.
As to the helium flash supernova model, the observations allow 2% of helium flashes
to be that violent. If this is the case then the small fraction might arise either from
rather narrow ranges of mass, rotation etc. being faveurable or, in some ways more
appealingly, from fluctuations in the convection. This implies that the evolution of
two stars with the same gross properties may be substantially different because of the
random nature of the convective process.
Acknowledgements
This work was supported by the United Kingdom Atomic Energy Authority and the
Science Research Council.
References
ALLEN, C. W., 1973. Astrophysical Quantities, 3rd ed., Athlone Press, London
DKMARQUE, P. and MENGEL, J. G., 1971. Ap. J. 164, 317
EDWARDS, A. C., 1975. M. N. R. A. S. 173, 207
EGGLETON, P. P., 1971. M. N. R. A. S. 151, 351
EGGLETON, P. P., 1972. M. N. R. A. S. 156, 361
SALPETER, E. E., 1955. Ap J. 121, 161
THOMAS, H.-C., 1970. Ap Space Sci. 6, 400
WICKETT, A. J., 1976. The Hydrodynamics of the Helium Flash, D. Phil. thesis,
University of Oxford
WOLTJER, L., 1972. Ann. Rev. Astr. Ap 10, 129
ZIMMERMANN, R. E., 1970. .The Hydrodynamics of a HeliumShell Flash in a Star of one
Solar Mass, Ph. Do thesis, University of California, Los Angeles
WAVE TRANSPORT IN STRATIFIED, ROTATING FLUIDS
M. E. McIntyre
Department of Applied Mathematics and Theoretical Physics,
University of Cambridge
SUMMARY
Momentum and energy transport by buoyancy-Coriolis waves is illustrated Dy means of a simple model example. The need for careful conslderation of a complete problem for mean-flow evolution is emphasised, especially when moving media are involved. Then a recent generallsation of the wave-action and pseudomomentum concepts is introduced, and used to exhibit in a very general way the roles of wave dissipation, forcing, or transience in the mean flow problem, for a certain class of "nearly-unidirectional" mean flows. This class includes differentially-rotating stellar interiors, which could well be systematically changed by wave transport of angular momentum. Similar results hold £or MHDand self-gravitating fluids. Finally the physical distinction between momentum and pseudomomentum is discussed.
i. INTRODUCTION
Some of the most spectacular natural manifestations of wave transport
effects are those believed on the basis of recent evidence to Occur in the
stratospheres of Earth and Venus I-4. Closely analogous effects appear
likely to influence the evolution of the rotation of stellar interiors 5,
and to be important in other astrophysical contexts 6. They are often
associated with rather complicated kinds of low-frequency fluid-dynamical
waves, in which buoyancy and Coriolis forces are essential. The waves set
up a "radiation stress" whereby the mean azimuthal velocity at one height
and latitude can undergo systematic acceleration at the expense of a
corresponding deceleration at a more or less distant location. Thus
transport of angular momentum by the waves is involved. This transport can
result in drastic changes to the pattern of differential rotation (which
in turn can drastically affect the wave propagation and lead to some
interesting feedback effects~.
291
An idealisation illustrating this kind of "radiation stress" phenomenon
is the model problem suggested in figure i. Inertio-gravity waves, which
are the simplest type of buoyancy-Coriolis wave, are being generated by a
slippery, corrugated boundary moving parallel to itself with constant
velocity c:
C
FIGURE I. Inertio-gravity waves being generated in a stably-stratified fluid (specific entropy increasing upward) by a rigidly-moving boundary. The frame of reference is rotating with constant angular velocity~= (O,O,A) and the effective gravity g = (O,O,g) = constant. (It can be shown that the waves have their crests or'lines of constant phase sloping forward, unlike the sound waves which might be generated if the boundary moved much faster.)
Here the Cartesian x direction plays the role of the azimuthal direction,
and the mean state is assumed independent of x. The mean pressure gradient
has no x-component; thus the fluid is free to accelerate in the x direction
in response to the radiation stress.
If the waves are being dissipated in some layer L at the top of the
picture, there is a systematic tendency for the mean flow to accelerate
there. So the wave-drag force which the boundary exerts on the fluid is not
felt at the boundary, as far as the mean flow is concerned; it is felt at
L. This is a typical radiation-stress effect.
If the waves were generated not at a boundary but by a moving system
of heat sources and sinks in some layer in the interior of the fluid, then
total momentum would be constant, and the mean acceleration at levels where
the waves are dissipated will be accompanied by a corresponding deceleration
where they are generated 3'4. The close connection between mean flow changes
and wave dissipation or forcing can be verified by detailed solution of
the appropriate sets of equations, but is not usually obvious from the
equations themselves.
Where the waves are dissipated will depend not only on the physics of
292
the dissipative processes but also upon the solution of the wave propagation
problem for the particular mean-flow profile involved. Other things being
equal, we usually get enhanced dissipation of the waves in places, if any,
where their intrinsic frequency (i.e. frequency in a frame of reference
moving with the local mean flow H ) is Doppler shifted towards zero - that
is, we tend to get enhanced dissipation near an actual or virtual "critical line "8 5(y,z) = c.
In this review I shall pay particular but not exclusive attention to
the class of problems exemplified by figure i. Their characteristic feature
is the existence of a coordinate x (cartesian or curvilinear) such that mean
quantities are independent of x, and "mean" can be defined as an average
with respect to x. Such problems, which I shall call "longitudinally
symmetric" happen to comprise an area of recent advances, and also serve
to illustrate some of the subtleties and pitfalls which can arise in thinking
in general terms about the transport of conservable quantities such as
energy and momentum by waves in material media,and most particularly waves
in moving media. (We are, of course, dealing with moving media par excellence
as soon as Coriolis forces are relevant.) In section 2 some of these points
are illustrated by describing in more detail what happens in the problem
of figure i. Most of the phenomena encountered can be found in one or other
of several related problems which have been discussed in the literature by Eliassen 9, Phillips IO, Matsuno II, Uryu 12, Grimshaw 13, and others.
In sections 3 and 4 I turn from illustrative example to general theory,
and survey some very recent developments which appear to be of quite wide significance, but which have proved to be especially powerful for longitud-
inally symmetric problems. A simple yet very general version of the
"wave-action" concept is involved, resulting from a synthesis and extension
of ideas from "classical field theory "14 and the more recent work of Eckart 15, Hayes 16, Dewar 17, and Bretherton 18. Equally relevant is the
pioneering work of Eliassen & Palm 19 and Charney & Drazin20; and a related
but not identical line of development is contained in the work of Soward 21 on the Braginskii dynamo problem. A remarkable feature of the general
results is that they enable useful statements to be made without requiring
validity of approximations of the "slowly-varying wavetrain" type and attendant concepts like "group velocity'. Also, they can be developed for
finite-amplitude waves 22. Their special value in longitudinally-symmetric
problems is that they lead to ways of expressing the problem for the
mean-flow changes which do directly exhibit the abovementioned general connection between those changes and wave dissipation or forcing 22-25.
Finally (section 5) I shall make some remarks about that elusive entity,
or rather, nonentity, wave "momentum'.
293
2. MORE ABOUT THE PROBLEM OF FIGURE 1
2.1 Equations
The simplest relevant set of model equations is the usual set for a
Boussinesq, incompressible, stratified fluid in a rotating £rame of
reference, with constant angular velocity JL :
~,t + ~'~ + 2J~^B + polyp - e~ = -X (2.1a)
8,t + U.V8 = -Q (2.1b)
V.u = O . (2.1c)
Here u = (u,v,w) is velocity, ( ),t stands for 6( )/6t, ~ is the unit vector
(O,O,i), e is the buoyancy acceleration given by minus the effective
gravity-plus-centrifugal acceleration (assumed constant in this model)
times the fractional departure of the density from its constant reference
value Po" For descriptive purposes we shall think of 8 as a measure of
temperature or potential temperature. The departure of the pressure from
the hydrostatic value associated with Po is denoted by p . The terms X =
(X,Y,Z) and Q may be thought of as representing arbitrary body forces and
~eating, which may or may not be functionally related to the fields of motion
but which in any case will be zero if the waves are neither dissipated nor
generated internally.
2.2 Excess momentum flux, and the mean-flow problem
Let an average with respect to x be denoted by an overbar: for instance
the mean velocity in the x-direction in figure 1 is 5(y,z). If we average
(2.1a) and make use of (2.1c) the result may be written in suffix notation
(i,j=i,2,3) as
ui,t + {uiuj + ~ij},j ÷ (2~A~)i - ezi = -{u[u~},j - xi " (2.2a) Po
(It will be convenient in what follows to use (x,y,z) and (u,v,w)
interch___angeably with (Xl,X2,X 3) and (Ul,U2,U3).) Here ( )" is defined as ( ) - ( ) , the departure from the mean, and (),j means d( )/6xj . Eq.(2.2a)
contains mean-flow quantities only, except for the term involving the
Reynolds stress uiu j . The equation tells us that uiu j is the excess mean ...... > i momentum flux due to the waves. Note that uiu j is a wave property, by which
I mean something which can be self'consistently evaluated as soon as you know the linear wave solution, i.e. when you know the fluctuating quantities ( )" to leading order.
294
It might be tempting to conclude that nothing more need be said: Eq.
(2.2a) states that the momentum transport by the waves is equal to uiu j ;
so "obviously" -ui~ is the stress whose divergence will give the mean
acceleration u,t, or at least the contribution to this acceleration
attributable to the waves. The average of Eq.(2.1b), namely
8,t + {ujS},j = -{u~8"},j - Q , (2.2b)
is irrelevant, one might think, because how, after all, can the excess heat
flux u{8" due to the waves affect momentum transport?
This conclusion would, however, be wrong (for reasons to appear
shortly), and the fact that it has appeared in the past literature
illustrates the dangers of "incomplete reasoning" about wave transport
effects on the basis of superficial consideration of a relevant-looking wave
property - in this case the excess momentum flux u~u~. Another illustration
will be encountered in section 2.6. In fact the only safe general recipe
for getting a self-consistent picture is to include a conslderation of the
complete problem for the mean flow and its solution correct to second order
in the wave amplitude a. In the present example, the wave properties u[u~
and ~ appear as forcing terms in the mean-flow problem; and both turn
out to play essential roles.
The result o£ averaging (2.1c) is
~.~ = 0 , (2.2c)
and this completes the set of equations, (2.2), for the mean quantities
and 8 . To obtain a well-determined model problem it is simplest to suppose
that the flow is bounded laterally by a pair of vertical walls y = O, b
on which the normal component of velocity vanishes, implying that
= O on y = O, b . (2.3)
We must beware, however, of assuming that ~ vanishes at z = O; in fact for
a rigidly-translating, corrugated boundary whose shape is described by a
given function h ,
z = h(x-ct, y) , (2.4)
where h=O(a), h=O, and c is a (real) constant, it can be shown that
+ O(a 3) at z = 0 . (2.5) = (v'h),y
This illustrates the fact that ~ , which is an average along a horizontal
line such as ~ in figure I, can represent a vertical mass flux, into or out
of the thin region betweeen ~ and the actual boundary, which is continuous
with a horizontal, O(a 2) mass flux within that region, associated with any
tendency for the disturbance velocity to be one way along troughs and the
other way along ridges in the boundary.
295
In fact, such a tendency turns out to be the rule rather than the
exception when Coriolis effects matter; for instance if h is of the form
a sin k(x-ct) then v" for conservative, plane inertio-gravity waves on a
uniformly stratified basic state of rest turns out to be exactly in
quadrature with w" and therefore exactly in phase with h at z=O. This can
easily be verified by setting 8,z = constant, 5 = 8,y = 0 , and X=O, Q=0,
and calculating the elementary plane-wave solutions ~ exp i(kx + mz -~t)
of the linearised disturbance equatlons derived from (2.1) (namely (3.2)
below). Other pertinent features of such plane-wave solutions are that 8",
being proportional to the vertical displacement through the basic stable
stratification 8 is (like h at z=O) in quadrature with the vertical ,Z '
velocity w'; also incompressibility dlctates that u" is in phase with w',
since (2.1c) implies iku" + imw" = O . Thus u'w', v'8" are nonzero, and v'w',
zero, in a plane inertio-gravity wave. The frequency of such a wave ~,
(= kc), satisfies the dispersion relation
~2 = (~ zk2 + 40~2m2)/(k2 + m 2) (2.6)
when H = O. (It should be noted that this implies that c 2 must lie between
4jl2/k 2 and O,z/k 2 for the inertio-gravity waves to be generated.)
2.3 Solution
I shall now describe, for the simplest relevant example, the result of solving the O(a 2) mean flow problem; ~ and Q will be set to zero, so that
we are talking about the effect on the mean flow of the waves alone. The
waves are supposed to have propagated upwards as far as L either because
they are being dissipated there or because a finite time has elapsed since
the bottom boundary started moving. Well below L we can take the waves to
have reached a steady state and the moticn to be conservative - we assume
that X'and Q'are zero there as well as ~ and Q. To keep life as slmple as
possible we shall assume that H = 0 initially, and follow its evolution as long as it can be considered to be O(a2). We also take e,z = constant +O(a 2)
for the moment.
The simplest kind of mathematical analysis for the waves (we omit the
details, since the results of section 4 will supersede them) makes the usual
kind of "slowly-varying" approximation, in which the plane wave solution
is locally valid. This involves inter alia an assumption that the layer L
is sufficiently deep compared with a vertical wavelength. We also take h
to be of the form a.f(y).sin k(x-ct) , where f(y) is a sufficiently slowly-varying function (which we assume vanishes at y=O,b). Then by the
properties of plane inertio-gravity waves previously mentioned, the important term on the right of the x-component of (2.2a) is -(u'w') and
Fz that on the right of (2.2b) is -~SVT,y . The remaining terms are not of
course exactly zero, because plane waves represent only the leading
approximation; but in fact it is consistent to neglect them. The response
of the mean flow to the forcing -(v'8"),y together with the forcing
represented by the inhomogeneous boundary condition (2.5) involves a mean
"secondary circulation" indicated schematically by the arrows in figure 2.
296
The picture assumes that the wave amplitude is a maximum near y=~band falls
monotonically to zero on either side, so that (~) changes sign once, ,Y
near y=~b. The mean flow feels an apparent "heating" on one side of the
channel, and "cooling" on the other (about which more will be said in section
2.5). This gives rise to an O(a 2) mean vertical velocity~which beautifully
satisfies the boundary condition (2.5) and, by Eq.(2.2c) , demands a mean
motion across the channel, i.e. a contribution to V, in the vicinity of the
layer L where the wave amplitude goes to zero with height.
"Z- f
1 WAVES
i
I
!
t
I
!
w
!
t
t
FIGURE 2. Left: end view (looking along the x axis) of the problem of figure i. Right: typical profile of the mean acceleration in the longitudinal or x direction. The left-hand picture indicates how the secondary circulation ~, ~ is closed by a mass flux "in the bottom boundary', associated with a positive correlation between the disturbance y-velocity, v', and the depth -h of the corrugations in the boundary.
The Coriolis force associated with this O(a 2) contribution to ~ accounts
for a contribution to H t which is generally comparable with that from the
Reynolds stress dzvergence-(u w ),z zn the x component of Eq.(2.2a). In fact
the two contributions, in the present simple problem, can be shown to stand
approximately in the ratio
-6s 2 Reynolds stress divergence _~ Coriolis force associated with wave heat flux 4_Q- 2
. (2.7)
The two contributions are comparable in magnitudewnenever the Coriolis term
is significant in the dispersion relation (2.6); indeed if k2<<m 2 in (2.6),
~2 e 4~2 and the two contributions are almost equal and opposite. In that
297
case an estimate of effective momentum transport from the Reynolds stress
-~-~walone could be too large by an order of magnitude. (It is always wrong
in order of magnitude in another example, namely that of quasi-geostrophic,
vertically propagating Rossby waves II'12 - in which case it is too small,
by a factor of order the Rossby number.) Only when both contributions are
accounted for will the calculated total rate of change of mean momentum
u,tdydz agree, as it must, with minus the horizontal wave-drag force F
exerted by the fluid on the lower boundary. F is defined correct to second
order as the integral with respect to y of
= P'h,xlz= O + O(a3) . (2.8)
This agreement (the detailed verification of which is omitted here) provides
a useful check on the correctness of the overall picture.
2.4 Lagrangian-mean flow and "radiation stress"
There are crucial differences between the foregoing picture, which is
based on Eulerian averaging, that is averaging at fixed values of (y,z),
and the same problem solved using a Lagrangianmean (definable approximately
as the mean following a fluid particle). It turns out that the Lagrangian-
mean secondary circulation is negligible sufficiently far below L. In a
region of steady waves, when Q=O, the fluid partlcles merely oscillate about
a constant mean level, and have no systematic tendency to migrate up or down.
This is no more than mlght be expected for adiabatic motion in stable
stratification. So in a Lagrangian-mean description there is no secondary
circulation linking the regions of wave generation anddlsslpation, and thus
no "Coriolis" contribution to the effective transport of momentum by the
waves. The analogue, in the Lagrangian-mean momentum equation, of the
Reynolds stress - ~ in the Eulerian-mean momentum equation (2o2a), thus
gives a more direct description of the momentum transport, as was recognised
by Bretherton 18,26, who suggested that it be identified as the radiation
stress. Its xz component, taking the place of -u'w r in the present,
Eulerian-mean description, is equal to the "wave-drag" force in the x
direction per unit area across amaterial surface whose undisturbed position
is a plane z = constant - which force is evidently the same as (2.8) when
z=O. These ideas have been further developed by Grimshaw 13 for the
slowly-varying case, and an exact "generalised Lagranglan-mean" descrip-
tion for arbitrary, finite-amplitude waves has been developed by Andrews
& McIntyre 22. The reader is referred to those papers for more discussion
of the differences between the Lagrangian and Eulerian-mean descriptions,
and to Bretherton 26 for a sufficient physical explanation, in terms of the
average Coriolis force on a thin piece of fluid bounded above by a corrugated
material surface and below by a flat, "Eulerian" control surface, as to why
the corresponding momentum fluxes differ in general; see also (4.7) and
(4.9) below.
2.5 More details about the Eulerian-mean secondary circulation
Returning to the example of figure 2, we describe in a little more detail
298
how, in the present description, the forcing term - ~ , y "gives rise"
to the Eulerian-mean vertical velocity W, since this will help motivate the
more general analysis of section 4. In the region below L, where we may
suppose the waves to be in a steady state and not dlsslpating (X'=O, Q'=O),
the forcing term -(v'e~,y is in fact balanced mainly by ~ 8,z on the left
of (2.2b), when we rewrite that equation with the aid of (2.2c) as
8,t + ~ 8,y + ~ 8,z = -{v'S'),y . (2.9)
The term ~ e ~, is O(a 4) and therefore negligible, because ~ is O(a2), and
e,y is also O(=a 2) for the following reason. We have taken ~=O(a2), which
implies that 8,y=O(a 2) because the x component of the curl of (2.2a) gives
(when X=O and ~,~ are O(a2))
~,y = -231U,z + O(a 2) . (2.10)
(This is known in geophysical fluid dynamics as the "thermal wind
equation" ) The other term 8 t on the left of (2.9) turns out to be negligible
unless we are within a distance of order the "Rossby height"
H R = 2~b/(e,z)%
£rom the layer L where the waves are unsteady or dlssipating 9. This point
will be further explained in section 4.6 below. Thus, sufficiently far below
L we have a balance between W~,z and -(~,y , which implies that
~ - ~ y , (2 ii)
where we have defined
= v'8"/(O,z) , (2.12)
again using (2.10) and our temporary assumption that H=O(a 2) in order to
neglect e,y z .
There is another didactic point to be made here, incidentally: it has
o£ten been assumed in the literature, for instance in connection with
thermodynamic arguments, that the nonzero value of ~-~ signifies a tendency
for the waves to transport heat across the channel. Even more than with , this is true but misleading. There is no tendency at all for the
mean temperature to rise on one side and fall on the other, if we are
sufficiently far below L. The adiabatic heating or cooling associated with closely compensates the divergence of ~ This compensation is
intrinsic to the nature of the wave motion, as is underlined by the already-mentioned consideration that indlvidual fluid particles are not
being heated or cooled below L, because the motion is adiabatic there.
The right-hand half of figure 2 schematically indicates the profile
o£ the mean acceleration H If the layer L is shallower than the Rossby It" height H R, then additional contributions to ~ and ~arise in a layer of depth
H R centred on L. These adjust the values of e,t and 5,t in such a way as
299
to keep the thermal-wind equation (2.10) satisfied; there is
a circulation only in a layer of depth H R (see Eq.(4.12) 'room" for such below).
It turns out that Eq.(2.11) still holds for mean flow profiles H(y,z)
and 8(y,z) which vary sufficiently slowly and are such that the Richardson number ~,z/(U,z )2 is large. Then
V e ~,z (2.13)
can be significant below L. The associated Coriolis force does not, however,
lead to an acceleration of the mean flow well below L, because it turns out
that it is always cancelled by an equal and opposite Reynolds stress
divergence if the waves are steady and conservative. The fact that this
cancellation must take place will be seen as a corollary of the much more
general results to be described in section 4.
2.6 Energetics
Our simple example is also quite instructive as regards questions of
energy transport. Suppose that the waves are dzsslpating in the layer L and
heating the fluid there. (The amount of heat involved does not change
significantly within the Boussinesq approximation, but that is beside the
point.) What is the source of this energy? Obviously, the work done by the
agency moving the bottom boundary. How does the energy get from the boundary
up to L ? Answer: there is a vertical energy flux p'w" due to the waves. Indeed, the rate of working by the boundary
-~C = -cp h,x z=O = P w z=O ' (2.14)
in virtue of (2.8) and the fact that w'= -ch
here are wave properties. ,x" All the quantities involved
What we must not forget, however, is that this simple picture, while
correct to O(a2), depends crucially on the circumstance that 5 is zero, apart
from the O(a 2) contribution indicated in figure 2. If we look at the
similar problem in which the boundary is brought to rest
and the fluid is moving past it with velocity H = -c + O(a2), the picture
is quite different. Clearly the boundary can now do no work. The source of
energy is now the kinetic energy of the mean flow near L, whose density is changing at a rate Po times
(~u2),t = UU,t =-cu,t + O(a4) • (2.15)
The integral of (2.15) over the yz domain is, indeed, equa I to Fc, by the remarks above Eq.(2.8).
I n t h i s p r o b ] e m there is no need for the waves to transport
any energy into L at all; and indeed it does turn out that there is no net transport, despite the fact that p'w" is still the same as before. Note first
that in the region of steady, conservative waves below L, the work done
300
across a material surface corrugated by the waves is obviously zero, because
in the present p r o b ] e m such a surface is immobile. Alternatively,
in an Eulerian-mean description of the energy budget correct to O(a2), based
on the model equations (2.1) and (2.2), the total energy tlux across a
horizontal control surface contains two terms which combine to cancel the
contribution Ip'w'dy . The first comes from the O(a 2) part of the advection
by w" of the leading contribution ~po(U + u') 2 to the total kinetic energy:
PoW'(~52 + 5u" + ~u "2) = po H u'w" + O(a 2) . (2.16)
(This contribution to the total Eulerian energy £1ux has been drawn
attention to in the literature just about as often as it has been forgotten
aboutL) The second contribution is the mean pressure-working f~dy
associated with the y-dependent part of the mean pressure whose gradient
balances the Coriolis force associated with H .
The general conclusion to be drawn is that, whenever moving media are
involved, we must expect that a solution to the O(a 2) mean flow problem will
be essential to a self-consistent picture of the way in which waves
contribute to the energy budget. We must also remember that, as always, use
of the energy concept requires us to pay attention to £rames of re£erence!
There is no such thing as "the" net energy transport due to the waves; and
the transport can be identified with p'u" only if the medium is everywhere
at rest.
None of this affects the quite separate fact that the wave property
p'~" is the quantity usually related to the group velocity (when that concept
is applicable). It usually turns out that
p'u" = Ex(group velocity relative to the local mean flow) (2.17)
for plane waves, where E is intrinsic wave-energy density, a wave property
which in the present problem takes the form
E =]~po ( u.2 + S.2/~,z) (2.18)
(The second term is the "available potential energy "27 associated with
vertical displacements of particles in the stable stratification; there is
no internal energy because our model assumes incompressible flow. Brether-
ton & Garrett 28 have analysed the idea of "wave-energy" as a physical concept
in some depth, and in particular have established condltions under which
E can be uniquely defined in a general manner independent of the mathematical
formulation of the wave problem. I shall not go into that here except to
say that, roughly speaking, E is the work you would do in setting up the
disturbance, in a frame of reference in which the mean flow is at rest -
which clearly makes approximate sense in problems of moving media only when
the mean state varies sufficiently little over a wavelength.) I said that (2.17) "usually" holds, by the way, because there are some exotic cases such
as Rossby waves where the two sides differ by an identlcally nondivergent
term. However, in the case of our plane inertio-gravity waves the two sides
301
can be verified to be equal.
3. THE GENERALISED WAVE-ACTION PRINCIPLE
I deliberately avoided writing down the full conservation equations
for the energy budget, partly because the details get quite complicated for
all but the very simplest mean flow structures. Besides, I want to leave
space to introduce another kind of conservable quantity, generalised
wave-action, which apart from its wider significance will lead to a more
powerful approach to problems of the kind just dlscussed. This approach will
depend in no essential way upon any "slowly-varying" approximations.
Wave-action is an O(a 2) wave property which, in its most general form,
satisfies a conservation relation, apart from source terms in X" and Q',
for any mean-flow structure whatever. This remarkable property is to be
contrasted with the equation for wave-energy, whose "right-hand side"
contains a complex of terms representing exchange of energy with the mean
flow:
~E + ~. (~E + p'u') = 5t
. . . . . • - -
= po[-5i,juiuj - (~ij-zizj)e,iuj e - 2 , zt+UjS,zj )]
-Po[~'.~" + 8"Q'/8,z] • (3.1)
wave-energy fails to be conserved as soon as you have a moving medium. Eq.
(3.1) is just for the Boussinesq case, and is derived by dotting the
linearised versions of (2.1a) with PoU" and of (2.1b) with poe'/e,z and adding. We should always keep in mind that (3.1), as implied by the
discussion just given in section 2.6, represents only a part of the whole
energy budget. The linearised equations corresponding to (2.1) are
Dtu + u'.~7u + 2 NAu ÷ pj " - 6)" = " (3.2a)
DtS'+ u'.~z~ = -Q" (3.2b)
V.~" = O , (3.2c)
where the linearised material derivative
Dt( ) = ( ),t + ~.V( ) .
Here we shall define the generalised wave-action correct to O(a 2) only.
(It can be defined exactly, for finite-amplitude disturbances, once one has the generalised Lagrangian-mean description 22, but that is beyond the scope
of this review.) Two preliminaries are needed. The first is to introduce
the O(a) particle displacement field ~(x,t), which is defined to satisfy
302
~7.~ : O, ~ = O, and
Dt~ = u i , (3.3a)
where u ~ is the Lagrangian disturbance velocity
u" u = ~ + [ . V ~ ( 3 . 3 b )
For the problem of figure i, (3.3) simplifies, correct to O(a2),to
Ot~ = v', Ot~ = w', Ot~ = u" + ~u y+ ~U,z , (3.4)
where ~, ~, and ~ are the components of ~.
The second preliminary is a formal device (see Hayes 16 and Bretherton 25)
which is introduced for the sake of the greatest possible generality: we
reinterpret the Eulerian averaging operator ( ) as an ensemble average, over
an ensemble of wave solutions distinguished by a single, smoothly varying
parameter o~ . In a stochastic problem o~ would range over a "sample space';
but random waves are merely one possible case. In the determznlstic problem
of figure i, for example, we can generate a suitable ensemble just by
translating the boundary and the wave pattern a distance ~ in the x
direction. Then (--~ may be trivially re-defined in terms of an integral over
o~ rather than over x. For the axisymmetric mean flows important in
astrogeophysical applications the principle is the same but the details less
trivial 22'25. Quite generally, we nave the basic property
{( ),~} = {( )},~= 0 , (3.5)
whenever the ensemble of disturbance fields depends differentiably upon the
parameter c~ , which we shall take to be the case.
Instead of dotting the linear ised momentum equation (3.2a) with PoU ",
we now take its dot product with the derivative Po~,o~ and average. After some manipulation there results 22'23
~_ +()A ~7.B = -po( ~,o~ .X" + ~,o~ q') + O(a4) (3.6)
where
and
s so that q = O and
A =
B=~A+ t~p"
q" = - e" - ~.7g + 0(a3),
3.7a)
3.7b)
3.8a)
303
(3.8b)
Thus q" = O when Q" = O, i.e. q" = O for adiabatic motion. So A is conserved,
with flux ~ , whenever X" and Q" are zero.
In the derivation of (3.6), the property (3.5), and its corollary that
if f'(~,t,~) and g'(x,t,~) are any two disturbance fields then
f',~ g" = -f'g',~ , (3.9)
and ~ =O(a2). are needed a number of times. Also (2.2) are used, with
Eq. (3.6) corresponds when X" and Q" are zero to one of a class of exact
conservation relations pointed out by Hayes 16, which arise from replacing
certain space or time derivatives by ( ),~ in the classical %nergy- m omentum-tensor formalism 14. The relationship between these exact conserv-
ation laws and the adiabatic conservation laws discovered by Whitham 29 is
discussed in some detail in Hayes" paper. Hayes took the range of ~ to be
(0,27), which makes A unique and is convenient for applications to periodic
or almost periodic waveS, since ~may then be interpreted as phase. However,
it is convenient here not to fix the range of ~ , in order to leave a little
more flexibility in applications. Then A is defined only up to a
multiplicative constant.
Hayes called his conserved quantity the "action" irrespective of what
variational principle was used as the starting point. It can be shown 25 that
A (with O<~<2~) is equal to Hayes" invariant when the governing variational
principle is Hamilton's principle, expressed in its classical sense in terms
of the particle displacements I. A is to be carefully distinguished from
the other conservable quantities to which other varlational principles may
lead, via Hayes" modification of the energy-momentum-tensor formalism, and
which may or may not be wave properties. For example, Hayes" invariant is
not a wave property when the Clebsch-Herivel-Lin variational principle 30
is used as the starting point.
A, or more properly its generalisation to finite amplitude 22, repre-
sents the fundamental, exactly conservable wave propertywnich, in problems
of slowly-varying, conservative waves (X, Q both zero), reduces to the adiabatically-conserved wave-action whose physical meaning and precise
relation to Whitham's adiabatic invariants was elucidated by Bretherton &
Garret, 28. The connection between A and Bretnerton & Garrett's wave-action
can be made via the scalar virial theorem 22, i.e. the result of dotting the
momentum equation with ~ rather than with ~ (This reduces to
"equipartition of energy" in the non-rotating case.) Bretherton & Garrett's
wave-action is defined in those "slowly-varying" circumstances in which the
wave-energy E is uniquely defined, and is then equal to E divided by the
intrinsic frequency, or frequency in a frame of reference moving with the local mean flow, ~÷.
The result (3.6), or more generally (3.11) below, appears to have two
distinct types of appl~cation. One is the same as that envisaged by
304
Bretherton & Garrett, namely to computing the spatial and temporal
dependence of wave amplitude. The second application is to the calculation
of mean-flow evolution. Both applications depend on the fact that A and
are wave propertles, and it has been found in both appllcations that the required information is obtained, in at least some cases 23, from far less
computation than would otherwise be needed.
For reference we quote the corresponding result for a compressible fluid of density p and general equation of state
p = S(8,p) , (3.10)
where p is pressure and 8 potential temperature or specific entropy; 8still
satisfies Eq. (2.1c) in either case. The result is almost as simple as before
(again quite unlike the corresponding wave-energy equation):
[ .... 1 %y + v.~ = -P!,~ .x" - ~ ~ q" p,~ + O(a 3) • (3.11)
where A and B are still given by (3.7) with~) O replaced by E , the mean density, and 32
§ = ~-iOs(~,~)/6~ .
(3.12)
(3.13)
The definitions of ~ and q" are the same as before, except that because of
compressibility we have ~.~ = - p~/p + O(a 2) (pL= p-. ~.~).
4. THE GENERALISED ELIASSEN-PALM RELATIONS, AND THE CONNECTION BETWEEN
MEAN-FLOW ACCELERATION AND WAVE GENERATION OR DISSIPATION IN LONGITUD-
INALLY-SYMMETRIC PROBLEMS
4.1 Conservation of pseudomomentum
For simplicity we now revert to the assumption that the mean flow is
independent of x i, as in the example of figure 1 (where i=l). That is, the
mean flow is invariant to translations in the x i dlrection. Associated with
this invariance is a conservable wave property
Pi = -Po~,i "(ul +~^~ ) (4.1a)
because in this case we may replace ( ),~ by -( ) ,i in the generalised
wave-action principle (3.6). The associated flux is
Qij = 5jPi - ~j,i p" ' (4.1b)
and (3.6) is replaced by
305
Pi,t + Qij,j = Po(~,i "~" + {i q') + O(a4) " (4.2)
(Of course I could have derived (4.2) in the £irst place by dotting (3.2a)
with -~i rather than ~,~. But I wanted to go via the wave-action principle
because of its importance as the starting point for applications in various
other contexts. For instance it is not quite so obvious how to find the
analogue of (4.2) in the more general Kind of longitudinally-Symmetric
problem where the mean flow is rotationally invariant, until we take (3.6)
or (3.11) as starting point and apply it to the ensemble generated by
rotating the disturbance pattern22'25.) For reasons to be explained in
section 5 I shall call Pi the density of pseudomomentum.
There is still no obvious connection between (4.2) and the mean-flow
equations (2.2) - although there would have been at this stage had we been working with the generalised Lagrangian-mean description 22. Because we are
using the Eulerian-mean description, some more analysis will be needed.
First, we use the remarks in section 2.5 to motivate a simple transforma- tion 23,24,32 of the mean-flow equations, which will take them one step
closer to the connection with (4.2). The idea is to subtract out the
contribution to the O(a 2) Eulerian-mean secondary circulation expressed by the stream function ~ of (2.12). The second and £inal step 3,4,23,
24,26,32-35 will involve manipulation of the linearised equations in a way
foreshadowed by the celebrated work of Eliassen & Palm 19 and recently brought to a very general form by Andrews & McIntyre 23'32.
4.2 Preliminary transformation of the mean-flow problem (2.2)
Now take x=x I as the direction of symmetry. Define v and by
v= ~ +v , w=- ~ +w (4.3) ,z y ,
_, _, a 2) so that v , w represent the "residual" O( mean secondary circulation
left over after subtracting out the part corresponding to (2.12). Then a
small amount o£ manipulation converts the mean-flow problem (2.2) into the form
u,t + Uy~* + UzW* = -Sxy,y - Sxz,z -
2Jlu,t + P,ty + v,tt = -Y,t
-8,t + P,tz + w,tt = -Z,t
O,t + ~ yV* + e --* , , z w = -G - rz
_, w,
V~y + W,z = O
(4.4a)
(4.4b)
(4.4c)
(4.4d)
(4.4e)
where
Uy = H,y - 237_ , Uz = H,z (4.5a)
308
Sxy = ~ - UzV~e~/e,z (4.5b)
SXZ = u'w" + UyV'8"/8,z (4.5c)
O = w'e ''~ + v'e' e,y/e,z (4.5d)
-- (v "e"/~, z) = (v ~" ) ,y + (v'w'),z + ,zt + Y + O(a4) (4.5e)
= (v'w'),y + (w---~") ,z - (v'e'/e,z) ,yt + ~ + °(a4) • (4.5f)
Correct to O(a2), (4.4) can be regarded as a set of equations for the five unknowns
{~,t , V*, ~*, §,t ' P,t} , (4.6)
since Uy, U z, ~,y and e,z can be regarded as known "coefficients" apart from contributions O(a2). We are of course thinking of the linearised wave problem as having been solved, so that the wave properties on the right are
known forcing terms.
The transformation (4.3) is dependent on our choice of coordinates; a coordinate-independent preliminary transformation may be used instead, at the cost of getting more complicated-looking versions of the results 32.
4.3 Excess momentum fluxes and generalised Eliassen-Palm relations
We now multiply the x component of (3.2a) by ~ and then by ~ 26, and average. The resulting pair of relations reduces, after a little manipulation in which we use (3.4), (3.9) (with ~ replaced by x) , and our
assumption that ~ = (~,O,O) + O(aZ), to
u'v" + pjl~7, x p- _ Uz ~ = ~ + _IUy(-~),t + (~u'),t + O(a4) (4.7a)
u w + po -I ~,x p" - Uy~-~ = ~ + ½Uz(~),t + (~u'),t + O(a4)" (4.7b)
The second term in (4.7b) is to be compared with (2.8), the wave-drag force - are the excess momentum fluxes per unit area, and indeed ~,xp" and - p;
(or minus the "radiation stress" components) in one of the forms in which they appear in Lagrangian-mean analogues of (2.2a) 13'18'22. Eqs.(4.7)
relate these to the "Eulerian" excess momentum fluxes u'v" and ~-~. Note that -~,x p~ and - ,x~ are also equal to the y and z components of the nonadvective part of the flux of the pseudomomentum component PI; see
(4. ib).
Next we multiply (3.2b) by ~ and average to get
-v'8" + (~e'),t + ½e,y(~),t + e,z~ = _-~v + o(a4 )
and ~ times the first plus ~ times the second of (3.4) gives
, (4.8)
307
i ~ i , t = ~ + ~"w"~ (4.9)
These may he used to eliminate ~v? and ~w" from (4.7). The result is a pair
of relations like (4.7) except that Sxy and Sxz appear in place of u'v" and ~ , and the remaining terms, apart from ~xp-~ and ~ , are either of the form [--5, t or contain a factor X" or Q'. If the first relation is differentiated with respect to y and the second with respect to z, and (4.2) used to eliminate the two terms in p" , there results
Sxy,y + Sxz,z -- (~X~,y + (~---fS.z +
+ .&x~ + ~,x=V uz(~l~,z),y + UyC~V/~,zl,z
+ + - pol l - Oz C o' ÷ -
+{Uy[(~-~ + ~Oy ~)/[9,Z] + Uy~ + ~Uz~},z ] + O(a 4) . (4.10a
By multiplying (3.2b) by 0", averaging, and differentiating with respect to z, we get
G,z = I~/~,z),~ + ~-~/~,z],z + o(a4) 1 (4.10b)
4.4 Deductions
The results (4.10) imply that the O(a 2) forcing terms on the right of the mean-flow problem (4.4) can be expressed as a sum of terms each of which falls into one of three categories:
(i) the Eulerian-mean external forcing terms ~ and Q (which we shall choose to regard as unconnected with the waves),
(ii) wave terms of the time-differentiated form ( ),t ,and
(iii) wave terms all involving X', Q', or q" , that is, all depending explicitl Z on the forcing or dissipation of the waves.
This immediately shows that in the proDlem of figure l, for instance, the forcing of mean-flow changes vanishes below the layer L, where the waves are steady and conservative, for any initial pro£iles of mean flow and stratification, no matter how complicated, and no matter whether or not approximate, "slowly-varying" descriptions of the waves are valid.
It also carries the implication that although strictly conservative waves or instabilities (X', Q" and q" zero) can change the mean flow if they themselves are growing or decaying in amplitude, such changes are temporary
in that no net change to the mean flow persists if the waves propagate cut of the region of interest. This is almost obvious 23 from the fact that all the O(a 2) wave terms on the right of (4.4) can then be written in the form (-~,t ,with the aid of (4.10). However, if avery small amount of dissipation
308
is present, its effects can be greatly enhanced by the occurrence of
mean-flow changes that would otherwise be temporary. For a striking example
of this, see reference ii. A change in the mean-flow profile due to wave
transience can bring about an approach to "critical-line" conditions
somewhere, giving the dissipation terms a chance to take over locally, in
turn causing further and more permanent mean-flow changes. Theoretical work
on such highly nonlinear feedback effects, which could easily be important
in the evolution of stellar differential rotation, for example, is still in its infancy 7'8.
4.5 Extensions
Precisely analogous results, enabling the same qualitative conclusions
about mean-flow evolution to be drawn without solving wave problems in
detail, have been derived for:
I) rotationally as well as translationally invariant mean flows, with
mean velocity predominantly in the longitudinal (azimuthal) direction 22-2~ 32
2) a fluid with a general equation of state p = S(8,p); the Boussinesq
approximation is not essential 22'25'32
3) a self-gravitating fluid 22
4) a conducting fluid, with mean magnetic field as well as mean velocity
predominantly longitudinal 31.
4.6 Simplifications for the problem of section 2
It is of interest in connection with the previous discussion to exhibit
the approximate form taken by the transformed mean-flow problem for the
almost-plane inertio-gravity waves of figure i. A self-consistent set of
approximations requires that the Richardson number ~,z/(H,z)2 be large, and
the depth of the layer L, and other scales of mean variation in the vertical,
including the Rossbyheight H R= 2_O.b/(~,z )i , large compared to the vertical
radian wavelength m -I. The horizontal scale b is then large also, compared with k -1, assuming that both terms in the dispersion relation (2.6) are not
too different in magnitude. The upshot of such approximations is that all
the terms of the form (-~,y or (--) ~ in (4.10) become negligible, and (4.4)
can be shown to simplify, when X and ~ are zero, to
U,t - 2/i~* = - ~,x.X" - ~,X q" + poIPl,t (4.11a)
2]lu,t + P,ty = O (4.11b)
-8,t + P,tz = O (4.11c)
- -* (4. lld) ~,t + ~9,zW = O
V,y + w, z = O • (4.11e)
808
According to the second and third equations the mean flow stays in
geost~ophic and hydrostatic balance as it changes: knowledge of P,t implies
knowledge of ~,t and 8,t " (The thermal-wind equation (2.10) holds, with
its O(a 2) contribution negligible). If we eliminate 5 t and 8 t in favour of P,t in (4.11a) and (4.11d), and cross-differentiate to eliminate ~* and m* w via (4.11e), there results
{--~(~yT. + ~Z--~ -~-HR~ ~I P,t- = 2J9-{ ~,x.X" + ~,X q; - polPl,t} y, • (4.12)
The form of the elliptic operator on the left shows why the Rossby height
= 2J~b/(8 z)% takes over as the vertical scale of the response of the H R mean flow, whenever the forcing on the right of (4.12) is confined to a layer
L of depth smaller than H R .
5. PSEUDOMOMENTUM IS NOT MOMENTUM
It is surprising how often one seems to encounter the conceptual mistake
that since waves can transport momentum, they must possess it. No less than
Lord Rayleigh 36 appears to have been under this impression when he wrote
that "if the reflexion of a train of waves exercises a pressure upon the
reflector, it can only be because the train of waves itself involves
momentum'. Nowadays one often reads about "the momentum of the waves" and
abo~t waves "exchanging'(their) momentum with the mean flow, and suchlike;
and there appears to be a tendency to assume that this momentum which the
waves are supposed to have is to be identified with the wave property Pi defined in (4.1a), or rather the wave property
Pi = Eki/~+ , (5.1)
to which Pi can be shown to reduce in those "slowly-varying" circumstances where Bretherton & Garrett's 28 arguments apply.
On the other hand, as Brillouin 37 pointed out in 1925, Rayleigh's
statement is a non sequitur because in a material medium you can perfectly well have a nonzero flux of momentum unaccompanied by any momentum density
- try leaning against a brick wall. Two specific counterexamples to
Rayleigh's statement which I happen to know are the obvious one of waves in solids (phonons) 37'38 , and a simple fluid-dynamical example Ipublished
in 197340 . In the latter example, apacket of "inertia" (pure Coriolis) waves
propagates along a waveguide comprising an incompressible , homogeneous
liquid between rigid, parallel boundaries in a rotating frame of reference.
The mean momentum is zer____~o, for reasons of mass continuity. Nevertheless
310
there is a non-zero recoil force when the wave packet is reflected from an immersed obstacle.
That problem has the further interesting feature that the wave packet
does have a well-defined "fluid impulse', I, which gives the recoil force.
is not equal to the integral of P; indeed it can have the opposite sense]
(In fact the momentum flux due to the mean pressure ~ plays a leading role in this particular problem.)
An example of a somewhat different kind is the celebrated problem of
a packet of electromagnetic radiation in a refractive medium. This problem
has long been controversial, but has been convincingly clarified in recent years by Penfield & Haus 42, Gordon 39, and Peierls 38, to whose papers,
together with the review by Robinson 41, the reader is referred for some very
interesting history. In this problem, provided we neglect dispersion, there
is a definite, non-zero total momentum M i which travels with the waves.
Again, this is not generally equal to the integral of Pi (or rather its
electromagnetic counterpart, the "Minkowski quantity'), nor is it equal to
the electromagnetic part of the total momentum (the "Abraham quantity').
On the other hand recoil forces, are, this time, simply related to Pi (but not to Mi!) in at least some circumstances 39.
Brillouin's point is that waves don't have to possess momentum; the
examples show that in fact they sometimes do and sometimes don't - and that
when they do, the momentum is not necessarily related to recoil forces. The
wave property Pi may or may not be closely related to either; and whether
it is depends in fact on 91obal considerations - on the full O(a 2) mean
problem and its boundary conditions. So we must either say that Pi may
sometimes "be interpreted" as momentum, and sometimes not, depending on the
global problem - surely a most unsatisfactory conceptual structure - or we
must decide that Pi is simply an entity in its own right, not necessarily related to any momentum, whereupon the conceptual problems disappear. This
is why I like to have a separate name "pseudomomentum" for Pi, just as one
likes to have separate names for other pairs of quantities, like energy and
torque, which have the same dimensions but different physical natures. I
want to use the term "momentum" in its ordinary, elementary sense, of course
(which we use when thinking intuitively about forces and accelerations).
The terminology follows the usage of workers in solid-state physics, to whom all this has naturally been more obvious than to most. Blount and Gordon 39
have carried the terminology, and the distinction between momentum and
pseudomomentum, into classical electrodynamics, and shown how it helps
clarify the issues of the so-called Abraham-Minkowski "controversy" over
electromagnetic waves referred to above.
"But, the reader may say, you said in section 3 that Hayes
conservation relation results from replacing certain space or time
derivatives by ( ),~ in the variational derivation of the conservation
relation for the energy-momentum tensor. So the result of going in the
reverse direction, which is just (4.2) without the "dissipative" terms, is
nothing but the conservation law satisfied by the "momentum" part of the
311
energy-momentum tensor itself. So surely Pi i_ss a momentum."
My reply is that that would follow if the same mathematical formalism
always represented the same physical entity. Here, however, the blanket
term "energy-momentum tensor" tends to obscure the fact that different
variational principles, springing from different basic formulations of the
physical problem, can be used as starting point. Different basic formul- ations carry different implications about the kinds of invariance
properties associated with conservation laws. Certainly Eq. (4.2) is
associated with invariance under translations in space, just as is
conservation of momentum. But the translational invariances referred to
are in fact quite different, a point well made by Peierls 38. Conservation
of the i-component of momentum, for instance, depends on invariance of the
basic physical problem, including force potentials (e.g.gravitational),
under translations in the x i direction. Conservation of Pi depends on
translational invariance of the mean flow, insofar as it enters into the disturbance problem. This condition does not necessarily involve external
force potentials. Other necessary conditions for the two kinds of conserv-
ation law to hold are also quite different. For instance, whether or not
momentum is conserved has nothing to do with whether or not the motion is
adiabatic, while conservation of Pi does depend on the motion being
adiabatic since it requires, inter alia, that q" = O in (4.2).
It is the description in terms of particle displacements (and Hamilton's
principle in the classical sense) that lies behind the appearance of
pseudomomentum rather than momentum in the "energy-momentum" tensor. By
contrast, if we form components of the Eulerian-mean energy-momentum tensor
from the usual pure-Eulerian variational principle in fluid dynamics, namely the Clebsch-Herivel-Lin principle 30, then in place of Pi and its flux
Qij we get the Eulerian-mean density and flux of momentum ,
pu i and P6"ij - Puiuj , (5.2)
to within an identically nondivergent contribution. This should be no
surprise in view of the foregoing remarks on translational invariance.
Conservation of pseudomomentum, as distinct from momentum, is connected
with invariance to a displacement of the disturbance pattern while mean
particle positions are kept fixed, as distinct from a displacement of the whole system, particles as well as disturbance pattern 38. The idea of "fixed
mean particle positions" cannot be directly expressed within a purely
field-theoretic or Eulerian description, which does not keep track of where
fluid particles are. But it is implicit in a description of the disturbance
in terms of particle displacements (from "mean particle positions'). (What
this means for finite-amplitude disturbances is dealt with in reference 22.)
One possible reason why momentum and pseudomomentumhave sometimes been
mistaken for one another may be that in certain examples, even~ore idealised
than those already cited, not only are both quantities conserved (requiring
X'and Q~to be zero in the case of pseudomomentum), but also their conservation
relations reduce to the same form. If in these examples we generate the waves
312
starting from an initial state in which momentum and pseudomomentumare both
zero, it can happen that they evolve in parallel and remain equal. The
simplest example is the trivial one of an electromagnetic wave in vacuo.
Here there is no medium present to make the translational symmetry
operations different. But there are also examples involving waves in media,
all of them longitudinally-symmetric problems, in which the lon@itudinal
components of pseudomomentum and mean momentum evolve in parallel. Perhaps
the most celebrated example is that of Stokes" periodic waves on the surface of an infinite, inviscid ocean. The initial conditions of no motion are
hidden in the assumption of irrotational motion~ Approximate longitudinal
symmetry would be enough for approximate conservation of pseudomomentum;
but we need also that there be no mean horizontal pressure gradient and that,
concomitantly, the mean mass continuity equation (which constrains the
distribution of mean momentum but not that of pseudomomentum) plays no
significant role. Exactly the same considerations explain why a further such
example is provided by the problem of section 2, in the case when H R << H, the scale of the layer L. Eqs. (4.11) then imply that, for conservative
waves,
poU,t = Pl,t (X=O, Q=O, HR<<H). (5.3)
These examples are very special, and in any case the question of whether
or when there happens to be a momentum density equal to Pi is not the most
relevant one in practice. Statements which are more useful and general can
be made about the fluxes of momentum and pseudomomentum. Especially when a Lagrangian-mean description is used, the excess momentum flux due to the
waves is often simply related, although not usually equal, to the flux of pseudomomentum. It is basically this fact which accounts for examples of
the kind just mentioned. It is also why (4.2) could be used to eliminate
the terms in p" during the derivation of (4.10a) from (4.7). The reasons
for the existence of such relations are hinted at by Eqs. (2.14), (2.17),
(5.1), and the well-known argument about the relation between wave-drag,
phase speed, and the rate of working across a material surface.
More explicitly, in many "slowly-varying'situations it turns out that
the mean flow can be defined in such a way that (i) the excess momentum flux is the only wave term in the leading
approximation to the O(a 2) mean-flow problem (Eqs.(4.11) provide an example
of this), and (2) the excess momentum flux is then either equal to the pseudomomentum
flux, or differs from it by a contribution Cij which in some cases does not cause systematic mean-flow changes because it can be balanced quasi-
statically by the reaction of the medium. When MHD effects are not involved, and the fluid is compressible, Cij is
an isotropic~ pressure-like contribution which can be thought of as a kind
of acoustic "hard-spring" effect coming from the nonlinearity of the equation of state Ig'37'44. Analytically this results from redefining the
mean pressure in such a way as to avoid having a wave term in the equation
of state for the mean flow. For electromagnetic waves in refractive media Cij is again isotropic 42'39 and comes from "electrostrictive" and "mag-
The same remark applies in the classical theory of sound waves.
313
netostrictive effects 41. In MHD problems Cij is not, however, isotropic 17.
Whether or not such statements are helpful or misleading depends on whether or not, in the problem in question, the difference Cij between the pseudomomentum and excess momentum fluxes has time to be balanced quasi- statically by the mean stress in the medium (it usually does have enough time in "slowly-varying; situations), as well as on whether that mean stress happens to affect the answer to the particular question being posed. (To take a classical example, Ci~ is relevant to the force exerted by the absorption of acoustic waves znto the end wall of a closed container, but not into an absorber immersed within a larger volume of fluid 43,44. It is
presumably the former situation more than the latter, incidentally, to which the problem of solar wind acceleration by Alfv4n radiation pressure is analogous.) When (5.1) holds, and "group velocity" is meaningful, it is often true that the analogue of (2.17) holds also, namely that Qij equals Pi times the jth component of the group velocity. So there are some slowly-varying situations (those in which the questions being asked permit Cij to be ignored in some sense) where one can say mnemonically that the
waves transport momentum as if a local momentum density equal to Pi were being carried along through a vacuum at the group velocity. But the difficulty of saying in general terms when this mnemonic is applicable, and
when it isn't, brings us back in the end to the point made earlier: the only safe and completely general recipe for studying wave transport effects is to consider not only the "wave properties~ which can be evaluated from the O(a), linearised problem, but also a self-consistent analysis, correct to O(a2), of whatever global mean-flow problem is relevant.
REFERENCES
i. Lindzen,R.S. 1973 Boundary-layer Meteorol.4, 327-43 2. Holton,J.R. 1975 The dynamic meteorology of the stratosphere and
mesosphere. Boston, Amer.Met. Soc., 218pp. 3. Fels,S.B. & Lindzen,R.S. 1974 Geophys. Fluid Dyn.6, 149-91 4. Plumb,R.A. 1975 Q.J.Roy.Meteorol.Soc.lOl, 763-76 5. Spiegel,E.A., Gough,D.O., personal communication 6. E.g. Sakurai,T. 1976 Astrophys.& Space Sci.41, 15-25 7. Holton,J.R. & Mass,C. 1976 J.Atmos. Sci.33, 2218-25 8. E.g. Grimshaw,R. 1975 J.Atmos.Sci.32, 1779-93 9. Eliassen,A. 1952 Astrophysica Norvegica 5, 19-60 iO.Phillips,N.A. 1954 Tellus 6, 273-86 ll.Matsuno,T. 1971 J.Atmos.Sci. 28, 1479-94 12.Uryu,M. 1974 J.Meteorol.Soc.Japan 52,481-90 13.Grimshaw,R. 1975 J.Fluid Mech.71, 497-512 14.Landau,L.D. & Lifshitz,E.M. 1975 The classical theory of fields, 4th
English edition. Pergamon, 402 pp. See also Dougherty,J.P. 1970 J. Plasma Phys. 4, 761-85
15.Eckart,C. 1963 Phys.Fluids 6, 1037-41 16.Hayes,W°D. 1970 Proc.Roy. Soc.A 320, 187-208 17.Dewar,R.L. 1970 Phys.Fiuids 13, 2710-20 18.Bretherton,F.P. 1971 Lectures in Appl. Math. 13, 61-102 (Amer.Math. Soc.)
314
19.Eliassen,A. & Palm,E. 1961 Geofys. Publ.,22#3, 1-23 20.Charney,J.G. & Drazin,P.G. 1961 J.Geophys.Res.66, 83-109 21.Moffatt,H.K. 1977 Magnetic field generation. Cambridge Univ. Press (to
appear) 22.Andrews,D.G. & McIntyre,M.E. 1977 Submitted to J.Fluid Mech. 23.Andrews,D.G. & McIntyre,M.E. 1976 J.Atmos. Sci.33, 2031-48 24.Boyd,J. 1976 J.Atmos.Sci. 33, 2285-91 25.Bretherton,F.P. 1977 Submitted to J.Fluid Mech. 26.Bretherton,F.P. 1969 Q.J.Roy.Meteorol.Soc.95, 213-43 27.Lorenz,E.N. 1955 Tellus 7, 157-67 28.Bretherton,F.P. & Garrett.C.J.R. 1968 Proc.Roy. Soc.A 302, 529-54 29.Whitham,G.B. 1974 Linear and nonlinear waves. Wiley. 30.Bretherton,F.P. 1970 J.Fluid Mech 44, 19-31 31.Andrews,D.G., personal communication 32.Andrews,D.G. & McIntyre,M.E. 1977 To appear in J.Atmos. Sci. 33.Eliassen,A. 1968 Geofys. Publ.27#6, 1-15 34.Dickinson,R.E. 1969 J.Atmos.Sci.26, 73-81 35.Uryu,M. 1973 J.Meteorol.Soc.Japan 51,86-92 36.Rayleigh,Lord 1905 Phil.Mag.lO, 364-74. (Sci.Papers,5, 262-71) 37.Brillouin,L. 1925 Annales de Physique 4, 528-86 38.Peierls,R. 1976 Proc.Roy. Soc.A 347, 475-91 39.Gordon,J.P. 1973 Phys. Rev.A 8, 14-21 40.McIntyre,M.E. 1973 J.Fluid Mech 60, 801-11 41.Robinson,F.N.H. 1975 Phys.Reports (See.C.of Physics Letters) 16, 314-54 42.Penfield,P. & Haus,H.A. 1966 Phys,Fluids 9, 1195-1204 43.Brillouin,L. 1936 Revue d'Acoustique 5, 99-111 44.Rooney,J.A. & Nyborg,W.L. 1972 Amer.J.Phys. 40, 1825-30
WAVE GENERATION AND PULSATION IN STARS
WITH CONVECTIVE ZONES
Wasaburo Unno Department of Astronomy, University of Tokyo
Bunkyo-ku, Tokyo, JAPAN
SUmmary. Wave generation processes are classified in (i) strong and (2) weak, and (a)
Spontaneous and (b) stimulated processes. Then, the case (2b) operating in convective
Zones is discussed in detail. Both the dynamical and the thermodynamical coupling be-
tween pulsation and convection are formulated by use of the diffusion approximation for
the turbulent convection. A mixing length variable with time is thereby introduced.
The work integral is transformed so that each of its terms can reveal the mechanism re-
sponsible for the stellar stability. An important destabilizing mechanism associated
with the convective flux is found to exist among other known mechanisms. The mechani-
cal work is shown to be rather important.
Wave generation processes. The genuine hydrodynamleal generation of waves is due to
the nonlinear Reynolds stresses (Lighthill 1952, Unno 1964). The adiabatic wave gene-
ration in an isothermal atmosphere was thoroughly studied by Stein (1967). In this
Case, the waves are generated spontaneously. The propagation is not isotropic, but the
anisotropy is not very strong because of dominating quadrupole emissions. The wave am-
plitude in situ is small in subsonic turbulence, bu tthe effect can be appreciable after
the waves propagate in the outer layers. On the other hand, if the medium is made strong-
ly anisotropic by the presence of magnetic field or rotation, the monopole and dipole can
be very important, and the wave generation can be strong. The generation of Alfv~n
waves from turbulence under the presence of a strong magnetic field was found to be very
effective (Kato ]968, Roberts ;976). A change in the basic structure of the medium is
then expected. Spiral arms in galaxies, spicules and sunspots (Parker 1974) may be
the manifestation of such cases.
For waves that are trapped in some region of a star, the stimulated emission should be
Considered. The emitted wave and the underlying oscillation have a phase relation so
that the whole process forms a self-exciting system. Therefore, in principle, the
strong stimulated generation of waves may not be an inaccurate concept. Osaki (1974),
however, considered that the resonance between the nonradial oscillation and the over-
stable convection in a fast rotating core could be the cause of the 8 Cephei variabil-
ity. In such a case, the theory remains qualitative.
For trapped waves or pulsations, the weak stimulated emission can be accumulated and
become important. The thermodynamieal excitation of pulsation has been worked out by
many authors (Zhevakin 1953, Baker and Kippenhahn 1962, Cox 1963, Christy 1964) as the
316
cause of the variabilities of Cepheids and RR Lyr stars. Less investigation has been
done for stars having deep convective envelopes because of the theoretical difficulties
i~ the treatment of a convective zone.
Modulation of convection by pulsation can be calculated on the basis of the mixing-
length theory (Vitense 1953) slightly generalized to include the time dependence (Unno
1967). Recently, Gabriel, Scuflaire, Noels and Boury (1975) calculated the thermody-
namlca! coupling of the convection with the nonradial pulsation, and they demonstrated
an appreciable effect of the convective flux perturbation on the stability coefficient.
The dynamical coupling has been neglected so far. But, it is caused by the 2erturba-
tions in the turbulent pressure, visocsity and conductivity, and its effect on the sta-
bility is not negligible as shown later.
Table i. Classification of wave generation mechanisms
I. STRONG
(monopole) dipole anisotropic
2. WEAK
(quadrupole)
isotropic
A. SPONTANEOUS B. STIMULATED
noise {phase relation propagating "trapped
STRUCTURE CHANGE
8-Ceph spicules spiral arms sunspots (Osaki) (Kato, Parker)
Effects on Outer Layers Excitation of Pulsation
homog. (Lighthill) thermal & dynamical 5mn Oscill. pulsating stars
isothermal (Stein) ~stability
Table 1 summarizes the classification of the wave generation mechanisms discussed above.
The excitation of spiral arms in galaxies (Mark 1976) is considered as an example of a
strong emission mechanism. The exeitation of the solar 5 min. oscillation studied by
Ando and Osaki (1975) belongs to the weak stimulated emission mechanism. In these two
examples~ however, the coherence in spatial and temporal wave patterns is not completely
perfect, and the emission mechanism may better be considered as partially spontaneous
and partially stimulated. The solar stability (Dilke and Gough 1972, Boury, Gabriel,
Noels, Scuflalre and Ledoux 1975, Shibahashi, Osaki and Unno 1975) is an interesting
example involving the weak stimulated emission mechanism. The convection-pulsatlon
coupling is important. A qualitative change in the underlying solar structure discussed
in the present Colloquium may not take place, since the emission mechanism is weak.
Basic equations °f the pulsation-convection coupling. We shall hereafter restrict our-
selves to study the mechanisms of pulsational stability operating in the convective zone.
At present, no complete description of the compressible inhomogeneous turbulence is a-
317
vailable. We shall, therefore, approximate the nonlinear effects of the turbulent con-
vection by the eddy dlffusivltles. Then, the conservation equations of mass, momentum
and thermal energy are described by (Unno 1969)
d~ 1 dt ~- pV-m+~V-(<v>V~) %- ~V.~ , (I)
d~ 1 i d---{ = - ~ V(P+Pt) - V~ + ~ [V(<~>V-u) + (V.<~>V)n] , (2)
d! = ! (~N + ~V 1 1 dt T - ~ V'FR) + ~ V. (<%>VS) , (3)
where d/dt is the Lagrangian time differentiation, P, T, P, S and ~ denote density,
temperature, pressure, specific entropy, and the velocity, eN, EV, ~R and Pt denote the
nuclear energy generation rate, the turbulent viscous dissipation, the radiative flux,
and the turbulent pressure, respectively, and <l>, <~> and <v> describe the coeffi-
cients of turbulent conductivity, viscosity and diffusion that are approximated by
<pu'%>, Z being the mixing length and the prime indicating the convective fluctuation.
The small scale fluctuations inside a representative convective element have been smooth-
ed out so that the Pt(=<pu'2>), <l>, <~> and <~> appear from their nonlinear effects.
Since the turbulence has a continuous energy spectrum, the magnitude of these terms
should depend also on the scale of motion under consideration (Nakano 197~), but this
dependence will not be explicitly described in this paper. We shall also neglect <~>,
since mixing is efficient in the convection zone. The viscous dissipation ~V which is
dimensionally given by
C V ~ u'S/E (4)
in accordance with equation (2) should not be disregarded, since for the adiabatic con-
vection the following relations,
~V0 ~ (llP0)<%>oVTo'VS0 ' (5)
~co ~ - <X>o TorSo • (6)
reduce essentially to the original Biermann formalism (1932) and then the energy con-
servation in steady state,
P0~N0 = V'(FR0 + ~CO ) , (7)
is ensured by equation (3). Here FC denotes the convective flux and the subscript 0
indicates the statistically steady undisturbed state. The equilibrium structure is
determined by
(I/o0)V(P 0 + Pt0 ) + V{ 0 = 0 (8)
318
in addition to equation (7). Subtraction of these equilibrium equations from the basic
equations (2) and (3) yields the equations for perturbations. The difference in spatial
and temporal spectra between pulsation and convection can be used to separate the system
of equations governing pulsation and convection from each other.
The work in te$ral of the nonradial pulsation.
- im (~p /p 0 + ¥ ' 6 r ) = 0 ,
- m26t + (I/P0)WPI - (Pl/p02)¥P0 = ~i '
- i~6S = ~ ,
The equations of pulsation are given by
(9)
( i 0 )
( 1 i )
where ~q and qlrepresent the Lagranglan and Eulerian perturbations of any variable q, t
the time differentiation d/dt0=8/~t+u 0 .V 0 of the Lagrangian perturbations are replaced
by -I~ , and
1 Pl ~0 3 1 : - v¢1 - ~o v P t l + TjZo v e t ° - t v ( < ~ > ° " ~ ) + ( v ' < ~ > ° v ) ~ 1 ' ( n )
~ = ~[T-I (EN + eV - O-I¥'FR ) + P-I¥'<%>¥S] (13)
The effects of the modulated convection enter through Ptl in 71 and ~gV and ~<l> in
~. The calculation of these terms will be made later.
Now we can define the pulsation energy Ep per unit volume by
2 + ~ dPo ) tdlnP0 _ ~ ) - i P1 - p9~01 ) 2] I (14) Ep= ½Po [ml -c2po 2 (- ~ "71dr (71P0
where c2=YiP0/Po and 71 =(~inP/81nQ) S. It consists of the kinetic energy and the poten-
tial energies of acoustic and gravity waves. From equations (9), (i0) and (ii), after
some manupilations, we obtain
~Ep ~t + ¥" (PlUl + EpU0) = P0(~I" ~I + ~T6~ ) (15)
Integrating this equation over the whole volume of a star and assuming the boundary !
conditions of PI~I=0 and 40=0 at the surface, we obtain
~d I EpdV = W = WM + WT ~ Re I d t O0(~I*'~I+6T* ~ )dV" (16)
The result is independent of the quasi-adiabatic approximation and the Cowling approxi-
mation used above.
The mechanical work W M is the sum of the turbulent pressure work Wp and the turbulent
viscous stress work WS;
3 1 9
Wp = Re -i~r*.V~PtdV = 2m f Ptolm[(~p*/00)(~u'/u'0)]dV ,
W S = Re Ul*- (-~m) [V(<P>0V-~r) + (V-<~>0V)~r]dV
-- -c°~ f<~>0 (16P/Po 12 + IV~f2)dV
The calculation of Im[(Sp*/p 0)(6u'/u0)] will be given later.
W T consists of the nuclear energy work WN, the viscous dissipation work WV, the radia-
tive work WR, and the convective work W C. After some manlp~%lations, we obtain
WN=Re f p0(~T*/T0)6eNdV = f P0ENO(gN)T,adI~T/T0'2dV , (19)
W R = Re f p0(dT*/T0)6(O-IV-k'VinT)dV
- []-~'(~'0 ) + ~ I~01']dV + I 2V°]F"^Re[~T* ~6r,.. ~0 ~u ~0 @T'~v f (~26r 2 ~6r 2-£ (£+i)
+ Re FRO ~0" ~ - r ~r + r ~ 6r)dV , (21)
W C = Re I P0 (6T~/T0)6(Tp-IV'<X>VS)dV
= Y3-1 - F'~3-i- - u ~0 z0
where
+ Re f FC0 ~6T, 6u' ~0 ~ f F 2T~V.-_ -f~Q Re[~* ~r- T O Dr (u~ ~ + )dV + u i 0 -0 ~-~--]dv
f dT*f82~r 2 8~r 2-Z(£+I) 6r)dV + Re FC0 TO0 " ~ r Dr + r 2
.81nT. =d_~_~ H0 dr Y3-1 = [~-~no)S , V 0 dlnP0 ' dlnP 0 '
4acT ~ (81nK. .~l..~..~ I- K = 3<p ' ~ = t~--~TnT)S ' (gN)T,ad = t~inT )S
(17)
418)
The thermodynamieal work
(22)
L denotes the surface luminosity, and equation (6) has been used. The radiative con-
ductivity K appears in the expression of the radiative flux FR, ~R=-I<VInT. In calculat-
ing W R and WC, the following identity has been used,
~Bo (~2~r 2 ~r 2-£(I+i) dr) - 2V.(AoVB 0) ~r ~(V'AoVB0) = - A0 ~-r-- ~ r 8r + r Dr '
320
where the spherical harmonics yim(@,~) for the angular dependence of a normal mode has
been assumed. The work integrals will be transformed to more convenient forms for re-
vealing physical processes, after the convection modulated by the pulsation is discuss-
ed in the following.
Time dependent convection.
out from the basic equations (i), (2) and (3) as follows,
Equations of the time dependent convection can be separated
dp'/dt I + V.(pu') % V.(pu') % 0 , (23)
1 p' 1 , p' (d~l +~)~' +~VP'- ~r YP =- ¥~ ' - ~'Pt +~ YPt (24)
dS'/dt I +~"¥S = [T-1(e N + e V - p-IV'~R) + p-l¥'<X>VS]' , (25)
when d/dtl=~/~t+~l.¥ , TV~Z/u', the primed quantities are the Eulerian perturbations due
to convection, and the coordinate and the quantities without prime include the Lagrang-
Jan perturbations due to pulsation in addition to the equilibrium values.
Now we neglect the right hand side of equation (24) and use the Boussinesq approximation
instead of equation (23). Then, using the lateral component of equation (24) to elimi-
nate P' from the radial component, we obtain
(d+! , k~ 2 p' BP k~ 2 T' ~P Tv)U r = k - - ~ ~ k--- f- IpTl ~- (- ~) , (26)
where pT=(alnp/alnT)p and a spatial dependence of exp(i~-~) has been assumed for the
convection variables. Also, equation (25) can be rewritten as
!
(d~1+!+!-!)S' +u r ~s°=0 TC TR ~N Dr
(27)
where TC, TN, and T R denotes the time scales of the turbulent conduction, the heating
by nuclear burning, and the radiative cooling of a convective element. We shall here-
after write u' for u r' and T for ~V and T C without distinguishing the different numeri-
cal factors of the order of unity that are unimportant in the stability analysis.
Equations (26) and (27) have to be supplemented by the definition of the mixing length v
A which is modulated by pulsation. We assume that a convective element born at time t
has a mixing length equal to the instantaneous scale height H (=-~r/~inP) initially and
evolves according to the law 0A3=const. during its llfe time T. Then, the relative ex-
is given by ([6H]/H0)e-i~' at its birth and will be further increased by cess
(-i/3)([~p]/p0)(e-i~t-e-i~t') during the time interval from t' to t, the time dependence
in ~H and ~p being expressed explicitly and [~H] and [~p] being the compressible ampli-
tudes. For the average convective element, we obtain
321
[%~_~0 e-~0t = r[[6H___~] e_i~t' _ 1 [~P] (e_i~t _gi~t')]exp( - ~') d(t~t') ~0 H0 3 Po
or
~ 1 (~H iw'c 6p) o+ -Wo '
(28)
assuming a constant birth rate and constant life time for the convective elements. A
more precise theory for the time-dependent mixing length has been developed by Gough
(1976).
For equilibrium (d/dtl=0), equations (26) and (27) reduce essentially to the Vltense
(1953) formalism. The modulation of convection by pulsation can be calculated from
the Lagranglan variations of equations (26) and (27), (Unno 1967, Gabriel, Scuflaire,
Noels and Boury 1975). For qualitative study, we now consider an envelope (TN/TR -~°)
in which convection takes place almost adiabatically (TR/~-~) . The result comes out
to be :
BT' 1 (B% D6r) T O ' - l-i~T ~0 - Dr (29)
6u' i (~£ D6r) .H 0 D (6P) , u o' - l-i~ i~0- ar + 2-imT a-~ PO (30)
where the variations of PT and of the mean molecular weight have been neglected. The
value of B%/~0 is calculated from equation (28) in which by definition (H=-dr/dlnP),
~H/H 0 = D~rlSr + H0(D/Dr)(SP/P 0) (31)
Thus, the modulation of convection to be used for the calculation of the work integral
is now expressed in terms of (aT/T0) and (D~r/Dr)~ on account that Vad(~P/Po)=(Y3-1)
(6P/Po)=BT/To •
Some useful formulae. Readers can skip this section, unless mathematical details are
of interest.
The work integrals W R and W C should be simplified further by use of the equations of
adiabatic oscillations. Neglecting ~i and ~ , we obtain from equations (9), (i0)
and (ii) after some manipulations,
l(l+7)GM --~ar ( r 2 6 r ) - ~zy2 "r~r = - r2 (1 - ~2r2 " 0 p -
D GM= 1 D~P GMr 6_~ a--{ [--~Br) - ~2Br 00 Dr rZ P0
, ( 3 2 )
(33)
322
Multiplying equation (33) by r ~, differentiating, and using equation (32), we obtain
r 2 9,(9,+i)c~ ~t3 ~i r~ ~P ~-P0 ~r~ _ r2 ~r + 2-£(bl-l)r2 ~r = ---GMr (°~2 - ~ ~0 - ~r [G---M~.rp 0 ~r + -] , (34)
where the variation of Mr has been neglected as in the Cowling approximation ($i=0).
Thus, we have a formula appearing in equations (21) and (22) :
6T__*+82~r 2 8~r 2-£(£+i) Re[T 0 (-~fr - ~-r + r 2 6r)]
where
÷ - - I~TI~I (Y3_l)~og 2 Va d t 1T~ TO) I
- ~ [~-:+'ad i+Tl:' {"~ - ~ (+-~+)> i~oi+T +,j
+ + 0',=!- ~o + #-:.)' I~1: , [3 -' l ad
~g2 _= GMr/r 3
(35)
(36)
Next we shall derive the expression of 36r/~r. Differentiating equation (32) with r
and using equation (33), we obtain
32(r2~r)/Sr2_£(E+l)~r = _(8/~r)(r260/pO)-£(Z+l)~-2A~P/00 , (37)
where A=(dlnOo/dr)-(i/Yl)(dlnPo/dr). The operation : [(37)/r2-(34)] gives the expres-
sion of ~r/~r. Unfortunately, the resulting expression is not so simple. We shall
be satisfied with a crude approximation in which we introduce an effective number of
nodes n such that ~2(r2~r)/Sr2~-n26r and ~2(rZ6P/p0)3r2~-n2~P/P O. If we neglect A, we
obtain from equation (37),
~r/~r ~ - {n2/[n2+E(£+l)]} (6P/PO) (38)
Physical processes of stability. With the help of the results in the preceding two
sections, all the integrands in the work integrals given by equations (17)-(22) can be
transformed into the forms proportional to the power of pulsation.
After a number of partial integrations, we obtain finally,
WM= ~I ~t0'[2u}- r2Pt ,Z y3-± ~ ~-~ -
W T = (i/2)L(Vad -I - K T) I~T/T012r= R
- I m2)V' IK~R°+{%-I ) - ' - 2+u~R+'~TR}FC011+T/~0t~dV
323
+ I POeN0[ (~N)T, ad + (Y3-1)-1+2aT-Vad-r(l-Vad,P+eN,p/2) ]IST/ToI 2dV
+I I'dv ~0eV0[2(UT -£T ~'6T )-i-{~ In(r2p0gV0)+
+ I [FR0(Vad-1-V0-1 )+FcOVad-I ]H0[[ a (~T/T0)/ar] 2+{% (%+1)/r~} 16T/ToI 2 ]dV
d I-V ST 2 ,R ,R 9 aT 2 (4o)
where approximations (5) and (6) have been used, aT, ~ etc. are defined by, [equations
(28)-(31)],
~Sr ST nZ/(y3,11 ~T ' a r = ~T ~ 0 % - n2+£(%+I) T~ '
SH ST , 8 .ST. V_a~)~T H_O - 8 (ST) --H0 = HT TO0 + HT ~LTo0) = (aT + qad TO ÷ Vad ~TO0 '
, ~ ,~T,.
~0 ~0 +~T~'~o ~ =ra~H~ ~rl% HT
' ~ 8 .ST, ~u I aT , 8 caT. V~d-IVad.P) aT ( £T u0 To + = + + + "rr%' = l-iLoT 2-i~T ~0 l-im~
and the other symbols have been defined before except that
~N,P = d(IngN0)/dlnP0 " and Vad,P = d(inVad)/dlnP0
The turbulent pressure work represented by the first integral in equation (39) can be
positive or negative depending upon the mode (% and n) and the relative convection time
mT, while the turbulent stress work represented by the second integral is always nega-
tive. The ratio between them is of the order of (~T) -l which can be larger or smaller
than unity depending upon the mode and the stellar structure. If the integrands of the
thermodynamical work W T are estimated to be of the order of (Fco/~0) I6T/T012 and FCO ~
PoU0 in convection zones, the ratio WM/W T becomes to be of the order of ~T or (~T) 2
The mechanical work should, therefore, be taken into account in the stellar stability
calculation.
In equation (40), the first llne on the right hand side shows the spherical effect and
the <-mechanism. If K increases strongly with increasing temperature and density (nega-
tive KT) , larger radiation loss results at lower density higher entropy state, causing
the destabillzation of the oscillation. The sphericity has a similar effect, though
not so pronounced. The second llne shows the correction to the K and other effects. The
third llne shows primarily the e mechanism which is the destabilizing effect due to high
324
temperature sensitivity of the nuclear burning process. The fourth line shows the simi-
lar effect by the viscous dissipation, but its work like the turbulent pressure work can
be positive or negative. The term proportional to FRO in the fifth line describes the
Cowling-Souffrin-Spiegel (or radiative Cowling) mechanism (Graff 1976), and the term
proportional to FC0 describes the turbulent conduction mechanism (or convective Cowlin~
mechanism) which seems to be found by the present analysis. The sixth line shows the
corrections to these two mechanisms. Because of the latter corrections, the radiative
Cowlin~ mechanism may not work for high frequency modes, while the convective Cowling
mechanism may remain effective. If a fluid element moves up (or down) slowly (small ~) in a suDeradiahatic layer, the
temperature inside becomes higher (or lower) than outside, and the density inside be-
comes lower (or higher) than outside because of the pressure balance. Both the radia-
tion and the turbulent conduction decrease (or increase) the thermal energy inside
where the entropy is higher (or lower). This explains the radiative and convective
Cowling mechanisms. Both mechanisms are due to the superadiabaticity, but they differ
greatly in efficiency, because the gain (or loss) in thermal energy is of the same or-
der as the convective flux for the convective Cowling mechanism while it is smaller by
a factor of (?0-Vad) for the radiative Cowling mechanism. Although none of the mecha-
nisms can be neglected, the K mechanism and the convective Cowling mechanism are the
main destabilizing mechanisms of pulsation in a convective star. Various approximations have been employed in the derivation of equations (39) and (40).
Many of them should be easily avoided in numerical work of stellar stability.
The author thanks Dr Y. 0saki for discussion.
References. Ando, H., Osaki, Y. 1975, Publ. Astron. Soe.Japan, 27, 581 Baker, N., Kippenhahn, R. 1962, Z. Astrophys., 54, ll4 Biermann, L. 1932, Z. Astrophys., 5, 117 Boury, A., Gabriel, M°, Noels, A.,--Scuflaire, R.,Ledoux, P. 1975,Astron.Astrophys.
41, 279
Christy, R.F. 1964, Rev.Mod.Phys., 36, 555 Cox, J.P. 1963, Astrophys.J., 138, 487 Dilke, F.W.W., Gough, D.O. 1972, Nature, __240, 262 Dziembowski, W., Sienklewicz, R. 1973, Acta Astron.,23, 273 Gabriel, M., Scuflaire, R., Noels,A., Boury,A. 1975, Astron. Astrophys.,40, 33 Gough, D.0. 1976, Astrophys. J., 2i].4, 196 Graff, Ph. 1976) Astron. Astrophys., 49,299 Kato) S. ]968) Publ. Astron. Soc. Japan, __20, 59 Lighthill) M.J. 1952, Proc. Roy. Soc., A 211,564 Mark, J.W.-K. ]976, Astrophys. J., 206,]-~-~-- Nakano, T. 1972, Ann. Phys., 73, 326 Osaki, Y. 1974, Astrophys.J., ]89, 469 Parker, E.N. ]974, Solar Phys., 36,249 Roberts, B. ]976, Astrophys. J., 204, 268 Shibahashi, H., Osaki, Y., Unno)W', Puhl Astron. Soc. Sapan, 2__7, 40J Stein, R.F. 1967, Solar Phys., 2, 385 Unno, W. 1964, Transaction IAU XilB, Academic Press, New York, p.555. Unno, W. ]967, Publ. Astron. Soc. Japan, 19, ]40(ef.Proc.Astron.Soc.Australia,!,379 ,
1970) Unno, W. |969, Publ. Astron. Soc. Japan, 2[I, 240 Vitense, E0 1953, Z. Astrophys., 32, ]35 Zhevakln, S.A. ]953, Russian Astro-n.J., 30, ]6]
FULLY DEVELOPED TURBULENCE, !NTERMITTENCY AND MAGNETIC FIELDS
Uriel Friseh
C,N,R,S,
Observatoire de Nice, France
I. INTRODUCTION
Turbulence is usually associated with the idea of chaos, i.e. erratic behaviour of
some observable quantity. Let me stress that there are at least two different kinds
of chaos.
Temporal chaos is known to appear in certain systems having only a few degrees of
freedom. Take for example the Lorenz model (Lorenz, ]963) which has three degrees of
freedom. It is a crude truncation of the Rayleigh-B~nard problem with only one Fourier
component in the X- and the Z- directions so that the motion can in no way be consi-
dered as spatially chaotic. Nevertheless, there is a strong numerical evidence that
the temporal spectrum becomes continuous when the Rayleigh number crosses a certain
threshold, an indication that temporal chaos has developed. This kind of chaos can
appear on the largest scales of the system which makes it easy to observe. Possible
candidates for such theories are irregular variable stars, the geodynamo, etc.,...
Very different is the problem of fully developed turbulence which is essentially a
spatio-temporal chaos : when the Reynolds number goes to infinity all space and time
scales, down to infinitesimal are excited. Such chaos may develop in a finite time
and has universal sealing properties (e.g. a power-law energy spectrum). In the
astrophysical context fully developed turbulence may not always be directly observable
because of lack of resolution of the small scales. But it will always manifest itself
indirectly through a drastic modification of the transport properties.
In the next two sections we try to give simple phenomenologieal insight into univer-
sal properties of fully developed turbulence, particularly the question of intermit-
tency, or in other words spottiness of the small scales. Intermittency is very much
at the center of present theoretical studies (Kraichnan, |974, Frisch, Lesieur and
Sulem, 1976, Frlsch, Sulem and Nelkin, ]977). Experimentally, it is rather difficult
to observe because the small scales carry very little energy. However , magnetic fields
which are very sensitive to small-scale velocity gradients can he used as tracers of
the small scales (in the ~D case). It is therefore of great interest to note that re-
cent high resolution observations of the small-scale solar magnetic field indicate a
very intermittent structure (Stenflo, ]975). Non magnetic intermittent turbulence being
rather poorly understood it seems premature to consider the MHD case in detail. However,
many overall features are probably common to both cases in particular the steepening of
326
the spectrum. The reader interested in the non intermittent aspects of MHD turbulence
such as the non linear dynamo effect is referred to Pouqnet et al. (1976),
2, KOLMOGOROV 1941 REVISITED
Big whorls have little whorls
Which feed on their velocity
And little whorls have lesser whorls
And so on to viscosity.
L.F. Richardson, ]922
By the Kolmogorov 1941 (in short K41) theory, we mean the general class of arguments
developed by Kolmogorov, Obukhov, Onsager and others which has led in particular to
the 5/3 law (see Batchelor, 1953, and Monin and Yaglom, ]975, for review). The 5/3
law may be derived from dimension analysis, but more insight is gained from a simple
dynamical argument borrowed from Kraichnan (1972, page 213). We define the energy
spectrum E(k) as the kinetic energy per unit mass and unit wavenumber k. It is a
convenient simplification, with no significant loss of generality~ to consider a dis-
crete sequency of scales or "eddies"
= £ 2 -n n = O, |, 2, ... (2.1) n o
and of wavenumbers k = Z -I . The kinetic energy per unit mass in scales ~ £ is n n n
defined as
I kn + 1 E(k) dk- (2.2) E n = kn
Let us assume that we have a state of statistically stationary turbulence where ener-
gy is introduced into the fluid at scales % Zo, and is then transferred successively
to scales ~ ~|, ~ £2' ..., until some scale %d is reached where dissipation is able
to compete with non linear transfer (Fig. I). If we now make the essential assumption
that eddies of any ~eneration are space fillinb, as indicated in Fig. |, we may write
E ~ v 2 (2.3) n n •
where v is a velocity characteristic of n-th generation eddies (in short, n-eddles). n
In Eq. (2.3) and subsequentl~ factors of the order of unity will be systematically
dropped except when such factors would cumulate multiplicatively in successive cas-
cade steps. Note that v n is not the velocity with which n-eddies move with respect
to the reference frame of the mean flow, this being mostly due to advection by the
327
largest eddies. It is rather a typical velocity difference across a distance ~ £ n
the latter being the only dynamically significant quantity. (In this respect the
"velocity" in the second line of Richardson's poem is misleading). We now define
the eddy turnover time
t n % Zn/Vn . (2.4)
The quantity tn-! may be considered as the typical shear in scales ~ in' and there-
fore defines the characteristic rate at which excitation at scales % £ is fed into n
scales % in+l"
There are, however, at least two important exceptions to this statement. First we may
define a viscous dissipation time
t diss % i 2/v , (2.5) n n
where ~ is the kinematic viscosity of the fluid. If
t diss << t (2.6) n n'
then transfer is no longer able to compete with dissipation, and most of the excita-
tion in scales ~ £ is lost to viscosity. Second, if n
>> t = £o/Vo , (2.7) tn o
then the shear acting on scales % £ comes mostly from scales ~ £o' n
used instead of t as a dynamical time. n
and t o should be
Assuming that neither of these two exceptions applies, (this may be checked a poste-
riori) we make the fundamental assumption that in a time of the order of t a sizeable n
fraction of the energy in scales ~ Zn is transferred to scales % in+l" The rate of
transfer of energy per unit mass from n-eddies to (n+l)-eddies is then given by
E n ~ En/t n ~ Vn3/£n . (2.8)
Since we assume a stationary process in which energy is introduced at scales ~ Z o
and removed at scales ~ £d' conservation of energy requires that
en - ~ , £d < £n -< £o " (2.9)
328
NotiCe that ~ can be thought of as a rate of energy injection, a rate of energy trans"
fer or a rate of energy dissipation, From the point of view of inertial range dynamics,
the second of these three definitions iS the most relevant, Using (2.8) and (2.9) we
solve fO~Vn and En~
v n ~ ( ~ £n )I/3, mn ~ (~ £n )2/3 ' (2.10)
which by Fourier transformation yields the K41 spectrum
E(k) ~ ~ 213 k-513 (2.11)
The eddy turnover time of Eq. (2.4) is given by
t ~ ~ -I/3 I 2 / 3 ( 2 . t 2 ) n n
Equating (2.12) to the viscous diffusion time (2.5) determines the Kolmogorov micro-
scale
£d = (v3/;)114 (2.13)
gq. (2.13) gives the length scale at which the cascade is terminated by viscous
dissipation.
Injectio
Transfer
O0@C O© O00000000OOOO O0000C O00000000C O00
Dissipation Fig. I. The energy cascade according to the Kolmogorov 194] theory, Notice that
at each step the eddies are space-filling.
329
3. INTERMITTENCY : THE 8- MODEL
Big whorls have little whorls
Which feed on their vorticity
And little whorls have lesser whorls
Which hardly ever can you see.
Since the first experiments of Batchelor and Townsend (1949) there has been strong
evidence that the small scale structures of turbulence become less and less space
filling as scale size decreases. (Kuo and Corrsin, 1972; see also Monin and Yaglom,
1975, for review). Dynamically this spottiness of the small scales can be made
plausible by a simple vortex stretching argument. Consider the point M within a large
scale structure which at the initial time T o has the largest vorticity amplitude I~I.
This point is also likely to have a large velocity gradient IVy[ ~ [~I- The strai-
ning action of the velocity gradient on the vortieity may then be described by a
crude form of the vorticity equation
Dt ; (3.1)
hence it is expected that the vorticity downstream of M will rise to very large
values (possibly infinite at zero viscosity) in a time of the order of the large eddy
turnover time t ~ I~l -I. o
Even if the vortieity at time T has a very flat spatial distribution, the non- o
linearity of Eq. (3.]) will lead to a very narrowly peaked spatial distribution at
+ t . So we see that small scale structures may be generated in a very loca- time T o o
lized fashion. This argument can be made fully rigorous fur the Burgers equation, but
not for the Navier-Stokes equation (L~orat, 1975). For the Navier-Stokes equation
there is the important complication that the velocity gradient at a point ~ is not
related in any simple way to the vorticity at ~; instead it is given by a Poisson
integral with a fairly substantial local contribution, but also with some coupling
to nearby points. This could smooth out the vorticity peak, but the smallest scale
structures will still have some tendency not to occur uniformly.
Assuming that the small eddies do indeed become less and less space filling, let us
now define the 8-model. At each step of the cascade process any n-eddy of size
= 2 -n produces on the average N (n+])-eddies. If the largest eddies are space £n £o filling, after n generations only a fraction
330
B n = B n (B ffi N /2 3 - < ]) ( 3 . 2 )
of the space will be occupied by active fluid (see Fig. 2). Furthermore we assume
that (n+1)-eddies are positionally correlated with n-eddies by embedding or attach-
ment. (For the sake of pictorial clarity this feature is not included in Fig. 2).
I°
Injection/-
@C) C) ©©©
0 0 0 0
• 6 m
TransFer © o
Dissipation
Fig. 2. The energy cascade for intemittent turbulence. Notice that the eddies become
less and less space filling.
It is straightforward to work out the modification to K4! in the B-model. Let v n now
denote a typical velocity difference over a distance ~ ~n in an active re$ion. The
kinetic energy per unit mass on scales ~ £n is then given by
2 ( 3 . 3 ) E n ~ B n v n
331
The characteristic dynamical time for transfer of energy from active n-eddies to
smaller scales is still given by the turnover time t n = Zn/Vn as in K4I : the
generation of (n+])-eddies arises from the internal dynamics of the n-eddy in which
it is embedded. We can express the rate of energy transfer from n-eddies to
(n+1)-eddles exactly as in K4], and as in K4! this quantity must be independent of
n in the inertial range:
Vn3/£ n - en % En/tn ~ 8n ~ E (3.4)
Defining
= - log 2 B ,
we combine Eqs. (3.2 - 3.4) to obtain
-e I /3 £ 1/3 (Zn/£o)-~13 (3.5) Vn n *
e - - I / 3 £ 2/3 (£n/£o)U/3 (3.6) tn n I
En ~ ~ 213 Zn2/3 (Zn/Zo)+U/3, (3.7)
and
E(k) % ~ 213 k-5/3 (k ~o )-u/3. (3.8)
All the intermitteney corrections may be expressed in terms of the self-similarity
dimension D = 3 -~ , a special case of Mandelbrot's (1975) fractal dimension, which
is related to the number of offspring by
N = 2 D (3.9)
That D can suitably be called a dimension is made clear by Fig. 3 which shows three
very familiar objects : a unit interval, a square and a cube which have dimensions
D equal to I, 2 and 3 respectively. If we reduce the linear dimensions of these
objects by a factor of 2, as in the cascade process, the number of offspring needed
to reconstruct the original object is 2 D. For mere complicated self-similar objects
a natural interpolation is N = 2 D, where D need no longer take only integer values.
(Some rather exotic examples can be found in Mandelbrot (1975).) It has been shown
by Mandelbrot (1974) that D is also the Hansdorff dimension of the dissipative struc-
tures in the limit of zero viscosity. D = 2 would correspond to sheet-like structures,
but in view of the experimental value of the exponent for the dissipation correlation
function a more likely value is D ~ 2.5 (See Frisch et al 1977).
332
-~ ................ ' . . . . I
/ ~ I ; , i I ,
/ i ' 4
I I, s~ --]~--7
Fig. 3 When the linear dimensions of a D-dimensional object are reduced
by a factor I (here 2), ID pieces are needed to reconstruct the original.
Moze exotic examples with non integer D, such as probably oceurin turbulence,
may be found in Mandelbrot (1975).
333
Remark 3.1
Remark 3.2
Remark 3.3
Equation (3.8) relating the correction to the 5/5 exponent of the K4I
theory and the fraetal dimension was first derived by Mandelbrot (]976)
using the Novikov-Stewart (1964) model.
Since 0 < D < 3 the corrections to K4] can not make the spectral expo-
nent larger than 8/3. This same upper bound can be derived rigorously
from the Navier-Stokes equation for finite energy turbulence (Sulem and
Frisch, 1975). This reference also gives a heuristic argument to show
that D > 2.
When (3.4) is used for the largest scales we obtain
Vo3!~ ° , 7 (3 . J O)
the same result as in the non-intermittent case. This is important since
(3.10) is frequen~lyused in practical calculations. Intermittency may,
however, be of practical importance in other ways, particularly when
chemical reactions are involved (Herring, 1973).
Dissipation sea!e
Equating the turnover time (3.6) to the viscous diffusion time £n2/~ we obtain the
dissipation scale
R -31(4-p) (3.11)
where we have introduced the Reynolds number
R = £ v /v ~ e 1/3 £ 4/3 -I (3.12) o o o
Singularities
Both the K41 and the ~-model imply that the three dimensional Euler equation (Navier-
Stokes with zero viscosity) leads to a singularity in a finite time. Indeed, if we
start with very smooth initial conditions, say only large eddies, then the complete
hierarchy of eddies, down to infinitesimal scales should be established in a time
t± ~ E t ~ ~olVo (3.13) n=0 n
334
Since there is now no viscous cutoff the enstrophy given by
f° fl = k 2 E(k) dk (3.14) 0
will become infinite at time tx. There is in fact some numerial evidence that such
singularities exist (Orszag, 1976a, b)° There are also a few known exact solutions
which display singularities at a finite time, but these solutions are badly behaved
at large distances (Childress and Spiegel, 1976). Finally various stochastic models
or low order closures of the statistical Euler equation can be shown to produce sin-
gularities in a finite time (Lesieur and Sulem, 1976; Andr~ and Lesieur, 1977).
NOTE ON THE M H D CASE
The K4| theory can be easily modified to account for the effect of Alfv~n waves. It
then yields a k -3/2 spectrum (Kraiehnan, 1965; Pouquet et al., 1976). How intermitten-
cy can be handled in the MHD case is not yet clear, but it is again likely to steepen
the spectrum. That the spectral exponent can become as large as 2.5 as suggested by
certain solar observations (Harvey, 1976) is a possibility which cannot be ruled out.
Finally we note that singularities should appear in the MHD case as well as in the
nonmagnetic ease. There are even some indications that they are present in two-
dimensional MHD flows (Pouquet, 1976) although they are known to be absent in the
non magnetic two-dimensional case (Wolibner, ]933). The presence of singularities at
a finite time in the MHD equation implies that magnetic field line reconnection
at high kinetic and magnetic Reynolds number occurs in a time which does not depend
on the magnetic diffusivity : it is essentially the large eddy turnover time.
335
REFERENCE S
Andre, J.C. & Lesieur, M. 1977, Evolution of high Reynolds number turbulence,
J° Fluid Mech.,to appear
Batchelor, G.K. & Townsend, A.A. 1949, Proe. Roy. Soc. A 199, 238
Batchelor, G.K. 1953, Theory of homogeneous turbulence, Cambridge U. Press
Childress, S. & Spiegel, E. 1976, Private communication
Prisch, U., Lesieur, M. & Sulem, P.L. 1976, Phys. Rev. Lett. 37, 895
Frlsch, U., Sulem, P.L. & Nelkin, M. 1977, A simple dynamical model of intermittent
fully developed turbulence, submitted J. Fluid Mech.
Harvey, J.W. 1976, Private communication
Herring, J. 1973, Private cormnunieation
Kolmogorov, A.N. 1941, C. R. Aead. Sci. USSR 30, 301, 538
Kraichnan, R.H. [965, Phys. Fluids 8, 1385
Kraichnan, R.H. 1972 in "Statistical Mechanics : New concepts, New problems, New
applications", Rice, S.A., Fried, K.F. & Light, J.C. Eds., University
of Chicago Press, Chicago
Kraichnan, R.H. 1974, J. Fluid Mech. 6_~2, 305
Kraichnan, R.H. 1975, J. Fluid Mech. 67, 155
Kuo, A.Y. & Corrsin, S. 1971, J. Fluid Mech. 50, 285
L~orat, J. 1975, Thesis, Observatoire de Meudon, Meudon, France
Lesieur, M. & Sulem, P.L. 1976, "Les ~quations spectrales en turbulence homog~ne et
isotrope : quelques r~sultats th~oriques et num~riques" in "Proc.
Journ~es Math~matiques sur la Turbulence", Temam, R. ed., Springer Lecture
Notes in Math., to appear
Lorenz, E.N. 1963, J. Atmos. Sci. 20, 130
Mandelbrot, B. 1974, J. Fluid Mech. 62, 33]
Mandelbrot, B. 1975, "Les Objets Fractals : Forme, Hasard et Dimension", Flarmnarion,
Paris
Mandelbrot, B. 1976, "Intermittent turbulence and fractal dimension : kurtosis and
the spectral exponent 5/3 + B" in "Proc. Journ~es Math~matiques sur la
Turbulence", Orsay~ June 1974, Temam, R. ed., Springer Lecture Notes in
Mathematics, to appear
Monin, A.S. & Yaglom, A.M. ]975, Statistical Fluid Mechanics, vol. 2 , MIT Press
Novikov, E.A. & Stewart, R.W. 1964, Isvestia Akad. Nauk USSR, Ser. Geophys. n = 3,
p. 408
Orszag, S. |976a, '~tatistical Theory of Turbulence" in "Proc. Les Houches 1973",
Balian, R. ed., North Holland, to appear
Orszag, S. [976b, Private communication
336
Pouquet, A. 1976, "Remarks on two dimensional MHD turbulence", preprint
Pouquet, A., Frisch, U. and L~orat, J. 1976, J. Fluid Mech. 77, 32!
Stenflo, J.O. ]976, "Influence of magnetic fields on solar hydrodynamics :
experimental results", to appear in "Proc. of IAU Coll. n ° 36" held at
Nice, September 1976
Sulem, P.L. & Frisch, U. 1975, J. Fluid Mech. 72, 4i7
Wolibner, W. 1933, Math. Zeitsehr. 37, 698
TURBULENCE : DETERMINISM AND CHAOS
Y. POMEAU
CEA~ DPhT, Gif sur Yvette, FRANCE
After the article of Ruelle and Takens (I), there has been recently much interest
in the problem of the "onset of turbulence". That is, instead of trying to understand
the structure of a well established turbulence flow, one studies the way in which a
flow "jumps" from a quiet stable laminar state to a turbulent state whenits Reynolds
(or Rayleigh) number increases.
The idea of turbulence is connected with the one of "chaos". The ergodic theory (2)
allows one to give a precise content to this last notion (Take care that this differs
from the one given in the review paper by May (3) , see also (4)) . One first considers
a time dependent quantity, say u(t), as, for instance, the fluid velocity at a given
point in a turbulent flow of fluid under constant (or periodic, or eventually station-
nary "in average") external conditions, in such a way that one may define a gliding
average as | [t+T
<~[u(t)]> = lim ~ dr' ~(u(t')), T ~ ~ Jt
where ~ is any smooth function.
We assume that these averaged quantities are independent of the initial conditions,
at least for "almost" any choice of them and that they do not depend on t. The "signal"
u(.) has the property of mi~ng, that we shall consider as defining the chaos if :
<~[u(t)]~[u(t+t')] - <~><~>> + o
tw~ for any smooth ~ and ~ . This property expresses the idea that, after a sufficiently
long interval of time, say t', the system "forgets" the detail of the initial condi-
tions (= the fluctuations of u at two very distant times are uncorrelated).
A good example of such a "chaotic" signal, with astrophysical implications, is
provided by the time dependence of the magnetic field of earth (5) . The geological data
show that the earth'smagnetic dipole has reversed a large number of times. It is of
interest to know whether these reversals occur "regularly" or at random. For that pur-
pose, let us consider the autocorrelation function which is built up from the data (6)
as follows : ~ and ~ represent the same function. This function is equal to +I when
the polarity is the same as now, and to (-I) in the reversed case. The autocorrela-
tion function of this random signal is given in Fig. I. It shows rather clearly that,
from this point of view, the reversals are at random, and the reversals follow appro-
ximately a Poisson law.
t 338
dipo]e data from Heirzler et al. J of geoph.reso 73,2119 (1968)
There is often a misunderstanding about this idea of chaos, which is implicitly
connected with the one of "noise". At least, at the level of the equations of the
motion, it is often thought that chaos mus~ be introduced by some noise source and
that chaos may exist in non-deterministic systems only (as, say, a damped harmonic
oscillator in contact with a heat bath). In order to Understand how chaos may arise
very simply from a deterministic process, let us consider a "discrete" dynamical
system. This dynamical system mimics a system depending continuously on time, wherein
measurements are made at discrete instants, say t| , tl+T, tl+2T, .o., tl+nT, .... •
Thus we shall define for this dynamical system a "variable" and a time translation
operator (that is an operator which allows one to jump from the value of the variable
at any time t to its value at time t+O . This is a dync~nieaZ system if the transfor-
mation acts continuously and is inversible, in mathematical terms it is a homeomor-
phi8m) (= time can he reversed to get the initial data from the final state). To
define the variable of our dynamical system, we consider a set K with a finite num-
berp say k, of elements and the doubly infinite sequences of elements of K :
{ .... } u t = ...i_n , .... i_l~Zo~Zl~...~in, ln+l,...
where ij (j = 1,2 .... ,k) 6 K . Thus, giving the initial data u(t I) , means that a
particular sequence is known. The transformation that allows one to find ut+ T , once
u t is given, is just the shift of the sequence ; by definition :
in(Ut+ T) = in_1(ut)
339
If we consider now the class of functions of u defined by
m=-Oo
with +co
I max ~0m(i m) I 2 < oo , m =-o= i 6K
m
it is easy to see that, if the im'S are taken a t random in K w i t h the p r o b a b i l i t i e s
Pl'''''Pk (k= cardinality of K) such as jE=l pj = 1 , then
+oo K
<~(u) > = I ~ Pn t~m (Yn) ' m=-co n = ]
where Yn is the n th element of K , and
<[~(ut) - <~>] [@(Ut+N) - <@>]> --+ 0 ,
which proves the property of mixing for our discrete (and deterministic) system.
Actually this last property expresses a very simple fact : ~ and @ depend on terms
of the infinite sequence {i m} which are located in a fixed part of this sequence.
Shifting at each step this sequence on the left, one "loses " part of the knowledge
of the values of the {i } in this region, as new i's come from the right which are m m
uncorrelated with the already known i's . Obviously this double infinite sequence m
looks very much as the k-ary expansion of a real number (except that it is doubly
- instead of singly - infinite). This helps to understand that, in a mixing dynamic-
al system, the noise source might just be the infinite (say decimal) expansion of
the real numbers defining the initial datas.
Of course this notion of ten.oral chaos is not sufficient to define
turbulence, as another typical feature of turbulence in unbounded flows is the
absence of spatial correlation with an infinite range. There is an obvious extension
of the mixing property to the 8pat~al ease, that is a position dependent function
u(~) has this mixing property iff
<:~[u(~)] ¢ [u (~+~ ) ] - <~> <@>> , o ,
where the averages are now to be understood as gliding space averages. This definition
implies, of course, that the flow is unbounded in some direction and that the turbu-
lent state is invariant under the translations along this direction. Although there
is clear evidence (7) that the turbulent flows have the mixing property both in
time and space, we shall only consider the time dependent properties, as the connec-
tion between spatial chaos and the original non linear Navier-Stokes equation is
rather unclear at the present time.
340
On the contrary, if one neglects completely this question of the spatial struc-
ture, one is led to consider the fluid motion as described by the solution of a sys-
tem of ordinary differential equations (O.D.E.). Of course, one is mainly interested
in the qualitative properties of these O.D.E., as the fluid equations cannot be
replaced by a fully equivalent system of a finite number of O.D.E. with an explicit
form. This qualitative theory of the O.D.E. has been the subject of detailed investi-
gations (8'9), in particular one may understand quite well how the solutions of such
a system may have the property of mixing.
Recently Ruelle and Takens (I) have drawn attention to the possible connection
between the onset of turbulence in flows and some bifurcation properties of O.D.E. •
Without going into too many details, I will just explain what is presently known
about the onset of turbulence. Following Martin and McLaughlin (]0), one may consider
three different ways for the occurrence of turbulence .
I. The onset o~ turbulence in the Lorenz s~stem
By a drastic reduction of the Oberheck-Boussinesq equations for a flow convect-
ing in a horizontal layer, Lorenz (ll) has obtained the following system of O.D.E. :
|,a
l.b
i.c
d~ d-~ = o (y-x)
dY = - •z + rx - y dt
dz d-~ = xy - bz
where ~ , r and b are numerical parameters. By numerical computations he has shown
that in some range of values of these parameters the motion described by these equa-
tions is chaotic and that the trajectory, after some transient, reaches very rapidly
an "attractor",which does not depend on the initial conditions. This attractor is
very interesting, as it is presumably structurally stable, that is it exists (and
remains attracting) for values of the parameters in open intervals, The idea of
structural stability is actually much more general(12),for O.D.E.,likethe system (1),
it just means (|3) that one may add a small perturbation depending on x , y , z on the
right hand side, without changing the topology of the velocity field defined by this
system .This means that by a homeomorphic mapping of space (i.e. a change of varia-
÷ f(~) that is both continuous and inversible) one may change the trajecto- bles x'=
ries of the perturbed system into the ones of the unperturbed system. The structural
stability is essential, as it means that the properties of the system under considera-
tion do not depend on the detai~ of the equations, and that they remain essentially
the same under any kind of (small enough, but finite) perturbation (actually one
thinks of perturbations arising from a lack of knowledge of the exact form of the
equations).
341
The structure of the Lorenz attractor has been recently studied by a number of
authors (]4), and I will just give a brief (and I hope clear enough) account of
these works. A simple way to describe it is the one of Williams (14). He first consi-
ders a "semi-flow", that is a flow on a surface where two sheets may collapse. This
cannot really represent the solution of O.D.E., as the motion cannot be traced back
unambiguously, as it should be allowed for O.D.E. But this helps to understand the
structure of the Lorenz attractor. I have tried to draw as clearly as possible this
surface on Fig.2 . It has two holes and the line along which the two sheets collapse
is the dashed line. On the right part of the figure is the section of the surface by
the mid -vertical plane.
\/
The trajectories run on this surface approximately as follows :
They revolve around each hole by diverging slowly and if at one of these revolutions
the trajectory cut the dashed line beyond the middle point , it is inserted at the
next turn close to the other hole and revolves by diverging slowly around this "new"
hole. Finally it jumps at random from a hole to the other. To understand why this
motion is non periodic, one considers the so-called Poincar~ transform on the shaded
segment where the two sheets merge. Let us define on this segment a coordinate~ say
x which varies between -] and +] . The Poincar~ transform defines a function f(x) ,
-I ~ f(x) ~ +] if -] ~x~+] : if the trajectory cresses the shaded line at x , its
next crossing will be at f(x). The function f(x) hasa discontinuity at x= 0 , and
looks approximately as represented in Fig.3 .
/ / I
342
If one assumes that its derivative (when it exists) is everywhere larger than I,
then it is clear that the application x ÷ f(x) cannot have any stable ~ixed point
or even any stable period . It is important to understand that this is possible only
because f(x) has a discontinuity, otherwise if f and its derivative were continuous
If'(x)Icould be not everywhere larger than l, if f['l,|] ~[-l,l].
Now the question of the mixing character of the motion is turned into the ques-
tion of the mixing character of the application x ÷ f(x). This mixing property is
rather obvious (if one does not want to get a rigorous proof) : consider two
points{x l, x2}very close to each other, and theirsuccessive transform :{f(xl),f(x2)}:
{f[f(xl)], f[f(x2)] } ..... ; {f(n)(xl) , f(n)(x2) } ....
(By definition f(n)(x) = f[f(n-])(x)] and f(]) E f) . The distance between the tans-
forms~f== x I and x 2 is multiplied after each application of f by a quantity larger than
min l~x 1 that is larger than I, thus it increases at least exponentially. This means
that, at some time, the two image points will be separated by the discontinuity of
f, and" the subsequent trajectories starting from xl or x 2 will be completely diffe-
rent from each other. This is a version of the mixing property : a small fluctuation
in the initial conditions yields,after some time, a huge difference in the arrival
points ; in other terms unless one knows the initial conditions with an infinite
accuracy, the motion is unpredictible after some ~n~te time.
Let us come back now to a more realistic description of the Lorenz attractor
I have already noted that it cannot be considered as a surface in the usual sense,
since two sheets cannot merge owing to the deterministic character of the equations
of the motion. To understand what really happens, it is only necessary to replace the
shaded line where the two sheets collapse by a small surface parallel to this line
(as a thin stick).Now the Poincar~ transform is no longer given by a function of one
variable, but by a plane transform: that is,given a starting point inside the stick,
one wonders what is the next crossing point of the trajectory inside this stick.Essen-
tially (although things are a little bit more complicated), the Poincar~ transform
looks very much like a Baker's transform (15) : the stick is first cut in two
pieces, (this cutting remembers of course very much the discontinuity in f), each
piece is stretched and the two resultant pieces are put together inside the stick
(see fig. 4). The transform of the coordinate parallel to the stick is very similar
to the one dimensional transform represented in fig. 3, but now the Poincar6 trans-
form has been made invertible as the coordinate perpendicular to x allows one to
~Actually Lasota and Yorke (Trans. of the A.M.S. 186, 481 (1973)) show that there
is an absolutely continuous invarlant measure for such f, and f is ergodic for this
measure.
343
distinguish between the arrival points with the same x. By a repeated action of the
transform pictured on Fig. 4, one obtains a section of the attractor, which is the "ob-
ject" stable under an infinite number of applications of Poincar~ transform. If one
cuts this section by a line perpendicular to the axis of the stick, one easily sees
that, after each application of the transform, the central part of the stick is de-
leted, then the central part of the remaining segments is deleted, and so on. This
is precisely the way in which one generates Cantor sets. Accordingly the Lorenz attrac-
tor has the structure of a Cat, tot set perpendicular to its "surface". It is a surface
with a number of sheets which has the power of a continuum.
A
E ~ I F ~ .......... I
Let us notice that these properties of the Lorenz attractor have actually not
been proved from a rigorous study of the system (1), although it is a very reasonable
extrapolation from the computer studies.
Another important feature of the Lorenz attractor is that it appears by an inver-
ted bifurcation from a pair of stable fixed points : in a domain of values of the
parameters (r,~,b), one reaches~from some initial conditlons,one of the stable fixed
points, or/and some other Lorenz attractor.This manner of occurrence of turbulence is
well known, for instance in Poiseuille flow(16); in a range of values of the
Reynolds number the laminar flow is linearly stable, but unstable against perturbations
with a finite amplitude, and at the upper limit of this domain the laminar flow
becomes linearly unstable. It must be stressed that the stability of convection
flows is much less well known than the one of the Poiseuille flow, so that it is not
$44
~lemr if the Lorenz system describes, even in a rough way, the bifurcation toward
turbulence of convective flows. From this point of view the existence (or the non
existence) of two sorts of flows for the same value of the Rayleigh number would be
an important test.
2. The theories of ,La~dau-HoFf and of Ruelle-Takens
Landau (17) and Hopf have given a quite convincing picture of the onset of turbu-
lence. In order to understand their idea, it is necessary to introduce the notion
of quasiperiodic funation. Let us consider a dynamical system with a periodic limit
cycle. The existence of such an oscillatory behaviour is known to occur (18) in con-
vective flows.
The theory (19) shows that, by a certain type of bifurcation (called the Hopf
bifurcation), this limit cycle may give rise to amotionwith two ~ncommensurate frequen-
cies, say ~| and ~2 (these frequencies are incommensurate if no non zero integers P and q
exist such thatp~| = q~2). Then any function of time in this flow should be ~asi-
periodic. To define such a function, let us consider a function of two variables,
say t I and t2, which is periodic of period 27 with respect to each of the~ variables:
f(t I + 2~n, t 2+ 2~m) = f(tl,t2)
whatever the integers n and m are.
From this function we may build the quasiperiodic function ~(t) = f(mlt,~2t). II
ml and ~2 are incommensurate, this function will appear (at least at first sight) as
completely choatic, although it is not chaotic in the sense of the mixing property.
Its frequency spectrum is concentrated at the frequencies m1' ~2' and
more generally at any linear combination p~! + q~2 with integer coefficients.
The extension of this construction to a function that depends periodically with
the period 2~ on any set of variables, say t!, t2,...,tn, allows one to define the
most general quasiperiodic function which has not the mixing property.
One must take care that such a quasiperiodic behaviour is not structurally stable
(except for the case of a single period). If a parameter as, say, the Rayleigh number
varies in the domain of quasiperiodic behaviour, then a periodic (and structurally
stable) limit cycle should be reached every time when ~I and ~2 (which both depend
on the Rayleigh number) are commensurate. The non periodic behaviour is reached for
~solated values of the parameter R only (20). The idea of Landau and Hopf is the a following: when the Rayleigh number increases many new bifurcations occur, which
always correspond to frequencies 1~commensurate with the already existing ones. Then,
one can show that, if the quasiperiodic function depends actually on all these fre-
quencies, then it tends toward a chaotic signal (in the sense of the mixing @roperty)
after an infinite number of frequencies have appeared.
345
One may wonder first about the validity of this theory, as it is unclear whether
an infinite or finite Rayleigh (or Reynolds) number is required for the occurrence
of an infinite number of bifurcations.
Landau and Lifshitz (2|) relate the number of "degre~of freedom" of a turbulent
motion with its Reynolds number R e . Asymptotically this number should increase as
(Re)9/4 , so that the existence of an infinite number of degrees of freedom requires
an infinite Reynolds number. They assume that each "degree of freedom" is actually
connected with the freedom in the choice of the phase at a bifurcation where a new
frequency appears. Although this notion of "degree of freedom" is widely used in
theoretical physics, the way in which the (R)9/4 formula of Landau Lifshitz counts the e
number of bifurcations between R = 0 and a large value of R is rather unclear. e e
On the other hand followlngRuelle and Takens (l) , the Onset of turbulenc% as dese;ibed
by Landau and Lifshitz, cannot be a '~eneric" phenomenon. They show that, after the
occurrence of a few non cormnensurate frequencies, the next bifurcation is toward a
non periodic attractor. This non-perlodic attractor is built as follows : let us
consider the case of a quasiperiodic motion with four ~ncommensurate frequencies, if
one represents the trajectory by the motion of a point in a four dimensional space,
it intersects a 3-dimensional hyperplane (i.e. a usual 3-d space) following a full
torus. Again one considers that the motion describes a one to one application of
this torus into itself (that is, each point inside the torus has an image which is
the next crossing of the trajectory with the hyperplane). It is possible (22) to
find a transformation of the torus into itself that is continuous, invertible , and
which transforms the torus into a strange attractor after an infinite number of
applications (see fig, 5). This attractor is also structurally stable.
Therehave been attempt~ 23) to find if this picture of the onset of turbulence is
valid for Taylor instabilities, it is not completely clear whether the experimental
findings are or not in agreement with this theory.
346
3. Approach of chaos by succesivebifurcationfis
For some values of the parameters, the Lorenz system (I) has one (or two) perio-
dic limit cycle(24~ When a parameter such as r varies in this domain, this limit cycle
becomes very rapidly much more complicated by a mechanism of "cascading bifurcations'~
These bifurcations are rather striking as they describe a smooth transition from a
periodic limit cyclewith a single period toward a strange (i.e. non-periodic) attractor.
For r ~ rl, the period of the limit cycle is, say T ; for r just below rl(bi-
fureation point)the period is 2T ; but as the motion is anharmonic, the amplitude of
the Fourier component of frequency |/2T start from a zero value at rl, then increases
continuously as r becomes smaller than r I . In order to make clear possibility of this
mechanism of frequency division, it is enough to draw a closed trajectory (= the
limit cycle) in the space of thevariables x,y,z (Fig. 6), when r becomes just a little
smaller than rl, this closed curve with singl~ orb becomes a closed curve with two
orbs, as drawn (approximately)in fig. 7 .
i
J
/
347
There is a series of such splitting of the limit cycle when r decreases, so
that the period of the limit cycle becomes 2T, 4T, ... 2nT. There is an infinite num-
ber of such bifurcations when r decreases from r I to, say r . The limit r~ is appro-
ached in a geometric fashion : let r be the value of r which the limit cycle of 2 n
period 2nT becomes unstable giving birth to a limit cycle of period 2n+]T, then the
quantity (r2n - r )/(r2n+l-r~itends to a limit as n increases, which is an universal
number (= independent of the detail of the equations) approximately equal to
0.214I 693. At the end of these bifurcations (i.e. at r , and for lower values ofr )
the period of the motion is infinite,which means that this motion is chaotic. Actually,
the recurrence time for a given point on the limit cycle is the period. It is easy to
see that the autocorrelation function for such a periodic system musttake the
same finite value at separation time of one period, two periods,... N periods, which
forbids it to tend to zero at infinite separation times (so that a periodic system
is obviously not ahaot~c). On the contrary, if the period of the motion is infinite,
the autoeorrelation function may tend to zero for large separation times, and the
motion may be chaotic.
At the present time,these are only theoretical examples for this occurrence of
chaos by an infinite number of bifurcations in a finite domain of variation of the
parameters. However this is probably (25) the best understood case.
CONCLUSION
Even if the proofs of theorems are quite remote, there is some hope at the pre-
sent time for understanding the way in which turbulence may occur in flows (convective
or not). However let us emphasize again that this approach leaves aside the question
of spatial chaos. Further studies in this domain are needed in order to know
whether the spatial chaos Dccurs or not in the infinite Rayleigh (or Reynolds) number
limit,as required by the Landau theory of turbulence.
REFERENCES
I. D. Ruelle, F. Takens, Comm. Math. Phys. 2iO , 167 (I971)
2. P.R. Halmos, Ergodic theory, Chelsea Pub. Comp. New York (1956)
3. R. May, Nature 261, 459 (1976)
4. T.Y. Li and J.A. Yorke, Am. Math. Month. 8_~2, 985 (1975)
5. T. Rikitake, "Electromagnetism and the earth interior" Elsevier (1968)
6. C. Laj, Y. Pomeau, in preparation
7. Monin, Yaglom, "Statistical Fluid Mechanics'~ Vol. I-2, MIT Press (1971)
348
8. I. Kubo, notes from the Nagoya Univ.,
E. Hopf, "Ergodentheorie", Splnger Verlag, Berlin (1937),
M. Smorodinsky , "Ergodic Theory,Entropy", Springer Verlag Lectures Notes in
math. 2 14, Berlin (1971)
9. S. Smale, Bull. AMS 73, 747 (|967)
I0. P.C. Martin and J.B. Me Laughlin, Phys. Rev. Left. 33, 1189 (1974), Phys. Rev°
At_!2 , 186 (1975)
II. E.N. Lorenz, J. Atmo. Sci. 20, 130 (1963)
12. R° Thom "ModUle math~matique de la morphoggn:se", 10-18, Paris (1974)
13. A. Andronov and L. Pontryagin, Dokl. Akad. Nauk. SSSR 14, 247 (1937)
14. D. Ruelle, in "Turbulence and Navier-Stokes equation" Lecture Notes in Math,
565, Springer Verlag, Berlin (]975),
O. Lanford, III lectures at IHES (1975)~
R.B. Williams "On the Structure of the Lorenz Attractors", preprint
15. V. Arnold, A. Avez, "Ergodic Problems of Classical Mechanics", Benjamin, New York
(1969)
]6. R. Betchov and W.O. Criminale Jr. "Stability of Parallel Flows", Acad. Press,
New York (]966)
17. Landau et Lifehitz, "M:canique des Fluides" chap. III, §27, ed. Mir. (Moscou)
(1971)
18. D.R. Caldwell. J. of Fluid Mech. 64, 347 (1974)
]9. N.N. Bogoliubov, J.A. Mitropolskii, A.M. Samo~lenko, "Methods of accelerated
mechanxcs convergence in non linear " " Spinger Verlag, Berlin (1976)
20. V.I. Arnold, Small divisors I, Izv. Akad. Nauk. SSSR Ser. Mat. 25 (1), 21 (J961),
Small divisors II, Usp. Mat. Nauk I8 (5), 13 (]963) ; 18 (6), 9I (]963),
M.R. Hermann, Thesis, Orsay (1976)
21. Reference 19, p. 154
22. Reference 9, p. 788t
M. Shuh, Thesis, Univ. of CaLif. Berkeley (]967)
23° J.P. Gollub, S.L. Hulbert, G.M. Dolny and H.L. Swinnay, to appear in "Photon
correlation, spectroscopy and velocimetry " Ed. E.R. Pyke and H.Z. Cun~ins,
Plenum Press (1976)
24. J.L. Ibanez, Y. Pomeau, to be published
25. Ref. 3, and P. Stefan, preprint IHES (Bures-sur-Yvette) Dec. ;976~
A.N. ~arkovskiy, Urk. Math. t. 16, I (1964),
B. Derrida and Y. Pomeau, to be published
STELLAR CONVECTION
D.O. Gough
Institute of Astronomy, Cambridge, England
The most important function of a convection theory for stellar model building is
to determine the temperature stratification in terms of the heat flux. Apart from in
some recent work by Latour et al. (2), mixing length theory, in one or other of its
guises, still provides the only prescription that is used. Unfortunately this is not
a reliable procedure because of the crude way in which the dynamics is treated. Fur-
thermore, the resulting formulae depend on the mixing length £ which occurs in the
theory as an undetermined function. Work on convection that may one day lead to a more
satisfactory theory is taking place, but none of it has yet reached the point to
warrant displacing the methods currently practised in stellar evolution computations.
The reader is referred to the reviews by Spiegel for a discussion of the astrophysi-
cally relevant work on convection up to 1972 (2,3).
A.ATTEMPTS TO MODEL THERMAL CONVECTION
One of the principal factors inhibiting progress in stellar convection theory is
that conditions in stars are very different from those in the laboratory. Stellar con-
vection is characterized by high values (1020 ) of the Rayleigh number R, which is a
dimensionless measure of the temperature gradient, and low values (10 -9 ) of the Prandtl
number ~, which is the ratio of kinematic viscosity to thermal diffusivity. On the
other hand in the laboratory R is quite low (< 1011 ) by astrophysical standards, and
is of the order or greater than unity. Moreover, stellar convection zones extend typi-
cally over many scale heights of pressure and density, leading to compressible motions,
whereas in the laboratory the depth of a convecting layer is always a minute fraction
of a scale height: the motion is essentially incompressible and can be described by
the Boussinesq approximation (4).
Most of the theoretical work is aimed at mimicking laboratory conditions. A thin
layer of fluid bounded by twQ isothermal planes, the lower boundary being at the higher
temperature, is usually considered. The equations of motion are solved, usually in the
Boussinesq approximation, either numerically at moderate R and ~ (5-]0) or analytically
at low R close to the critical value R c at which a static fluid layer becomes unstable
to convection (;I). At present the computational difficulties are too severe to extend
these calculations to values of R and ~ of astrophysical interest. The main objective
Reprinted from: Trans. (Reports), 16A, Pt II, 169 (1976)
350
is to determine the dependence of the Nusselt number N, a dimensionless measure of
the heat flux, on ~ and R. A plausible extrapolation procedure might then lead to
a better prescription for stellar convection.
The most obvious procedure, one might think, would be to apply mixing-length theo-
ry to laboratory convection and attempt to determine I experimentally. However it is
the astronomer's belief that this is of no use, for whereas in the laboratory eddies
extend across the whole of the convecting layer, in a compressible fluid many scale
heights deep the shear produced by differential expansion and contraction of verti-
cally moving fluid is thought to disrupt the convective motion in about a scale height
(Schwarzschild, (12)). This is a nonlinear argument, and so is not contradicted by the
fact that the most unstable linear modes extend across the entire convectively unstable
region (Spiegel, B~hm, (13-16)). Accordingly, I is presumed to be proportional to a
density or pressure scale height (Opik, Vitense, ]7,18)), usually the latter, the con-
stant of proportionality being determined astronomically, and contact with terrestrial
experience is lost.
In its most usual form the mixlng-length theory provides a local relationship be-
tween the heat flux F and the snperadiabatlc temperature gradient D = V - Vad" There
are many uncertainties in the theory, and consequently there is opportunity to incor-
porate into it several adjustable parameters, though only two are of immediate inte-
rest (]9): ~ ~ 1/H, where H is an appropriate scale height, and a measure 7 of the
radiative losses. The theories are calibrated either by constructing a solar model
and adjusting it to have the correct luminosity and effective temperature at an age
of about 4.7 x ]09 yr (Schwarzschild et el; Sears (20,2])), or by fitting a theoreti-
cal sequence to a young cluster diagram (Demarque and Larson; Copeland, Jensen and
J4rgensen (22,23)). Both methods yield ~ = |, the precise value depending on the de-
tails of the theory adopted, but leave y undetermined. Some comfort is derived
from the observation that this implies an eddy size at the top of the solar convection
zone comparable with the length scale of the granulation (Schwarzschild (12)). The
gross structure of a main sequence stellar model is insensitive to ~, which matters
only near the outer edge of the envelope convection zone. The parameter ~ does affect
the convective envelopes of red giants, however, which have large nonadiabatic regions
(Henyey et al (24); Schwarzschild (25)). Red giant models are probably more sensitive
to other details of how convection is treated, too, especially in the surface layers
where fluctuations are large in magnitude and horizontal extent.
It should be noticed that the mixing-length formalisms used in stellar structure
computations are based on the Boussinesq approximation to incompressible flow. One
would have expected (4) this to have been valid had Z been much less than H, but in-
dications are that it is not a good approximation otherwise (Graham (26), Deupree (27)).
Thus the calibration 1 = H exposes an inconsistency in the theory. However, the evi-
dence for the functional dependence 1 = H is hardly overwhelming, and adopting it no
doubt introduces errors that are just as great. The apparent success of the mixing-
351
length formalism lies in the fact that the gross structure of a main sequence stellar
model is almost independent of the functional form of the relationship between F and D
(Gough and Weiss (19)).
Stellar mlxlng-length theory ignores viscosity. This sounds plausible since the
Reynolds number of the heat-transporting flow is large and ~ is small. It implies
that N depends on ~ and R only in the combination ~R which is independent of visco-
sity. Furthermore, since N increases with R at fixed ~, it must therefore increase
with ~ at fixed R, provided ~ remains small.
The beginnings of an attempt to bridge the gap between laboratory and stellar
conditions, using a trO~cated modal expansion, has recently been reported (Gough,
Spiegel, Toomre (28,29)). Although the analysis is in the Boussinesq approximation,
it can treat with the same assumptlonsthe extreme values of R and ~ typical of stars
and the more moderate values encountered in the laboratory. The procedure can be made
to reproduce some of the features of laboratory convection, but its most obvious draw-
back is that it contains several undetermined parameters. In its simplest form there
are just two such parameters, characterizing the horizontal scale and shape of the
convective eddies. Although this is perhaps an improvement over mixing-length theory,
which depends on an undetermined function Z, an unambiguous calibration by comparison
with laboratory convection has not been possible. The theory has the property that for
Y<< I, N is a function of ~R, provided the convection is three-dlmensional, which
accords with astronomers' prejudices.
It should be pointed out, however, that the ~ dependence of N is not universally
believed. This arises partly because almost all laboratory experience is with fluids
that have ~ ~ 1, and for these both theory and experiment show that N is almost inde-
pendent of ~ at fixed R. Furthermore, numerical solutions of the Boussinesq equations
at moderate R, which until recentlyhave always constrained the flow to be two-dimen-
sional, have predicted almost no ~ dependence~ and even a slight increase of N as
is decreased below unity (Veronis (5); Quon (8); Moore and Weiss (9)), though it has
been argued that this may be a result of constraining the horizontal length scale of
the motion (Lipps and Somerville (6); Willis, Deardorff and Somerville (7)).The sim-
plest modal analysis predicts that N is independent of Y when the motion is two dimen-
sional (28). Analytical expansions of the full Boussinesq equations for R near R c re-
veal only a weak dependence on ~ in that case too (Sehluter~ Lortz and Busse (30)) •
but, like the modal result~ suggest a strong decrease in N at low ~ when the motion is
three-dimenslonal. This led Jones, Moore and Weiss (31) to investigate numerically
axlsyrmmetrieal convection in a cylinder which, though mathematically dependent on only
two space variables, is geometrically three-dimensional. They reproduced the analyti-
cal results for R just above Re, but showed that at moderate R the flow readjusted it-
self to resemble the two-dimensional flows, and produced an N that is independent of
~at high and low ~, and slightly decreasing with ~ in the neighbourhood of ~ = I.
352
The issue is unresolved. Jones et al. suggest that their flow is unstable and that
at sufficiently Nigh R it would become turbulent with N independent of viscosity at
low ~. Moreover, three-dimensional calculations reported recently by Veltishchev and
Zelnin (10) at ~ = 0.7 and ~ = ! suggest that the flow does not adjust itself to the
kind of structure that is preferred when axisynm~etry is imposed, and that N is less
when ~ = 0.7 than it is when o = I. This concurs with the evidence from laboratory
experiments (29), though this is admittedly weak. Finally, if a convection theory
based on eddies of scale I = H and governed by dynamics similar to that exhibited by
the two-dimensional and axisymmetrical Boussinesq computations (and therefore implying
N is independent of ~) were subject to the usual astronomical calibration, the result
would be that I would be but a very small faction of H, which most astronomers would
find unpalatable.
Little study has been made of fully developed convection in a layer of compressible
fluid many scale heights deep. Graham (26) has made some two-dimensional computations
for a perfect gas at moderate R and ~.The property exhibited by similar Boussinesq
calculations that N is a decreasing function of ~ when ~ = | is accentuated as the
layer depth, and the effects of compressibility, are increased. Moreover, no tendency
for eddies to break up on a scale of H was found. Graham's more recent three-dimensio-
nal compressible calculations yield similar results (32). Compressible modal calcula-
tions have also been performed in the anelastic approximation; by van der Borght (33)
with ~ = 1 and by Latour et al. (]) under more realistic stellar conditions modelling
an A star envelope. As with the Boussinesq calculations there are undetermined para-
meters which can be chosen to produce plausible results. Once again, time-dependent
calculations (]) show no tendency for the motion to break up into eddies on the scale
of H.
B. PENETRATION AND OVERSHOOTING
The edges of stellar convection zones are not rigid inpenetrable boundaries as they
are in most laboratory and theoretical investigations. The density stratification chan-
ges from being conveetively unstable to convectively stable, from the point of view of
linear stability analysis, and fluid accelerated in the convectively unstable region
c~n penetrate, Qr overshoot, into the adjacent stable regions.
This phenomenon has been of interest particularly to meteorologists interested in
mixing at the atmospheric inversion (34). D.W. Moore (35) gives a brief account of the
relevant physics. A convective element, or thermal, on reaching the top (or bottom) of
the conveetively unstable region, still has a temperature excess (or deficiency) rela-
tive to its i~mediate surroundings and continues to experience a buoyancy force. If
the element were to maintain its identity and move adiabatically in pressure equili-
brium with its surroundings, buoyancy would not disappear until the level z = z s were
353
reached at which the specific entropy were the same as at the level at which the ele-
ment originated. This point does not necessarily mark the edge of the zone of penetra-
tion, however, because the element still has momentum to carry it on yet further. En-
traiament of stable fluid, on the other hand, retards the motion of the element. Thus
Z = z~ may either overestimate or underestimate the extent of penetration. Observa-
tions of the motions of cloud tops suggest that it is usually an overestimate
though in some circumstances, such as in tropical storms, large plume - like
structures penetrate well above the tropopause (36). An additional complication, usually
ignored by meteorologists in this context, is radiative diffusion, which tends to re-
duce both the buoyant acceleration and retardation by reducing temperature fluctuations.
Buoyant thermals penetrating into the stable layer advect heat upwards, though this is
offset by the induced return flow. Near the outer edge of the penetrated region both
upward and downward moving fluid presumably transport heat counter to the net flux.
Some aspects of the situation can be modelled with the ice-water experiment. This
consists of a layer of water cooled from below at 0=C and with its upper boundary main-
tained above 40C, the temperature of the density maximum. Laboratory experiments show
that the unstable layer extends beyond the limits it would have occupied had there been
no motion (Townsend (37), Adrian (38)), and in addition plume-like motions in the un-
stable region penetrate into the stable layer above. Adjacent layers of convectively
stable and unstable fluid have also been created by inducing spatially varying tempe-
rature gradients in water near room temperature, either by imposing time varying boun-
dary conditions (Krishnamurti (39); Deardorff, Willis and Lilly (40)) or by internal
heating or cooling (Failer and Kaylor (4|); Whitehead and Chen (42)). The nature of
the motions in the stable layer is not entirely clear, but the temperature fluctuations
observed by Townsend (37) seem to be the product of trapped gravity waves. Theoretical
numerical experiments in two-dimensions by Moore and Weiss (43) also exhibit the en-
croachment of the unstable region into the region that would have been stable in the
absence of motion, and the excitation of gravity waves. They also predict weak vis-
cously driven countereells which are not seen in the laboratory, and little evidence
of plumes. Earlier steady one-mode mean-field calculations, which in some sense repre-
sent two-dimensional motion, yielded similar results, without the gravity waves (Mus-
man (44)). Thus some of the observed features of laboratory experiments are reproduced
theoretically; the differences, as Spiegel (3) has pointed out, might result from the
two-dimenslonal constraint imposed on the numerical computations.
Although the ice-water experiment sheds some light on the mechanism of penetration
it seems difficult to generalize to stellar conditions. There is some evidence from the
two-dimensional numerical experiments that penetration increases as Prandtl number de-
creases (D.R. Moore (45)). Modal calculations by Latour (46) et al.(]) modelling three-
dimensional convection in A star envelopes predict greater penetration by the almost
plume-like columns in the eentres of hexagonal cells than by two-dlmensional rolls.
However, this analysis has not been applied to the ice-water problem; there is yet no
354
bridge between stars and laboratory experience.
The various theoretical prescriptions that are usually employed to describe over-
shooting from stellar convection zones are all essentially based on mixing-length theo-
ry (Spiegel (47), Parsons (48), Ulrich (49), Scalo (50), Shaviv and Salpeter (51)). Of
necessity they are nonlocal theories, though they all rely on the Boussinesq approxi-
mation. They have been used, in particular, to model the solar atmosphere which is per-
haps the most sensitive astrophysical testing ground at present, because quite detailed
comparison of theoretical predictions with observations can in principle be made. It
is not easy to deduce the height dependence of the solar atmospheric velocity fluctua-
tions~ nor is it easy to disentangle convective motion from waves, though an attempt -I
has been made (Frazier (52)). It seems likely, however, that velocities of about 2 Pan s
extend well above the photosphere (de Jager (53)), which agrees with a model computed
by Ulrich (54), though the theory does appear to overestimate the overshoot. Travis and
Matsushima (55), using a theory of Spiegel (47), compare their models with limb darke-
ning measurements and conclude also that too great an overshoot is predicted if a mixing
length to scale height ratio ~ of about unity is adopted; they favour ~ ~ 0.35, in
contradiction to the usual calibration. A subsequent investigation by Travis and Matsu-
shlma (56) of the eolours of cool main sequence stars and metal-deficient subdwarfs also
suggested a low value for ~. Nordlund (57), using Ulrich's approach, found overshoot to
a lesser degree for a given ~, and produced a model in better agreement with the Harvard-
Smithsonian Reference Atmosphere (58) and similar to an earlier model built by Parsons
(48) using a convective heat flux calculated from a nonlocal estimate of vertical velo-
city and a local estimate of temperature fluctuations. In contradiction, Edmonds's ana-
lysis (59) of the photospheric velocity and brightness fluctuations favours a greater
degree of overshoot, so the matter seems unresolved. One thing that does seem clear is
that at their present stage of sophistication nonlocal convection theories should not
be relied upon to explain fine details, especially in regions in which the assumptions
on which they are based are not satisfied. All the theories have adjustable parameters
and can no doubt be tuned to rationalize the limb darkening; adjusting the radiative
loss coefficient in Spiegel's theory, for example, could probably lead to an atmosphere
hardly distinguishable from Nordlund's with an ~ consistent with the evolutionary ca-
libration. Indeed Spruit (60) has produced a model with the correct centre to limb
flux variation using a local mixing-length theory with no overshoot at all, though
presumably this does not reproduce the fluctuation measurements discussed by Edmonds
(59). Although there seems to be too much uncertainty in the theories at present to
apply such subtle tests, detailed analyses of inhomogeneous atmospheres must eventually
be undertaken both for theoretical model building and for analysing observational data.
Horizontal temperature fluctuations increase the horizontally averaged opacity, for
example, since opacity is a steeply increasing function of temperature, which leads to
an increase in the actual mean temperature gradient. Furthermore, since the magnitude
355
of the fluctuations decrease with height, the temperature gradient currently inferred
from limb darkening observations (55,57) is overestimated when fluctuations are ignored.
Abundance measurements may be affected. Turbulent Reynolds stresses generated by both
the convection and the gravity waves in the photospheric regions also influence the
stratification.
Another important consequence of overshooting is material mixing, particularly
at the edges of convective cores. Early estimates (Roxhurgh (6]); Saslaw and Schwarz-
schild (62)) which ignored the influence of the convective energy flux on the tempe-
rature stratification, implied negligible mixing rates. But recently Shaviv and Sal-
peter (51) pointed out that the modification to the stratification increases the pe-
netration of the motion into the stable envelope, just as in the case of the ice-water
experiment. Maeder (63) and Cogan (64) independently confirmed this conclusion with
more detailed calculations. The influence of the overshooting on colour-magnitude dia-
grams for old open clusters was subsequently investigated. Of particular interest is
the position and magnitude of the gap at the top of the main sequence, which can be
more accurately reproduced theoretically if an appropriate degree of mixing at the
core boundary is assumed (Maeder (65), Prather and Demarque (66)). Using Shaviv and
Salpeter's prescription for overshooting, Maeder (67) found once again that a value
of ~ somewhat less than unity gives the best results. This too should not be regar-
ded as contradicting the usual calibration, partly because the chemical composition
adopted for the models may not have been appropriate, partly because there lles buried
in the mixing-length formalism an undetermined parameter in the relation between ve-
locity and temperature fluctuations that does not appear in the formula for the heat
flux (|9), partly because the geometry of the core has not been taken into account,
and partly because the ratio of the mixing length to pressure scale height can hardly
be a universal constant.
Calculations by Sugimoto and Nomoto (68) and Iben (69) suggest that theoretical
predictions of nucleosynthesis in post main sequence stars would be significantly
affected by overshooting beneath convective envelopes. It would also have some bea-
ring on the observed lithium abundance in the sun (Spiegel (70)).
C. SUBCRITICAL CONVECTION
In the relatively straightforward case of ordinary convection discussed in ~A,
N is an increasing function of R at fixed ~. That is not necessarily the case when
agents such as rotation, magnetic fields or nonuniformities in composition are pre-
sent to inhibit the motion. The minimum Rayleigh number R o above which convection can
exist is modified by the presence of the stabilizing agent, but it is not always
possible to determine its value by linear stability analysis. It is often the case
that direct convective motion of finite amplitude can adjust itself to reduce the
efficacy of the stabilizing forces~ and so exist at a Rayleigh number below the
356
value R C predicted by linear theory (Veronis (71)). This is called suhcritical con-
vection. Of course such a state can be achieved only if it were approached hy lowering
R from a value greater than R~ or if a metastable state, with R < R < R~, were appro- o
priately perturbed by a finite amount. It seems that the latter is often achieved spon-
taneously because in many circumstances there is a range of R below R e within which
the fluid is overstable, that is to say unstable to infinitesimal oscillations
(Veronis (71), Weiss (72)). Weak experimental evidence exists to support the idea
that such motion might grow to an amplitude great enough to trigger subcritical con-
vection (Turner (73), Shirtcliffe (74), Rossby (75)).
The most widely studied problem of this type, and perhaps the easiest to understand,
is thermohaline convection. Turner (34) summarizes well the present state of knowledge.
The case of interest here is when salt stabilizes a layer of water heated from below.
Veronis (7] , 76) studied the overstability and suhcritical direct convection and gave
a simple physical explanation of why they should occur (71)~ Laboratory experiments
reveal that instability does occur first as a growing oscillation (74), and that con-
vection subsequently organizes itself into a series of superposed shallow layers sepa-
rated by diffusive interfaces (Turner and Stormnel (77)), a configuration that has
been observed to occur naturally (Hoare (78), Neal, Neshyba and banner (79)). The
fluxes of heat and salt appear to be controlled by the diffusive interfaces, and Tur-
ner (80) has observed that their ratio ~, When measured in units of the fluxes that
would have occured had motion been absent, appears to be independent of the ratio X of
the jumps in salinity and temperature across the layer, over a wide range of ~. Indeed,
it has been suggested (Turner (34)) that this value of # depends only on the diffusion
coefficients of the fluid, though recent experimental work indicates that it depends
also on R (Marmorino and Caldwell (81)).
The astrophysical relevance of thermohaline convection is to the edges of convective
cores of stars (Spiegel (82)),where the products of nuclear reactions~ usually helium,
take place of salt. When the usual criterion for convective instability is employed in
a massive stellar model evolving off the main sequence, it is found that once a suffi-
cient, stable discontinuity of composition is built up at the edge of the convective
core, the envelope immediately outside it is also convectively unstable. This has been
considered unacceptable by many astrophysicists, and it is assumed that the disconti-
nuity is somehow smoothed out, usually to precisely the degree that results in no more
than marginal stability immediately beyond the truly convective core (Ledoux (83),
Tayler (84), Schwarzschild and Harm (85)), though other amounts of mixing have been
proposed (Gabriel (86), Sale (87)). Different criteria are used to define marginal
stability, which lead to rather different results, but it does not seem possible to
choose between them by astronomical means (i e,g, Ch~osl and Summa (88) ;
Robertson (89); Swefgart and Demarque (90); Z~6~kows~i (91); Varshavskfi (92);
Sreenivasan and Ziebarth (93); Stothers and Chin (.@4)).
This situation has some similarftles to thermohaline convection set up by heating from
357
below an initially isothermal layer of water stably stratified with salt, but the ana-
logy is not perfect. Gabriel (86) has argued against generalizing from laboratory expe-
rience in this case. However, the idea that at least one shell of ordinary convection
is created outside the convective core does not seem implauslhle, though it is not ob-
vious whether the interface separating the two convecting regions would be stable enough
to survive the disrupting forces of the turbulence. Such a possibility has been pointed
out by Tayler (84, 95) as a mathematically consistent alternative to the conventional
procedure. To determine the structure of the region an understanding of the diffusive
interfaces is required. Had ¢ been independent of % and ~ one might have had some con-
fidence in extrapolating laboratory measurements, especially sinee~ if thin convective
layers are formed, this is one place where the Boussinesq approximation might be valid ;
but it appears that the answer is still out of reach.
Subcritical convection may also be relevant to solar type stars. It has been pointed
out that the stability characteristics of the solar core are potentially similar to
those of the thermohaline situation : overstable to infinitesimal perturbations and
able to sustain direct convection of finite amplitude (Dilke and Gough (96)). An
important difference~ however~ is that whereas the usual saline layer derives its ener-
gy from an externally imposed heat source and will convect so long as that source is
maintained, the sun must derive its extra energy from burning a supply of 3He which is
mixed from the edge of the core. The amount of SHe available is finite and after it
is burnt convection is presumed to cease, and the solar core becomes quiescent again
until a new supply of fuel has accumulated near its edge. If it occurs, this process
may have some bearing on the solar neutrino problem and the occurrence of terrestrial
ice ages. Subsequent more detailed analysis has supported the overstability postulate
(e.g. Noels et al., (98) Unno (99, 97, I00) ), though some computations have cast
doubt on it (Christensen-Dalsgaard and Gough (10])). The likelihood of subcritical
convection is questionable too (Ezer and Cameron (102) ; Ulrich (]03)), though some
evidence for it has been found (Rood (I04)).
D. ROTATION AND MAGNETIC FIELDS
Uniform rotation inhibits convective motion and so increases the critical Rayleigh
number above which convection can take place. At finite amplitude the motion can redis-
tribute the angular momentum so that subcriticai convection can occur (Veronis (IO5)).
Typically the constraint cannot be cancelled entirely and the rotation reduces the heat
flux. This is not always true, however. Rossby found in the laboratory that rotation
sometimes increases N at fixed R, a behaviour seen also in three-dimensional numerical
experiments (Somerville and Lipps (]06)) and a modal analysis (Baker and Spiegel (]07).
It gives fair warning to those who argue that factors inhibiting linear instability
necessarily inhibit subsequent nonlinear development.
358
Astrophysical interest in the interaction between convection and rotation has
been concerned in recent years with the structure of the convection zone in the solar
envelope and the maintenance of the differential rotation. Of particular interest are
the numerical experiments by Gilman (108). The subject has been reviewed recently by
Gilman (]09), Durney (]]0) and Weiss (]]J). Tayler (I12) has discussed convection in
rotating stellar cores.
The solar convection zone will not be understood until it is known how convection
interacts with magnetic fields. It is hard to infer the field strengths beneath the
surface, especially since the topology of the convective motion is such as to submerge
the field (Drobyshevskiand Yuferev (I13)). The formation and decay of magnetic field
concentrations are of obvious interest, and are reviewed in the proceedings of IAU
Symposium n ° 7]. Like uniform rotation, a uniform magnetic field tends to inhibit con-
vective motion. The linear stability characteristics of a plane Boussinesq fluid layer
heated from below are similar to the rotating case with no magnetic field. But Weiss
(114) has pointed out that there are fundamental differences between Lorentz and Coriolis
forces and that care must be taken when comparing the two cases. Weiss found that the
nonlinear development of both overstable and direct infinitesimal motions can be oscil-
latory, provided the magnetic field is not too weak. The final state is not necessarily
one in which there is equipartition between kinetic and magnetic energies (Peckover and
Weiss (I15)). Modal calculations (van der Borght, Murphy and ~piegel (I16)) have
revealed only a decrease in N at fixed R as the magnetic field increases, hut a magnetic
field appears to be able to interact with a rotating fluid in such a way that the
resulting Nusselt number is greater than it would have been in the absence of the field
(van der Borght and Murphy (I]7)).
Of interest recently has been the question of whether convection can be the source
of dynamo action. Childress and Soward (118) demonstrated that the kind of flow encoun-
tered in a rotating convecting fluid is suitable for amplifying magnetic fields, as has
been noticed also by Spiegel (3). Perturbation expansions about the marginal state
(Soward (119) ; Roberts and Stewartson (]20)) and a modal analysis (Baker (121)),
both of which incorporate the forces on the fluid arising from the perturbed magnetic
field, indicate that a convectlng fluid can indeed sustain a magnetic field by induction.
E. TIME-DEPENDENT CONVECTION
New difficulties are encountered when a star is varying globally on a time-scale
comparable with the convective turnover time. This may occur when the star is not in
hydrostatic equilibrium : during gravitational collapse, a nova or supernova explosion,
a flare or envelope ejection, or whenever a star pulsates. It is perhaps for pulsating
stars that an understanding of the time dependence of convection is most urgently needed
because both theory and observations have progressed further than in the studies of other
359
classes of intrinsically variable stars. Many of the gross features of the observations
have been explained, but the position of the red edge of the Cepheid strip, for example,
remains unsolved. This can probably be blamed on an inadequate treatment of convection
in the theoretical models.
Most computations of stellar pulsations have either ignored convection entirely,
ignored perturbations (either Lagrangian or Eulerian) in the convective heat flux indu-
ced by the pulsations, assumed the convection to adjust instantaneously to its changing
environment, or assumed it to relax towards the state given by the usual mixing-length
formulae at a rate proportional to the amount by which it deviates from that state and
inversely proportional to the eddy lifetime. The last of these prescriptions is perhaps
the most credible, and was first used by Cox et al. (]22) to compute model Cepheids.
However, its obvious deficiency is that it contains a free parameter : a constant of
proportionality that determines the rate at which convection readjusts to the pulsation.
This in turn determines the phase difference between the convection and pulsation, upon
which the pulsational stability of the star directly depends.
Attempts have b~en made by Cough (123) and Unno (324) to generalize the mixing-
length theory. Unfortunately there are different ways of formulating the fundamental
postulates. The resulting formulae are essentially the same for hydrostatic stars but
differ when the star is presumed to pulsate. M~reove~ there appear to be no relevant
laboratory experiments with which to compare the various possibilities. Despite these
uncertainties it would be interesting to know how sensitive pulsating stellar models
are to the assumptions behind the convection formalism, and Whether a possible choice
of the uncertain parameters exists that rationalizes the observations. Computations in
the quasiadiabatic approximation suggest this may be so, but apart from misrepresent-
ing nonadiabatic effects these computations are deficient in an important respect :
they do not take due account of the turbulent Reynolds stress.
It is usual to ignore the Reynolds stress when computing stellar models, partly
because ~ pO~t~O~ mixing-length estimates are less than the gas pressure gradient in
all but a thin region at the top of the hydrogen ionization zone. Attempts to include
this stress in nonpulsating stars have been made, notably by Henyey et al. (24) and
Parsons (48) but the formulation adopted is not entirely consistent. The mixing-length
formula for the Reynolds stress adds second derivatives of temperature and pressure to
the hydrostatic equation, raising its order and introducing singular points at the edges
of the convection zones. This has led to numerical instabilities (24) which have been
removed by judiciously ignoring high derivatives. It is claimed that this should not
alter the results substantially. A consistent computation should be done to check.
It is even more important to include the Reynolds stress in pulsating models. The
motion of most of the star is almost adiabatic ; density and pressure perturbations are
almost in phase and the work done in a single cycle is much less than the energy ex-
changed between thermal, gravitational and kinetic forms. There is no reason to suppose
that the turbulent stress is in phase with the density, however, so even though its
360
magnitude may be much less than the gas pressure gradient the work it does might not
be negligible.
The modal approach adopted by Latour et al. (1) includes the Reynolds stress.
Because viscosity is include the equations are not singular, but the low Prandtl num-
ber gives such severe numerical trouble that it has not yet been possible to compute
stellar models with deep convection zones. In principle this method can be used for
pulsating stars, but certain aspects of the turbulent energy transfer are lost in the
modal truncation and once again the results would have to be treated with some caution)
A recently discovered pulsating star of some interest is the sun (Hill and Stebbins
(125) ; Fossat and Ricort (]26) ; Severny, Kotov and Tsap (I27) ; Brookes, Isaak and
van der Raay (128)), which is pulsating in many modes simultaneously. The pulsations
are of too low an amplitude to have a noticeable influence on the structure of the
star, hut they could provide a powerful diagnostic tool. The oscillation periods are
in satisfactory agreement with theoretical estimates (Christensen-Dalsgaard and Gough
(130)) Scnflaire, Gabriel, Noels and Boury (]29)), but how the oscillations are dri-
ven is not yet known. It is unlikely that convection is unimportant. Differences bet-
ween linear analyses which have either ignored convective flux perturbations (e.g.
(97) Shibahashi et al. (100, ]Ol) ) or have taken them into account (Noels (98, 131)
et al.) using Unno's (124) approach suggest that the stability of the modes of oscil-
lations are rather sensitive to the assumptions adopted. In the light of experience
with stars in hydrostatic equilibrium (Gough and Weiss (]9)) perhaps it is too opti-
mistic to hope for an unambiguous solar calibration of time-dependent convection in
the near future.
REFERENCES
I. Latour, J., Spiegel, E. A., Toomre, J. and Zahn, J.-P., (]976), Ap J., 207, 233
and 545
2. Spiegel, E. A., (197|), A~n. Rev. A. Ap , 9, 323
3. Spiegel, E. A., (]972), Ann. Rev. A. AP , ]O) 26I
4. Spiegel, E. A.) and Veronis, G., (]960), Ap J:., ]31, 442 ; (]962) 135, 655
5. Veronis, G., (]966), JFM, 26, 49
6. Lipps, F. B. and Sommerville, R. C. J., (1971), Phys. Fluid s, 14, 759
7. Willis, G. E.) Deardorff, J. W. and Sommerville, R. C. J., (1972), JFM) 54, 35]
8. Quon, C., (1972), Phys. Fluids, ]5, J2
9. Moore, D. R. and Weiss) N. O.) (1973), JFM, 58, 289
]O. Veltishchev, N. F. and Zelnin, A. A., (1975), JFM, 68, 353
]]. Busse, F. H., (197]), Proc. IUTAM S~p., Herrena!b ~ 4]
]2. Schwarzschild) M., (|96|), Ap J., 134, I
13. Spiegel, E. A. and Unno, W., (]962), PASJ, 14, 28
361
]4. Spiegel, E. A., (1964), Ap J., 139, 959
I5. Spiegel, E. A., (1965), Ap J., 141, 1068
16. B~hm, K. -H., (1967), IAU Syrup. NO. 28, p. 366
|7. Dpik, E. J., (]938), Publ. Obs. Astr. U. Tartu, 3__O0, No. 3
18. Vitense, E., (1953), Z. Ap, 32, 135
]9. Gough, D. O. and Weiss, N. O., (1976), MN, ]76, 589
20. Sehwarzschild, M., ~oward, R. and H~rm, R., (1957), Ap J., ]25, 233
21. Sears, R. L., (]959), Ap J., 129, 489
22. Demarque, P. R. and Larson, R. B., (]964), Ap J., 140, 544
23. Copeland, H., Jensen, J. 0. and JMrgensen, H. E., (]970), A. and. Ap, ~, 12
24. Henyey, L,, Vardya, M. S. and Bodenheimer, P., (1965), Ap J., ]42, 84]
25. Sehwarzschild, M., (1975), Ap J., ]9_55, 137
26. Graham, E., (1975), JFM, 70, 689
27. Deupree, R. G., (]975), Ap J., 201, 183
28. Gough, D. 0., Spiegel, E. A. and Toomre, J., (1975), JFM, 68, 695
29. Toomre, J., Gough, D. O. and Spiegel, E. A., (]977), JFM, 79, ]
30. SchSlter, A., Lortz, D. and Busse, F. H., (]965) JFM, 23, ]29
31. Jones, C. A., Moore, D. R. and Weiss, N. O., (]976), JFM, 73, 353
32. Graham, E., (]975) private communication
33. Van der Borght, R., (1975), MN, ]73, 85
34. Turner, J. S., (Cambridge Univ. Press, ]973), Buoyancy effects in fluids
35. Moore, D. W., (]967), IAU Symp. No. 28, 405
36. Saunders, P. M., (1962), Tellus, J4, ]77
37. Townsend, A. A., (]964), Q. J. roy. Met. Soe=, 90, 248
38. Adrian, R. J., (1975), JFM, 69, 735
39. Krishnamurti, R., (1968), JFM, 33, 445 and 457
40. Deardorff, J. W., Willis, G. E. and Lilly, D. K., (|969), JFM, 35, 7
4]. Failer, A. J. and Kaylor, R., (]970), JGR, 75, 52|
42. Whitehead, J. A. and Chen, M. M., (1970), JFM, 40, 549
43. Moore, D. R. and Weiss, N. 0., (]973), JFM, 6_!I , 553
44. Musman, S., (1968), JFM, 3], 343
45. Moore, D, R., (]975), private communication
46. Latour, J., (1973), Thesis, Univ. Nice
47. Spiegel, E. A., (]963), Ap J., 138, 2]6
48. Parsons, S. B., (1969), Ap J. Suppl., ]88, 127
49. Ulrich, R. K., (1970), ~ Sp. Sci., !, 7]
50. Scalo, J. M. and Ulrich, R. K., (1973), Ap.~J. , 183, 151
51. Shaviv, G. and Salpeter, E. E., (1973), Ap J., 184, ]9]
52. Frazier, E. N., (]968), Ap J., 152, 557
53. de Jager, C., (1972) Solar Phys., 25, 71
54. Ulrich, R. K., (1970), Ap Sp. Sci., !, 183
362
55. Travis, L. D. and Matsushima, S., (1973), Ap J., J80, 975
56. Travis, L. D. and Matsushima, S., (1973), Ap J., ]82, 189
57. Nordlund, ~., (|974), A. and Ap, 32, 407
58. Gingerich, O., Noyes, R. W., Kalkofen, W. and Cuny, Y., (J971), Solar Phys., 18, 347
59. Edmonds, F. N., Jr, (1974), Solar Phys., 38, 33
60. Spruit, H. C., (1974), Solar Phys., 34, 277
6]. Roxburgh, I. W., (1965), MN, 130, 223
62. Saslaw, W. C. and Schwarzschild, M., (|965), Ap J., 142, 1468
63. Maeder, A., (]975), A. and Ap, 40, 303
64. Cogan, B. C., (1975), Ap J., 20], 637
65. Maeder, A., (1974), A. and Ap, 3~4, 409
66. Prather, M. J. and Demarque, P. R., (1974), Ap J., 193, 109
67. Maeder, A., (1975), A" and Ap, 43, 61
68. Sugimoto, D. and Nomoto, K., (1974), IAU S ymp.: No. 66, 105
69. Iben, I., Jr~, (1975), Ap J., 196, 525
70. Spiegel, E. A., (1967), Highlights. in Astr. (IAU), ~6l
71. Veronis, G., (1965), J. Marine Res., 23, ]
72. Weiss, N. O., (1964), Phil. Trans., A 2____~56, 99
73. Turner, J. S., (1968), JFM, 33, 183
74. Shlrtcliffe, T. G. L., (1969), JFM, 35, 677
75. Rossby, H. T., (1969), JFM, 36, 309
76. Veronis, G., (]968), JFM, 34, 315
77. Turner, J. S. and Storm~el, H., (1964), Proc. Nat. Acad. Sei., 52, 49
78. Hoare, R. A., (}966), Nature, 210, 787
79. Neal, V. T., Neshyba, S. and Denner, W., (]969), Science, 166, 373
80. Turner, J. S., (]965), Int. J. Heat and Mass Trans., 8, 759
8]. Marmorino, G. O. and Caldwell, D. R., (]976), Deep-Sea Res., 23, 59
82. Spiegel, E. A., (}969), Comm. Ap Sp. Scl., i, 57
83. Ledoux, P., (]947), Ap J., IO_~5, 305
84. Tayler, R. J., (]954), Ph.D. The£i£, Univ. Cambridge
85. Schwarzschild, M. and Harm, R., (1958), Ap J., 128, 348
86. Gabriel, M., (1970), A. and Ap, ~, ]24
87. Saio, H., (]974), Ap Sp. Sc~., 29, 4J
88. Chiosi, C. and Sun~ma, C., (]970), Ap Sp. Sci., 8, 478
89. Robertson, J. W., (]972), Ap J., 177, 473
90. Sweigart, A. V. and Demarque, P., (]972), A. and AP, 20, 445
91. Zi61kowski, J., (1972), Aeta Ast., 22, 327
92. Varshavskii, V. I., (]973), Soy. Ast. AJ, 16, 86]
93. Sreenivasan, S. R. and Ziebarth, K. E., (]974), Ap Sp. Sei., 30, 57
94. Stothers, R. and Chin, C., (1975), Ap J., ]98, 407
95. Tayler, R. J., (]969), FIN, ]44, 231
363
96. Dilke, F. W. W. and Gough, D. O., (1972), Nature, 240, 262
97. Christensen-Dalsgaard, J., Dilke, F. W. W. and Gough, D. O., (]974), FIN, I69, 429
98. Noels, A., Gabriel, M., Boury, A., Scuflaire, R. and Ledoux, P., (1975), M~m. Soc.
Roy..Sci. L~ge, # Ser., 8, 317
99. Unno, W., (1975), PASJ, 27, 81
]00. Shibahashi, H., Osaki, Y. and Unno, W., (]975), PASJ, 27, 4Ol
lO]. Christensen-Dalsgaard, J. and Gough, D. 0., (]975), Mfim. Soc. Roy..Sci. Liage,
6 Set., 8, 309
102. Ezer, D. and Cameron, A. G. W., (1972), Nature phys. Sci. , 240, 180
]03. Ulrich, R. K., (1974), Ap J., 188, 369
]04. Rood, R. J., (1972), Nature Phys. Sci. , 240, 178
]05. Veronis, G.) (1959), JFM, ~, 401 ; (I965), 24, 545
IO6. Sommerville, R. C. and Lipps, F. B., (]973), J. Atmos. Sci.) 30, 590
107. Baker, L. and Spiegel, E. A., (]975), J. Atmos. Sci., 32, ]909
]08. Gilman, P. A., (]976), IAU Symp. No 7], 207
]09. Gilman, P. A., (1974), Ann. Rev. A. Ap, 12) 47
]IO. Durney, B. R., (]976), IAU Symp. No 71, 243
Ill. Weiss, N. 0., (1976), .IAU Symp. No 7|) 229
ll2. Tayler, R. J., (1973), MN, ]65) 39
If3. Drobyshevski, E. M. and Yuferev, V. S., (1974), JFM, 65, 33
114. Weiss, N. O., (1975), Adv. Chem. Phys., 32? ]01
]15. Peckover, R. S. and Weiss, N. 0., (1972), Comp. Phys* Comm., ~, 339
116. Van der Borght, R., Murphy, J. O. and Spiegel, E. A., (]972), Aust. j. Phys.,
25, 703
]17. Van der Borght, R. and Murphy, J. O., (1973), Austr. J. phys., 26, 617
]]8. Childress, S. and Soward, A. M., (1972), Phys. Rev. Left., 29, 837
If9. Soward, A. M., (]974), Phil. Trans., 275, 61!
120. Roberts, P. E. and Stewartson, K., (1975), JFM, 68, 447
]21. Baker, L., (]975), private con~nunication
122. Cox, J. P., Cox, A. N., Olsen, K. H., King, D. S. and Eilers, D. D., (1966), Ap J,,
:,,44, ]o3s
123. Gough, D. 0., (1965), GFD, II (Woods Hole Ocean. Inst.) p. 49
I24. Unno, W., (1967), PASJ, ]9, 140
|25. Hill, H. A. and Stebbins, R. T., (]975), Ann. New York Acad. Sci., 262, 472
126. Fossat, E. and Ricort, G., (]975), A. and Ap, 43, 243
127. Severny, A. B., Kotov, V. A. and Tsap, T. T., (1976), Nature, 259, 87
128. Brookes, J. R., Isaak) G. R. and van der Raay, H. B., (]976), Nature, 259, 92
129. Scuflaire, R., Gabriel, M., Noels, A. and Boury, A., (]975), A. and Ap, 45, ]5
130. Christensen-Dalsgaard) J. and Gough, D. 0., (]976), Nature, 259, 89
131. Gabriel, M., Scuflalre, R., Noels, A. and Boury) A., (1975)) A. and Ap, 40, 33
Selected Issues from
Lecture Notes in Mathematics
Vol. 431 : S~minaire Bourbaki - vol. 1973/74. Exposes 436-452. IV, 347 pages. 1975.
Vol. 433: W. G. Faris, Sslf-Adjoint Operators. VII, 115 pages. 1975.
Vol. 434; P. Brenner, V. Thom(~e, and L. B. Wahlbin, Besov Spaces and Applications to Difference Methods for Initia/Value Problems. II, 154 pages. 1975.
Vol. 440: R. K. Getoor, Markov Processes: Ray Processes and Right Processes. V, 118 pages. 1975.
Vol. 442: C. H. Wilcox, Scattering Theory for the d'Alembert Equation in Exterior Domains. III, 184 pages. 1975.
Vol. 446: Partial Differential Equations and Re}ated Topics. Proceedings 1974. Edited by J. A. Goldstein. IV, 389 pages. 1975.
VoI. 446 : Spectral Theory and Differential Equations. Proceedings 1974. Edited by W. N. Everitt. XlI, 321 pages. 1975.
Vol. 449: Hyperfunctions and Theoretical Physics. Proceedings 1973. Edited by F. Pham. IV, 218 pages. 1975.
Vol. 458: P. Waltera, Ergodic Theory - Introductory Lectures. VI, 198 pages. 1975.
VoL 459: Fourier Integral Operators and Partial Differential Equa- tions. Proceedings 1974. Edited by J. Chazarain. VI, 372 pages. 1975.
Vo]. 461 : Computational Mechanics. Proceedings 1974. Edited by J. T. Oden. VII, 326 pages. 1975.
Vol. 463: H.-H. Kuo, Gaussian Measures in Banach Spaces. Vl, 224 pages. 1975.
Vol. 464 : C. Rockland, Hypoellipticity and Eigenvalue Asymptotics. III, 1 71 pages. 1975.
VoL468: Dynamical Systems- Warwick1974. Proceedings 1973/74. Edited by A. Manning. X, 405 pages. 1975.
Vol. 470: R. Bowen, Equilibrium States and the Ergodic Theory of Anosov Diffeomorphisms. III, 108 pages. 1975.
VoL 474: Seminaire Pierre Lelong (Analyse) Annie 1973/74. Edit~ par P. Lelong. Vl, 182 pages. 1975.
VoI. 484: Differential Topology and Geometry. Proceedings 1974. Edited by G. P. Joubert, R. P. Mcussu, and R. H. Roussarie. IX, 287 pages. 1975.
VoI. 497 : H. M. Reimann und 1". Rychener, Funktionen beschr;~nkter mittlerer Oszillatlon. VI, 141 Seiten. 1975.
Vol. 489: .1. Bair and R. Fourneau, Etude G~cm~rique des Espaces Vectoriels. Une Introduction. VII, 185 pages. 1975.
Vol. 490: The Geometry of Metric and Linear Spaces. Proceedings 1974. Edited by L M. Kelly. X, 244 pages. 1975.
VoL 503: Applications of Methods of Functional Analysis to Problems in Mechanics. Proceedings 1975. Edited by P. Germain and B. Nayroles. XlX, 531 pages. 1976.
VoL 507: M. C. Reed, Abstract Non-Linear Wave Equations. VI, 128 pages. 1975.
V.cI. 509: D. E. Blair, Contact Manifolds in Riemannian Geometry. VI, 146 pages. 1976.
Vol. 515: B,4cklund Transformations. Nashville, Tennessee 1974. Proceedings, Edited by R. M. Miura. VIII, 295 pages. 1976.
VoL 516: M. L. Silverstein, Boundary Theory for Symmetric Markov Processes. XV!, 314 pages. 1976.
Vol. 518: S6minaire de Th~orie du Pctentiel, Proceedings Paris 1972-1974. Edit~ par F. Hirsch et G. MokobodzkL VI, 275 pages. 1976.
VoL 522: C. O. Bloom and N. D. Kazarinoff, Short Wave Radiation Problems in Inhomogeneous Media: Asymptotic Solutions. V, 104 pages. 1975,
Vol. 523: S. A. AIbeverio and R. J. Heegh-Krohn, Mathematical Theory of Feynman Path Integrals. IV, 139 pages. 1976.
Vol. 524: S6minaira Pierre Lelong (Analyse) Ann6e 1974/75. Edit6 par P. Lelong. V, 222 pages. 1976.
Vol. 525: Structural Stability, the Theory of Catastrophes, and Applications in the Sciences. Proceedings 1975. Edited by P. Hilton. VI, 408 pages. 1976.
Vol. 526: Probability in Banach Spaces. Proceedings 1975. Edited byA. Beck. VI, 290 pages. 1976.
Vol. 527: M. Denker, Ch. Grillenberger, and K. Sigmund, Ergodic Theory on Compact Spaces. IV, 360 pages. 1976.
Vol. 532: Th(~orie Ergodique. Proceedings 197311974. Edit~ par J.-P. Conze and M. S. Keane. VIII, 227 pages. 1976.
Vot. 538: G. Fischer, Complex Analytic Geometry. VII, 201 pages. 1976.
Vol. 543: Nonlinear Operators and the Calculus of Variations, Bruxelles 1975. Edited by J. P. Gossez, E. J. Lami Dozo, J. Mawhln, and L. Waelbroeck, Vii, 237 pages. 1976.
Vol. 552: C. G. Gibson, K. Wirthm011er, A. A. du Plessis and E. J. N. Looijenga. Topological Stability of Smooth Mappings. V, 155 pages. 1976.
Vol. 556: Approximation Theory. Bonn 1976. Proceedings. Edited by R. Schaback and K. Scherer. VII, 466 pages. 1976.
Vol. 559: J.-P. Caubet, Le Mouvement Brownlen Relativists. IX, 212 pages. 1976.
Vol. 561: Function Theoretic Methods for Partial Differential Equations. Darmstadt 1976. Proceedings. Edited by V. E. Meister, N. Weok and W. L. Wendland. XVlII, 520 pages. 1976.
Vol. 564:.Ordinary and Partial Differential Equations, Dundee 1976. Proceedings. Edited by W. N. Everitt and B. D. Sleeman. XVIII, 551 pages, 1976.
Vol. 565: Turbulence and Navier Stokes Equations. Proceedings 1975. Edited by R. Tomato. IX, 194 pages. 1976.
Vol. 566: Empirical Distributions and Processes. Oberwolfach 1976. Proceedings. Edited by P, Gaenssler and P. R~v~sz. VIi, 146 pages. 1976.
Vol. 570: Differential Geometrical Methods in Mathematical Physics, Bonn 1975. Proceedings. Edited by K. Bleul~r and A. Reetz.. Vltl, 576 pages. 1977.
VoL 572: Sparse Matrix Techniques, Copenhagen 1976. Edited by V, A, Barker. V, 164 pages. 1977.