06-08-029.pdf

10
Energy Flow and the Organization of Life Harold Morowitz D. Eric Smith SFI WORKING PAPER: 2006-08-029 SFI Working Papers contain accounts of scientific work of the author(s) and do not necessarily represent the views of the Santa Fe Institute. We accept papers intended for publication in peer-reviewed journals or proceedings volumes, but not papers that have already appeared in print. Except for papers by our external faculty, papers must be based on work done at SFI, inspired by an invited visit to or collaboration at SFI, or funded by an SFI grant. ©NOTICE: This working paper is included by permission of the contributing author(s) as a means to ensure timely distribution of the scholarly and technical work on a non-commercial basis. Copyright and all rights therein are maintained by the author(s). It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may be reposted only with the explicit permission of the copyright holder. www.santafe.edu SANTA FE INSTITUTE

Transcript of 06-08-029.pdf

Page 1: 06-08-029.pdf

Energy Flow and theOrganization of LifeHarold MorowitzD. Eric Smith

SFI WORKING PAPER: 2006-08-029

SFI Working Papers contain accounts of scientific work of the author(s) and do not necessarily represent theviews of the Santa Fe Institute. We accept papers intended for publication in peer-reviewed journals or proceedings volumes, but not papers that have already appeared in print. Except for papers by our externalfaculty, papers must be based on work done at SFI, inspired by an invited visit to or collaboration at SFI, orfunded by an SFI grant.©NOTICE: This working paper is included by permission of the contributing author(s) as a means to ensuretimely distribution of the scholarly and technical work on a non-commercial basis. Copyright and all rightstherein are maintained by the author(s). It is understood that all persons copying this information willadhere to the terms and constraints invoked by each author's copyright. These works may be reposted onlywith the explicit permission of the copyright holder.www.santafe.edu

SANTA FE INSTITUTE

Page 2: 06-08-029.pdf

Energy flow and the organization of life

Harold MorowitzKrasnow Institute for Advanced Study, George Mason University, Fairfax, VA 22030

Eric SmithSanta Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501

(Dated: August 7, 2006)

I. INEVITABLE LIFE?

Life is universally understood to require a source of freeenergy and mechanisms with which to harness it. Re-markably, the converse may also be true: the continuousgeneration of sources of free energy by abiotic processesmay have forced life into existence as a means to alle-viate the buildup of free energy stresses. This assertion– for which there is precedent in non-equilibrium sta-tistical mechanics and growing empirical evidence fromchemistry – would imply that life had to emerge on theearth, that at least the early steps would occur in thesame way on any similar planet, and that we should beable to predict many of these steps from first principlesof chemistry and physics together with an accurate un-derstanding of geochemical conditions on the early earth.A deterministic emergence of life would reflect an essen-tial continuity between physics, chemistry, and biology.It would show that a part of the order we recognize asliving is thermodynamic order inherent in the geosphere,and that some aspects of Darwinian selection are expres-sions of the likely simpler statistical mechanics of physicaland chemical self-organization.

The principles that suggest life was inevitable have notyet been applied quantitatively to biochemistry, but theyare commonplace and well understood for simpler sys-tems encountered in meteorology, materials science, andother fields. A lightning bolt is a plasma channel cre-ated when air su!ers dielectric breakdown in response toa charge separation between the upper atmosphere andthe ground. The ionized plasma is kept far from equi-librium with the surrounding air by the driven motionof its own charged particles. Convective storms such ashurricanes constitute non-equilibrium states of transport,evaporation, and condensation sustained by temperaturedi!erences at di!erent places on the globe, particularlyin the oceans.

The common feature of ionized air in a lightning boltand wall storms in a hurricane is that both create chan-nels to transport currents of matter and energy1 between

1 Here we distinguish internal energy, represented in thermody-namics by U , from the free energy U + pV ! TS, which containscontributions from pressure p, volume V , temperature T andentropy S. The increase of entropy attending the exchange ofenergy between two reservoirs can cause a free energy decreaseeven when energy in the reservoirs is conserved.

two reservoirs at di!erent potential. For lightning thepotential is voltage and the current is charge, and forconvective weather the potential is temperature and thecurrent is heat. Without lightning or hurricanes, chargeor heat could still move by di!usion, but the resistanceto their motion through near-equilibrium states is muchgreater and the transport much slower than throughthe channel state. We understand well how voltage ortemperature di!erences can drive these non-equilibriumchannels to form and stabilize them under perturbations,and we have ways to predict the main features of thechannel states from the properties of the systems in whichthey arise.

Whereas weather is a di!use phenomenon primarilyinvolving mass transport and physical state change, lifecreates transport channels in the chemical domain, em-ploying the more concentrated energy flows associatedwith molecular re-arrangements. Two major abiotic pro-cesses lead to free energy sources su"ciently concentratedto maintain chemical order. Most organisms have evolvedto depend primarily on energy from either one process orthe other, creating a dichotomy in the basic strategies forenergy metabolism. Some organisms draw energy exclu-sively from the transfer of electrons from donor to accep-tor molecules2 created by geothermal chemistry. Otherorganisms convert visible solar light to infrared thermalradiation, in the process generating the oxidants and re-ductants necessary to biochemistry. Geochemical redoxenergy originates in nuclear fission reactions, while solarflux results from nuclear fusion. In the absence of life,energy from either source remains to some extent unusedbecause the energy carriers are not easily accessed.

For example, the reaction of molecular hydrogen (areductant) with carbon dioxide (an oxidant), thoughenergy-yielding, does not happen spontaneously at an ap-preciable rate. If these molecules are not consumed byorganisms, they tend to accumulate wherever they aregenerated by geochemical processes. Similarly, either in-elastic absorption of light is prohibited by quantum selec-tion rules (primarily applicable to small molecules in theatmosphere) or it triggers a photo-dissociation that elim-inates the electronic transition responsible for absorp-

2 Electron donors are called reductants, and electron acceptors arecalled oxidants. A donor-acceptor pair that yields free energyfrom the transfer of electrons from donor to acceptor is knownas a redox couple.

Page 3: 06-08-029.pdf

2

tion after a single event. George Wald has suggested [1]that achieving repeated inelastic absorption poses a dif-ficult chemical problem, solved only twice in the courseof evolution with the emergence of the rhodopsins andthe chlorophylls. The major converter of light to heat onearth is multiple weak inelastic scattering in the oceans,which is the primary driver of the global weather sys-tem but which cannot concentrate energy su"ciently toactivate chemical transitions on a large scale.

It has not been quantitatively demonstrated that thesefree energy stresses can force complex chemical order intoexistence or stabilize it under perturbations, nor are wesure which elements of living order could be predictedfrom first principles if this were so. However, guided byintuition from systems like lightning and weather, we canrecognize many features in core intermediary metabolismthat suggest metabolism formed in this way, and that itmay be a unique or at least optimal channel for accessingthese particular sources of energy. We can start witha fresh look at the structure of metabolism, and at itsorganization in ecosystems and along phylogenetic linestoday.

II. STRUCTURE AND UNIVERSALITY OFBIOCHEMISTRY

The first thing to appreciate is that life on earth as awhole, and every ecosystem to the extent that it can beconsidered in isolation, synthesizes and utilizes all nec-essary organic molecules within a closed network of re-actions. In organisms this property of biochemical com-pleteness is called autotrophy, meaning “self-feeding”, soan autotrophic organism is one that requires only in-organic material inputs and a source of energy to live,grow, and reproduce. All self-su"cient ecosystems areby definition autotrophic, while only some organismsare. Organisms not capable of autotrophy are designatedheterotrophic, or “di!erent feeding”, because they mustdraw organic inputs from their environments.

Second, we know from analysis of entire genomes3 thatthe complete metabolic chart of autotrophs has a univer-sal core, based on a set of fewer than 500 small – less than400 Dalton molecular weight– organic molecules. Theseinclude sugars, amino acids, nucleotides, fatty acids, anda few more complex cofactors for handling organic func-tional groups or metal ions, such as Coenzyme A, NAD,pyridoxal phosphate, biotin, and heme.

Within core metabolism, we recognize two major cat-egories of function. Anabolism comprises the set of reac-tions that build organic compounds, while catabolism isthe breaking down of organic compounds for energy ormaterials. Anabolism is essentially a reductive process,meaning that it consumes energy-rich electrons to create

3 Source http://www.genome.ad.jp/kegg/

molecular bonds. It is possible for an organism to existwith anabolic reactions alone, if suitable electron donorsare provided by its environment, and many major cladesof anaerobic organisms that are thought to have very an-cient lineage are autotrophs living on geochemical inputs(called chemo-autotrophs) whose metabolism is almostentirely anabolic [2].

Catabolism is associated with organization into ecosys-tems, in which heterotrophs consume other organisms orproducts thereof, and break down their organic moleculesfor energy or biologically available nitrogen or carbon.Catabolic reactions appear to function as support foranabolism, providing the same inputs as those used byreductive autotrophs but not changing the essential an-abolic reaction network. The existence of a universalanabolic core is the biochemical basis of trophic ecol-ogy. For at least the last two billion years, most inputsto catabolism have originated with photosynthesis, andmost catabolic reactions are either neutral (such as fer-mentation) or oxidizing (such as respiration). Organismsthat are photo-autotrophic, obtaining energy from sun-light, have also come to store organic molecules and toaccess them catabolically.

Only the anabolic core of intermediary metabolismis truly universal, existing in the same form in allecosystems. Catabolic strategies vary among ecosystemsand can be almost absent in communities of chemo-autotrophs. The greater variability of catabolic net-works, together with the isolation of chemo-autotrophicanabolism in the apparently most ancient clades, sug-gests to us that the first living systems were reductivechemo-autotrophs and may have been autotrophic at thelevel of individual cells.

At the core of the anabolic network are the elevencarboxylic acids of the citric acid cycle. Though mostfamiliar as the Krebs cycle in oxidizing heterotrophs,which use it to break down organic molecules to CO2, thesame reactions run in the opposite direction4 in many au-totrophs, reducing environmental CO2 to build organicacids. The biosynthetic pathways for all compounds inthe anabolic core originate in the citric acid cycle, andwhen run in the reductive direction this cycle can dupli-cate its own members from abiotic CO2 and electrons,a property designated network autocatalysis. Thus thereductive citric-acid cycle appears, at the level of thebiosynthetic network, to be a self-contained engine ofsynthesis for all biochemical precursors [3].

The rest of the core anabolic network unfolds from thecitric acid cycle in a kind of cascade. The addition of am-monia and reductant yields the simpler biological amino

4 This reductive direction for the reactions is unfortunately knownas the “reverse citric acid cycle”. As it was probably the firstform of the cycle to exist, and perhaps the first step in the emer-gence of life, it would be more natural to call the reductive cyclethe forward direction, and the later, oxidizing Krebs cycle the“reverse” direction.

Page 4: 06-08-029.pdf

3

acids directly from citric-acid cycle intermediates, manyin five or fewer simple reaction steps. The 3-carbon sug-ars and their 6- and 5-carbon derivatives including riboseare built from the citric-acid cycle intermediate pyruvate,also in relatively few steps. The saturated fatty acidsthat form cell membranes are produced from another in-termediate – acetate – in a recursive reaction that furtherreduces the carbon-oxygen bonds of acetate to hydrocar-bon. More elaborate pathways leading to the complexamino acids, nucleic acids, and cofactors, follow fromthese elementary steps in a dense and surprisingly simpleweb of reactions.

The biosynthetic reactions from environmental CO2

and a reductant such as molecular hydrogen are exer-gonic as a whole [3], though many of the individual stepsrequire input of energy to overcome free energy “hills”if they are to occur at appreciable rates. The energyprovider for most of these reactions, as well as much ofthe re-arrangement chemistry of sugars, is the hydrolysisof the high-energy ester bonds of phosphate polymers.Today these are provided by ATP, but the ubiquity ofphosphate in core metabolism, together with the improb-ability that ATP could have been synthesized until sev-eral other chemicals were available as precursors, leads usto surmise that energetic polyphosphates must have beenubiquitous in the geochemical settings where life began.5

The cascade of metabolism thus functions overall asa relaxation channel for two major sources of geochem-ical energy: electron transfer from reductants and thehydrolysis of phosphates. The network structure sug-gests a channel that could have formed spontaneouslyand unfolded in stages, with some of the simplest (2- to4-carbon) compounds in the citric-acid cycle arising bythermal fluctuations and then self-amplifying by virtueof the autocatalysis of the network. The carbon atomsin citric-acid cycle intermediates are not completely re-duced, and the formation of fatty acids from acetate re-leases another increment of redox energy by producinghydrocarbon groups and water. The formation of sugars,amino acids, and other compounds creates pathways tohydrolize phosphate esters or to incorporate ammonia.Apart from some uncertainty about phosphates, all ofthese species are known to be produced in the hydrother-mal vent environments of the modern earth. Vent en-ergy sources support communities of organisms today [8]in the same way we propose they supported the earliest

5 The status of phosphates on the early earth is still unclear.Polyphosphates are found in volcanic condensates and otherdehydrating settings today [4, 5], but hydrothermal vent flu-ids appear depleted in orthophosphate relative to the seawaterfrom which they form [6], and vent polyphosphates are not re-ported. Christian deDuve has argued [7] that the ubiquitous useof thioesters as intermediates in the formation and hydrolysisof phosphate ester bonds is another ancient feature of biochem-istry, which could have led to the production of pyrophosphateand other phosphate polymers from reduced sulfur minerals in apre-biotic age.

proto-metabolic chemical systems, except that modernorganisms do not require energetic phosphates from theenvironment because they can recycle phosphate inter-nally using redox energy and the machinery of oxidativephosphorylation.

The anabolic network is dense, and leaves no energeticwaste products. Said another way, core metabolism con-verts all carbon from a fully-oxidized abiotic input (CO2),either into its most reduced form (methane), or into con-stituents of biomass such as the fatty acids, sugars, aminoacids, nucleotides, and cofactors – which then provide theplatform for further metabolism. This feature would alsobe expected of a network that was driven into existenceby pressures to transfer electrons from high-energy tolow-energy bonds, and to return phosphate from a high-energy polymer state to low-energy orthophosphate.

If modern metabolism recapitulates the process of bio-genesis – and we will discuss why it should – the energeticcascade through the anabolic network suggests a cascadein the emergence of life, one that began in metabolism.The earliest and simplest stage in the cascade, the reduc-tive citric acid cycle, is most accessible by random fluctu-ations and is also the stage coupled directly to the abioticcarbon source, CO2. This cycle looks simple enough tohave emerged spontaneously in an unsupervised networkof small-molecule reactions, but only captures about onethird of the redox energy available in transforming CO2

to its most-reduced and lowest-energy intermediate, ac-etate. Fatty acid synthesis captures about another thirdof the available energy, but can only occur from a start-ing pool of acetate, and is not network-autocatalytic forcarbon intake. However, it does provide an outlet forcitric-acid-cycle carbon, maintaining the disequilibriumthat drives the cycle.

The remaining third of available redox energy not re-leased by fatty acid synthesis is obtained by convertinghydrocarbons to methane, or more directly by convert-ing acetate to methane as some methanogens do today.Based on the complexity of some of the cofactors nowused and the remarkable carbon isotope selectivity dis-played by methanogens [9], we suspect that methane pro-duction was the last major redox innovation in the reduc-ing world.6 The di"culty of methanogenesis may alsobe suggested by the fact that, when large deposits ofbiomass are trapped in the earth, hydrocarbons accumu-late in fossil fuels rather than being released as methane.The emergence of biochemical intermediates would thenhave been a sort of bootstrap process in which simplenetworks near the input were elaborated to more diversenetworks downstream, as further reductant, ammonia, orphosphate reacted with the starting compounds.

The chemical ensembles in which life exists are struc-tured by kinetics as well as energetics, and the provi-

6 This position is disputed by those who would place methanogen-esis early in the origin of metabolism [10, 11].

Page 5: 06-08-029.pdf

4

sion and structure of catalysis will be central to under-standing how a chemical network could have been or-ganized by energy flows. Our understanding of kineticsis being changed by a growing realization that cataly-sis is not limited to complex polymers of amino acidsor RNA with sequences adapted to highly specific reac-tions. Even the biological monomers and small polymershave been found to posess catalytic activity [12, 13] andwe have proposed [14] that catalysis was continuouslyprovided during the emergence of life by a hierarchy ofmolecules, starting in the monomer world and passingthrough stages of longer and more organized polymeriza-tion.

Presenting metabolism from a perspective of energyflow – that is, starting with reductive autotrophy andadding complexity as it was likely added during the emer-gence of life – yields a remarkably hierarchical, compact,and simple picture, but one limited to descriptive chem-istry. To understand whether the emergence of life was akind of “lightning bolt” through the graph of geochemicalreactions, and whether the spontaneously generated paththrough that graph can be predicted from first principles,requires a quantitative statistical framework for handlingflows through chemical networks.

III. FLUX AND ORDER

Such a framework was suggested in its broadest outlineby E. T. Jaynes, who more than anyone else has arguedthat statistical physics is simply the application of prin-ciples of statistical inference to systems whose boundaryconditions cannot completely specify their internal con-figurations [15]. The entropy of physics and chemistry isthe entropy of information theory, formalizing Laplace’sprinciple of “insu"cient reason”. Entropy maximizationfixes the infinitely many free moments of a distributiononly partly specified by boundary conditions without re-stricting what the conditions can be. The propertiesof the resulting distribution then reflect required conse-quences of the boundary constraints, whatever else mayor may not be knowable about the system. These meth-ods for extracting the consequences of partial constraintclearly are not limited to cases of thermodynamic equi-librium. Entropy can depend on currents as well as onconfigurations. When it does, the principle of free energyminimization for open systems, which is derived fromentropy maximization, can be extended to driven sys-tems. The free energy is then generalized to depend onthe richer boundary conditions responsible for driving.

In principle Jaynes’s observation opens a whole newdomain for statistical mechanics, but in practice few in-stances that usefully reach beyond equilibrium have beenderived. Chemistry is still studied with the classical

state variables and entropy of Gibbs7, and the applica-tion of information theory to life at a system level haslargely been limited to bioinformatics of gene or proteinsequences. The use of equilibrium entropy makes lifeseem paradoxical [16]: the so-called equilibrium entropyof biomass is less than that of the same matter in anabiotic state, and yet the core molecules and pathwaysof life have apparently been stable despite continual per-turbation for almost four billion years. The theory ofstability in statistical mechanics therefore suggests thatan earth with life must be more probable than an earthdistributed according to thermal equilibrium, and thatthe “equilibrium entropy” within the biosphere is not theproper measure of likelihood on a driven earth.

The paradox is not entirely naıve, though Boltzmannsaw already that it was based on an incomplete descrip-tion. Many have recognized that the increase in equilib-rium entropy of the non-living environment can allow theequilibrium “entropy” of biomass to decrease (refrigera-tors do this all the time), but this recognition misses anessential connection and leaves us still unable to calculateor make predictions. Life mediates flows in its environ-ment, which in some cases can correctly be associatedwith rates of entropy change [15, 17]. Yet life is carriedby states of matter, and a proper statistical theory shouldat least allow us to predict the amount of order in a statethat can be stabilized by a given rate of environmentalentropy increase. The answer to this question will tell uswhether life could have formed as a spontaneous chemicalchannel.

The failure of equilibrium entropy to provide a validmaximization principle for driven systems has led to at-tempts [18–20] to formulate principles based on “entropyproduction”, which necessarily invoke dynamical as wellas statistical assumptions. Though these may inheritsome validity from well-formulated entropy maximizationin special cases, they ultimately fail as general principlesbecause they are based on the application of the equilib-rium expression for entropy to driven systems where thatexpression no longer measures the actual uncertainty inthe systems’ states.

Examples have been worked out that avoid these prob-lems simply by enriching the description of a system’sstate space to include measures of order in its processesas well as in its configurations. In classical mechanics,this leads to an entropy that depends on the net driftcurrents created by non-equilibrium boundaries in space,as well as the concentrations of chemical species [21]. Thesame generalization can be made in quantum mechanics– the natural language for describing systems of chemicalbonds and reactions – where the entropy becomes a func-tion of reaction currents as well as configurations [22].The resulting entropy-maximization principles can pre-

7 or their extensions to equilibrium quantum-mechanical ensem-bles for intramolecular properties or reaction transition states

Page 6: 06-08-029.pdf

5

dict the spontaneous formation of currents, whereas theequilibrium entropy is maximized on currentless states.These are the appropriate entropy principles underlyingOhm’s law for small electric currents and Fourier’s lawfor heat flow. The very existence of a current consti-tutes a restriction on the system and hence a decreasein its entropy, but these near-equilibrium models are toosimple to capture the possibility of driving the configu-rations into significantly more ordered states to createbetter channels for energy flow – a feature of life we setout to explain.

Applied to nonlinear systems, these same ideas estab-lish an important mathematical connection. The pres-ence of positive feedback in a current-carrying systemcan create a threshold for the sudden formation of macro-scopic order, and the crossing of this threshold is a phasetransition equivalent in all statistical respects to equilib-rium phase transitions [23, 24]. The ordered state createsa channel between the environment’s input and outputreservoirs with much better conductance than the equi-librium state. Order in turn is maintained by energyextracted from the current between the reservoirs. Theentropy principle used in these examples is not equivalentto maximization of the inter-reservoir current subject tosome constraint, because the system state is not con-strained with respect to current capacity. Rather the or-dered state results from a balance of many contributionsto the true entropy, some of which may be understood ap-proximately as the entropy of the internal system state,some in terms of entropy transport between reservoirs,and some in terms of entropy production from dissipa-tion [22].

The foregoing examples will fall short of an integratedframework for understanding life, because the chemi-cal substrate of life has a complex structure and infor-mational richness exceeding that of other physical sys-tems. However, these models do provide in patchwork aset of appropriate tools for mathematically relating theorigin and robustness of life to simpler classes of self-organization. They show, when the non-equilibrium en-tropy is properly formulated, how the initiation of cur-rents can be consistent with entropy maximization andcan even be required by it in open systems. They alsoshow how positive feedback, leading to configurationalphase transitions that enhance current flow, can arisewhen an internal cycle of matter or energy in the sys-tem is coupled to the transport between its reservoirs.The way such coupling enters these examples of physicalself-organization [23, 24] corresponds closely to the reac-tion structure of chemistry [25] and provides a statisticalfoundation for what is called the cycling theorem [26].

In the attempt to understand organization in drivensystems, particularly regarding their coupling to the en-vironment, the potential forms of order not observed areas informative as those observed. The forms of orderwe do not observe in biochemistry are the many organicmolecules with a complexity and free energy comparableto that of core metabolites, yet which are not produced by

metabolism. The metabolic core comprises a surprisinglysmall and well-ordered collection of molecular species,suggesting limitations in the kinds of organic moleculesrelevant to its emergence, and in the pathways capableof extracting energy from these sources. For instance,the citric acid cycle is distinguished as a simple pathwayfor organosynthesis from CO2 and reductant, but it isof no apparent significance in networks driven by highconcentrations of the reactive molecule formaldehyde –which undergoes aldol condensations to form complexmixtures of diverse sugars [27]. Similarly, the biologi-cal amino acids are among the most natural elaborationsof citric-acid cycle intermediates in an environment ofammonia and reductant [28]. These amino acids are notdistinguished in synthetic reactions from gas-phase freeradicals [29], and they are not distinguished in the deep-space photochemistry of interstellar and cometary ices ormeteors [30].8

IV. THE COLLAPSE TO LIFE

The creation of energy channels by means of phasetransition provides a way to understand how the corebiochemistry of life can have been stable throughout theage of the earth: a state of the geosphere which includeslife becomes more likely than a purely abiotic state. Thenon-living earth would have been metastable under con-ditions of continuous geochemical free energy production.Its “collapse” to greater stability was the emergence oflife.

We must be careful, however, in describing the morestable state to which earth collapsed. It is probablywrong to regard biomass as a thermodynamic “state”of terrestrial matter. A more accurate picture from well-studied systems might be phase separation in binary al-loys [31], in which the whole alloy is in a thermodynamicstate consisting of separated “phases” of di!erent chem-ical composition that interact at interfaces. The phaseseparation of the cellular milieu from its abiotic surround-ing is clearly more complex than phase separation in al-loys, and the interactions between the phases – both atinterfaces and mediated by them – has qualitative di!er-ences because life is among other things a transport phe-nomenon. Still, alloys provide a model for the emergenceof combined spatial and chemical structure – clearly es-sential to understanding metabolism.

Cells use membranes to concentrate organic reagents

8 It appears to be a common assumption in the experimental litera-ture on “prebiotic chemistry” that the production of metabolitesby abiotic processes is the main problem to be overcome, andthat later stages of evolution somehow separated the observeduniversal metabolism from a rich primordial soup. Our explana-tion would isolate more of the observed metabolic pathways atpre-cellular stages, where chemical selection more closely resem-bles thermodynamics than it does cellular evolution.

Page 7: 06-08-029.pdf

6

and exclude toxins. They use them to create pH andvoltage di!erences converting redox to phosphate energyand they use them to segregate polar and non-polar en-vironments in which di!erent biochemical reactions arefavored. It is likely that cells came into existence as themost productive platform for metabolism, after which thecharacteristics of individuality and heritable variation socentral to Darwinian selection emerged as a by-productof cellular form.

The fact that the biosphere must be considered asbut one in a collection of geospheres was appreciated byRankama and Sahama [32], p. 320, who puzzled over howbiomass – so small a fraction of terrestrial matter – canhave exercised such a large influence on the earth’s pro-cesses, particularly in the oxygenation of the atmosphereand oceans. We have already noted that the inelastic ab-sorption of light by photosynthetic organisms provides achannel – independent of other physical processes – forthe transduction of light energy between the visible andthermal infrared bands. An elementary calculation [33]shows that absorption in place of elastic scattering in-creases heat-transporting capacity by a factor of ! 109

for each atom acting as an absorber. In other words, ev-ery atom e!ectively rendered capable of photo-absorptionby organosynthesis becomes a billion times more e"cientas a transporter of energy from sunlight to black-bodyradiation than that same atom in an abiotic state. Theenhancement of this energy capacity is mitigated by thelarge volume of biomass apparently required to createeach photosynthetic absorption site.

An emergence of life by collapse does not imply thatthe collapse was a single event, or even that it has run tocompletion.9 The concept of phase transition unifies theemergence of life with the major transitions in evolutionthat were clearly chemical and apparently irreversible,and it also emphasizes that life is not necessarily a singlestatistical phenomenon.

There were presumably distinct chemical “batteries”in the early geosphere (redox, phosphate), and di!erentmodules within metabolism appear to be stabilized inde-pendently to some degree by chemical potentials from thedi!erent sources. The unification of these essentially geo-chemical processes within cells could have required a laterinnovation, such as oxidative phosphorylation for the re-cycling of phosphates, which are di"cult to transportacross nonpolar cell membranes. Nitrogen fixation ap-pears as a still later innovation in response to changes inocean chemistry [34], apparently occurring after the lastuniversal common ancestor (LUCA) had emerged. Theemergence of the LUCA was likely not even energetically

9 In very large systems capable of highly intricate order, such asthe biosphere, it is di!cult to assign a practical meaning to eventhe concept of “completion”. Presumably most of the variables inorganism and ecosystem structure are nearly neutral with respectto the (scalar) criterion of energy transport, and the evolutionarydynamics of these variables must be understood in other terms.

the end of the collapse of the abiotic earth, though it doesrepresent the synthesis of remarkably many forms of or-der. An anabolic, reductive, non-photosynthetic LUCAcould have incorporated all of the major structures andfunctions needed for cells to evolve and flourish, as re-ductive autotrophs do today.

The innovation of photosynthesis could then have pro-ceeded atop this sophisticated platform of genes, en-zymes, and compartments, with photosynthesis from hy-drogen sulfide preceding that from water [35]. Theamount of free energy available from sunlight relative tothat available from geotectonic chemistry is large enoughthat energetically the collapse to photosynthetic life mayhave created a more significant channel than the emer-gence of cells. This is why photosynthesizing life ap-pears to account for most biomass, either directly, or in-directly through by-products on which other ecosystemsdepend. Other apparent novelties following on photo-synthesis were the reductive pentose phosphate cycle forcarbon fixation, endosymbiosis, and ultimately multicel-lularity – the last two of which seem to have been drivenparticularly by the rise of molecular oxygen in the atmo-sphere and oceans.

This collapse to life is an ongoing process. Its most en-ergetically significant and accessible steps probably fol-lowed one another rapidly in the initial stages, with fur-ther refinements coming later and more sporadically –either because they followed from less likely events orbecause they o!ered less energy-flow advantage. How-ever they occurred, we would expect the truly chemicaltransitions to be distinguished by irreversibility, as theemergence of photosynthesis, endosymbiosis, and multi-cellularity apparently are.

This point of view emphasizes that life is a confederacy– of chemical constituents, pathways, and functions – andthat the modular form of its emergence continues to bereflected in its modern organization. The essentially ther-modynamic succession of a biogeosphere through increas-ingly stable states also explains why metabolism seems toshow a clear and deterministic progression, while speciesand their ecologies appear to have been subject to con-tinual waves of extinction.

V. A NEW UNDERSTANDING OF LIFEWITHIN THE GEOSPHERE

However tightly integrated they are in modern cells thefunctions of life are heterogeneous and retain somewhatindependent dynamics. It may be that we should viewcore metabolism as an outgrowth of geochemistry ratherthan a totally novel innovation of biology. As trees in adry landscape tend to grow along the riverbeds, the struc-tures of life have formed themselves around the pathwaysof primordial metabolism. It is precisely there that sig-nificant innovation was not required. Rather than viewcore metabolism as a subset extracted from the primor-dial soup by later evolution we would characterize extant

Page 8: 06-08-029.pdf

7

life as the downstream elaboration of possibilities createdby a metabolic core much like the present one.

The emergence of cells in this picture was a process ofenfolding and controlling geochemical flows. The premisethat cells produced all organic molecules from abioticprecursors right from the beginning has sometimes beencalled an “autotrophic” theory of the origin of life, inorder to distinguish it from the idea of “heterotrophic”dependence on a primordial soup [36]. The terminologyis unfortunate as it conflates di!erent concepts. We pic-ture the emergence of life as a sort of chemical bootstrap-ping process from abiotic precursors in steady state, asopposed to a process initiated by a primordial soup andthen required to develop anabolic pathways to provideits own inputs as the soup was exhausted. The notionsof autotrophy and heterotrophy emerged later, when cel-lularization and speciation arose to facilitate metabolismand evolution. The dynamics at this stage determinedhow cells and species partitioned the metabolic functionsto form ecological structure.

The metabolism of ecosystems is empirically moreuniversal than their partitioning according to the rolesand interactions of species. This fact makes sense ifmetabolism is the bridge between geochemistry and thenovel elements in cell physiology and trophic ecology.The major chemical or metabolic transitions in evolutionhave a di!erent character from the record of extinctions,invasions, and ecological successions. The latter are gov-erned by the dynamics of repartitioning rather than ofprimary production. Finding a stable ecological divi-sion of labor requires solving a complex network of con-straints. The species making up an ecology must jointlyimplement a universal metabolic chart, while predation,saprophyty, parasitism, and symbiosis, create a flux bal-ance of organic molecules among the individuals. Themagnitudes of metabolic fluxes are in turn constrainedby the body sizes of individuals and by environmentalmass transport. In principle, many distinct ecologies cansatisfy these constraints with comparable overall energythroughput, though they may be transformable into oneanother only by complex avalanches of species replace-ment and trophic restructuring.

Other evidence for the di!erence between species dy-namics and the dynamics of metabolism comes from thepreservation of large parts of the metabolic network dur-ing major chemical transitions in evolution. It is easy toimagine that with the rise of oxygen reductive anabolicpathways might have been so poisoned by this wasteproduct as to bring life to an end [37]. Instead they wereessentially preserved, with core networks like the citric-acid cycle reversing direction to break down compoundsfor energy – which in the reducing world they had syn-thesized – while still providing a complete set of biosyn-thetic precursors. Innovations such as the light/dark cy-cle now used by plants emerged, suppressing anabolic ac-tivity during the sunlight hours when molecular oxygenis produced, and re-activating it at night when oxygenpartial pressure is lower.

The prior existence of thermodynamically orderedpathways for organosynthesis opens the possibility fora natural sequence of steps from proto-metabolism tothe first macromolecular phase of life. Geochemicallyordered primordial pathways would favor the preserva-tion of any randomly formed polymers that increasedpathway fluxes, thus producing more of the raw ma-terials from which the polymers were made. The ca-pacity for molecular replication could have been com-pletely sequence-independent in its very earliest stages.If the first polymers were rare in a world where a nascentmetabolic order already existed, their sequences wouldoriginally have been selected on the basis of their inter-actions with metabolites.10 Only at a later stage, whenmany such sequences were present, would we expect themto have taken on relations to each other that aided intheir replication. Under this order of selective forces thecommitment to metabolism would have been made beforereplication became a coordinated macromolecular pro-cess, and we would expect the resulting metabolism topreserve the prior geochemical pathways. The fixing ofproto-metabolic pathways by coded macromolecules cre-ated initial conditions from which these pathways wouldbe inherited by all subsequent generations of cells.

Darwinian competitive exclusion is rooted in thechemical competitive exclusion of metabolism, whetherthrough di!erential rates of growth or di!erential re-source capture. The congruence of cell physiology withthe statistical paths of least resistance will inevitablybe a determinant of fitness. Through it the forces thatdrove the emergence of life will be expected to have stabi-lized its metabolic core throughout earth’s history. If theemergence and maintenance of metabolism have indeedboth been brought about by the same thermodynamicforces, then universality due to descent from common an-cestors becomes indistinguishable from universality dueto evolutionary convergence.

The organization of energy flow through metabolicpathways allows us to recognize many forms of continu-ity absent in conventional thinking. We have good reasonto believe that the first emergent metabolism was simi-lar in many respects to modern universal core anabolism.Metabolism itself becomes a bridge from driven geochem-istry to the foundations of cell physiology and trophicecology. If our story is correct, the thermodynamic forcesresponsible for the emergence of anabolism within prebi-otic chemistry have ensured its stability throughout theensuing history of life. Energy flow embeds life withinthe geosphere not just mechanistically but conceptuallyas an inevitable form of driven geochemical order.

10 We thus dissociate the source of selective advantage from the pro-cess of molecular replication, to the point where the two couldhave been completely independent. Note that this premise is di-rectly opposite the usual premise adopted for an RNA world [38]:that di"erential ability of macromolecules to self-replicate wasthe first source of selective advantage.

Page 9: 06-08-029.pdf

8

Acknowledgments

We are supported in this work by grant FIBR: TheEmergence of Life: From Geochemistry to the Ge-netic Code Award 0526747, National Science Foundation.DES acknowledges Insight Venture Partners for individ-

ual support. We are immensely indebted to Cormac Mc-Carthy for advice on the manuscript, and to Shelley Cop-ley for shaping our views of biochemistry and catalysis.We also wish to thank Doug Erwin and VijayasarathySrinivasan for comments and input.

[1] George Wald. Fitness in the universe: Choices and ne-cessities. Orig. Life Evol. Biosphere, 5:7–27., 1974.

[2] Joseph W. Lengeler, Gerhart Drews, and Hans G.Schlegel. Biology of the Prokaryotes. Blackwell Science,New York, 1999.

[3] Eric Smith and Harold J. Morowitz. Universality inintermediary metabolism. Proc. Nat. Acad. Sci. USA,101:13168–13173, 2004. SFI preprint # 04-07-024.

[4] Michael R. W. Brown and Arthur Kornberg. Inor-ganic polyphosphate in the origin and survival of species.Proc. Nat. Acad. Sci. USA, 101:16085–16087, 2004.

[5] Y. Yamagata, H. Watanabe, M. Saitoh, and T. Namba.Volcanic production of polyphosphates and its relevanceto prebiotic evolution. Nature, 352:516–519, 1991.

[6] Deborah S. Kelley, John A. Baross, and John R. Delaney.Volcanoes, fluids, and life at mid-ocean ridge spreadingcenters. Annu. Rev. Earth Planet. Sci., 30:385–491, 2002.

[7] Christian de Duve. Singularities. Cambridge U. Press,New York, 2005.

[8] Michael Hugler, Carl O. Wirsen, Georg Fuchs, Craig D.Taylor, and Stefan M. Sievert. Evidence for autotrophicco2 fixation vie the reductive tricarboxylic acid cycle bymembers of the ! subdivision of proteobacteria. J. Bac-teriology, 187:3020–3027, 2005.

[9] Andrea Pruess, Rolf Schauder, and Georg Fuchs. Carbonisotope fractionation by autotrophic bacteria with threedi!erent co2 fixation pathways. Z. Naturforsch., 44:397–402, 1989.

[10] Michael J. Russell and William Martin. The rocky rootsof the acetyl-coa pathway. Trends Biochem. Sci., 29:358–363, 2004.

[11] William Martin and Michael J. Russell. On the originof microbial physiology: Acetogenesis, methanogenesis,carbonyl sulfide, acetyl phosphate, and chemiosmosis atan alkaline hydrothermal origin. Proc. Roy. Soc. London,under review.

[12] Faqing Huang, Charles Walter Bugg, and Michael Yarus.RNA-Catalyzed CoA, NAD, and FAD Synthesis fromPhosphopantetheine, NMN, and FMN. Biochemistry,39:15548–15555, 2000.

[13] Sandra Pizzarello and Arthur L. Weber. Prebiotic aminoacids as asymmetric catalysts. Science, 303:1151, 2004.

[14] Shelley D. Copley, Eric Smith, and Harold J. Morowitz.From proto-metabolism to the rna world. in preparation,2006.

[15] R. D. Rosenkrantz, editor. Jaynes, E. T.: Papers onprobability, statistics and statistical physics. D. Reidel,Dordrecht, Holland, 1983.

[16] E. Schrodinger. What is Life?: the physical aspect of theliving cell. Cambridge U. Press, New York, 1992.

[17] R. J. Tykodi. Thermodynamics of Steady States. Macmil-lan, New York, 1967.

[18] Lars Onsager. Reciprocal Relations in Irreversible Pro-

cesses. I. Phys. Rev., 37:405–426, 1931.[19] Lars Onsager. Reciprocal Relations in Irreversible Pro-

cesses. II. Phys. Rev., 38:2265–2279, 1931.[20] Dilip Kondepudi and Ilya Prigogine. Modern Thermo-

dynamics: from Heat Engines to Dissipative Structures.Wiley, New York, 1998.

[21] Aaron B. Corbet and Harold J. Morowitz. Information-theoretic derivation of a nonequilibrium velocity distri-bution. Phys. Rev. A, 6:2298–2301, 1972.

[22] Eric Smith. Thermodynamic dual structure of linearlydissipative driven systems. Phys. Rev. E, 72:36130, 2005.SFI preprint # 05-08-033.

[23] Eric Smith. Carnot’s theorem as noether’s theorem forthermoacoustic engines. Phys. Rev. E, 58:2818–2832,1998.

[24] Eric Smith. Statistical mechanics of self-driven carnotcycles. Phys. Rev. E, 60:1999, 3633-3645.

[25] Eric Smith, Harold J. Morowitz, and Shelley D. Copley.Core metabolism as a self-organized system. In SteenRasmussen, Norman Packard, and Marc Bedau, editors,Bridging nonliving and living matter, Cambridge, Mass.,to appear. MIT Press.

[26] Harold J. Morowitz. Physical background of cycles inbiological systems. J. Theor. Biol., 13:60–62, 1966.

[27] Arthur L. Weber. Sugars as the optimal biosyntheticcarbon substrate of aqueous life throughout the universe.Orig. Life Evol. Biosphere, 30:33–43, 2000.

[28] Shelley D. Copley, Eric Smith, and Harold J. Mo-rowitz. A mechanism for the association of amino acidswith their codons and the origin of the genetic code.Proc. Nat. Acad. Sci. USA, 102:4442–4447, 2005.

[29] S. L. Miller. Production of amino acids under possibleprimitive earth conditions. Science, 117:528–529, 1953.

[30] Sandra Pizzarello. Chemical evolution and meteorites:An update. Orig. Life Evol. Biosphere, 34:25–34, 2004.

[31] Hiroaki Okamoto. Desk Handbook: Phase Diagrams forBinary Alloys. ASM International, Materials Park, OH,2000.

[32] Kalervo Rankama and Thure Georg Sahama. Geochem-istry,. U. Chicago Press, Chicago, Ill., 1950.

[33] Guy A. Hoelzer, John W. Pepper, and Eric Smith. On thelogical relationship between natural selection and self-organization. J. Evol. Biol., to appear 2006.

[34] Jason Raymond, Janet L. Seifert, Christopher R. Sta-ples, and Robert E. Blankenship. The natural history ofnitrogen fixation. Mol. Biol. Evol., 21:XXX, 2004.

[35] Robert E. Blankenship. Molecular Mechanisms of Pho-tosynthesis. Blackwell Science, Malden, Mass., 2002.

[36] Iris Fry. The Emergence of Life on Earth: A Histor-ical and Scientific Overview. Rutgers U. Press, NewBrunswick, N.J., 2000.

[37] Lynn Margulis and Dorion Sagan. Microcosmos: fourbillion years of evolution from our microbial ancestors.

Page 10: 06-08-029.pdf

9

U. California Press, Berkeley, CA, 1997.[38] Raymond F. Gesteland, Thomas R. Cech, and John F.

Atkins, editors. The RNA World. Cold Spring Harbor

Laboratory Press, Cold Spring Harbor, New York, thirdedition, 2006.