apRoximate - authors.library.caltech.edu · Aprogramfor least squaresanalysis ofreassociation and...

11
Volume 4 Number 6 June 1977 Nucleic Acids Research A program for least squares analysis of reassociation and hybridization data * W.R. Pearson, E.H. Davidson and R. J. Britten Division of Biology, California Institute of Technology, Pasadena, CA 91125, USA Received 21 December 1976 ABSTRACT A computer program is described for the rapid calculation of least squares solutions for data fitted to different functions normally used in reassociation and hybridization kinetic measurements. The equations for the fraction not reacted as a function of Cot follow: First order, exp(-kCot); second order, (1+kCot)Y1; variable order, (1+kCot)-n; apRoximate fraction of DNA sequence remaining single stranded, (1+kCot)F ; and a function describing the pairing of tracer when the rate constant for the tracer (k) is distinct frym the driver rate constant (kd): exptkL1-(1+kdCot) -nJ/Lkd(1-n)J}. Several components may be used for most of these functional forms. The standard deviations of the individual parameters at the solutions are calculated. INTRODUCTION The quantitative examination of reassociation and hybridization kinetic measurements has become increasingly important as the sophistication of the measurements has grown. In this paper we describe a general computer program which can conveniently apply the variety of functions now used to interpret kinetic measurements. We have chosen to use a non-linear least squares method so that the solutions give equal weight to all of the individual measurements and no preliminary assumptions need be made about the initial or terminal values of the reaction. Least squares computer programs have been applied to the problem of resolving repetitive and single copy kinetic components in DNA renaturation experiments carried out on many organisms (e.g. 1,2,3). They are also used to determine rate constants in RNA excess hybridization reactions (e.g. 4,5). The use of cDNA orobes to determine the complexity of RNA Populations by kinetic rather than saturation measurements (e.g. 6,7) also relies heavily on the resolution of abundance classes by accurate determination of their rate constants. The five functions listed in Table 1 are used for the examination of the following kinds of measurements. The second order equation (FINGER) describes © Information Retrieval Limited 1 Falconberg Court London Wl V5FG England 1727

Transcript of apRoximate - authors.library.caltech.edu · Aprogramfor least squaresanalysis ofreassociation and...

Page 1: apRoximate - authors.library.caltech.edu · Aprogramfor least squaresanalysis ofreassociation and ... 1nJ/Lktgacer(1-n)J1 describes rate of reaction ... inal f1 ki f2 k2.200~ 0.300

Volume 4 Number 6 June 1977 Nucleic Acids Research

A program for least squares analysis of reassociation and hybridization data

*

W.R. Pearson, E.H. Davidson and R. J. Britten

Division of Biology, California Institute of Technology, Pasadena, CA 91125, USA

Received 21 December 1976

ABSTRACTA computer program is described for the rapid calculation of least

squares solutions for data fitted to different functions normally used inreassociation and hybridization kinetic measurements. The equations for thefraction not reacted as a function of Cot follow: First order, exp(-kCot);second order, (1+kCot)Y1; variable order, (1+kCot)-n; apRoximate fractionof DNA sequence remaining single stranded, (1+kCot)F ; and a functiondescribing the pairing of tracer when the rate constant for the tracer (k) isdistinct frym the driver rate constant (kd):exptkL1-(1+kdCot) -nJ/Lkd(1-n)J}. Several components may be used for most ofthese functional forms. The standard deviations of the individual parametersat the solutions are calculated.

INTRODUCTIONThe quantitative examination of reassociation and hybridization kinetic

measurements has become increasingly important as the sophistication of the

measurements has grown. In this paper we describe a general computer program

which can conveniently apply the variety of functions now used to interpret

kinetic measurements. We have chosen to use a non-linear least squares method

so that the solutions give equal weight to all of the individual measurements

and no preliminary assumptions need be made about the initial or terminal

values of the reaction.

Least squares computer programs have been applied to the problem of

resolving repetitive and single copy kinetic components in DNA renaturation

experiments carried out on many organisms (e.g. 1,2,3). They are also used

to determine rate constants in RNA excess hybridization reactions (e.g. 4,5).The use of cDNA orobes to determine the complexity of RNA Populations by

kinetic rather than saturation measurements (e.g. 6,7) also relies heavily on

the resolution of abundance classes by accurate determination of their rate

constants.

The five functions listed in Table 1 are used for the examination of the

following kinds of measurements. The second order equation (FINGER) describes

© Information Retrieval Limited 1 Falconberg Court London Wl V 5FG England 1727

Page 2: apRoximate - authors.library.caltech.edu · Aprogramfor least squaresanalysis ofreassociation and ... 1nJ/Lktgacer(1-n)J1 describes rate of reaction ... inal f1 ki f2 k2.200~ 0.300

Nucleic Acids Research

TABLE 1 Functional forms used by the program

Number Name Form Use

1 FINGER fi(l + kiCot)f1 second order reaction: DNArenaturation measured byhydroxyapatite chromatography

2 WHATOR f(1 + kCot)-n variable form reaction: to determinevalues of n when the apparent order ofthe reaction is unknown

3 NUFORM fiexDtkiL1-(kdCot)1nJ/Lk (1-n)J1 describes rate oftgacer reaction when tracer rateconstant k differs from driver rateconstant k

-2d4 EXCESS fiexp(-kiCot) first order function: for RNA excess

experiments

5 WHTCMP f.(1 + k.Cot) 0 44 modified second order function: S-1nuclease assay of hybridization

accurately the form of DNA reassociation kinetics assayed by hydroxyapatite

chromatography (8,9) though for fairly comDlex reasons (10,11). The

pseudo-first- order equation (EXCESS) applies when the nucleic acid drivingthe reaction remains unpaired as in RNA driven reactions. The third function

(WHATOR) has a form which can be varied by changing the value of the exponent,and is useful when the apparent order of the reaction is not known or there is

a need to test the heterogeneity of the reacting comnonents. When the

exponent n=1, it reduces to second order form. When high quality single

component hydroxyapatite measurements are analyzed, the least squares solution

yields n=1.0 (11). As n becomes large this equation approaches first orderand values of n as high as 10 yield a form which is indistinguishable from

first order. The next functional form (WHTCMP) is applicable to DNA

reassociation when the reaction is assayed by S-1 nuclease (11,12) and

expresses the fraction of the length of the DNA sequence present which remains

single stranded. The last equation (NUFORM) applies when the rate of

reassociation of tracer with the driver and the rate of driver renaturation

itself differ. The amount of driver available is assumed to follow the

equation (1+kCot)-n. A value of n=.44 is usually used and gives a good

approximation to the actual capacity of the remaining single stranded regions

to reassociate with tracer molecules (13).

The non-linear least squares program described in this paper is in use on

a PDP-10 timesharing computer. The program provides for interactive input but

1728

Page 3: apRoximate - authors.library.caltech.edu · Aprogramfor least squaresanalysis ofreassociation and ... 1nJ/Lktgacer(1-n)J1 describes rate of reaction ... inal f1 ki f2 k2.200~ 0.300

Nucleic Acids Research

has been designed for ease of conversion to a batch processing system. A

typical run of the program is presented in Appendix I, with user responses

underlined. In a Batch processing environment the underlined inputs would be

submitted on cards.

The algorithm used in the program was developed by Marquardt (10) fornon-linear least squares problems. The implementation of the algorithm

provides two additional facilities: 1) the ability to hold any number or

combination of parameters constant in order to find the best solution with the

remaining variable parameters; and 2) the ability to substitute any function

for the NUFORM function by replacing a subroutine. Parameter fixing can

provide important insight into the uniqueness of the solution parameter set.

The function substitution option allows more complex functions to be fit to

data as more comolex phenomena - e.g. rate retardation for single strands on

duplexed molecules (10) - are studied.

ANALYSIS OF HYBRIDIZATION DATAThe program strategy

This non-linear least squares program is designed to converge on a

solution yielding parameters which minimize the least squares deviation of the

function from a set of data. There are two main concerns: whether the final

solution is biased by the input parameter estimates, and whether further

iterations will improve the solution. A solution is indeoendent of inout

parameter estimates and insensitive to further iterations if it has converged.

Convergence is indicated by the amount of change in the RMS from one iteration

to the next and by the change in the DELMX parameter. Because of the Taylor'sseries approximation technique used by this program, the parameter values

converge very rapidly in the neighborhood of a solution. Although this

neighborhood may be difficult to find for the first few iterations (thealgorithm's strategy uses a "gradient" technique to find the neighborhood),once found, successive iterations will improve the oarameter values by one to

two significant figures with each iteration. This imorovement is reflected in

a 10- to 100- fold decrease in DELMX with successive iterations. This rapiddecrease in DELMX may occur while the RMS changes very little. Attemots to

improve the solution beyond convergence may cause the message CORRECTIVE

ITERATIONS EXCEEDED to be printed.

Possible problemsThis orogram rapidly converges to a unique solution from a wide range of

parameter estimates when the data provide adequate constraints. New parameterestimates are always better (in the least squares sense) than the previous

1729

Page 4: apRoximate - authors.library.caltech.edu · Aprogramfor least squaresanalysis ofreassociation and ... 1nJ/Lktgacer(1-n)J1 describes rate of reaction ... inal f1 ki f2 k2.200~ 0.300

Nucleic Acids Research

values but the parameters may become negative during the process. Inadequate

termination data may allow the FINAL parameter to become negative while the

FRACTION parameter for the slowest component increases without bound. Fixing

the FINAL fraction unreassociated (presumably near zero) or close scrutiny of

the DELMX values to find regions of local convergence may solve the problem.Often when the series of iterations do not immediately converge the

program hesitates in a region of local convergence. When pressed to improve

the RMS the program may go off in a direction (such as negative parameters)where termination is impossible. Usually DELMX starts at 1.0 to 5.0, then

decreases to 0.02 to 0.1 as the fit converges (the largest parameter change is

less than 2-10%). DELMX may then jump by a factor of 50 to 200 and start on

another (possibly unterminated) path. The small DELMX value indicates a local

low RMS region which might provide good values for the parameters but theymust be carefully examined. To determine the quality of the parameter values,plot the solution.

Occasionally the program returns negative rate constants for components

of the reaction. This is usually due to an attempt by the program to remove

that component from the solution. Plotting the solution usually shows why

this was done. Single erroneous points that do not show in the plot should be

searched for and perhaps a new solution should be attempted with fewer

components.

Parameter fixing

Data from other measurements may supply fixed values for parameters in a

solution. For example, chemical measurements of genome size may be used to

establish the rate constant of the single copy component. Slave "mini-cot"experiments (3) often provide the most reliable determinations of repetitive

and single copy rate constants which may be used as fixed values to calculatethe fraction of the genome associated with the different rate components in a

total reassociation curve.

Fixing the single copy rate constant at a value determined from the

genome size or minicot analysis may be particularly important for DNA which

contains a small fraction repeated 2-20 fold. This low repetition class isvirtually indistinguishable from single copy DNA but can increase the apparent

single copy rate constant by a factor of two.Parameter fixing also provides information about the variety of similar

least squares solutions which describe the data equally well. A graph of RMS

vs. a set of fixed values of a parameter can be informative, particularly if

it turns out to be a very shallow curve near the minimum.

1730

Page 5: apRoximate - authors.library.caltech.edu · Aprogramfor least squaresanalysis ofreassociation and ... 1nJ/Lktgacer(1-n)J1 describes rate of reaction ... inal f1 ki f2 k2.200~ 0.300

Nucleic Acids Research

TABLE: EFFECT OF GAUSSIAN NOISE ON PARAMETER VALUES AND ERROR ESTIMATESSuccessive trials with random noise

Trial Number ofpoints

F:

Value Parameters used 0Error to generate data 0

SolutionError

SolutionError

SolutionError

SolutionError

SolutionError

SolutionError

Parameter valuesat solution

inal f1 ki f2 k2

.200~ 0.300 0.1000 0.300 10.00I U LU`

1 40 o:. 1b6 o:25 0.0536 0:82832bg0.017 0.037 o0.0289 0. 38 .382 40 o.19 0o.309 0.106 0.297

0.019 0.038 0.048 0.038 6.3 40 0.212 0.801 0.149 0. 1 lt.

0.015 0. 32 0.058 0.8? 7.44 40 o.208 o 0o061 0 82

0.017 0.040 0.039 0:841 2.475 40 o.180 0.340 0.110 0.302 10.2

0.015 0.035 0.039 0.035 4.2sum of 200 0.200 0.293 0.103 0-.300 9.28

trials above 0.007 0.017 0.020 0.017 1 .81

RMS

0.0350

0.0466

0.0392

0.0382

0.0379

0.0403

a Noise factor equal to EFS described in the text. For this example, F is0.2

Parameter statistics

Parameter standard deviations and correlation coefficients calculated by

the program provide information about the range of parameter values which may

adequately describe the data. These numbers, along with the RMS and DELMX,

indicate the significance of the parameter values obtained at the least

squares solution.

The parameter standard deviation and correlation coefficient calculation

is similar to standard deviation and correlation coefficient calculations in

linear regression analysis. To obtain the standard deviations and correlation

coefficients the data covariance matrix is inverted. The calculation assumes

that the Taylor's series linear approximation is accurate in the neighborhood

of the solution (indicated by a low DELMX at convergence). To examine theusefulness of the parameter standard deviations and correlation coefficients,

artificial test data were generated. Some examples of solutions using the

second order (FINGER) function are shown in Table 2.

The data set for each of these analyses was calculated using a Gaussian

random number generator for the final unreacted quantity as follows:

C/Co = GAUS(F,FSD) + ifi(1+kiCot)1(GAUS is a computer subroutine 'which generates a series of values following a

1731

0.300 0.1000 0.300 10.00 I

Page 6: apRoximate - authors.library.caltech.edu · Aprogramfor least squaresanalysis ofreassociation and ... 1nJ/Lktgacer(1-n)J1 describes rate of reaction ... inal f1 ki f2 k2.200~ 0.300

Nucleic Acids Research

Gaussian distribution with average E and standard deviation ES2.)This method of generating variation in the "data" is a good analogue of

actual fluctuations from measurement to measurement, such as the binding of

DNA to hydroxyapatite. The fluctuations observed in Table 2 for the various

solutions and the standard deviations shown are probably representative ofwhat would happen in repetitions of actual measurements which showed a

comparable RMS. It is clear that the standard deviations of the rate

constants are much larger than those of the component quantities. Whenanalyses of this sort are carried out with components only a factor of ten

apart in rate constant (instead of the factor of 100 for the exampleillustrated in Table 2) the rate constant fluctations are very large. It can

be seen from Table 2 that there is a reasonable quantitative relationshipbetween the fluctuations from set to set and the calculated standard

deviations.

This table exhibits one of the characteristic oroblems of fittingreassociation kinetic data and indicates the insight into the accuracy ofindividual parameter estimates provided by the standard deviation

calculations. It is clear that where two kinetic components are present, even

a factor of one hundred apart, very accurate data are needed to obtain

estimates of rate constants with moderate accuracy.

DISCUSSIONWe have described a powerful and flexible program for the least squares

analysis of the kinetics of reassociation. This program has been used for the

analysis of an extensive series of measurements (3,4,5,10,11,13). Least

squares analysis is necessary for reproducible and clear interpretation ofsuch measurements. We refer to these paoers for examples of use and

interpretation, while in this discussion we focus on oroblems of

over-interpretation or misinterpretation of the solutions. As powerful

analytical tools of this sort are developed it becomes very easy to simplyaccept the "output" and lose touch with its meaning.

It is important to remember that any successful least squares strategy

provides parameter values which "fit the data better" in the "least squares

sense". The solution does not guarantee either physical or biological

reality. ComDonents may be used to fit small peculiarities in the data and

have little physical meaning. This is usually evident from a plot of the fit.Parameter values may also be the fluke of a particular set of data and have

little absolute importance. The parameter standard deviations calculated bythe program provide confidence limits for the Darameter values given the data.

1732

Page 7: apRoximate - authors.library.caltech.edu · Aprogramfor least squaresanalysis ofreassociation and ... 1nJ/Lktgacer(1-n)J1 describes rate of reaction ... inal f1 ki f2 k2.200~ 0.300

Nucleic Acids Research

Where it is known from other measurements that a DNA fraction is

homogeneous or where a single copy fraction has been purified the least

squares analysis provides an excellent means for evaluating its rate constant

and permits a more accurate calculation of its complexity. In general it is

not known that components are homogeneous and often the values of parameters

derived from the least squares solutions may be averages of a set of

unresolved components. In such a case the solution makes an excellent model

of the set of repetitive sequences which may be used in a variety of

calculations even though the true individual components are not known. Where

an important issue rests on the potential heterogeneity of a component other

tools must be used. For example fractionation of double from single standed

DNA could be carried out at the midpoint of its reassociation and the rate of

reassociation of the two fractions carefully compared.

Another consideration is as important as questions of parameter meaning:

solution uniqueness. If a parameter can change over a wide range without

affecting the quality of the fit measured by the RMS or GOODNESS OF FIT,conclusions based on a specific parameter value are suspect. In many cases,

this problem will be indicated by a high parameter standard deviation. More

quantitative insight into the significance of a particular parameter value can

be gained by plotting the RMS or GOODNESS OF FIT criterion as a function of

the best fit using different fixed values for the parameter in question. For

example, if the best fit of the data gives a rate constant of 0.05 for a slow

component with a GOODNESS OF FIT of 0.02, other solutions should be found with

the rate constant fixed at 0.15 and 0.015. If the variation in the GOODNESS

OF FIT is less than 10% over this range of rate constants, no conclusions can

be based on the 0.05 value which would be different for the other values. If

the GOODNESS OF FIT changes by 50- to 100% with variations in the parameter

value the value is probably uniquely determined by the data. While the method

of fixing one parameter value and varying the others to find the best least

squares solution can be used for any parameter, it is particularly importantwhen measuring rate constant parameter values. Data with two rate constants

differing by a factor of 10-30 can usually be described by a very wide range

of rate constants.

In summary, once a fully convergent least squares solution has been

established for a set of data we must consider four issues of interpretation:1. There may be systematic errors in the measurement such as DNA degradation

or unknown fragment size. 2. The individual set of data may be atypical,particularly if only a few measurements are available and the standard

1733

Page 8: apRoximate - authors.library.caltech.edu · Aprogramfor least squaresanalysis ofreassociation and ... 1nJ/Lktgacer(1-n)J1 describes rate of reaction ... inal f1 ki f2 k2.200~ 0.300

Nucleic Acids Research

deviations are then a ooor measure of the possible error. Deviations inreassociation kinetic measurements are typically not due to random statistical

variables such as sampling from a population or radioactive decay, but aremore likely due to variations in assay procedures such as hydroxyapatite batch

or temperature or volume of samples. This weakens the significance of thestandard deviations. 3. The components may not represent true individualrate components but be averages of unresolved sets of components. 4. The

least squares minimum in the region of the solution may be shallow. In that

case, the set of solution parameter values may not uniquely fit the data;other parameter sets with substantially different values might provide equallyvalid interpretations of the data.

In many cases such problems can be shown to be of minor quantitativesignificance. Least squares solutions such as those generated by this programrepresent the best presently known approach to the interpretation of

reassociation and hybridization kinetics.

The program described in this paper is available as a punched card deckfrom the authors. Two versions are available, one for a DECsystem-10timesharing computer and a second for an IBM 370 Batch processor. Both

programs are written in FORTRAN and can be easily modified for other systems;the two programs differ in minor system dependent features. Inquiries shouldbe directed to William Pearson, California Institute of Technology, 101

Dahlia, Corona del Mar, CA 92625.

Also Staff Member, Carnegie Institute of Washington

REFERENCES1 Davidson, E. H. and Britten, R. J., (1973) Quart. Rev. Biol.

46,565-6132 Davidson, E. H., Galau, G. A., Angerer, R. C. and Britten, R. J.,

(1975) Chromosoma 51,253-2593 Britten, R. J., Graham, D. E. and Neufeld, B. R. (1974) in Methods in

Enzymology (L. Grossman and K. Moidave, eds.) Vol. 29 Part E pp.363-416

4 Galau, G. A., Britten, R. J. and Davidson, E. H., (1974) Cell 2,9-205 Galau, G. A., Klein, W. H., Davis, M. M., Wold, B. J. Britten, R. J.

and Davidson, E. H. (1976) Cell 7,487-5056 Bishop, J. O., Morton, .J. G., Roshbash, M. and Richardson, M. (1974)

Nature 250,199-2047 Ryffel, G. U. and McCarthy, B. J. (1975) Biochemistrv 14,1379-13848 Britten, R. J. and Kohne, D. E. (1966) Carnegie Inst. Wash. Yearbook

65,73-1069 Wetmir, J. G. and Davidson, N. (1965) J. Mol. Biol. 31,349-370

10 Britten, R. J. and Davidson, E. H., (1976) Proc. Nat. Acad. Sci. US73,415-419

1734

Page 9: apRoximate - authors.library.caltech.edu · Aprogramfor least squaresanalysis ofreassociation and ... 1nJ/Lktgacer(1-n)J1 describes rate of reaction ... inal f1 ki f2 k2.200~ 0.300

Nucleic Acids Research

11 Smith, M. J., Britten, R. J. and Davidson, E. H. (1975) Proc. Nat.Acad. Sci. US 72,4805-4809

12 Morrow, J. (1974) Ph. D. Thesis, Stanford Universitv13 Davidson, E. H., Hough, B. R., Klein, W. H. and Britten, R. J.

(1975) Cell 4,217-23814 Marquardt, D. W. (1963) J. Soc. Indust. Appl. Math. 11,431-441

1735

Page 10: apRoximate - authors.library.caltech.edu · Aprogramfor least squaresanalysis ofreassociation and ... 1nJ/Lktgacer(1-n)J1 describes rate of reaction ... inal f1 ki f2 k2.200~ 0.300

Nucleic Acids Research

* 00

r

700

O-

00+I3

(LD 9t 0a0 4

*00

. 9*

AEt.- -- O

C_H *

Y 0

00-;000 0

(a0 1E 0@

CC

Cr 4 4. C)0 1 )

* H00

e0 0

4. 0 C O.4.. a IV" 0Q0 00m0:3 4 &- 3

0)C0 Ca

0C0)0.

0. 09C

0 0 04-

0 0 -

0t S -4 10(D-

fi C 3>< *

0v -o0 00

.C j r

0 0 00

-3 CW 3= CIC: ^1Cy

O 0 00-

0 oe C0 0

-0" e

CDO t

_ w 08

:cn Inrn (Ilo C) O OC I I

4-0 C cn a, `D

0 co 0) C)

80 e0 e0 0: + o+ co+ +14 rr mttvc0aCID uln cin I-4 r, LCn Uo Lne * -4 . 0 .

) -C O O O-4 0) -

cst cli r,j -- rMj C\00 0to 00 0o I tic

UE W- CN C\M.-C\j M m .)

a00 00 0

C4 CCO Oc C)9?+ + C + +

0^n CIO Val 0-,I_wD Coco o

W 0 *) c

*

(O+ + rn

_ n- El -

OO a 0 0 D

30r 0 \00CLtf :t t o.Dti -ti:t zr JotN t--

C\-1 Ntv - 'D

00 00 -O 0 O

.n +

L.0w

4.O -0$0 ++OCC\ ++D+¢

CO0 -000

C) 0C)00.C + L ++-t-

I)Ot

m 0080

+0 - 000("

0 0000

00 -00 O Xs

C- + OO U>tFnt

O -000000 Z00000

. .- 00000W

uMazonx o t-o

C

r_

0

0 E0

0

co

CYL

C) C C

t_ 3 P

C00C

_ C4 _

0 0 n£ e z -

rz r- fc s

rnC.

.0e

a

.la

rnncl

-.

F-

01 ru

=C-3

Z ascc0=0

tn -I) rn

C) C) C)

O coO (.> C)

C) C) C)

Of tm rJ

CDUNtt0

ON C) - -

--C) O OOC) 0 C: 11

I: + + t0 ctSrW1 4W )cm oT Ln =>

C) O CO 0 4

o CO Of O >

£ + r-+ + a*CaN J t 0

a .a ea c

C, 0

nyu>+ ( CMrn sD0: I? C!"? I?S00

'd CY,

Of. C

0 in C

td Cy

O.CO

0 0

:J;C

* C\0 00CL

00I,OE4U~CO

+Y71 +yatn C\-.

OC OJC

cl

's]

zo

-3o.U

UE

fn

CC

E*tD r

C.

_ ou

.0cn (L .

a, ooD

(n Lr.\.

C) 0 +

C/) w uo rC4]a Mz

f0'C).+=*0

r^0. ul.0 _ot-

MU)r, *co *

CDoxn

z. 10.

,So a, c-

E:rrS;

*nO

za

:D o

0 0

Oa,Orx,

(1H

CD C) M

O- 0M r

5;Co

Coa

00

_2

00

u O-O>

000nna

-0000

. I %.C*C;(0C;C;0

-OAC,88_oooo7

. .7 . ..o

_o r uot

1736

W W

C\1CM CM

0 C)0

CMLl.C) 0

CM CM0 C)

cn cl.

E4 U3Ln 0C\j --x

C.0 0

W wC), C)

C; C;-r C) ;;OC".0 O

+ +1.1 Ln E.3

Ul-D alo

f C\jLf)1.3 --*

C\j CR cmnoO.

. .9 .

C)o C) C)

cOrO

C

0

a.0i-v)C.t-o-3a.-3oc.E-0-311.L)'Ilif-0-3C6

"icoul

am

QC0iH

-CH1/)

la

C0

4)-4-4t...

0

4)8coC

WZ.-cr

L)>-

E- 6-0u -C

I.- -CC a

E--cn.-q-3

0

Page 11: apRoximate - authors.library.caltech.edu · Aprogramfor least squaresanalysis ofreassociation and ... 1nJ/Lktgacer(1-n)J1 describes rate of reaction ... inal f1 ki f2 k2.200~ 0.300

Nucleic Acids Research

0)C. @3

0. 0.3

.-0

0

0 L.

0-. @30C) >

o0 3.(#3 -

- 0 Ca C0-4n 0 C00 2 Ht4.2

0- 0 - -O F aC

.4 0 3..

2 a

t.r

J a: C

x~~~~

0. 4.4

r +

04 40 040

co~~ ~

;O* 0.~-O

L '-000

@ ~ ~~~~~+ +

0 '-000000

O 000t.00-

-4

a o_-lDo\

o ooao

oaoooo

*tXoozne

C -0000Omt

O1S) O *OD New cn cn M n n (In

0 0 0) 0) 0) 0)bD I I I ICa L. r3 r,] E. r-3 rS3 E.

4. :> LrE U,, Ln LC LCN U)c M cn cn M M

.- 4.* 4. 4. 4. 4. 4

0. 000 000

0 0 0 0 0 0 00.0C 0 0 0 O 0 00.3 4.+ + + + + +

.0C.:3.W(#E3

cr, rM r rn -r0 rS

40 4.1* 4. 4. 4. 4. 4.

O 3-4 404 404OJ J4 O044*.03CO0 CO;. CO C'0340

. 0 0 0 0 0 0

C) 0 00 40 COOD 0

2442 2 4003 0 0 C. O, 003 003s 0 0 o~~~C o

0C4. + + + + + +C.] Wx rW W C.]r

_e 4 t 1

CC)&- C) OC) - -

z + + + + + +tD Mrt C\oJ N CM.n N

00 \0o 0

0

* 'DO o loo co coC) 0OoCX) + 1D + Jo + t. + cn + 4 +

o C.3 NrSa CMSCa S -r cowS£o -t

MO Ln C_ D- -_-D- N-- --r

Z ao 10 040 100 NO00~

ZE4 M mto _-M M- M;V M; M=

atC', m orC) orn

*00

(\ 0

0 ,343_ O

33.3. 0

1.* .3

*.3 00 0

a#30 - -0

..

0 O-

0 L0

E..0 '0'

+ I

0 0-

_0 UNX

0 00

0 .

* 0

L) UG\ < D

n 0O 0

rS. CY--C

(In rn0 0

C) o

cn CY)

C)

O 5

r\e N

Cr) C).0

CO

\0 Oa

o r-

MO ONCV( 5'"'

L'n U", U Ln LEs)rn rn rr} rn rr

CY E- JD flo D

U) LO U) U) Ln

C4 E. . . .3

t- CY, -- - (n(n N\ rn "I rn

W t r r

ur U,% Ln Ln tn

CIO E. Eo EO

O C; C; o O

t0 10 -O t-O _-noO n + D+ -+ a+-c WcrnrS orS Nr

O- O-0 O- O--r00 00 Oa 00 00

_II. CW_ |-Ora oz COI, oXDMtn) 0¢c OsD 0r Mt-NNI( NOJ(\ NNjNNiNOJ\(or r r ocn o.11. C.. . . . .~

00 00 OO OO 00)

000

I +

4.6 C)00

0

0' . 0'-

oo

0 003-401+\3.00

CO 3- o3

*oocE. r.)4S_

0O -~ '-'- 0

=r !o MN O L.043-. _. 3CO

CO * 000L.

C0 '-' la.t

O * + IE. )rS r L.

O - 4)in AiFU3 _ _ W

00 *+ +C

ao * OG 0 C

+ C 4)

_\ 4-3

O%f a.* 000*0j --oa4 L.o

O3

_ _W_ MN|e _ M_ CL.

0

,4t

C0-4X

0C

0 .coE-- C r,-

0040 0O

02

- o

JL r,,

LA rn-

En E

crn

rn

e

0

00

co

H

44

f-

U' ot

__o

-oaa

LrV ooc C,

C-O00I- I0000wU-I

1737

0 0U" LnM cn

E.] cono noN cliU. U.(D 0

CV CV

clicn

Ln U'%

C; C;

Ed0% 0

C;OCD

C)o 00cn + z +loEd C.3C\Ko

00 Oa

CU C\I1.2 1.2O0.3 0 E.3crv- Cy%t-C\I(\j (%JC\I91.1 911.Oa 00

ua:2.M

0

E-OU

MCE-CL.

10CD