Incorporation of parametric factors into multilinear receptor model studies of Atlanta aerosol
-
Upload
eugene-kim -
Category
Documents
-
view
212 -
download
0
Transcript of Incorporation of parametric factors into multilinear receptor model studies of Atlanta aerosol
Atmospheric Environment 37 (2003) 5009–5021
ARTICLE IN PRESS
AE International – North America
*Correspond
E-mail addr
hopkepk@clark
1352-2310/$ - se
doi:10.1016/j.at
Incorporation of parametric factors into multilinear receptormodel studies of Atlanta aerosol
Eugene Kima, Philip K. Hopkeb,*, Pentti Paateroc, Eric S. Edgertond
aDepartment of Civil and Environmental Engineering, Clarkson University, Box 5708, Potsdam, NY 13699, USAbDepartment of Chemical Engineering, Clarkson University, Box 5708, Potsdam, NY 13699, USA
cDepartment of Physical Sciences, University of Helsinki, Helsinki, FIN-00014, FinlanddAtmospheric Research and Analysis, Inc. 3500 Cottonwood Drive, Durham, NC 27707, USA
Received 11 February 2003; accepted 14 August 2003
Abstract
In prior work with simulated data, ancillary variables including time resolved wind data were utilized in a multilinear
model to successfully reduce rotational ambiguity and increase the number of resolved sources. In this study, time
resolved wind and other data were incorporated into a model for the analysis of real measurement data. Twenty-four
hour integrated PM2.5 (particulate matter p2.5mm in aerodynamic diameter) compositional data were measured inAtlanta, GA between August 1998 and August 2000 (662 samples). A two-stage model that utilized 22 elemental
species, two wind variables, and three time variables was used for this analysis. The model identified nine sources:
sulfate-rich secondary aerosol I (54%), gasoline exhaust (15%), diesel exhaust (11%), nitrate-rich secondary aerosol
(9%), metal processing (3%), wood smoke (3%), airborne soil (2%), sulfate-rich secondary aerosol II (2%), and the
mixture of a cement kiln with a carbon-rich source (0.9%). The results of this study indicate that utilizing time resolved
wind measurements aids to separate diesel exhaust from gasoline vehicle exhaust. For most of the sources, well-defined
directional profiles, seasonal trends, and weekend effects were obtained.
r 2003 Elsevier Ltd. All rights reserved.
Keywords: Source apportionment; Receptor modeling; Positive matrix factorization; Multilinear engine; PM2.5
1. Introduction
Advanced source apportionment methods for the
airborne particulate matter will be needed to assist in
State Implementation Plan as a data analysis tool for
identifying and apportioning airborne particulate matter
sources. Positive matrix factorization (PMF) (Paatero,
1997) and Unmix (Henry and Norris, 2002) have been
shown to be powerful alternatives to traditional receptor
modeling of airborne particulate matter (e.g. chemical
mass balance, conventional factor analysis) (Huang
et al., 1999; Willis, 2000; Qin et al., 2002; Maykut
ing author.
esses: [email protected] (E. Kim),
son.edu (P.K. Hopke).
e front matter r 2003 Elsevier Ltd. All rights reserve
mosenv.2003.08.035
et al., 2003). Bilinear PMF (PMF2) has been used to
assess particle source contributions in the Arctic (Xie
et al., 1999a), in Hong Kong (Lee et al., 1999), in
Phoenix (Ramadan et al., 2000), in Thailand (Chueinta
et al., 2000), in Vermont (Polissar et al., 2001), in three
northeastern US cities (Song et al., 2001), and in Atlanta
(Kim et al., 2003a). Unmix has been applied to several
aerosol data sets from Los Angeles (Kim and Henry,
2000b) and Phoenix (Lewis et al., 2003). Also, PMF2
and Unmix were compared in the northern Vermont
aerosol study (Poirot et al., 2001) and in the Seattle
particle size analysis (Kim et al., 2003b).
As pointed out by Henry (1987), there is rotational
ambiguity in the factor analysis problem that makes
infinite number of possible solutions. The incorporation
of additional information can be useful to improve the
d.
ARTICLE IN PRESS
Fig. 1. Location of the Jefferson Street monitoring site in
Atlanta, GA and major point sources contributing to the
monitoring site. (1): cement kiln, (2): rail yard, (3): metal
recycling, (4): bus station and (5): metal processing.
E. Kim et al. / Atmospheric Environment 37 (2003) 5009–50215010
solution. A more flexible multivariate method, the
multilinear engine (ME, Paatero, 1999), can fit any
model that can be expressed as a sum of products. It has
been used to analyze Arctic aerosol in a multiway model
(Xie et al., 1999b; Yli-Tuomi et al., 2003) and has been
used to fit the standard bilinear factor analysis model
(Ramadan et al., 2003). In a recent work, hourly
measured wind variables and other factors were utilized
in an expanded model to reduce rotational ambiguity.
This model was successfully tested using simulated data
developed by the US EPA (Paatero and Hopke, 2002)
and measured PM2.5 (particulate matter p2.5mm in
aerodynamic diameter) mass concentrations data
(Paatero et al., 2003).
The objective of this study is to examine the use of
such model with an actual particle composition data set.
The model incorporates time resolved wind measure-
ments into ME to enhance solution of multilinear
receptor model. In the present study, ME was applied
to PM2.5 compositional data of daily samples collected
during a two-year period at a monitoring site in Atlanta,
Georgia. The ME resolved PM2.5 sources are compared
with previous study for the same data set using PMF2
(Kim et al., 2003a). The directional profiles, hourly
patterns, seasonal trends, and weekend effects are
discussed.
2. Sample collection and chemical analysis
The PM2.5 compositional data analyzed in this study
consisted of measurements taken at Jefferson Street
monitoring site located 4 km northwest of downtown
Atlanta. This monitoring site is located in an industrial
and commercial area shown in Fig. 1. The prevailing
winds are from the east, southeast, southwest, and
northwest in summer. In winter, winds are mostly from
east and northwest. Daily integrated PM2.5 samples were
collected using the particulate composition monitor
(PCM, Atmospheric Research and Analysis, Inc.,
Durham, NC) that has three independent sampling lines
(air flow rate 16.7 l/min) 5m above ground. Each
sampling line has a 10 mm-cyclone (URG, Carrboro,NC) followed by WINS impactor (URG, Carrboro,
NC) which has a 2.5 mm-cutoff size (D50) in particleaerodynamic diameter (Vanderpool et al., 2001; Peters
et al., 2001). The PCM permits simultaneous sampling
on a 3-stage filter pack (Teflon, Nylon, and cellulose
filter; diameter 47mm), a Nylon filter (diameter 47mm),
and a quartz filter (diameter 37mm). The PCM includes
carbonate denuders and citric acid denuders upstream of
both the 3-stage filter and the Nylon filter. The quartz
filter includes an upstream carbon denuder (BYU,
Provo, UT) to remove gaseous organic materials
(Eatough et al., 1993). The Teflon filters of the 3-stage
filter pack samples were measured for mass concentra-
tions and analyzed via energy dispersive X-ray fluores-
cence (XRF) (Dzubay et al., 1988) by Chester LabNet,
Inc. (Tigard, OR). The nitrate (NO3�) and ammonium
(NH4+) mass loss on the Teflon filter was measured by
the following Nylon and cellulose filters of the 3-stage
filter pack samples that were analyzed via ion chroma-
tography (IC) for NO3� and NH4
+ (Appel et al., 1981;
Pszenny et al., 1993). The measured mass concentrations
have been increased to account for the volatilized
ammonium nitrate. The Nylon filters of the independent
sampling line were analyzed via IC for sulfate (SO42�),
NO3�, and NH4
+. The quartz filter was analyzed via
thermal optical reflectance/Interagency Monitoring of
Protected Visual Environments (IMPROVE) protocol
(Chow et al., 1993) for organic carbon (OC) and element
carbon (EC) (Desert Research Institute, Reno, NV). In
addition, wind speed and wind direction are measured
hourly at the monitoring site: geometric mean of wind
speed was 1.37m/s.
In this study, 662 daily samples collected between
August 1998 and August 2000 and 22 species were used.
Daily samples in which wind speed or wind direction
was missing were excluded from this analysis. XRF S
and SO showed excellent correlations (slope=3.1,
r2 ¼ 0:99), therefore only IC SO42� was used in this
analysis. The analysis of the compositional data revealed
a mass closure problem. The measured PM2.5 mass
concentrations by 3-stage filter were compared with the
summations of PM2.5 particle species. Approximately
34% of the measured PM2.5 mass concentrations were
smaller than the summations of species concentrations
in this comparison. This mass closure problem (sum of
species >PM2.5 mass) is thought to be caused by the loss
of semivolatile OC (Van Vaeck et al., 1984) since the
ARTICLE IN PRESSE. Kim et al. / Atmospheric Environment 37 (2003) 5009–5021 5011
mass of the volatilized ammonium nitrate has been
already added to the measured mass concentration. It
also represents a problem for the multilinear regression
analysis that has generally been used in the ME analysis.
Therefore, an alternative approach described in next
section was employed instead. In this study, species for
which more than 90% of values were below the
detection limit were excluded. Summaries of PM2.5
speciation data used in this study are shown in Table 1.
3. Data analysis
The ordinary receptor modeling problem can be
stated in terms of the contribution from p independent
sources to all chemical species in a given sample (Miller
et al., 1972; Hopke, 1985, 1991) as follows:
X ¼ GFT þ E; ð1Þ
where X is a n � m data matrix with nmeasurements and
m number of elements. E is a n � m matrix of residuals
that are not fit by the model. G is a n � p source
contribution matrix with p sources, and F is a m � p
source profile matrix. For the consistent notation, F is
transposed in this study. The corresponding component
Table 1
Summary of PM2.5 and 21 species mass concentrations used for ME
Species Concentration (ng/m)3
Geometric meana Arithmetic mean Minimum
PM2.5 15871 18029 1930
SO42� 4488 5579 526
NH4+ 2485 2920 298
NO3� 877 1124 127
Cl 82 108 21
EC 1643 1982 227
OC 4024 4495 878
As 1.0 1.5 0.47
Br 2.9 4.2 0.26
Cu 2.0 3.8 0.59
Mn 1.4 2.0 0.37
Pb 3.7 6.6 1.1
Sb 2.0 3.5 1.9
Se 0.93 1.4 0.32
Sn 3.9 4.4 3.2
Ti 3.6 5.0 2.0
Zn 12 17 0.42
Al 11 29 5.7
Si 191 253 23
K 65 81 7.7
Ca 61 78 4.6
Fe 102 132 13
aData below the limit of detection were replaced by half of the repobBelow detection limit.
equation is
xij ¼Xp
k¼1
gikfjk þ eij ; ð2Þ
where xij is the jth species concentration measured in the
ith sample, gik is the particulate mass concentration
from the kth source contributing to the ith sample, fkj is
the jth species mass fraction from the kth source, eij is
residual associated with the jth species concentration
measured in the ith sample, and p is the total number of
independent sources.
In the expanded ME analysis, the bilinear model
shown in Eq. (2) is augmented by additional complex
equation that contains modeling information. The most
basic form of the this equation is
xij ¼XP
k¼1
Dðdi; kÞVðvi; kÞfjk þ e0ij ; ð3Þ
where D and V represent matrices, consisting of
unknown values to be estimated during the model fitting
process. The known index value di and ni indicate wind
direction and wind speed of the ith day for the kth
source, respectively. The indices are shown in parenth-
eses, not as subscripts for the typographic reasons. In
this study, the index value di is obtained from the
analysis
Number of BDLb
values (%)
Number of missing
values (%)
Maximum
49264 0 0
20851 0 5 (0.8)
10314 0 6 (0.9)
6014 0 44 (6.6)
613 115 (17.4) 44 (6.6)
10234 0 22 (3.3)
22089 0 21 (3.2)
11 337 (50.9) 35 (5.3)
204 29 (4.4) 34 (5.1)
42 214 (32.3) 34 (5.1)
13 166 (25.1) 34 (5.1)
78 195 (29.5) 34 (5.1)
105 321 (48.5) 34 (5.1)
10 284 (42.9) 34 (5.1)
17 551 (83.2) 34 (5.1)
55 363 (54.8) 34 (5.1)
211 3 (0.5) 34 (5.1)
1700 406 (61.3) 34 (5.1)
3966 0 34 (5.1)
996 0 34 (5.1)
702 1 (0.2) 34 (5.1)
1502 0 34 (5.1)
rted detection limit values for the geometric mean calculations.
ARTICLE IN PRESS
Table 2
Classification of auxiliary variables used for ME analysis
Index Auxiliary variable
Wind direction (degree) Wind speed(m/s) Calm wind (m/s) Time-of-day Time-of-year Weekend
1 0–35 1.0–1.5 0–1.0 3–6 Jan–Feb Weekend
2 35–73 1.5–2.1 7–9 Mar–Apr
3 73–92 2.1–2.8 10–13 May–Jun
4 92–112 >2.8 14–16 Jul–Aug
5 112–142 17–20 Sep–Oct
6 142–166 Nov–Dec
7 166–194
8 194–217
9 217–230
10 230–242
11 242–254
12 254–271
13 271–290
14 290–304
15 304–318
16 318–331
17 331–343
18 343–360
E. Kim et al. / Atmospheric Environment 37 (2003) 5009–50215012
classification of the wind direction of the ith day into 18
indices as shown in Table 2. For example, if the wind
direction is 170� for the ith day, then di ¼ 7 for suchday. Four classifications were used for the value ni: Theclassifications were chosen to make each index interval
have approximately the same population.
In Eq. (3), other information on the sources of
variation in the concentration that might aid the
separation of the sources can be incorporated. In this
study, wind direction, wind speed, time of day, time of
year, and weekend/weekday were used. For the wind
direction and wind speed, hourly averaged values were
used. The complete expanded model consists of the basic
bilinear equation and a multilinear equation specifying
the physical model
xij ¼XP
k¼1
gikfjk þ eij ; ð4Þ
xij ¼XP
k¼1
SðZi; kÞWðoi; kÞX24h¼1
Dðdih; kÞVðvih; kÞ
Rðeih; kÞTðlih; kÞfjk þ e0ij ; ð5Þ
where SðZi; kÞ is the element of matrix S with the indexvalues Zi corresponding to the time-of-year classification
of the ith day for the kth source. Time-of-year was
classified into six two-month periods (or season) as
shown in Table 2. For the value i ¼ 1–60 (January andFebruary), Zi ¼ 1: Wðoi; kÞ is the element of matrix Wwith the index values oi corresponding to weekend/
weekday factor of the ith day for the kth source. The
weekend effect matrix W has dimension 1 by p: In thisstudy, the weekday coefficients have been fixed at unity
so that only the weekend coefficients are variable. The
elements of matrix W specify the average strength of
each factor on weekend relative to the strength in
weekday. Dðdih; kÞ is the element of matrix D with theindex values dih for the wind direction during hour h of
the ith day for the kth source. Vðnih; kÞ is the element ofmatrix V with the index values nih for the wind speed
during hour h of the ith day for the kth source. Rðeih; kÞis the element of matrix R with the index values eih for
the calm wind (o1m/s) during hour h of the ith day for
the kth source. Because of isotropic wind direction, calm
wind was separated as an separate matrix R in this
analysis instead of being included in the wind speed
index matrix V. Also, the wind direction of calm wind
was not incorporated in the wind direction index matrix
D. Tðlih; kÞ is the element of matrix T with the indexvalues lih for the time-of-day during hour h of the ith
day for the kth source. The matrices, S,W, D, V, R, and
T contain unknown values to be estimated in the fitting
process. The specific factor elements used to fit a
particular data point are selected based on the hourly
or daily values of the corresponding auxiliary variables.
Therefore, these auxiliary variables are not fitted, but
served to determine the indices to the values to be fitted.
ME provides a solution that minimizes the value of Q;based upon uncertainty estimates for each observation
(Paatero, 1997, 2000) while the values of the unknown
matrices G, F, S,W, D, V, R, and T are to be determined
so that the model fit the data as well as possible. The Q
ARTICLE IN PRESSE. Kim et al. / Atmospheric Environment 37 (2003) 5009–5021 5013
value is defined as
Q ¼Xn
i¼1
Xm
j¼1
eij
sij
� �2þXn
i¼1
Xm
j¼1
e0ij
s0ij
!2; ð6Þ
where sij is an uncertainty estimate for the bilinear
model and s0ij is an uncertainty estimate for the
multilinear model in Eq. (5) in the jth element measured
in the ith day.
The application of the ME depends on the estimated
uncertainties for each of the data values. The uncer-
tainty estimation provides a useful tool to decrease the
weight of missing and below detection limit data in the
solution as well as account for the variability in
the source profiles. The effective error estimates should
be specified so that they include all uncertainties which
produce a deviation between the fit and the observed
values. The procedure of Polissar et al. (1998) was used
to assign measured data and the associated uncertainties
as the input data to the ME. The sum of the analytical
uncertainty and 1/3 of the detection limit value was used
as the uncertainty assigned to each measured value.
Values below the detection limit were replaced by half of
the detection limit values and their uncertainties were set
at 5/6 of the detection limit values. Missing values were
replaced by the geometric mean of the measured values
and their accompanying uncertainties were set at four
times of this geometric mean value. In addition, the
estimated uncertainties of measured species that had
scaled residuals larger than72 were increased to reducetheir residuals (Paatero, 2000; Hopke and Paatero,
2003). Large uncertainties were assigned to several
elements so that their scaled residuals were smaller than
72: PM2.5 mass concentration, As, Se, Ti, Si, K, and Ca
(two times of its accompanying uncertainties); OC, EC,
Cl, Mn, Sb, Sn, and Al (3� ); SO42�, NH, Br and Fe
(4� ); Cu (5� ).In Eq. (5), the multilinear equation is one of multiple
possible models depending on the understanding of the
system under study while the mass balance in the
bilinear equation should be much more applicable.
Because the variability of the index factors is restricted
by the model, Eq. (5) will produce a significantly poorer
fit to the data than the bilinear equation (Eq. (4)).
Therefore, the uncertainty estimates corresponding to
the multilinear equation must be larger than those
corresponding to the bilinear equation to decrease the
weight of the multilinear equation in the solution. In this
study, from the experiments with different uncertainties,
estimated uncertainties of multilinear equation were set
at nine times of estimated uncertainties of bilinear
equation.
Generally, the results of ME are normalized by a
scaling constant that is determined by regressing total
PM mass concentrations against estimated source
contribution values, so that the quantitative source
contributions as well as profiles for each source are
obtained (Kim et al., 2003c). In this study, because of
the mass closure problem noted above, the measured
PM2.5 mass concentrations were included as an inde-
pendent variable in the ME analysis to directly obtain
the mass apportionment without the usual multilinear
regression. When the measured PM2.5 mass concentra-
tion is used as a variable, the ME apportions a mass
concentration for each source according to its time
variation. The results of ME analysis were then normal-
ized by the apportioned particle mass concentrations so
that the quantitative source contributions for each
source were obtained. Specifically
xij ¼XP
k¼1
ðckgikÞfjk
Ck
� �; ð7Þ
where ck is directly apportioned mass concentration by
ME for the kth source.
To reduce the influence of extreme values on the ME
solution, the Robust mode was used for this study. A
data point was classified as an extreme value if the model
residual exceeds four times the estimated uncertainty.
The estimated uncertainty values of those extreme
values were then increased so that the influence of the
outliers were reduced.
The six independent variables in the multilinear model
are correlated. Also, this analysis is based on daily
integrated speciation data and hourly measured meteo-
rological data. In order to avoid spurious results and
obtain unique smooth solution, it is necessary to
regularize the model at the expense of possibly losing
some detail. In this study, the wind speed index matrix
V, time-of-day index matrix T, and weekend effect
matrixW were pulled towards unity as a regularization,
such as
Vðr; kÞ ¼ 1 ðr ¼ 1;y; 5; k ¼ 1;y; pÞ; ð8Þ
where Vðr; kÞ is the element of matrix V with the rth
index value for the kth source. In addition, smooth
equation was used for wind directional index matrix D
Dðr; kÞ ¼ Dðr þ 1; kÞ ðr ¼ 1;y; 17; k ¼ 1;y; pÞ: ð9Þ
The uncertainty estimates corresponding to these
auxiliary equations were specified so that the contribu-
tions to the value of Q from these equations were
relatively small to the contributions from the bilinear
equation (Eq. (4)). The regularization removed spurious
variations from factors that were not essential for
achieving a good model fit.
4. Results and discussion
To determine the number of sources, it is reasonable
to test different numbers of sources and use the one that
both adequately fits the data and provides the most
ARTICLE IN PRESSE. Kim et al. / Atmospheric Environment 37 (2003) 5009–50215014
physically meaningful results. As pointed out by Henry
(1987), rotational ambiguity exists in factor analysis
modeling. Therefore, ME was run hundreds times to
determine the optimal solution in the range within which
the objective function QðEÞ values in Eq. (6) remainsrelatively constant. The selected final ME solutions were
determined by trial and error with different number of
sources as well as different uncertainty estimates, and
based on the evaluation of the resulting factor profiles.
Nine sources were obtained from fitting the expanded
model. Within the range of relatively constant QðEÞvalues, the factor profiles were relatively stable. In the
eight-source model, gasoline and diesel exhaust profiles
were combined to make a motor vehicle profile. In ten-
source model, an additional source profile that had zero
mass concentration was deduced indicating that nine-
source model was appropriate. The sensitivity analyses
were conducted by excluding each index matrix from the
multilinear equation (Eq. (5)). Diesel exhaust was not
separated from gasoline exhaust without one of six index
matrices: Diesel and gasoline exhausts were shown as a
motor vehicle source or two OC high sources in the
results.
Fig. 2. Source profiles.
Fig. 2 presents the identified source profiles and Fig. 3
shows time series plots of estimated daily contributions
to PM2.5 mass concentrations from each source. A
comparison of the daily reconstructed PM2.5 mass
contributions from all sources with measured PM2.5
mass concentrations is shown in Fig. 4. When the
uncertainties associated with this data set are consid-
ered, the squared correlation coefficient of 0.90 indicates
that the resolved sources effectively account for most of
the variation in the particle mass concentration. Also,
this squared correlation coefficient 0.90, linear regres-
sion coefficient 0.84, and intercept 1.84 show improve-
ment in model predictions when they are compared with
those from PMF2 analysis (0.83, 0.68, and 3.54,
respectively). The average source contributions of each
source to PM2.5 mass concentration are compared
between PMF2 and ME in Table 3.
The sulfate-rich secondary aerosols I and II are
characterized by high concentrations of SO42� and
NH4+. The sulfate-rich secondary aerosol I has the
highest source contribution to PM2.5 mass concentra-
tions (54%). The sulfate-rich secondary aerosol II that
has higher OC concentration than sulfate-rich secondary
Fig. 3. Time series plot of source contributions.
ARTICLE IN PRESSE. Kim et al. / Atmospheric Environment 37 (2003) 5009–5021 5015
aerosol I accounts for 2% of the PM2.5 mass concentra-
tion. OC typically becomes associated with the second-
ary sulfate aerosol. For the sulfate-rich secondary
aerosol I (NH4)2SO4 accounts for 46% of the PM2.5
mass concentration and remaining 8% is accounted by
mostly OC. This OC association is consistent with
previous Phoenix (Ramadan et al., 2000) and north-
eastern US (Song et al., 2001) aerosol studies.
In studies of northeastern US particle source, PMF2
separated the sulfur into a high photochemistry source
and a low photochemistry source with seasonal differ-
ences of the Se/S concentrations (Polissar et al., 2001).
In previous study for the same Atlanta data set (Kim
et al., 2003a), PMF2 could not separate to two sulfate
sources due to the poor correlation between S and Se. In
contrast to this previous study, the expanded model
Table 3
Comparison of average source contributions (%) to PM2.5 mass conc
Average source contribution (
PMF2a
Sulfate-rich secondary aerosol I 55.7 (1.3)
Motor vehicle exhaust 22.2 (0.7)
Gasoline exhaust
Diesel exhaust
Nitrate-rich secondary aerosol 7.3 (0.2)
Metal processing
Metal recycling 0.5 (0.02)
Wood smoke 10.8 (0.3)
Airborne soil 1.2 (0.07)
Sulfate-rich secondary aerosol II
Cement Kiln/Carbon-rich 2.0 (0.06)
Bus/Metal processing 0.3 (0.02)
aKim et al. (2003a).bThis study.
Fig. 4. Measured versus predicted PM2.5 mass concentration.
extracted two different sulfate-rich secondary aerosol
sources. The deduced profiles of the sulfate-rich
secondary aerosol I are very similar to those of previous
study (Kim et al., 2003a). The sum of these mass
contributions (56%) is consistent with 56% of previous
study using PMF2 and with the study of three north-
eastern US cities which identified secondary sulfate
source contributions of 47%, 55%, and 51% to the
PM2.5 mass concentration (Song et al., 2001).
The next profiles are gasoline and diesel exhaust that
were not separated in previous PMF2 analysis (Kim
et al., 2003a). By using the expanded model, they were
separated because of the directionality, seasonal and
time-of-day variation, and weekday/weekend difference.
They are represented by high OC and EC whose
abundances differ between the sources (Lowenthal et al.,
1994; Watson et al., 1994, 2001; Watson and Chow,
2001). Gasoline and diesel exhaust account for 15% and
11% of the PM2.5 mass concentration, respectively. The
sum of both mass contributions (26%) is consistent with
22% of previous PMF2 analysis (Kim et al., 2003a). The
ratio of OC/EC is 3.27 for gasoline vehicle exhaust and
0.88 for diesel exhaust show reasonable agreement with
typical 2.05 in fresh gasoline exhaust and 0.72 in diesel
exhaust for PM10 (Cadle et al., 1999).
The nitrate-rich secondary aerosol is identified by its
high concentration of NO3�. This source accounts for
9% of the PM2.5 mass concentration. Similarly, PMF2
deduced 7% contributions from this source. The metal
processing is characterized by Zn, Si, Fe, and Cl (Small
et al., 1981; US EPA, 2002) accounting for 3% of the
PM2.5 mass concentration. A metal recycling facility
located about 0.7 km east of the site is mainly used for
storage, grinding, shredding, and loading onto railcars.
Also, several metal processing facilities are located
about 6 km southeast of the site. The high OC
entrations between PMF2 and ME
standard error) Pearson correlation coefficient ðrÞ
MEb
54.3 (1.7) 0.97
14.6 (0.4)
11.2 (0.5)
8.9 (0.4) 0.95
3.1 (0.2)
2.9 (0.2) 0.56
2.2 (0.1) 0.97
1.9 (0.1)
0.9 (0.04) 0.96
ARTICLE IN PRESSE. Kim et al. / Atmospheric Environment 37 (2003) 5009–50215016
concentration may be from lubrication oil used for the
grinding and shredding although there could be adsorp-
tion of low vapor pressure organic compounds on the
primary particles. Previously, PMF2 apportioned 0.5%
from metal processing and 0.3% from the mixed source
of bus and metal processing.
Wood smoke containing OC, EC, and K (Watson
et al., 2001) contribute 3% to the PM2.5 mass
concentration in contrast to the 11% contribution
estimated from PMF2. Also, Table 3 shows poor
correlation (r ¼ 0:56) between PMF2 estimated andME estimated wood smoke contributions. The 8%
difference is explained by increased contributions mostly
from gasoline vehicle/diesel exhaust (3.5% increase),
metal processing (2.3% increase), nitrate-rich secondary
aerosol (1.7% increase), and sulfate-rich secondary
aerosol I/II (0.5% increase). This difference shows that
the expanded model separates sources differently when
more information about sources such as directional,
hourly, and seasonal variation is incorporated. July 4
Fig. 5. Wind directional indice
fireworks in 1999 and 2000 that produce high K
concentration appear as wood burning contributions in
Fig. 3. A similar effects was seen in the Underhill, VT
results (Polissar et al., 2001).
Airborne soil consists of Si, Fe, Ca, Al, and K
(Watson and Chow, 2001; Watson et al., 2001). The
particles of these crustal elements could be produced by
unpaved roads, construction sites, and wind-blown soil
dust. This source contributes 2% to the PM2.5 mass
concentration consisting with 1% estimated by PMF2.
The cement kiln/carbon-rich source is characterized by
OC, EC, and Ca (US EPA, 2002). It is likely to include
contributions from a cement kiln located about 7 km
northwest of the site and an unknown carbon-rich
source located in the same direction. This source
contributes 0.9% to the PM2.5 mass concentration.
Previously, PMF2 deduced 2% contribution from this
source.
Fig. 5 presents the wind directional factor values in
which the matrix D values are plotted on polar
s for each of the source.
ARTICLE IN PRESSE. Kim et al. / Atmospheric Environment 37 (2003) 5009–5021 5017
coordinates. The point sources show strong direction-
ality that agree well with the locations of known sources.
There is clear directionality for all of identified sources
except for Sulfate-rich secondary aerosol I. The sulfate-
rich secondary aerosol II that has high OC concentra-
tion points to the southeast in the direction of the
junction of highways I-75 and I-20 and downtown
Atlanta. The polar plot of gasoline vehicle exhaust
factor also points to the southeast toward the highway
junction and downtown Atlanta.
The diesel exhaust appear to have contributions from
all directions and weakly points to the northeast where
highways I-75 intersects I-85. There is railroad traffic
around the monitoring site and rail yard is located about
2 km northwest of the site. Also, a bus station is located
only about 200m southeast of the site. These diesel
vehicle exhaust seem to be mixed with other diesel
exhaust. The nitrate-rich secondary aerosol also points
Fig. 6. Wind speed indices f
to the northeast highway junction. For the metal
processing, there are indications of higher contributions
from the direction of east and southeast. Those may
show the contributions from a metal recycling facility
located east of the site and several metal processing
facilities located southeast of the site. The plot for wood
smoke points to the southeast and southwest where a
residential area is located. Airborne soil has contribu-
tions from the southwest and northwest. The direction
of the cement kiln situated about 7 km northwest of the
monitoring site is clearly shown in Fig. 5.
Fig. 6 shows the wind speed values. The general trend
of the nitrate-rich secondary aerosol is that the values
decrease with increasing wind speed indicating a dilution
effect: the emitted mass is diluted with increasing wind
speed so that the concentration decreases. The nitrate-
rich secondary aerosols were trapped near the ground
before being diluted by higher wind speeds. For other
or each of the source.
ARTICLE IN PRESS
Fig. 7. Time-of-day pattern for each of the source.
E. Kim et al. / Atmospheric Environment 37 (2003) 5009–50215018
eight sources, the wind speed factors do not show strong
or clear trend indicating that their contributions to the
monitoring site were not much affected by wind speed.
Fig. 7 shows the time-of-day factor values. The
nitrate-rich secondary aerosol is high in the evening
through morning and decreases in the afternoon. This is
likely due to lower temperatures and higher relative
humidity (Mangelson et al., 1997). The other sources do
not show strong diurnal variations indicating that the
variations of their hourly contributions were weak. It is
thought to be caused by high source strengths in the
daytime and high concentrations at night due to the
temperature inversion.
Fig. 8 shows the seasonal factor values. The sulfate-
rich secondary aerosol I shows strong seasonal variation
with high concentrations in summer time when the
photochemical activity is highest (Polissar et al., 2001;
Song et al., 2001). In contrast, the sulfate-rich secondary
aerosol II that has high OC concentration shows slightly
higher concentrations in winter when the vapor pressure
of OC is lower and therefore the secondary organic
aerosol is more easily formed. The 13 variation observed
in the time-of-year indices may be due to variation in
source strength or in transport condition or in both. The
gasoline exhaust has the lowest contributions between
September and October. The diesel exhaust contributed
the lowest in summer and the highest between Septem-
ber and October. It is thought to be caused by seasonal
change of prevailing wind directions: decreased wind
frequency from the southeast where the highway
junction and downtown are located made low contribu-
tions of gasoline exhaust and increased wind frequency
from the northwest where the rail yard is located made
high contribution of diesel exhaust in September and
October. The nitrate-rich secondary aerosol has seasonal
variation with maxima in the winter time. This indicates
that lower temperatures and higher relative humidity
help the formation of nitrate aerosols in Atlanta. This is
ARTICLE IN PRESS
Fig. 8. Seasonal profile for each of the source.
Fig. 9. Weekend effect for each of the source.
E. Kim et al. / Atmospheric Environment 37 (2003) 5009–5021 5019
consistent with results from three northeastern US sites
(Song et al., 2001).
The metal processing source has a winter-high
seasonal pattern. The wood smoke source has seasonal
trend with high in winter time indicating residential
wood burning. The airborne soil shows strong seasonal
variation with high concentrations in the drier season,
especially the highest concentration in spring agricultur-
al tilling time. The mixture of cement kiln and carbon-
rich source has seasonal trend with high in summer.
These seasonal patterns are consistent with previous
study using PMF2 (Kim et al., 2003a).
Weekend effects are presented in Fig. 9. The factor
values are the average strength of each source on
weekend relative to the strength on weekday. There were
reduced contributions from the diesel exhaust and the
mixture of cement kiln and carbon-rich source on
weekends. The strong weekday high variations of the
ARTICLE IN PRESSE. Kim et al. / Atmospheric Environment 37 (2003) 5009–50215020
diesel exhaust demonstrate that the main sources of
these are likely to be commuter buses, railroad traffic,
delivery trucks or closely located bus station.
5. Conclusion
The ordinary bilinear factor analysis model augmen-
ted by an expanded multilinear model that contains time
resolved meteorological measurements reduced the
rotational ambiguity in the solution and aided in
identifying the sources. A total of 662 samples, 22
elemental species, two wind variables, and three time
variables were used for this analysis. Nine sources were
identified and directional profiles, wind speed depen-
dencies, hourly patterns, seasonal trends, and weekend
effects were obtained. As different from previous study
for the same data set using PMF2 (Kim et al., 2003a),
two sulfate-rich secondary aerosol sources were ex-
tracted and the diesel exhaust was separated from
gasoline exhaust by using the expanded model. This
study demonstrated that the time resolved wind and
other data were successfully incorporated into ME for
the analysis of real measurement data and could
significantly improve source apportionment study.
Acknowledgements
This study was supported by the Southern Company.
References
Appel, B.R., Tokiwa, Y., Haik, M., 1981. Sampling of nitrates
in ambient air. Atmospheric Environment 15 (3), 283–289.
Cadle, S.H., Mulawa, P.A., Hunsanger, E.C., Nelson, K.E.,
Ragazzi, R.A., Barrett, R., Gallagher, G.L., Lawson, D.R.,
Knapp, K.T., Snow, R., 1999. Composition of light-duty
motor vehicle exhaust particulate matter in the Denver,
Colorado area. Environmental Science and Technology 33,
2328–2339.
Chow, J.C., Watson, J.G., Pritchett, L.C., Pierson, W.R.,
Frazier, C.A., Purcell, R.G., 1993. The DRI thermal/optical
reflectance carbon analysis system: description evaluation
and applications in US air quality studies. Atmospheric
Environment 27A (8), 1185–1201.
Chueinta, W., Hopke, P.K., Paatero, P., 2000. Investigation of
sources of atmospheric aerosol at urban and suburban
residential area in Thailand by positive matrix factorization.
Atmospheric Environment 34, 3319–3329.
Dzubay, T.G., Stevens, R.K., Gordon, G.E., Olmez, I.,
Sheffield, A.E., Courtney, W.J., 1988. Composite receptor
method applied to Philadelphia aerosol. Environmental
Science and Technology 22 (1), 46–52.
Eatough, D.J., Wadsworth, A., Eatough, D.A., Crawford,
J.W., 1993. A multiple-system, multi-channel diffusion
denuder sampler for the determination of fine-particulate
organic material in the atmosphere. Atmospheric Environ-
ment 27A (8), 1213–1219.
Henry, R.C., 1987. Current factor analysis models are ill-posed.
Atmospheric Environment 21, 1815–1820.
Henry, R.C., Norris, G.A., 2002. EPA Unmix 2.3 user guide.
Hopke, P.K., 1985. Receptor Modeling in Environmental
Chemistry. Wiley, New York.
Hopke, P.K., 1991. Receptor Modeling for Air Quality
Management. Elsevier, Amsterdam, The Netherlands.
Hopke, P.K., Paatero, P., 2003. Discarding or downweighting
high-noise variables in factor analytic models. Analytica
Chimica Acta 224271, 1–13.
Huang, S., Rahn, K.A., Arimoto, R., 1999. Testing and
optimization two factor-analysis techniques on aerosol at
Narragansett Rhode Island. Atmospheric Environment 33,
2169–2185.
Kim, B.M., Henry, R.C., 2000. Application of the SAFER
model to Los Angeles PM10 data. Atmospheric Environ-
ment 34, 1747–1759.
Kim, E., Hopke, P.K., Edgerton, E.S., 2003a. Source identifica-
tion of Atlanta aerosol by Positive Matrix Factorization.
Journal of the Air and Waste Management Association 53,
731–739.
Kim, E., Hopke, P.K., Larson, T.V., Covert, D.S., 2003b.
Analysis of ambient particle size distributions using Unmix
and Positive Matrix Factorization. Environmental Science
and Technology, in press.
Kim, E., Larson, T.V., Hopke, P.K., Slaughter, C., Sheppard,
L.E., Claiborn, C., 2003c. Source identification of PM2.5 in
an arid northwest US city by positive matrix factorization.
Atmospheric Research 66, 291–305.
Lee, E., Chun, C.K., Paatero, P., 1999. Application of positive
matrix factorization in source apportionment of particulate
pollutants. Atmospheric Environment 33, 3201–3212.
Lewis, C.W., Norris, G.A., Henry, R.C., Conner, T.L., 2003.
Source apportionment of Phoenix PM2.5 aerosol with the
Unmix receptor model. Journal of the Air and Waste
Management Association 53, 325–338.
Lowenthal, D.H., Zielinska, B., Chow, J.C., Watson, J.G.,
1994. Characterization of heavy duty diesel vehicle emis-
sions. Atmospheric Environment 28 (4), 731–743.
Mangelson, N.F., Lewis, L., Joseph, J.M., Cui, W., Machir, J.,
Eatough, D.J., Rees, L.B., Wilkerson, T., Jensen, D.T.,
1997. The contribution of sulfate and nitrate to atmospheric
fine particles during winter inversion fogs in Cache Valley,
Utah. Journal of the Air and Waste Management Associa-
tion 47, 167–175.
Maykut, N.N., Lewtas, J., Kim, E., Larson, T.V., 2003. Source
apportionment of PM2.5 at an urban IMPROVE site in
Seattle, WA. Environmental Science and Technology, in
press.
Miller, M.S., Friedlander, S.K., Hidy, G.M., 1972. A chemical
element balance for the Pasadena aerosol. Journal of
Colloid and Interface Science 39, 165–176.
Paatero, P., 1997. Least square formulation of robust non-
negative factor analysis. Chemometrics and Intelligent
Laboratory Systems 37, 23–35.
Paatero, P., 1999. The Multilinear Engine-A table driven, least
square program for solving multilinear problems, including
the n-way parallel factor analysis model. Journal of
Computational and Graphical Statistics 8 (4), 854–888.
ARTICLE IN PRESSE. Kim et al. / Atmospheric Environment 37 (2003) 5009–5021 5021
Paatero, P., 2000. User’s guide for positive matrix factorization
programs PMF2 and PMF3, Part 1: tutorial.
Paatero, P., Hopke, P.K., 2002. Utilizing wind direction and
wind speed as independent variables in multilinear receptor
modeling studies. Chemometrics and Intelligent Laboratory
Systems 60, 25–41.
Paatero, P., Hopke, P.K., Hoppenstock, J., Eberly, S.I., 2003.
Advanced factor analysis of spatial distribution of PM2.5 in
the eastern United States. Environmental Science and
Technology 37 (11), 2460–2476.
Peters, T.M., Vanderpool, R.W., Wiener, R.W., 2001. Design
and calibration of the EPA PM2.5 Well Impactor Ninety-Six
(WINS). Aerosol Science and Technology 34, 389–397.
Poirot, R.L., Wishinski, P.R., Hopke, P.K., Polissar, A.V.,
2001. Comparative application of multiple receptor meth-
ods to identify aerosol sources in northern Vermont.
Environmental Science and Technology 35 (23), 4622–4636.
Polissar, A.V., Hopke, P.K., Paatero, P., Malm, W.C., Sisler,
J.F., 1998. Atmospheric aerosol over Alaska 2. Elemental
composition and sources. Journal of Geophysical Research
103 (D15), 19045–19057.
Polissar, A.V., Hopke, P.K., Poirot, R.L., 2001. Atmospheric
aerosol over Vermont: Chemical composition and sources.
Environmental Science and Technology 35, 4604–4621.
Pszenny, A., Fischer, C., Mendez, A., Zetwo, M., 1993. Direct
comparison of cellulose and quartz fiber filters for sampling
submicrometer aerosols in the marine boundary layer.
Atmospheric Environment 27 (2), 281–284.
Qin, Y., Oduyemi, K., Chan, L.Y., 2002. Comparative testing
of PMF and CFA models. Chemometrics and Intelligent
Laboratory Systems 61, 75–87.
Ramadan, Z., Song, X.H., Hopke, P.K., 2000. Identification of
sources of Phoenix aerosol by positive matrix factorization.
Journal of the Air and Waste Management Association 50,
1308–1320.
Ramadan, Z., Eickhout, B., Song, X.H., Buydens, L., Hopke,
P.K., 2003. Comparison of Positive Matrix Factorization
(PMF) and Multilinear Engine (ME-2) for the source
apportionment of particulate pollutants. Chemometrics
and Intelligent Laboratory Systems 66 (1), 15–28.
Small, M., Germani, M.S., Small, A.M., Zoller, W.H., Moyers,
J.L., 1981. Airborne plume study of emissions from the
processing of copper ores in southeastern Arizona. Envi-
ronmental Science and Technology 15, 293–299.
Song, X.H, Polissar, A.V., Hopke, P.K., 2001. Source of fine
particle composition in the northeastern US. Atmospheric
Environment 35, 5277–5286.
US EPA, 2002. SPECIATE version 3.2. US Environmental
Protection Agency, Research Triangle Park, NC.
Vanderpool, R.W., Peters, T.M., Natarajan, S., Tolocka, M.P.,
Gemmill, D.B., Wiener, R.W., 2001. Sensitivity analysis of
the USEPA WINS PM2.5 separator. Aerosol Science and
Technology 34, 465–476.
Van Vaeck, L., Van Cauwenberghe, K., Janssens, J., 1984. The
gas-particle distribution of organic aerosol constituents:
measurement of the volatilization artifact in hi-vol cascade
impactor sampling. Atmospheric Environment 18, 417–430.
Watson, J.G., Chow, J.C., 2001. Source characterization of
major emission sources in the Imperial and Mexicali Valleys
along the US/Mexico border. The Science of the Total
Environment 276, 33–47.
Watson, J.G., Chow, J.C., Lowenthal, D.H., Pritchett, L.C.,
Frazier, C.A., 1994. Differences in the carbon composition
of source profiles for diesel and gasoline powered vehicles.
Atmospheric Environment 28 (15), 2493–2505.
Watson, J.G., Chow, J.C., Houck, J.E., 2001. PM2.5 chemical
source profiles for vehicle exhaust, vegetative burning,
geological material, and coal burning in northwestern
Colorado during 1995. Chemosphere 43, 1141–1151.
Willis, R.D., 2000. Workshop on Unmix and PMF as applied
to PM2.5. EPA 600-A-00-048.
Xie, Y.L., Hopke, P.K., Paatero, P., Barrie, L.A., Li, S.M.,
1999a. Identification of source nature and seasonal varia-
tions of Arctic aerosol by positive matrix factorization.
Journal of Atmospheric Sciences 56, 249–260.
Xie, Y.L., Hopke, P.K., Paatero, P., Barrie, L.A., Li, S.M.,
1999b. Identification of source nature and seasonal varia-
tions of Arctic aerosol by the multilinear engine. Atmo-
spheric Environment 33, 2549–2562.
Yli-Tuomi, T., Hopke, P.K., Paatero, P., Basumia, M.S.,
Landsberger, S.L., Viisanen, Y., Paatero, J., 2003. Atmo-
spheric aerosol over Finnish arctic: Source analysis by the
multilinear engine and the potential source contribution
function. Atmospheric Environment, in press.