FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of...
-
date post
19-Dec-2015 -
Category
Documents
-
view
218 -
download
0
Transcript of FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of...
FREQUENCY ANALYSIS
• Basic Problem:
To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions.
FREQUENCY ANALYSIS
• Basic Assumptions:
(a) Data analyzed are to be statistically independent
& identically distributed - selection of data (Time
dependence, time scale, mechanisms).
(b) Change over time due to man-made (eg.
urbanization) or natural processes do not alter the
frequency relation - temporal trend in data
(stationarity).
FREQUENCY ANALYSIS
• Practical Problems:(a) Selection of reasonable & simple distribution.
(b) Estimation of parameters in distribution.
(c) Assessment of risk with reasonable accuracy.
REVIEW OF BASIC CONCEPTSProbabilistic
Outcome of a hydrologic event (e.g., rainfall amount & duration; flood peak discharge; wave height, etc.) is random and cannot be predicted with
certainty.
Terminologies Population The collection of all possible outcomes relevant to the process of interest. Example: (1) Max. 2-hr rainfall depth: all non-negative real numbers; (2) No. of storm in June: all non-negative integer numbers.
Sample A measured segment (or subset) of the population.
REVIEW OF BASIC CONCEPTSTerminologies - Random Variable A variable describable by a probability distribution which
specifies the chance that the variable will assume a particular value.
Convention: Capital letter for random variables (say, X) whereas the lower case letter (say, x) for numerical realization that the random variable X will take.
Example: X = rainfall amount in 2 hours (a random variable) ; x = 100.2 mm/2hr (realization).- Random variables can be - discrete (eg., no. of rainy days in June) or - continuous (eg., max. 2-hr rainfall amount, flood discharge)..
REVIEW OF BASIC CONCEPTSTerminologies Frequency & Relative Frequency
o For discrete random variables:o Frequency is the number of occurrences of a specific event. Relative frequency
is resulting from dividing frequency by the total number of events. e.g.
n = no. of years having exactly 50 rainy days; N = total no. of years.
Let n=10 years and N=100 years. Then, the frequency of having exactly 50 rainy days is 10 and the relative frequency of having exactly 50 rainy days in 100 years is n/N = 0.1.
o For continuous random variables:o Frequency needs to be defined for a class interval.o A plot of frequency or relative frequency versus class intervals is called
histogram or probability polygon.o As the number of sample gets infinitely large and class interval length
approaches to zero, the histogram will become a smooth curve, called probability density function.
REVIEW OF BASIC CONCEPTSTerminologies
Probability Density Function (PDF) – • For a continuous random variable, the PDF must satisfy
and f(x) ≥ 0 for all values of x.
• For a discrete random variable, the PDF must satisfy
and 1≥ p(x) ≥ 0 for all values of x.
-
( ) 1f x dx
( ) 1all x
p x
REVIEW OF BASIC CONCEPTSTerminologies
- Cumulative Distribution Function -
For a continuous random variable,
For a discrete random variable, by
-
Pr( ) ( ) ox
oX x f x dx
Pr( ) = ( )i o
o iall x x
X p xx
Statistical Properties of Random Variables
• Population - Synonymous to sample space, which describes the complete assemblage of all the values representative of a particular random process.
• Sample - Any subset of the population.
• Parameters - Quantities that are descriptive of the population in a statistical model. Normally, Greek letters are used to denote statistical parameters.
• Sample statistics (or simply statistics): Quantities calculated
on the basis of sample observations.
Statistical Moments of Random Variables
• Descriptors commonly used to show statistical properties of a RV are those indicative
(1) Central tendency; (2) Dispersion; (3) Asymmetry.
• Frequently used descriptors in these three categories are related to statistical moments of a RV.
• Two types of statistical moments are commonly used in hydrosystem engineering applications:
(1) product-moments and (2) L-moments.
Product-Moments • rth-order product-moment of X about any reference point X=xo is defined, for continuous case,
as
whereas for discrete case,
where E[] is a statistical expectation operator. • In practice, the first three moments (r=1, 2, 3) are used to describe the central tendency,
variability, and asymmetry. • Two types of product-moments are commonly used:
– Raw moments: µr'=E[Xr] rth-order moment about the origin; and – Central moments: µr=E[(Xµx)r] = rth-order central moment – Relations between two types of product-moments are:
. where Cn,x = binomial coefficient = n!/(x!(n-x)!)• Main disadvantages of the product-moments are:
(1) Estimation from sample observations is sensitive to the presence of outliers; and (2) Accuracy of sample product-moments deteriorates rapidly with increase in the order of the moments.
r r r
o o x o x
- -
E X x = x x f x dx = x x dF x
1
Kr r
o k o x kk=
E X x = x x p x
0
1r
i i 'r,i x r ir
i=
= μμ C 0
r' i
r,ir x r ii=
μ = μC
Mean, Mode, Median, and Quantiles• Expectation (1st-order moment) measures central tendency of random variable X
– Mean () = Expectation = 1 = location of the centroid of PDF or PMF.– Two operational properties of the expectation are useful:
• in which k=E[Xk] for k = 1,2, …, K.
• For independent random variables,
• Mode (xmo) - the value of a RV at which its PDF is peaked. The mode, xmo, can be obtained by solving
• Median (xmd) - value that splits the distribution into two equal halves, i.e,
• Quantiles - 100pth quantile of a RV X is a quantity xp that satisfies P(X xp) = Fx(xp) = p • A PDF could be uni-modal, bimodal, or multi-modal. Generally, the mean, median, and mode
of a random variable are different, unless the PDF is symmetric and uni-modal.
1x x x x
- -
E X = μ = x f x dx = x dF x = F x dx
1 1
K K
k k k kk= k=
E a X = a
1 1
K K
k kk= k=
E X =
0 5mdx
x md xF x = f x dx = .
0
mo
x
x x
f x=
x
Uni-modal and bi-modal distributions
x
(a) Uni-modal distribution
fx(x)
x
(b) Bi-modal distribution
fx(x)
Variance, Standard Deviation, and Coefficient of Variation
• Variance is the second-order central moment measuring the spreading of a RV over its range,
• Standard deviation (x) is the positive square root of the variance. • Coefficient of variation, x=x/x, is a dimensionless measure; useful for
comparing the degree of uncertainty of two RVs with different units. • Three important properties of the variance are:
– (1) Var[c] = 0 when c is a constant.– (2) Var[X] = E[X2] E2[X]– For multiple independent random variables,
where ak =a constant and k = standard deviation of Xk, k=1,2, ..., K.
2 2
1 1
K K
k k k kk= k=
Var a X = a
2 222 x x x x
-
Var X = = E X μ = x μ f x dx
Skewness Coefficient • Measures asymmetry of the PDF of a random variable• Skewness coefficient, x, defined as
• The sign of the skewness coefficient indicates the degree of symmetry of the probability distribution function.
• Pearson skewness coefficient –
• In practice, product-moments higher than 3rd-order are less used because they are unreliable and inaccurate when estimated from a small number of samples
• See Table for equations to compute the sample product-moments.
3
31 5 32
x
x .x
E X μμ = =
μ
1mox
x
μ x=
Relative locations of mean, median, and mode for positively-skewed, symmetric, and negatively-skewed distributions.
x x mo x md
(a) Positively Skewed, x>0
fx(x)
x
xxmo=xmd
(b) Symmetric, x=0
fx(x)
x
x md x mo x
(c) Negatively skewed, x<0
fx(x)
x
Product-moments of random variables
Moment Measure of Definition Continuous Variable Discrete Variable Sample Estimator
First Central Location
Mean, Expected value E(X)=x
( )x xx f x dx
( )x k kall x's
μ x p x ix x / n
Second Dispersion
Variance, Var(X)=2= x2
Standard deviation, x
Coefficient of variation,x
2 2( ) ( )x x xx μ f x dx
( )x Var X
x = xx
22
'k x x kx P x
x all x s
( )x Var X
x = xx
2 211 is x x
n
211 is x x
n
vC s x
Third Asymmetry
Skewness Skewness coefficient, x
33 ( ) ( )x xμ x μ f x dx
x = 3 / x
3
33 ( ) ( )k x x k
all x's
μ x μ p x x = 3 / x
3
3
3 ( 1)( 2) i
nm x x
n n
g = m3 / s
3
Fourth Peakedness
Kurtosis, x
Excess coefficient, x
44 ( ) ( )x xμ x μ f x dx
x = 4 / x
4
xx
44 ( ) ( )k x x k
all x's
μ x μ p x x = 4 / x
4
xx
4
4
( 1)
( 1)( 2)( 3) i
n nm x x
n n n
k = m4 / s
4
Kurtosis (x)
• Measure of the peakedness of a distribution. • Related to the 4th central product-moment as
• For a normal RV, its kurtosis is equal to 3. Sometimes, coefficient of excess, x=x3, is used.
• All feasible distribution functions, skewness coefficient and kurtosis must satisfy
4
42 42
x
xx
E X μμ = =
μ
2 1x x
Some Commonly Used Distributions
• NORMAL DISTRIBUTION
Standardized Variable:
Z has mean 0 and standard deviation 1.
X
Z
x < < ,
σ
μx -
π =x f
x
x
x
xxN for2
1exp
2
1 ,|
2
2
Some Commonly Used Distributions
• STANDARD NORMAL DISTRIBUTION:
0
0.1
0.2
0.3
0.4
0.5
-3 -2 -1 0 1 2 3
z
( z )
Some Commonly Used Distributions • LOG-NORMAL DISTRIBUTION
0.0
0.2
0.40.60.8
1.01.2
1.41.6
0 1 2 3 4 5 6x
fLN(x)
x=0.6 x=1.3
x=0.3
(a) x = 1.0
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0 1 2 3 4 5 6x
fLN(x)
x =1.65
x =2.25
x =4.50
(b) x = 1.30
2
2 lnlnln
lnln
ln( ) -1 1 | , = exp - , > 0
22x
xxLNxx
xx xf
x
Some Commonly Used Distributions
• Gumbel (Extreme-Value Type I) Distribution
0
0.1
0.2
0.3
0.4
-4 -3 -2 -1 0 1 2 3 4
y
Max
Min
fEV1(y)
1 | , exp exp for maxima
1 exp exp for minima
EVx
x = F
x ξ = +
β
1
1| , exp exp for maxima
1exp exp for minima
EV
x x x = f
x x = + +
Some Commonly Used Distributions • Log-Pearson Type 3 Distribution
e
β
ξx
Γβ = xf x-
α
P
/
1
3
1,,|
0.00
0.05
0.10
0.15
0.20
0.25
0.30
0 2 4 6 8 10 12 14
x
fG(x)
Some Commonly Used Distributions • Log-Pearson Type 3 Distribution
e
β
ξx
Γβ x = xf x-
α
LP
/)ln(
1
3
ln1,,|
with >0, xe when >0 and with >0, xe when <0