Random Variables and Probability Distribution: Purnomo Jurusan Teknik Mesin UGM
Random Variables and Probability Distribution: Purnomo Jurusan Teknik Mesin UGM
Probability Distribution
Purnomo
Jurusan Teknik Mesin
UGM
1
Random Variables
A random variable X is a numerical valued
function defined on a sample space.
A number X(e), providing a measure of
characteristic of interest, is assigned to
each simple event e in the sample
space
Contoh dadu :
X = 1, 2, 3, 4, 5, 6
2
Two balls are drawn in succession from a box that contains 4
red balls and 3 blue balls. The possible outcomes and the values
y of the random variable Y, where Y is the number of red balls
is
Sample space y
RR 2
RB 1
BR 1
BB 0
3
Illustration
Two products A and B are judge by four consumer who then
expressed a preference for A and B. The outcome when the first
and third consumers prefer A and the other consumers prefer B
is denoted by ABAB. The number of outcomes is 24 = 16.
4
Illustration
Suppose that the products are alike in quality and that the
consumers express their preference independently. Then the 16
simple events in the sample space are equally likely, and each
has a probability of 1/16. Let a random variable X be devined as
X= number of person preffering A to B.
Probability distribution :
Distinct value of X 0 1 2 3 4
Probability 1/16 4/16 6/16 4/16 1/16
5
Probability Distribution
6
Graphic Presentation
f(x)
Line diagram
6/16
4/16
2/16
x
0 1 2 3 4 7
Histogram of probability
Histogram
Value x 1 2 3 4
f(x) 1/8 ¼ ½ 1/8
4/8
Area = 0.5
2/8
8
1 2 3 4 5
Properties of relative frequency histogram
9
Expectation
Expected value or Expectation of X
E ( X ) xi f ( xi )
X 0 1 2 3 4 5 Total
f(x) 0.1 0.1 0.2 0.3 0.2 0.1 1
xf(X) 0 0.1 0.4 0.9 0.8 0.5 2.7
E(X) = 2.7
11
Variance and standard
Deviation
Variance of X (= σ2 = σx2)
Var(X) = E[(X – μ)2] = E(X2) – μ2
Standard deviation ( = σ = σx )
12
Standardized random variable
X k
Z has E(Z) = 0 and Var(Z) = 1
k
Random variable Z has a mean of 0 and
variance of 1
Bentuk ini akan banyak digunakan pada applikasi 13
PROBABILITY MODELS FOR CONTINUOUS
RANDOM VARIABLES
14
PROBABILITY DENSITY CURVE
15
Properties of Probability Density Function, f(x)
P a X b
area under the
density curve between a and b
f(x) is positive or zero
For continuous random variable, the
probability that X=x is always 0 (X is only
meaningful when X lies in an interval
16
Density Curves
Measuring center and spread for density curves
Density curves describe the overall shape of a
distribution
Ideal patterns that are accurate enough for
practical purposes
Faster to draw and easier to use
Areas or proportions under the curve represent
counts or percents of observations
17
Features of a Continuous Distribution
18
Center of a Density Curve
The mode of a distribution is the point where
the curve is highest
The median is the point where half of the
area under the curve lies on the left and the
other half on the right. Equal Areas Point
Quartiles can be found by dividing the area
under the curve into four equal parts
¼ of the area is to the left of the 1st quartile
3 of the area is to the left of the 3rd quartile
4
The mean is the balance point.
19
Percentiles
20
The Normal distribution
Discovered by Abraham de Moivre around 1720. Around 1870,
Adolph Quetelet realised that the normal curve could be used to
compare histograms of data.
Chest measurements of 5738 Scottish soldiers by Belgian
scholar Lambert Quetelet (1796-1874)
Pierre Laplace dan Carl Gauss : bell-shaped distribution
Gauss derived the normal distribution mathematically as the
probability distribution of the error of measurements, which is
called “normal law of error”
Gaussian Distribution
21
Normal Distributions
Symmetric
Single-peaked (unimodal)
Bell-shaped
The mean, median, and mode are the same
The points where there is a change in
curvature is one standard deviation on either
side of the mean.
The mean and standard deviation completely
specify the curve
22
Normal Distribution
N ( , )
23
Normal Distribution
The probability of the
interval extending
P X 0.683
P 2 X 2 0.954
P 3 X 3 0.997
24
The Empirical Rule
68% of the observations fall within one
standard deviation of the mean
95% of the observations fall within two
standard deviation of the mean
99.7% of the observations fall within
three standard deviation of the mean
25
Example: Young Women’s
Height
The heights of young women are approximately
normal with mean = 64.5 inches and std.dev. = 2.5
inches.
26
The normal distribution is the most important distribution
in Statistics. Typical normal curves with different sigma
(standard deviation) values are shown below.
27
Examples with approximate
Normal distributions
Height
Weight
IQ scores
Standardized test scores
Body temperature
Repeated measurement of same quantity
…
28
FACTS
Universality of the normal distribution is
only a myth, and examples of quite
nonnormal distribution abound in any
virtually every field of study
Still, the normal distribution plays a
central role in statistics (make things
easier)
29
Standardizing and z-Scores
One case, one curve --- too
complicated
Solution -- standardization
normalization
non-dimensionalization
---- z-Scores
---- All cases, one curve (or table)
30
Standardizing and z-Scores
an observation x comes from a distribution with
mean µ and standard deviation σ
The standardized value of x is defined as
x
z ,
which is also called a z-score.
A z-score indicates how many standard deviations
the original observation is away from the mean,
and in which direction.
31
The Standard Normal Curve
N(0,1)
32
Standardizing and z-Scores
N ,
X
Z ,
X b b
P X b P P Z
a X b a b
P a X b P P Z
Za Zb
33
The Standard Normal Table
The Normal Table is a table of areas under the
standard normal density curve. The table entry for each
value z is the area under the curve to the left of z.
34
The Standard Normal Table
The Normal Table can be used to find the proportion of
observations of a variable which fall to the left of a specific
value z if the variable follows a normal distribution.
35
36
37
38
Use of The Normal Table
39
Use of The Normal Table
Symmetry
P Z 0 0.5
P Z z 1 P Z z
40
Use of The Normal Table
If z>0
P Z z 0.5 P 0 Z z
P Z z 0.5 P 0 Z z
41
Use of The Normal Table
Calculate z
Find the area to the left of z in Standard
Normal Probability Table
Other calculations obey the properties
of the Standardized Normal Curve
42
Example : random variable
43
Example : Expectation
44
45
46
47
48