OP5205: BUSINESS STATISTICS (2 Credits) Session 21-24 - Confidence Interval & Hypothesis Testing
OP5205: BUSINESS STATISTICS (2 Credits) Session 21-24 - Confidence Interval & Hypothesis Testing
• Statement 2: We are 99% confident that population mean is in the interval [449, 551].
This conveys much more information about the possible value of population mean.
• Statement 3: We are 90% confident that population mean is in the interval [400, 700].
This interval conveys less information about the possible value of population mean, both because it
is wider and because the level of confidence is lower.
This is also the same as saying we are 95% confident that the true population parameter will
be in the interval
• Confidence interval for population mean when population standard deviation is unknown:
Student t-distribution
Answer: Comcast can be 95% confident that the average family of four within its population of
subscribers will watch an average daily number of online television shows between about 5.87 and
7.13.
• The t distribution is characterized by its degrees-of-freedom parameter df. For any integer value df
= 1, 2, 3, . . . , there is a corresponding t distribution. The t distribution resembles the standard
normal distribution Z: it is symmetric and bell-shaped. The t distribution, however, has wider tails
than the Z distribution.
[Confidence Interval Formula – Small Sample <=30]
• The mean of a t distribution is zero. For df > 2, the variance of the t distribution is equal to df/(df - 2).
• The t distribution thus reflects the uncertainty in two random variables, sample mean and SD, while
Z reflects only an uncertainty due to sample mean.
• As df increases, the t distribution Lecture
approaches the Z distribution
Presentation | Dr. Piyush Kumar
Source: Complete Business Statistics (4e) (TMH)
For Large Sample – Where Pop SD is unknown – t Converges to Z-distribution
Since the sample size is n =15, we need to use the t distribution with n -1 = 14 degrees of freedom. In
Table 3, in the row corresponding to 14 degrees of freedom and the column corresponding to a right-
tail area of 0.025 (this is alpha/2), we find that t(0.025) = 2.145
Thus, the analyst may be 95% sure that the average annualized return on the stock is anywhere from
8.43% to 12.31%.
• A null hypothesis is an assertion about the value of a population parameter. It is an assertion that
we hold as true unless we have sufficient statistical evidence to conclude otherwise.
• For example, a null hypothesis might assert that the population mean is equal to 100. Unless we
obtain sufficient evidence that it is not 100, we will accept it as 100. We write the null hypothesis
compactly as
• The alternative hypothesis, denoted by Ha, is the opposite of what is stated in the null hypothesis.
• The hypothesis testing procedure uses data from a sample to test the two competing statements
indicated by H0 and Ha. Examples - In a court case you look at the evidence, and convict the
person only if there is enough evidence that they are guilty. Your spam filter looks at each email and
rejects it if there is sufficient evidence that it is junk mail. When you give blood they test for HIV and
throw out the blood if there is evidence it is infected.
• In addition, a hypothesis may assert that the parameter in question is at least or at most some
value.
• For example, the null hypothesis may assert that the population proportion p is at least 40%. In this
case, the null and alternative hypotheses are:
• Generally what the statistician aims to prove is the alternative hypothesis, the null hypothesis
standing for the status quo, do-nothing situation.
In such cases, it is often best to begin with the alternative hypothesis and make it the
conclusion that the researcher hopes to support.
The conclusion that the research hypothesis is true is made if the sample data provide
sufficient evidence to show that the null hypothesis can be rejected.
Statement 1: A new teaching method is developed that is believed to be better than the current
method
Null Hypothesis: The new method is no better than the old method.
Statement 2: A new sales force bonus plan is developed in an attempt to increase sales.
Alternative Hypothesis: The new sales force bonus plan will increase sales
Null Hypothesis: The new sales force bonus plan is no better than the old method.
The claim is the null hypothesis and the suspicion is the alternative hypothesis. Thus, with
denoting the average time to fill an order,
The claim is the null hypothesis and the suspicion is the alternative hypothesis. Thus, with
denoting the average time to fill an order,
The claim is the null hypothesis. Thus, with denoting the variance,
The claim is the null hypothesis. Thus, with denoting the variance,
• P-value
• Optimal alpha and the Compromise between Type I and Type II Errors
• Sample Size
• As long as such decisions are made based on evidence that does not provide 100% confidence,
there will be chances for error.
Point 2: The maximum probability of type I error we set for ourselves. Since is the maximum p-value at
which we reject H0, it is the maximum probability of committing a type I error. In other words, setting 5%
means that we are willing to put up with up to 5% chance of committing a type I error.
Point 3: Increasing the value of alpha will decrease the probability of type II error. Example, increasing
from 5% to 10% means that in those instances with a p-value in the range 5% to 10% the H0 that would not
have been rejected before would now be rejected.
• Requires knowing the cost of each type of error. Cost estimation is difficult. Hence, normal approach
involves intuitive assigning of one of the three standard values, 1%, 5%, and 10%, to alpha.
• For example, Suppose we are testing the average tensile strength of a large batch of bolts produced by
a machine to see if it is above the minimum specified.
Type I error => Rejecting a good batch of bolts and the cost of the error is roughly equal to the
cost of the batch of bolts.
Case 1: Type II error => Accepting a bad batch of bolts and its cost can be high or low depending
on how the bolts are used. If the bolts are used to hold together a structure, then the cost is high
because defective bolts can result in the collapse of the structure, causing great damage.
In this case, we should strive to reduce the probability of type II error more than that of type I error
by keeping a large value of alpha, namely, 10%.
Case 2: Else, if the bolts are used to secure the lids on trash cans, then the cost of type II error is
not high and we should strive to reduce the probability of type I error more than that of type II
error. In such cases where type I error is more costly, we keep a small value for alpha, namely, 1%.
Case 3: Then there are cases where we are not able to determine which type of error is more
costly. If the costs are roughly equal, or if we have not much knowledge about the relative costs of
the two types of errors, then Lecture
keep alpha at 5%.| Dr. Piyush Kumar
Presentation
Source: Complete Business Statistics (4e) (TMH)
Type I & II Errors – Avoidance Options`- Beta & Sample Size
Power of Beta
• The symbol used for the probability of type II error is Beta
• The complement of Beta (1 - Beta) is known as the power of the test.
• The power of a test is the probability that a false null hypothesis will be detected by the test.
• On the basis of p value, we determine whether or not to reject H0 (based upon alpha).
• In this scenario, since calculation of Z determines the p-value, Z is also known as test statistic .
• A test statistic is a random variable calculated from the sample evidence, which follows a well-known
distribution and thus can be used to calculate the p-value.
• Similarly, t value, Chi-Square value, and F value are also known as test statistic as those can also be
used to determine the p-value.
• Suppose the population standard deviation is 5 and the sample size n is 100
1(a) The area to the left of -1.86, from the tables, is 0.5 - 0.4686 = 0.0314, or the p-value is 3.14%.
1(b) The area to the right of -1.86, from the tables, is 0.5 + 0.4686 = 0.9686, or the p-value is 96.86%. (Such a
large p-value means that the evidence greatly favors H0, and there is no basis for rejecting H0.)
1(c) The value -1.86 falls on the left tail. The area to the left of -1.86 is 3.14%. Multiplying that by 2, we get
6.28%, which is the p-value.
2. Only in the case of a left-tailed test does the p-value fall below the alpha of 5%. Hence that is the only
case where H0 will be rejected.
Here, The normality of the population may be established by direct tests or the normality may be
assumed based on the nature of the population. Z is calculated as:
Note: Since the t table provides only the critical values, it cannot be used to find exact p-values. For
example, if the calculated value of t is 2.000 and the degrees of freedom are 24, we see from the tables that
t(0.05) is 1.711 and t(0.025) is 2.064. Thus, the one-tailed p-value corresponding to t = 2.000 must be
somewhere between 0.025 and 0.05, but we don’t know its exact value..
Lecture Presentation | Dr. Piyush Kumar
Source: Complete Business Statistics (4e) (TMH)
Test Population Means
2. Assume that the population is normally distributed with the same population SD of 1.30 cm3.
Assume that the sample size is only 20 but the sample mean is the same 1,999.6 cm3. Conduct the
test once again at an of 5%.
3. If there is a difference in the two test results, explain the reason for the difference.
2. Assume that the population is normally distributed with the same population SD of 1.30 cm3.
Assume that the sample size is only 20 but the sample mean is the same 1,999.6 cm3. Conduct
the test once again at an of 5%.
3. If there is a difference in the two test results, explain the reason for the difference.
When the binomial distribution is used, the number of successes X serves as the test statistic. The
p-value is the appropriate tail area, determined by X (follows a discrete distribution), of the binomial
distribution defined by n and the hypothesized value of population proportion p.
A coin is to be tested for fairness. It is tossed 25 times and only 8 heads are observed. Test if the coin
is fair at alpha = 5%
0.053876 =BINOM.DIST(8,25,0.5,TRUE)
Chi-Sq = (n-1)s^2/sigma^2
= (31-1)*1.62/1
= 48.6