SOCR EduMaterials Activities Central Limit Theorem Chi square examples

From SOCR
Revision as of 19:14, 23 May 2007 by Create90215 (talk | contribs)
Jump to: navigation, search
  • Example 1
  • Supply responses true or false with an explanation to each of the following:
  • a. The standard deviation of the sample mean \(\overline{x}\)increases as the sample increases.
  • b. The Central Limit Theorem allows us to claim, in certain cases, that the distribution of the sample mean \(\overline{x}\) is normally distributed.
  • c. The standard deviation of the sample mean \(\overline{x}\) is usually approximately equal to the unknown population \(\sigma\).
  • d. The standard deviation of the total of a sample of n observations exceeds the standard deviation of the sample mean.
  • e. If \(X \sim N(8,\sigma)\) then \(P(\overline{x} > 4)\) is less than \(P(X > 4).\)
  • Answer:
  • a. false, the standard deviation of the sample mean is \(\frac{\sigma}{\sqrt n}\). Thus as the sample size increases, n increases, and as n increases, the standard deviation decreases.
  • b. True
  • c. False, standard deviation of the sample mean is \(\frac{\sigma}{\sqrt n}\)
  • d. True, the standard deviation of the total of a sample of n observations is \(n\sqrt \sigma\); but the standard deviation of the sample mean is \(\frac{\sigma}{\sqrt n}.\)Unless n is one, the standard deviation of the total of a sample of n observations exceeds the standard deviation of the sample mean.
  • e. False, let's assume \(\sigma=2\) and \(n=2\). In this case, the z-score for \(P(\overline{X} > 4)\) would be -2.828 while the z-score for \(P(X>4)\) would be -2. \(P(Z>-2.828) > P(Z>-2) \). Therefore the statement is false.
  • Example 2

A selective college would like to have an entering class of 1200 students. Because not all students who are offered admission accept, the college admits 1500 students. Past experience shows that 70% of the students admitted will accept. Assuming that students make their decisions independently, the number who accept X, follows the binomial distribution with n = 1500 and p=0.70.

  • a.Write an expression for the exact probability that at least 1000 students accept.
  • Answer:
  • \(P(X \ge 1000)= P(X=1000)+P(X=1001)+....+P(X=1500)= \sum_{x=1000}^{1500}(\frac{7}{10})^x (\frac{3}{10})^{1500-x}\).
  • b. Approximate the above probability using the normal distribution.
  • Answer:
  • We can use the normal approximation to binomial\[\mu = np = 1500 \times 0.70 = 1050.\] and \(\sigma = \sqrt npq = \sqrt1500 \times 0.7 \times 0.3= 17.748.\)

\(P(X \ge 1000)= P(Z> \frac{999.5-1050}{17.748}=P(Z>-2.845)=.9977\)

  • Below you can see a snapshot for this approximation:
SOCR Activities CLT Christou example2.jpg
  • Example 3

An insurance company wants to audit health insurance claims in its very large database of transactions. In a quick attempt to assess the level of overstatement of this database, the insurance company selects at random 400 items from the database (each item represents a dollar amount). Suppose that the population mean overstatement of the entire database is $8, with population standard deviation $20.

  • a. Find the probability that the sample mean of the 400 would be less than $6.50.
  • Answer:
  • \(\overline{X} \sim N(8, \frac{20}{\sqrt400}). P(\overline{X} <6.50) =P(Z<-1.5)=.0667\)
  • Below you can see a snapshot for this part:
SOCR Activities CLT Christou example3 a.jpg
  • b. Why can we use the normal distribution in obtaining an answer to part (a)?
  • Answer:
  • The central limit theorem states that the sample mean approaches the normal distribution as the sample size gets bigger. Usually, if \( n \ge 30\) we can assume that \(\overline{X} \sim N(\mu,\frac{\sigma}{\sqrt n})\).In this case \(n=400\), so n is large enough.
  • d. For what value of w can we say that P(p-w < X < p+w) is equal to 80%?
  • Answer:
  • \(\overline{X} \sim N(8,1).\)According to the snapshot below, the middle 80% of this distribution is (6.721,9.279). Therefore \(w=8-6.721 =1.29\)
SOCR Activities Normal Christou example3 d.jpg
  • e. Let T be the total overstatement for the 400 randomly selected items. Find the number b so that P(T > b) = 0.975.
  • Answer:
  • \(T \sim N(n\mu,\sigma\sqrt n).\) In this case, \(T \sim N(3200,400).\) We know that \(P(T>b) =.975.\)So now we need to find the 97.5th percentile of this distribution using SOCR. According to the SOCR snapshot below, the 97.5th percentile of this distribution is 3984. Therefore b=3984.
SOCR Activities Normal Christou example3 e.jpg
  • Example 4

A telephone company has determined that during nonholidays the number of phone calls that pass through the main branch office each hour follows the normal distribution with mean \(\mu\)= 80000 and standard deviation \(\sigma\) = 35000. Suppose that a random sample of 60 nonholiday hours is selected and the sample mean of the incoming phone calls is computed.

  • a. Describe the distribution of X.
  • Answer:
  • \(\overline{X} \sim N(\mu, \frac{\sigma}{\sqrt n })\). In this case, \(\overline{X} \sim N(80000, 4518.48)\).
  • b. Find the probability that the sample mean X of the incoming phone calls for these 60 hours is larger than 91970.
  • Answer:
  • We can find the answer using SOCR. The answer is 0.004032. Please see snapshot below:
SOCR Activities Normal Christou example4 .jpg

c. Is it more likely that the sample average \(\overline{x}\)will be greater than 75000 hours, or that one hour's incoming calls will be?

  • Answer:
  • We can find the answer right away using SOCR. Please see snapshots below:
This is the distribution for \(X\)
SOCR Activities Normal Christou example4 c.jpg
This is the distribution for \(\overline{X}\)
SOCR Activities Normal Christou example3 c2.jpg
  • The probabilities are 55.6% for one hour vs. 86.6% for sample mean. Therefore the sample mean is more likely to be greater than 75000 hours.
  • Example 5

Assume the daily S&P return follows the normal distribution with mean \(\mu\) = 0.00032 and standard deviation \(\sigma\) = 0.00859.

  • a. Find the 75th percentile of this distribution.
  • Answer:
  • According to the SOCR snapshot below, the 75th percentile is 0.006115.
SOCR Activities Normal Christou example5 aa.jpg
  • b. What is the probability that in 2 of the following 5 days, the daily S&P return will be larger than 0.01?
  • Answer:
  • \(P(X>.01)=.13.\)We can see this in the snapshot below:
SOCR Activities Normal Christou example5 bb.jpg
  • \(5 \choose 2 .13^2 .87^3= 0.11128. \)
SOCR Activities Normal Christou example5 bc.jpg
  • c. Consider the sample average S&P of a random sample of 20 days.
  • i. What is the distribution of the sample mean?
  • Answer:
  • \(X \sim N(.00032,.00192)\)
  • ii. What is the probability that the sample mean will be larger than 0.005?
  • Answer:
  • \(P(\overline{X}>.005)=.0074\)
SOCR Activities Normal Christou example5 cc ii.jpg
  • iii. Is it more likely that the sample average S&P will be greater than 0.007, or that one day's S&P return will be?
  • Answer:
  • One day's return is more likely to be greater than .007. The probabilities are 0.21 for \(X\) vs. .00022 for \(\overline{X}\).
This is the snapshot for \(P(X>.007)\)
SOCR Activities Normal Christou example5 cc iii 1.jpg
This is the snapshot for \(P(\overline{X}>.007)\)
SOCR Activities Normal Christou example5 cc iii 2.jpg
  • Example 6
  • If Y has a \(\chi^2\) distribution with n degrees of freedom, then Y could be represented by

Y=\(\sum_{i=0}^{n}X_i\) where \(X_i\)'s are independent, each having a distribution of \(\chi^2\) with 1 degree of freedom. A machine in a heavy-equipment factory produces steel rods of length Y, where Y is a normal random variable with \(\mu = 6 \) inches and \(\sigma^2 = 0.2\). The cost C of repairing a rod that is not exactly 6 inches in length is proportional to the square of the error and is given, in dollars, by \(C = 4(Y-\mu)^2\). If 50 rods with independent lengths are produced in a given day, approximate the probability that the total cost for repairs for that day exceeds $48.

  • Answer:

\(\sum_{i=1}^{50}\frac{Y-\mu}{\sigma^2} \sim \chi^2\)

\(P(\chi_2>60)= *Below you can see a snapshot for this example: <center>[[Image: SOCR_Activities_Normal_Christou_example6_1.jpg|600px]]</center> *We could have also used the Z distribution to approximate the answer. The mean for <math>\chi^2\) is n, and the standard deviation is \(\sqrt 2n \). Therefore the probability would be

P(Z>\frac{60-50){10})=P(Z>1)
  • Using the snapshot below, we find the answer, which is . This is close to the exact address using the \(\chi^2\)distribution.