Difference between revisions of "SOCR EduMaterials Activities GCLT Applications"
(→Application 1) |
|||
Line 10: | Line 10: | ||
First start by readying carefully the [[SOCR EduMaterials Activities GeneralCentralLimitTheorem]]. Then go over each of the CLT applications that are worked out in this activity supplement. You may find it useful going back abd forth between these applications and the formal CLT activity. | First start by readying carefully the [[SOCR EduMaterials Activities GeneralCentralLimitTheorem]]. Then go over each of the CLT applications that are worked out in this activity supplement. You may find it useful going back abd forth between these applications and the formal CLT activity. | ||
− | ===Application 1=== | + | ===Application 1 (Poisson)=== |
Suppose a call service center expects to get 20 calls a minute for questions regarding each of 17 different vendors that rely on this call center for handling their calls. What is the probability that in a 1-minute interval they receive less than 300 calls in total? | Suppose a call service center expects to get 20 calls a minute for questions regarding each of 17 different vendors that rely on this call center for handling their calls. What is the probability that in a 1-minute interval they receive less than 300 calls in total? | ||
Let <math>X_i</math> be the random variable representing the number of calls received about the <math>i^{th}</math> vendor within a minute, then <math>X_i \sim Poisson (20)</math>, as <math>X_i</math> is the number of arrivals within a unit interval and the mean arrival count is given to be 20. The distribution of the total number of calls <math>T = \sum_{i=1}^{17} \sim Poisson(17 \times 20)</math>. By CLT, <math>T \sim Normal(\mu = 17 \times 20, \sigma^2 = 17\times20)</math>, as an approximation of the exact distribution of the total sum. Using the [http://www.socr.ucla.edu/htmls/SOCR_Distributions.html SOCR Distribution applet] one can compute exactly the <math>P(T<300 | T \sim Poisson(17 \times 20))= 0.014021</math>. On the other hand side, one may use the CLT to compute a Normal approximation probability of the same event, <math>P(T<300 | T \sim Normal(\mu = 17\times20, \sigma^2 = 17\times20)) = 0.014896</math>. The last quantity is obtained again using the SOCR Distributions applet, without using continuity correction. Using continuity correction the approximation improves, <math>P(T<300 | T \sim Normal(\mu = 17\times20, \sigma^2 = 17\times20)) = 0.0140309</math>. Arguably, the CLT-based calculation is less intense and more appealing to students and trainees, compared to computing the exact probability. | Let <math>X_i</math> be the random variable representing the number of calls received about the <math>i^{th}</math> vendor within a minute, then <math>X_i \sim Poisson (20)</math>, as <math>X_i</math> is the number of arrivals within a unit interval and the mean arrival count is given to be 20. The distribution of the total number of calls <math>T = \sum_{i=1}^{17} \sim Poisson(17 \times 20)</math>. By CLT, <math>T \sim Normal(\mu = 17 \times 20, \sigma^2 = 17\times20)</math>, as an approximation of the exact distribution of the total sum. Using the [http://www.socr.ucla.edu/htmls/SOCR_Distributions.html SOCR Distribution applet] one can compute exactly the <math>P(T<300 | T \sim Poisson(17 \times 20))= 0.014021</math>. On the other hand side, one may use the CLT to compute a Normal approximation probability of the same event, <math>P(T<300 | T \sim Normal(\mu = 17\times20, \sigma^2 = 17\times20)) = 0.014896</math>. The last quantity is obtained again using the SOCR Distributions applet, without using continuity correction. Using continuity correction the approximation improves, <math>P(T<300 | T \sim Normal(\mu = 17\times20, \sigma^2 = 17\times20)) = 0.0140309</math>. Arguably, the CLT-based calculation is less intense and more appealing to students and trainees, compared to computing the exact probability. | ||
− | ===Application 2=== | + | ===Application 2 (Exponential)=== |
It is believed that life-times, in hours, of light-bulbs are Exponentially distributed, say <math>Exp({1\over{2,000}})</math>, mean expected life of 2,000 hours. Recall that the Exponential distribution is called the Mean-Time-To-Failure distribution. You can find more about it from the [http://www.socr.ucla.edu/htmls/SOCR_Distributions.html SOCR Distributions applet]. Suppose a University wants to purchase 100 of these light-bulbs and estimate the average life-span of these light bulbs. What is a CLT-based estimate of the probability that the average life-span exceeds 2,200 hrs? Let <math>X_i \sim Exp({ 1\over{2,000}})</math> and | It is believed that life-times, in hours, of light-bulbs are Exponentially distributed, say <math>Exp({1\over{2,000}})</math>, mean expected life of 2,000 hours. Recall that the Exponential distribution is called the Mean-Time-To-Failure distribution. You can find more about it from the [http://www.socr.ucla.edu/htmls/SOCR_Distributions.html SOCR Distributions applet]. Suppose a University wants to purchase 100 of these light-bulbs and estimate the average life-span of these light bulbs. What is a CLT-based estimate of the probability that the average life-span exceeds 2,200 hrs? Let <math>X_i \sim Exp({ 1\over{2,000}})</math> and | ||
<math>\overline{X}= \sum_{i=1}^{100}{X_i}</math>. Notice that in this case the exact distribution of <math>\overline{X}</math> is (generally) not Exponential, even though the density may be computed in closed form (Khuong & Kong, 2006). If we use the CLT, however, we can approximate the probability of interest | <math>\overline{X}= \sum_{i=1}^{100}{X_i}</math>. Notice that in this case the exact distribution of <math>\overline{X}</math> is (generally) not Exponential, even though the density may be computed in closed form (Khuong & Kong, 2006). If we use the CLT, however, we can approximate the probability of interest | ||
<center> | <center> | ||
− | <math>P(\overline{X} > 2,200) \sim P(\overline{X} > 2,200 | \overline{X} \ | + | <math>P(\overline{X} > 2,200) \sim P(\overline{X} > 2,200 | \overline{X} \sim N(\mu_{\overline{X}}=2,000, \sigma_{\overline{X}}^2 = {2,000^2 \over 100})),</math> |
</center> | </center> | ||
as we know that the mean and the standard deviation of <math>X_i</math> are <math>{1\over {\lambda}} =2,000</math> and the standard deviation of <math>\overline{X}</math> is <math>{1\over {\sqrt{100} \times \lambda}} =200</math>. Therefore, <math>P(\overline{X} >2,200) \approx 0.158655</math>, using the CLT approximation and the SOCR Distributions calculator, see figure below. | as we know that the mean and the standard deviation of <math>X_i</math> are <math>{1\over {\lambda}} =2,000</math> and the standard deviation of <math>\overline{X}</math> is <math>{1\over {\sqrt{100} \times \lambda}} =200</math>. Therefore, <math>P(\overline{X} >2,200) \approx 0.158655</math>, using the CLT approximation and the SOCR Distributions calculator, see figure below. | ||
Line 25: | Line 25: | ||
− | ===Application 3=== | + | ===Application 3 (Exponential)=== |
+ | A weekly TV talk show from broadcaster U invites viewers to call to express their opinions about the program. Many people call, which sometimes results in quite a long wait time until the host replies. The time it takes the host to respond tends to follow an ''exponential distribution'' with mean of 50 seconds. A competing TV network W has another similar talk show and would like to respond to callers faster than broadcaster U. To do that W executives need to know how long U takes to respond. So, W personnel make 25 calls per week to the U show (for 50 weeks) and measure how long it took the U host to respond. Then W executives compute the average length for their weekly samples of size 25. At the end of the year they plot the distribution of the sample means. What do you think are the center, spread and shape of this distribution? Find out using the SOCR CLT applet. Approximately, what proportion of time is the average weekly wait time for the U broadcaster exceeding 45 seconds? | ||
+ | |||
+ | The Figure below shows the corresponding sampling simulation. Notice the differences in the summary statistics between the native distribution, the sample distribution and the sampling distribution for the mean. The answer of this application may then be computed using the [http://www.socr.ucla.edu/htmls/SOCR_Distributions.html SOCR Distributions], | ||
+ | <math>P(\overline{X} > 45) \approx P(\overline{X} > 45 | \overline{X} \sim N(\mu_{\overline{X}}=50, \sigma_{\overline{X}}^2 = {50^2 \over 25}))=0.691463.</math> | ||
+ | This chance may also be computed empirically by counting the number of weekly samples that generate an average wait time over 45 seconds and dividing this number by 50 (the total number of weeks in this survey). | ||
+ | |||
===Application 4=== | ===Application 4=== |
Revision as of 14:33, 2 April 2007
Contents
SOCR Educational Materials - Activities - Applications of the General Central Limit Theorem (CLT)
This is a component of the activity is based on the SOCR EduMaterials Activities GeneralCentralLimitTheorem.
Goals
The aims of this activity are to demonstrate several practical applications of the general CLT. There are many practical examples of using CLT to solve real-life problems. Here are some examples which may be solved using the SOCR CLT resources.
The SOCR CLT Experiment
First start by readying carefully the SOCR EduMaterials Activities GeneralCentralLimitTheorem. Then go over each of the CLT applications that are worked out in this activity supplement. You may find it useful going back abd forth between these applications and the formal CLT activity.
Application 1 (Poisson)
Suppose a call service center expects to get 20 calls a minute for questions regarding each of 17 different vendors that rely on this call center for handling their calls. What is the probability that in a 1-minute interval they receive less than 300 calls in total?
Let \(X_i\) be the random variable representing the number of calls received about the \(i^{th}\) vendor within a minute, then \(X_i \sim Poisson (20)\), as \(X_i\) is the number of arrivals within a unit interval and the mean arrival count is given to be 20. The distribution of the total number of calls \(T = \sum_{i=1}^{17} \sim Poisson(17 \times 20)\). By CLT, \(T \sim Normal(\mu = 17 \times 20, \sigma^2 = 17\times20)\), as an approximation of the exact distribution of the total sum. Using the SOCR Distribution applet one can compute exactly the \(P(T<300 | T \sim Poisson(17 \times 20))= 0.014021\). On the other hand side, one may use the CLT to compute a Normal approximation probability of the same event, \(P(T<300 | T \sim Normal(\mu = 17\times20, \sigma^2 = 17\times20)) = 0.014896\). The last quantity is obtained again using the SOCR Distributions applet, without using continuity correction. Using continuity correction the approximation improves, \(P(T<300 | T \sim Normal(\mu = 17\times20, \sigma^2 = 17\times20)) = 0.0140309\). Arguably, the CLT-based calculation is less intense and more appealing to students and trainees, compared to computing the exact probability.
Application 2 (Exponential)
It is believed that life-times, in hours, of light-bulbs are Exponentially distributed, say \(Exp({1\over{2,000}})\), mean expected life of 2,000 hours. Recall that the Exponential distribution is called the Mean-Time-To-Failure distribution. You can find more about it from the SOCR Distributions applet. Suppose a University wants to purchase 100 of these light-bulbs and estimate the average life-span of these light bulbs. What is a CLT-based estimate of the probability that the average life-span exceeds 2,200 hrs? Let \(X_i \sim Exp({ 1\over{2,000}})\) and \(\overline{X}= \sum_{i=1}^{100}{X_i}\). Notice that in this case the exact distribution of \(\overline{X}\) is (generally) not Exponential, even though the density may be computed in closed form (Khuong & Kong, 2006). If we use the CLT, however, we can approximate the probability of interest
\(P(\overline{X} > 2,200) \sim P(\overline{X} > 2,200 | \overline{X} \sim N(\mu_{\overline{X}}=2,000, \sigma_{\overline{X}}^2 = {2,000^2 \over 100})),\)
as we know that the mean and the standard deviation of \(X_i\) are \({1\over {\lambda}} =2,000\) and the standard deviation of \(\overline{X}\) is \({1\over {\sqrt{100} \times \lambda}} =200\). Therefore, \(P(\overline{X} >2,200) \approx 0.158655\), using the CLT approximation and the SOCR Distributions calculator, see figure below.
Application 3 (Exponential)
A weekly TV talk show from broadcaster U invites viewers to call to express their opinions about the program. Many people call, which sometimes results in quite a long wait time until the host replies. The time it takes the host to respond tends to follow an exponential distribution with mean of 50 seconds. A competing TV network W has another similar talk show and would like to respond to callers faster than broadcaster U. To do that W executives need to know how long U takes to respond. So, W personnel make 25 calls per week to the U show (for 50 weeks) and measure how long it took the U host to respond. Then W executives compute the average length for their weekly samples of size 25. At the end of the year they plot the distribution of the sample means. What do you think are the center, spread and shape of this distribution? Find out using the SOCR CLT applet. Approximately, what proportion of time is the average weekly wait time for the U broadcaster exceeding 45 seconds?
The Figure below shows the corresponding sampling simulation. Notice the differences in the summary statistics between the native distribution, the sample distribution and the sampling distribution for the mean. The answer of this application may then be computed using the SOCR Distributions, \(P(\overline{X} > 45) \approx P(\overline{X} > 45 | \overline{X} \sim N(\mu_{\overline{X}}=50, \sigma_{\overline{X}}^2 = {50^2 \over 25}))=0.691463.\) This chance may also be computed empirically by counting the number of weekly samples that generate an average wait time over 45 seconds and dividing this number by 50 (the total number of weeks in this survey).
Application 4
- Back to SOCR CLT Activity
- SOCR Home page: http://www.socr.ucla.edu
- SOCR CLT Activity at MERLOT
- SOCR CLT Activity at CAUSEweb
Translate this page: