Business Statistics and Analysis-1
课件来源于Rice University的Business Statistics and Analysis
一，Introduction to Data Analysis Using Excel
- Pivot Table
- Line，Bar Graphs
- Pie Charts
- Pivot Charts
- Scatter Plots
二，Basic Data Descriptors, Statistical Distributions, and Application to Business Decisions
Week 1 – Basic Data Descriptors
1. Descriptive Statistics
- Box Plot
- Standard Deviation
Rule of Thumb
Approximately 68% of the data lie within one standard deviation, and approximately 95% lie within 2 standard deviations from the mean
Week 2 – Descriptive Measures of Association, Probability, and Statistical Distributions
1. Descriptive Measures of Association
The covariance measure is susceptible to the unit of measurement, we can arbitrarily inflate or deflate the covariance by choice of units.
• Range: 1 to 1
• Not affected by the units of measurement.
Loosely speaking, correlations > +0.5 or < -0.5 are considered indicative of a strong positive or strong negative relationship between two variables.
Probability is a numerical measure of the frequency of occurrence of an event. It is measured on a scale from 0 to 1. An event of probability 0 will definitely not occur. An event with probability 1 will occur with certainty
Viewing business processes as Random Experiment with an associated Random Variable is helpful in characterizing them and making predictions about the outcome
3. Statistical Distributions
It is common in business applications to use a continuous distribution such as the Normal (the Bell curve) for discrete data
t - distribution
Week 3 – The Normal Distribution
1. Probability Mass Function VS Probability Density Function
It is a rule that assigns probabilities to various possible values that a random variable takes when it is being approximated by a particular statistical distribution.
2. The Normal Distribution
Normal Distribution, aka the Bell Curve
Week 4 – Working with Distributions (Normal, Binomial, Poisson), Population and Sample Data
1. Applications of the Normal Distribution
A fast-food restaurant sells ‘falafel’ sandwiches. On a typical weekday, the demand for these sandwiches can be approximated by a normal distribution with mean 313 sandwiches and standard deviation of 57 sandwiches
Ques: What is the probability that on a particular day the demand for falafel sandwiches is less than 300 at the restaurant?
Demand ~ Normal(313 sandwiches, 57 sandwiches)
John can take either of two roads to the airport from his home (Road A or Road B). Owing to varying traffic conditions the travel times on the two roads are not fixed, rather on a Friday around midday the travel times across these roads can be well approximated per normal distributions as follows,
Road A: mean =54 minutes, std = 3 minutes
Road B: mean =60 minutes, std = 10 minutes
Ques: Which road should he choose if on midday Friday he must be at the airport within 50 minutes to pick up his spouse?
1 2 3 4 5 6 7 8 9
2. Population and a Sample
It is a subset of the relevant population and is used to make inferences about the population.
3. The Central Limit Theorem
In Plain Language,
Sample averages are normally distributed irrespective of where the sample came from. Not only are they normally distributed but more importantly they are normally distributed with mean equal to the population mean.
4. The Binomial Distribution
Two Popular Discrete Distributions
Consider a situation where there are n independent trials, where the probability of success on each trial is p and the probability of failure is 1-p. Define random variable X to denote number of successes in n trials. Then this random variable is said to have a Binomial distribution.