Overview of Some Discrete Probability Distributions (Binomial,Geometric,Hypergeometric,Poisson,NegB) - YouTube

Channel: unknown

[1]
Let's look at a quick overview of some discrete probability distributions and their relationships.
[5]
I intend this video to be used as a recap
[8]
after having been introduced to these distributions,
[10]
but it could possibly be used as an introductory overview.
[14]
I don't any calculations in this video,
[16]
nor do I discuss how to calculate the probabilities.
[19]
I simply discuss how these different distributions arise,
[21]
and the relationships between them.
[25]
The Bernoulli distribution is the distribution of the number of successes on a single Bernoulli trial.
[31]
In a Bernoulli trial we get either a success or a failure.
[34]
It's like an answer to a yes or no question.
[37]
A Bernoulli random variable can take on only the values 0 and 1.
[42]
For example, we can use the Bernoulli distribution to answer questions like:
[46]
if a single coin is tossed once, what is the probability it comes up heads?
[50]
Or, if a single adult American is randomly selected,
[54]
what is the probability they are a heart surgeon?
[58]
Some other important distributions are built on the notion of independent Bernoulli trials,
[63]
where we have a series of trials,
[64]
and each one results in a success or a failure.
[67]
An important one is the binomial distribution,
[70]
which is the distribution of the number of successes in n independent Bernoulli trials.
[75]
For example, with the binomial distribution we can answer a question like:
[79]
if a coin is tossed 20 times, what is the probability heads comes up exactly 14 times?
[85]
And since the binomial distribution is the distribution of
[88]
the number of successes in n independent Bernoulli trials,
[92]
the Bernoulli distribution is a special case of the binomial distribution with n=1, a single trial.
[100]
Continuing on with the theme of independent Bernoulli trials,
[103]
the geometric distribution is the distribution of the number of trials needed to get the first success.
[109]
For example, with the geometric distribution we can answer a question like:
[113]
if a coin has repeatedly tossed, what is the probability
[116]
the first time heads appears occurs on the 8 toss?
[121]
The negative binomial distribution is a generalization of the geometric distribution.
[126]
The negative binomial distribution is the distribution of the number of trials
[130]
needed to get a certain number of successes in repeated independent Bernoulli trials.
[136]
So the negative binomial distribution can help us answer questions like:
[139]
if a coin has repeatedly tossed, what is the probability
[143]
the third time heads appears occurs on the ninth trial?
[149]
The way the binomial distribution and the negative binomial distribution arise
[153]
can sound similar, and they can sometimes be confused.
[156]
They differ in what the random variable is.
[159]
In the binomial distribution, the number of trials is fixed,
[163]
and the number of successes is a random variable.
[166]
For instance, we're tossing a coin a fixed number of times,
[169]
and the number of heads that comes up is a random variable.
[172]
In the negative binomial distribution, the number of successes is fixed,
[177]
and the number of trials required to get that number of successes is the random variable.
[182]
For instance, we might be tossing a coin until we get heads 4 times.
[186]
And the number of tosses required to get heads 4 times is the random variable.
[193]
Now I'll talk about two distributions that are related to the binomial,
[197]
but aren't based on independent Bernoulli trials.
[200]
The hypergeometric distribution is similar to the binomial distribution
[204]
in that we're interested in the number of successes in n trials,
[208]
but it's different because the trials are not independent.
[211]
The hypergeometric distribution is the distribution of the number of successes
[216]
when we are drawing without replacement from a source that contains
[219]
a certain number of successes and a certain number of failures.
[223]
For example, we can use the hypergeometric distribution to answer a question like:
[228]
if 5 cards are drawn without replacement from a well shuffled deck,
[231]
what is the probability exactly 3 hearts are drawn?
[235]
It's different from the binomial because the probability of success,
[238]
the probability of getting a heart, would change from card to card,
[242]
depending on what happened before.
[244]
However, if the cards are drawn with replacement,
[247]
meaning the card was put back in and reshuffled before the next card was drawn,
[251]
then the trials would be independent and we would use the binomial distribution instead.
[258]
If we are sampling only a small fraction of objects without replacement from a large population
[263]
then the trials are still not independent, but that dependency has only a small effect,
[268]
and the binomial distribution closely approximates the hypergeometric distribution.
[272]
So there are times when a problem is in its nature a hypergeometric problem,
[276]
but we use the binomial distribution as an approximation.
[280]
This can make our life a little bit easier sometimes.
[284]
Another distribution related to the binomial is the Poisson distribution.
[288]
But this one's a little harder to explain.
[290]
The Poisson distribution is the distribution of the number of events
[294]
in a given time or length, or area, or volume etc.,
[298]
if those events are occurring randomly and independently.
[302]
There's a bit more to it than that, and I go into this in much greater detail in my Poisson videos.
[307]
But that's the gist of it. So we might use the Poisson distribution to answer a question like:
[312]
what is the probability there will be exactly 4 car accidents
[315]
on a certain university campus in a given week?
[320]
There is a relationship between the Poisson distribution and the binomial distribution.
[325]
The Poisson distribution closely approximates the binomial distribution
[329]
if n, the number of trials, in the binomial, is large
[332]
and p, the probability of success, is very small.
[336]
So suppose we have a question like:
[338]
what is the probability that in a random sample 100,000 births,
[341]
there is at least one case of progeria?
[344]
Progeria is an extremely rare disease that causes premature aging,
[348]
and it occurs in about 1 in every eight million births.
[352]
This is truly a binomial problem. But we have a binomial problem
[356]
with a very large n, 100,000,
[358]
and a very small probability of success, 1 in eight million or so,
[362]
because progeria such a rare disease.
[365]
And so this could be very well approximated by the Poisson distribution.
[370]
I look into all of these concepts discussed in this video in greater detail
[373]
in the videos for these specific distributions.