馃攳
Wisdom of the Crowds competition - the answer - YouTube
Channel: unknown
[0]
hello everyone first of all let me
[2]
introduce you to David speaker holder
[4]
who we've had on before with our
[6]
non-transitive dice video which is one
[8]
of the prices on offer in our
[10]
competition today but first of all did
[12]
you not get the memo did you know we
[14]
were going to dress up for this I chose
[16]
for the for the people that houses my
[18]
mathematical trance
[19]
mrs. Hughes dressed up is any choice
[20]
okay said David tell us about the
[22]
competition with it what in his last
[24]
video James asked you to guess how many
[26]
jelly beans were in this jar and he's
[30]
had over a thousand entries we've had
[32]
over a thousand responses we're going to
[34]
analyze the results and we've had a
[37]
winner which is in this envelope yes
[41]
you're right ago yes the winner is in
[44]
this envelope the correct answer I did
[46]
get a dick count all the jelly beans
[48]
myself but we will reveal this at the
[51]
end of the video so the last time I told
[54]
you about the wisdom of the crowd so a
[56]
mathematician called Sir Francis Galton
[58]
wants to know that crowd of people could
[61]
decide what the correct weight was of a
[64]
butchered ox now David what do you think
[66]
of wisdom of the crowds is it real or is
[68]
it bulk well I worked in Goldman's case
[71]
because he was at a country fair where
[72]
people knew quite a lot about meat and
[74]
about what an animal look like I've had
[76]
been chopped up and laid out and what
[78]
that actual quantity of meat would come
[80]
to when they added it all up but when
[82]
you try to estimate how many beans and
[84]
something like that it's quite tricky
[85]
because you're trying to estimate how to
[86]
sense your volume from just a
[87]
two-dimensional view of it people could
[90]
have some systematic biases in other
[92]
words they could on average even gets
[95]
too high or too low and I'm going to be
[97]
really interested to see what results
[99]
are so the wisdom is important exactly
[102]
so unfortunately we did have to reject
[104]
some of the outliers which I just
[106]
thought were typos really and to be fair
[109]
Sir Francis Galton had to do this
[111]
himself when he did it in his original
[113]
problem so I felt justified doing that
[115]
the were as well some obvious joke
[118]
answers if there does turn out to be a
[121]
googol of jelly beans inside the jar
[123]
you'll still win don't worry you're
[125]
still in the competition
[126]
francis galton actually was charging six
[129]
pence which he thought would be enough
[131]
to deter joke answers
[133]
like that so in future that's what we'll
[135]
do we'll start charging six months we
[136]
could make 60 quid out of this and but
[139]
there might as well be some bias because
[143]
in this case people could see other
[145]
people's answers as well and we're not
[148]
sure well but David take us through what
[150]
happened take us through the data let's
[152]
have a look okay here is the
[157]
distribution of the judgments that were
[160]
made the guesses that were made we see
[161]
here they range from you know very low
[163]
right up to nearly ten thousand some
[165]
people thought their new 10,000
[166]
jellybeans in that jar most of them
[168]
around about bit below between two
[170]
thousand and two thousand and we see a
[172]
peak here but a long tail with some
[174]
people thinking 3,000 4,000 5,000 jail
[177]
beans in the jar so we got is a very non
[180]
symmetric distribution of judgments the
[183]
problem when we've got a distribution
[184]
like this we have a sort of skew
[186]
distribution is it's not at all clear
[188]
what measure we should use when we won't
[190]
say the average opinion there's a number
[192]
of different averages we could choose
[193]
right so we could use the most common
[196]
answer right which is called the mode
[197]
average and in this case the most common
[200]
answer was 1337 budget net what else
[206]
could we use we could use the mean the
[208]
arithmetic mean that means you add up
[209]
all the guesses that were made and
[211]
divided by the number of people who
[213]
enter the competition and if you do that
[215]
you get a considerably higher value
[216]
about 2059 jelly beans and the point
[221]
about the mean is that it's very
[222]
strongly influenced by these few people
[225]
who thought there were 8,000 jelly beans
[227]
in the jar so instead we might want to
[229]
use another type of average called the
[230]
median average so if I listed all the
[233]
guesses in order and I took the middle
[235]
value
[236]
I actually got 1729 Hays Ramanujan's
[241]
number and in this case the median could
[244]
be considered the average person's
[246]
judgment rather than the average
[249]
judgment so instead of looking at the
[251]
data directly we decided to take a look
[252]
at the log of the data so why is a log
[255]
the log is the power of 10 needed to
[258]
make that number so a thousand would
[260]
have a power of three it's three powers
[263]
of 10 10
[264]
times ten times ten is a thousand so the
[267]
log of a thousand is three the log of
[269]
10,000 is four it's four powers of ten
[273]
10 times 10 times 10 times 10 is 10,000
[276]
and that's what we did we took the data
[278]
and we took the log of the data and this
[281]
is what we got when you do this you end
[283]
up with a really nice symmetric
[285]
distribution and in fact then we can fit
[288]
a normal curve to that where they
[290]
standard old bell-shaped curves and it
[292]
fits the data rather well and that means
[295]
if we fed a normal distributions of data
[296]
then we know how to take the looking for
[299]
the average we should just take a sample
[300]
mean of those logarithms of the
[303]
observations in order to estimate the
[305]
center of this distribution so this is
[308]
what we did we took the average of the
[310]
log of the data and we found that the
[312]
average power of 10 was three point two
[315]
three so we take 10 to the power of
[318]
three point two three we get our average
[320]
now as 1680 and this is called the
[325]
geometric mean it's the mean of the log
[328]
of the data and well what's so good
[330]
about it the geometric mean is a good
[333]
measure to use if you think the sort of
[335]
mistakes people make are proportional
[336]
mistakes in other words they're just as
[338]
likely to get double the answer as half
[340]
the answer and if that if you think that
[342]
and that's what gives rise to shapes
[343]
like this as a distribution then you
[346]
should use the geometric mean so I think
[347]
if I really believed there wasn't any
[349]
systematic bias in people's judgments I
[351]
would say one six eight oh would be on
[354]
our best judgment
[356]
so now let's reveal the answer in this
[359]
envelope which david hasn't seen so
[362]
David you're going to be revealing this
[363]
answer soon honorable mentions I think
[366]
to these three people here who actually
[368]
got the median average of 1729 so these
[373]
are average people right here another
[376]
honorable mention to virtual noodles who
[378]
was closest to our average value of 1680
[382]
he actually got 1681 but now to reveal
[388]
the actual answer which I'll give to
[389]
David also so excited and the answer is
[393]
one sick
[395]
one six 1616 which is acting within
[401]
about four percent of our average which
[406]
is pretty good initially amazing how you
[408]
can work on with actual real data we've
[411]
actually gotten pretty good answer
[414]
congratulations to our winner
[416]
didi ss6 who actually got it on the nose
[420]
1616 and I'll make a butt pad for you
[423]
it's going to include this Top Trumps
[425]
mathematicians and non-transitive dyes
[427]
whatever I think of so congratulations
[429]
again and if you have been thanks for
[432]
watching well it's amazing
Most Recent Videos:
You can go back to the homepage right here: Homepage





