Is there really 5.4% alcohol in that beer brand? We all see that a lot of brand publish on their wrapper that the alcohol level is 5.4%. Letβs say we collected the percent level of volume for those brand. We sampled randomly and measured the alcohol level ourselves
So we believe that the actual beer percent should be 5.4% but as a beer consumer, we feel sometime itβs not.
Police Data Challenge: Winner Recommendations February 1, 2018
The Police Data Challenge contest brought talented high school and undergraduate students across the nation to show their passion for the good statistics can do.
With the Police Foundationβs efforts to make the information available, the 70 teams used real crime data sets from Baltimore, Seattle and Cincinnati police departments to analyze the best possible solutions for safer communities.
Check out below how the winning teams analyzed the best way to fight crime through statistics:
When to give up? Exploration vs Exploitation A lot of hard working students donβt end up being selected for the scholarships. I should know because i lost 3 years doing it.
Now i turn into a information theoretic game to find when should i have quit the whole process.
Assumption: Your best score will get you scholarship if you are one of the sufficiently prepared student.
Say, entrance exams are the games.
Average for group vs Individual Inspection Paradox
Buses and trains are supposed to arrive at constant intervals, but in practice some intervals are longer than others. This means the buses do not follow schedule exactly. There is always some randomness..With your luck, you might think you are more likely to arrive during a long interval. It turns out you are right: a random arrival is more likely to fall in a long interval because, well, itβs longer.
Empirical rule and Chebyshevβs theorem Letβs talk about this really simple concept but powerful one. Data Distributions. A data distribution is an abstract concept(a function) that gives the the possible values of data and also how often that data is generated. When you want to talk about the all the data of your experiments at once, then talk about data distribution. A data distribution gives us the probability of how often that data will be an output if we keep repeating the experiment.