Data Science

Data Dashboard for StockX Contest

StockX Data Contest 2019 StockX Challenge is a call for data and sneakers nerds to have fun. source: stockX The basic idea is this: they give you a bunch of original StockX sneaker data, then you crunch the numbers and come up with the coolest, smartest, most compelling story you can tell. It can be literally anything you want. A theory, an insight, even just a really original data visualization.

Winona Area Public Schools: Community Contribution

Winona Area Public Schools Data Visualization Introduction: This Project addresses the need of communication of public school data to community members in an meaningful way.Also, making the data available to general public in a proper and useable format. There has been a wider discussion regarding the budget issue in Winona area schools. Here is the article Primarily, this Project was focused on cleaning and visualizing the Enrollment,Expenditures and Staffing History reports of the Winona Area Public District(WAPS) available publicly through Minnesota department of education, Data Center Link:http://education.

Animation:Internet Usage

How internet is eating the world? Internet Usage animation Internet Usage is the world bank development indicator. In this project i grabbed the world bank dataset(which is in the link provided below). Link to the tableau worksheet

Testing Alcohol level

Is there really 5.4% alcohol in that beer brand? We all see that a lot of brand publish on their wrapper that the alcohol level is 5.4%. Let’s say we collected the percent level of volume for those brand. We sampled randomly and measured the alcohol level ourselves So we believe that the actual beer percent should be 5.4% but as a beer consumer, we feel sometime it’s not.

Verifying empirical rule and Chebyshev's theorem

Empirical rule and Chebyshev’s theorem Let’s talk about this really simple concept but powerful one. Data Distributions. A data distribution is an abstract concept(a function) that gives the the possible values of data and also how often that data is generated. When you want to talk about the all the data of your experiments at once, then talk about data distribution. A data distribution gives us the probability of how often that data will be an output if we keep repeating the experiment.