Using natural language to predict Spam messages

Using natural language to predict Spam messages

Ever wonder how google determines what goes in your SPAM email box? A little bit of language processing, an example of which is shown in this project. Here we take thousands of text messages and process them to come up with a model that can 

Movie rating predictor

Movie rating predictor

Movielens Case Study, predictive modeling with logistic regression Tasks: Import the three datasets Create a new dataset named “master_data” with the following columns (MovieID Title UserID Age Gender Occupation Rating) Explore the datasets using visual representations (graphs or tables), also include your comments on the 

Leading causes of death in California

Leading causes of death in California

Do you live in Cali? Do you have heart disease or are genetically predisposed for it? You might want to stay away from that double-double at In and Out! This brief analysis plots the leading causes of death in 2012 for any state in America, 

Big Mart outlet sales prediction

Big Mart outlet sales prediction

Is there a relationship between item visibility and sales?¶ Working with the Big Mart data was challenging and fun. I think this rich, full-bodied retail dataset demonstrates some fun aspects of human psychology, namely spatial habits of mind and spending patterns. With Big Mart you’re 

Predicting the cost of health care

Predicting the cost of health care

Everyone feels the pinch of health care costs even if it is subsidized by the government or your employer. With this dataset we can see how smoking, BMI, age and other factors relate to the actual cost of health care. View on: