Skip to main content

Studying Association using AddHealth Dataset

With Data Analysis case study, as course assignments, Goal was to learn data analysis using python libraries using a case study approach where I had to choose a hypothesis and go through all steps of data analysis to understand that better, mainly study association.


I decided to work on an AddHealth codebook and study the association between social acceptance and self esteem.


Libraries used:

Pandas

Numpy

Seaborn

Matplotlib.pyplot

Scipy

Statsmodels


I choose variables related to social acceptance and self esteem. I performed literature review, studied frequency distribution, performed data munging and visualization steps, plotting and discovering associations, performed hypothesis testing.



Link to Blog


Code Link

Comments

Popular posts from this blog

Online Railway Reservation System

Demo railway reservation website developed using Html, CSS and PHP with wamp server.  Functionalities Implemented: User can book ticket, check fare charges, see destinations and cancel ticket.  User can inquire ticket availability.  Two modes of transaction by debit and credit card are implemented.  Booking detailed are stored in a table in Mysql and can be viewed by user.  Tables for transactions are also implemented. Screenshots Code Link

Guided Projects from DataQuest

Guided Project: Finding Heavy Traffic Indicators on I-94 This project was about analysing a dataset about the westbound traffic on the I-94 Interstate highway and goal of analysis was to determine a few indicators of heavy traffic on I-94 which could be weather type, time of the day, time of the week, etc. My analysis concluded summer months, business days and 7AM or 4PM hours indicated very high traffic volume while winter months, weekends, normal hours were low on traffic and two weather conditions light rain and snow and clear sky also indicated moderately high traffic. I used exploratory data analysis and visualization to work through this analysis. Code Link Guided Project: Exploring Hacker News Posts I worked with Hacker News Posts Dataset to determine the questions like: Do Ask HN or Show HN receive more comments on average? Do posts created at a certain time receive more comments on average? I worked on preparing dataset, cleaning it, sorting using python strings, date and time...

Successful Presentation Notes

4 Modules Fear, The Formula, Practice, Engaging the audience Its takes practice and confidence Successful presentation is a public performance of the private self. Prof. Kuskin’s ten best practices for successful public speaking- Number 1 The Keith Code Rule: Fear inhibits execution. By controlling fear, we are able to execute on multiple levels. Number 2 Successful Presentation is storytelling. Number 3 Stories have a Beginning, Middle and Ending and often have single main meaning or moral. Number 4 Stories are always, in part, about the storyteller or people want to see people. Number 5 Stories are told; indeed, great stories are performed. Number 6 The Basic Formula is a simple three by three grid: Introduction, Content, & Conclusion, each of which have three parts: a. The Introduction has three elements: Salutation, Review, & The One Compelling Point b. The Content has three parts: Topic, Data, & Analysis c. The Conclusion has three parts: Summary, Discussion, & Tha...