Descriptive Statistics - Exploratory Data Analysis | Coursera

Descriptive Statistics

Video placeholder

Loading...

Data Analysis with Python

4.7 (17,767 ratings)

|

430K Students Enrolled

Course 7 of 10 in the IBM Data Science Professional Certificate

Enroll for Free

Analyzing data with Python is an essential skill for Data Scientists and Data Analysts. This course will take you from the basics of data analysis with Python to building and evaluating data models. Topics covered include: - collecting and importing data - cleaning, preparing & formatting data - data frame manipulation - summarizing data - building machine learning regression models - model refinement - creating data pipelines You will learn how to import data from multiple sources, clean and wrangle data, perform exploratory data analysis (EDA), and create meaningful data visualizations. You will then predict future trends from data by developing linear, multiple, polynomial regression models & pipelines and learn how to evaluate them. In addition to video lectures you will learn and practice using hands-on labs and projects. You will work with several open source Python libraries, including Pandas and Numpy to load, manipulate, analyze, and visualize cool datasets. You will also work with scipy and scikit-learn, to build machine learning models and make predictions. If you choose to take this course and earn the Coursera course certificate, you will also earn an IBM digital badge.

Skills You'll Learn

Model Selection, Data Analysis, Python Programming, Data Visualization, Predictive Modelling

Reviews

4.7 (17,767 ratings)

5 stars
75.98%
4 stars
18.61%
3 stars
3.69%
2 stars
0.93%
1 star
0.76%

UA

Jul 28, 2020

AN excellent course. Hands-on training on the cloud makes an individual really involved. So far the best online course I have ever taken, and I have learned Python programming a lot from this course.

AB

Feb 12, 2020

Great introduction to data manipulation and analysis for common problems that arise in data science. Also allows you to gain a further understanding of Python syntax, specifically the pandas library.

From the lesson

Exploratory Data Analysis

In this module, you will learn what is meant by exploratory data analysis, and you will learn how to perform computations on the data to calculate basic descriptive statistical information, such as mean, median, mode, and quartile values, and use that information to better understand the distribution of the data. You will learn about putting your data into groups to help you visualize the data better, you will learn how to use the Pearson correlation method to compare two continuous numerical variables, and you will learn how to use the Chi-square test to find the association between two categorical variables and how to interpret them.

Exploratory Data Analysis1:28

Descriptive Statistics4:47

GroupBy in Python3:29

Correlation2:39

Correlation - Statistics2:45

Taught By

Joseph Santarcangelo
Ph.D., Data Scientist at IBM

Try the Course for Free

Explore our Catalog

Join for free and get personalized recommendations, updates and offers.