Data Science Specialization

Data Science Specialization

Launch Your Career in Data Science. A ten-course introduction to data science, developed and taught by leading professors.

Instructors: Roger D. Peng, PhD

499,943 already enrolled

Included with Coursera Plus

Learn more

10 course series

Get in-depth knowledge of a subject

4.5

(38,804 reviews)

Beginner level

Recommended experience

7 months

at 10 hours a week

Flexible schedule

Learn at your own pace

10 course series

Get in-depth knowledge of a subject

4.5

(38,804 reviews)

Beginner level

Recommended experience

7 months

at 10 hours a week

Flexible schedule

Learn at your own pace

What you'll learn

Use R to clean, analyze, and visualize data.
Navigate the entire data science pipeline from data acquisition to publication.
Use GitHub to manage data science projects.
Perform regression analysis, least squares and inference using regression models.

Skills you'll gain

Details to know

Shareable certificate

Add to your LinkedIn profile

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Advance your subject-matter expertise

Learn in-demand skills from university and industry experts
Master a subject or tool with hands-on projects
Develop a deep understanding of key concepts
Earn a career certificate from Johns Hopkins University

Specialization - 10 course series

Ask the right questions, manipulate data sets, and create visualizations to communicate results.

This Specialization covers the concepts and tools you'll need throughout the entire data science pipeline, from asking the right kinds of questions to making inferences and publishing results. In the final Capstone Project, you’ll apply the skills learned by building a data product using real-world data. At completion, students will have a portfolio demonstrating their mastery of the material.

The Data Scientist’s Toolbox

Course 117 hours

What you'll learn

Set up R, R-Studio, Github and other useful tools
Understand the data, problems, and tools that data analysts use
Explain essential study design concepts
Create a Github repository

Skills you'll gain

Category: Sampling (Statistics)

Category: Data Analysis

Category: Probability

Category: Probability & Statistics

Category: Probability Distribution

Category: Sample Size Determination

Category: Statistical Modeling

Category: Bayesian Statistics

Category: Statistical Inference

Category: Statistical Analysis

Category: Statistical Hypothesis Testing

Category: Statistical Methods

Category: Statistics

R Programming

Course 257 hours

What you'll learn

Understand critical programming language concepts
Configure statistical programming software
Make use of R loop functions and debugging tools
Collect detailed information using R profiler

Skills you'll gain

Category: Supervised Learning

Category: Data Processing

Category: Regression Analysis

Category: Classification And Regression Tree (CART)

Category: Machine Learning

Category: Data Collection

Category: Decision Tree Learning

Category: Predictive Modeling

Category: Feature Engineering

Category: Random Forest Algorithm

Category: Statistical Machine Learning

Category: Applied Machine Learning

Category: Machine Learning Algorithms

Category: R Programming

Getting and Cleaning Data

Course 319 hours

What you'll learn

Understand common data storage systems
Apply data cleaning basics to make data "tidy"
Use R for text and date manipulation
Obtain usable data from the web, APIs, and databases

Skills you'll gain

Category: Development Environment

Category: Rmarkdown

Category: Github

Category: Data Analysis

Category: Data Science

Category: Version Control

Category: Git (Version Control System)

Category: Integrated Development Environments

Category: Statistical Programming

Category: Big Data

Category: Software Installation

Category: R Programming

Exploratory Data Analysis

Course 454 hours

What you'll learn

Understand analytic graphics and the base plotting system in R
Use advanced graphing systems such as the Lattice system
Make graphical displays of very high dimensional data
Apply cluster analysis techniques to locate patterns in data

Skills you'll gain

Category: GitHub

Category: User Interface (UI)

Category: Leaflet (Software)

Category: Shiny (R Package)

Category: Hypertext Markup Language (HTML)

Category: Statistical Reporting

Category: Plotly

Category: Web Applications

Category: Rmarkdown

Category: Data Presentation

Category: Package and Software Management

Category: Interactive Data Visualization

Category: Data Visualization

Category: Data Visualization Software

Category: R Programming

Reproducible Research

Course 57 hours

What you'll learn

Organize data analysis to help make it more reproducible
Write up a reproducible data analysis using knitr
Determine the reproducibility of analysis project
Publish reproducible web documents using Markdown

Skills you'll gain

Category: Data Cleansing

Category: Data Presentation

Category: Data Analysis

Category: Exploratory Data Analysis

Category: Natural Language Processing

Category: Data Science

Category: Data Manipulation

Category: Predictive Modeling

Category: Data Storytelling

Category: Statistical Analysis

Category: Machine Learning

Category: Data Collection

Category: R Programming

Statistical Inference

Course 654 hours

What you'll learn

Understand the process of drawing conclusions about populations or scientific truths from data
Describe variability, distributions, limits, and confidence intervals
Use p-values, confidence intervals, and permutation tests
Make informed data analysis decisions

Skills you'll gain

Category: Rmarkdown

Category: Data Analysis

Category: Knitr

Category: Data Validation

Category: Statistical Reporting

Category: Statistical Analysis

Category: Verification And Validation

Category: Technical Documentation

Category: Data Sharing

Category: R Programming

Regression Models

Course 753 hours

What you'll learn

Use regression analysis, least squares and inference
Understand ANOVA and ANCOVA model cases
Investigate analysis of residuals and variability
Describe novel uses of regression models such as scatterplot smoothing

Skills you'll gain

Category: Program Development

Category: Data Analysis

Category: Performance Tuning

Category: Debugging

Category: Computer Programming Tools

Category: Data Structures

Category: Statistical Programming

Category: Statistical Analysis

Category: Data Import/Export

Category: Simulations

Category: R Programming

Practical Machine Learning

Course 88 hours

What you'll learn

Use the basic components of building and applying prediction functions
Understand concepts such as training and tests sets, overfitting, and error rates
Describe machine learning methods such as regression or classification trees
Explain the complete process of building prediction functions

Skills you'll gain

Category: Data Analysis

Category: Scatter Plots

Category: Box Plots

Category: Color Theory

Category: Unsupervised Learning

Category: Histogram

Category: Dimensionality Reduction

Category: Exploratory Data Analysis

Category: Ggplot2

Category: Graphing

Category: Plot (Graphics)

Category: Data Visualization

Category: Statistical Analysis

Category: Data Visualization Software

Category: R Programming

Developing Data Products

Course 910 hours

What you'll learn

Develop basic applications and interactive graphics using GoogleVis
Use Leaflet to create interactive annotated maps
Build an R Markdown presentation that includes a data visualization
Create a data product that tells a story to a mass audience

Skills you'll gain

Category: Web Scraping

Category: Data Management

Category: Data Manipulation

Category: File Management

Category: Data Collection

Category: Data Import/Export

Category: SQL

Category: Data Cleansing

Category: Data Quality

Category: Exploratory Data Analysis

Category: Data Access

Category: MySQL

Category: Data Transformation

Category: Application Programming Interface (API)

Category: Data Integration

Category: Data Wrangling

Category: R Programming

Data Science Capstone

Course 105 hours

What you'll learn

Create a useful data product for the public
Apply your exploratory data analysis skills
Build an efficient and accurate prediction model
Produce a presentation deck to showcase your findings

Skills you'll gain

Category: Data Science

Category: Predictive Modeling

Category: Regression Analysis

Category: Probability & Statistics

Category: Statistical Modeling

Category: Statistical Inference

Category: Statistical Analysis

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Roger D. Peng, PhD

Johns Hopkins University

37 Courses1,642,560 learners

Brian Caffo, PhD

Johns Hopkins University

30 Courses1,669,970 learners

Jeff Leek, PhD

Johns Hopkins University

32 Courses1,703,174 learners

Offered by

Johns Hopkins University

Industry partners

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

New to Data Analysis? Start here.

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

Time to completion can vary based on your schedule, but most learners are able to complete the Specialization in 3-6 months.

Each course in the Specialization is offered monthly.

Some programming experience (in any language) is recommended. We also suggest a working knowledge of mathematics up to algebra (neither calculus or linear algebra are required).

Yes! To get started, click the course card that interests you and enroll. You can enroll and complete the course to earn a shareable certificate, or you can audit it to view the course materials for free. When you subscribe to a course that is part of a Specialization, you’re automatically subscribed to the full Specialization. Visit your learner dashboard to track your progress.

Data Science Specialization

What you'll learn

Skills you'll gain

Details to know

See how employees at top companies are mastering in-demand skills

Advance your subject-matter expertise

Specialization - 10 course series

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

Earn a career certificate

Instructors

Offered by

Industry partners

Why people choose Coursera for their career

New to Data Analysis? Start here.

Open new doors with Coursera Plus

Advance your career with an online degree

Join over 3,400 global companies that choose Coursera for Business

Frequently asked questions

How long does it take to complete the Specialization?

How often is each course in the Specialization offered?

What background knowledge is necessary?

More questions