This course provides an introduction of some important concepts and tools on a very important aspect of data science: cleaning and organizing data before any analysis. A must for any data scientist.
Easy, mostly instructive Course. The Assignments and quizzes are quite good, and illustrates the lessons very well.\n\nSee the videos for general presentation, but use the energy on the excersizes.
par Marcelo S•
There is a lot of room for improvement. In an ironic twist, since the course is about "cleaning data," we are left to our own devices figuring out a lot of this very outdated material, broken links, codes that don't work, etc, so we have to google and search StackOverflow and forums to fill in the gaps and create a better course. I was subsequently asked to be a Mentor in the course, but I would rather the author of the course revise it, instead of having us work for free trying to help people get through outdated material. All the help is in the discussion forums already anyway, so I'm not sure why they need more Mentors. The saving grace of this course is that you will learn, if you are desperate to learn, and it is part of a greater Specialization that is worth your time.
par Marc F•
I believe this course suffers from neglect. Rarely did I see any of the mentors participating in the group discussions even though there were plenty of questions. Furthermore, some of the quiz questons seemed incomplete or confusing. The project was no better. I feel like the course was recorded a few years ago, and not much done after that to fix flaws, even though they are probably well known. The material is useful, but it would be nice to have a set of notes or a text to go with the lectures. You will spend a lot of time searching the internet to compelte the assignments. Sometimes that is good, but other times a guide geared to the course would have been better.
par Thaer Z•
I am done with this course. every week is the same thing. the lectures are a long list of references to other references. The quiz questions can not be answered without spending hours troubleshooting RStudio or searching the forum for help and hints to find out why the loaded packages or functions are not found. The quiz recommends to load packages that don't work or have dependencies that are no longer valid. I wanted to take this specialization to learn new data analysis techniques. if I wanted to spend my time searching the internet for answers I can do that without paying monthly fees. Good luck everyone. I am done. I will try a different course or field of interest.
par Greg R•
The methodology of getting and cleaning data was good but the course materials were lacking and really outdated. Some of the material is 5+ years old and reference deprecated packages and functions or includes links to sites that have been long updated or no longer exist. I found myself spending a lot of time doing my own research on what packages to use. There is value in that.
The quizzes and assignments cover good topics but the instructions are pretty unclear as to what the ask actually is. It takes a lot of independent research and combing through the forums to gain clarity. It is very time consuming.
par Willie C•
Not a great course. The lecture videos were dull and not very informative, and did not do a good job of preparing you for the quizzes at the end of each week. The lecture videos mentioned and linked to a number of external resources, but you couldn't click on the links through the videos, so that wasn't useful. The forums were much more helpful than the lecture videos when it came to teaching you what you needed to know. I understand why a course like this is essential to the Data Science specialization, but I feel like this content could've been covered in a much more engaging and instructive manner.
par Matt B•
Have to say, very disappointed when comparing this to the first course. The first course teaches you the concepts and the quizzes/projects give you a great environment to learn new concepts while proving knowledge of the previous ones. This course so far has 20is minutes of videos per week that teach you 60% of what you need for the quizzes, especially true for the second week. Save time and use another resource for learning about APIs and other data resources.
par Lyn S•
Not bad, but certainly not good. I cannot believe there is a style of teaching where you never get to see the best way to do something. I can slog thru the programming, but I doubt it's the best way to do something, but I never get to see how something should have been done. It's odd we have no feedback from prof and just 'grading' from other students who also are slogging thru without ever seeing the best or even some good ways to have done something.
par Alex F•
The content on downloading files needs to be explained much better. Including more practice with the different file types would have been great. Also needs an demonstration and lecture on what makes a good codebook and readme file. The content with dplyr was really well done though. For something so important in data science I would expect this course to have been done so much better.
par ALEXEY P•
The instructor cares very little about the ability of his students to keep up with his explanations. The pace at which the material is presented is horrible, the amount of details is just the bare minimum. I do not think it would be too much work for the instructor to double or maybe even triple the length of the course videos. But he just does not seem to care.
par Valentin D•
Instructor reads lectures in monotonic voice. The lectures themselves are just a series of cases of some R functions usage with no basics of Why you need to clean the data or real cases with complete examples how and where to get your data and what steps you can do to make it useful.
The course has a lot of links for tutorials in R. That's a plus.
par Shawn L•
The project at the end requires actions that data scientists should know but does not actually talk about the items. For example the project "book". You hear about it but are not actually taught the right way to make one. At best case you are taking a guess and at worst you are learning bad habits or missing out on what should be in it.
par Chris M•
Didn't really cover how to deal with messy data, e.g. if you need to join to datasets and have orphans, or you have no foreign keys between two datasets and you need to use fuzzy matching.
Basic validation was also not covered (i.e. making sure that your data covers all that you expect).
par Jason R H M•
The explication in every lesson is really bad, and the exercise need more thigs that they explain, you must search the most of the tools in the course, if they make some videos or examples with all tools in the program, maybe can be better but in this moment is not good course
par Jonathan O•
I saw two main issues with this course: 1) dated lecture videos, oftentimes with R code that can't be replicated using up-to-date packages, and 2) lack of thoughtful design: example after example after example after example doesn't really teach you anything.
par James O•
The class is getting stale. The instructors didn't respond to questions on the discussion forums about quiz items, the majority of assessment items seem to be available on Google and 50% of the peer reviewed assessment I checked used plagiarized solutions.
par Izabela L•
The code for the final assignment is peer reviewed which doesn't make sense. It should be reviewed by either a TA or some kind of application than can verify what you've done. Also, the assignments were a bit of a leap from the video tutorials at times.
par Daan v d V•
Although this course is on a very interesting topic, it is quite outdated. Its lectures and examples are quite outdated; some web scraping examples are incompatible or don't exist anymore, and the described techniques are mostly (outdated) R libraries.
par Stephen S•
The videos did not teach anything that was going to be on the quiz so it was like answering 5 questions at random using google. The lesson plans and project were very vague and too much time was spent trying to figure out what was even being asked.
par Shashank M•
This is a very crucial part of the data science specialisation and I feel more hands-on exercises and quizzes should have been there. Small practice quizzes for testing incremental learning within a week should be there.
par Eduardo S B•
In my opinion the structure of the course is not the best. I mainly dislike the fact that some libraries, packages, etc. (e.g. MySQL) are not trivial to install.
Still I learnt quite a lot, so I wouldn't say it's bad.
par le M N•
the instructor of this course, unlike the other 1, is quite unclear about what needed to be done. a lots of the packages of the course are not up to date.
more quiz and exercise would be highly beneficial
par Jason Y•
Mediocre presentation of tidy data, which is probably the most critical topic. Otherwise, its mostly just walking through what commands to use in R to load in various file formats.
par Patti M•
This class needs more content, more explanation. It is clearly a very important aspect of Data Science, but the assignments were more complex than the given course content.
par Sheila B•
I learned a lot but my usually happy & grateful attitude was sorely challenged by the fact that so many facts in the videos and obvious course material was, well, wrong.
par James K•
Out of date material. Many links broken. Some of the functions taught are sunset. Week 2 was too surface level to do anything useful. Weeks 3 and 4 better than week 2.