Chevron Left
Retour à Big Data Integration and Processing

Big Data Integration and Processing, Université de Californie à San Diego

4.4
1,290 notes
279 avis

À propos de ce cours

At the end of the course, you will be able to: *Retrieve data from example database and big data management systems *Describe the connections between data management operations and the big data processing patterns needed to utilize them in large-scale analytical applications *Identify when a big data problem needs data integration *Execute simple big data integration and processing on Hadoop and Spark platforms This course is for those new to data science. Completion of Intro to Big Data is recommended. No prior programming experience is needed, although the ability to install applications and utilize a virtual machine is necessary to complete the hands-on assignments. Refer to the specialization technical requirements for complete hardware and software specifications. Hardware Requirements: (A) Quad Core Processor (VT-x or AMD-V support recommended), 64-bit; (B) 8 GB RAM; (C) 20 GB disk free. How to find your hardware information: (Windows): Open System by clicking the Start button, right-clicking Computer, and then clicking Properties; (Mac): Open Overview by clicking on the Apple menu and clicking “About This Mac.” Most computers with 8 GB RAM purchased in the last 3 years will meet the minimum requirements.You will need a high speed internet connection because you will be downloading files up to 4 Gb in size. Software Requirements: This course relies on several open-source software tools, including Apache Hadoop. All required software can be downloaded and installed free of charge (except for data charges from your internet provider). Software requirements include: Windows 7+, Mac OS X 10.10+, Ubuntu 14.04+ or CentOS 6+ VirtualBox 5+....

Meilleurs avis

par AA

Mar 06, 2018

It was a good course, it could have been better if some examples of Spark were also provided in other Languages like Java, people without having background of python may find it difficult.

par DC

Oct 08, 2017

Very Interactive course. Theatrical classes are nicely drafted. Hands On exercises are interesting and some are challenging too. Overall very interesting course. Happy learning

Filtrer par :

267 avis

par Ievgenii Martynenko

Feb 07, 2019

Final assignment is not working properly. Whatever you choose - you are right.

There is no enough information how to dump tweets into file and how to use that in assignment.

This course doesn't worth any cent and should be either reworked or excluded.

par AJIT MENTA

Feb 05, 2019

Great Course! Provides a good exposure to the tools and utilities. The database part has been done well. The part with the Spark needs some more info on how to use the data frames.

par Mahamat Nour Ali Mai

Jan 31, 2019

it's really useful course for data integration. and you will understand the basic of data integration and processing which is really important part of big data as well.

par Gustavo Ide Maciel

Jan 20, 2019

Too easy until week 5. And in the week 6 disproportionate evaluations

par vasudha kirthi

Jan 16, 2019

amazing course great assignments

par Hector Grande Ráez

Jan 10, 2019

Pretty well course

par Zeinab Takbiri

Dec 30, 2018

Very good explanation of Spark layers and processes and how it differs from MapReduce. Thank you.

par Prashant N Negandhi

Dec 28, 2018

This course was very informative and provided some very good hands on exercises

par To Phung Huy

Dec 24, 2018

Too many software issues/installation bugs hampering the learning process. The setup procedures for every quiz takes up around 80% of the time and only 20% actually answering the quiz. Please reduce the number of quiz or consolidate them for learners do that we only need to do setup once. Mentor/Instructor presence in various discussions in which students encounter setup/installation issues are next to full absence and many sudents are left figuring out the problems themselves

par Jorge Viera

Dec 23, 2018

This has been one of the most exciting courses I've done. The final project makes a good job on making you apply a Big Data Processing Pipeline to solve a common task these days with SparkSQL: analyzing data on social media.