Big Data - Capstone Project
This course is part of Big Data Specialization
Instructors: Ilkay Altintas +1 more
Instructor ratings
We asked all learners to give feedback on our instructors based on the quality of their teaching style.
Skills you'll gain
There are 7 modules in this course
Welcome to the Capstone Project for Big Data! In this culminating project, you will build a big data ecosystem using tools and methods form the earlier courses in this specialization. You will analyze a data set simulating big data generated from a large number of users who are playing our imaginary game "Catch the Pink Flamingo". During the five week Capstone Project, you will walk through the typical big data science steps for acquiring, exploring, preparing, analyzing, and reporting. In the first two weeks, we will introduce you to the data set and guide you through some exploratory analysis using tools such as Splunk and Open Office. Then we will move into more challenging big data problems requiring the more advanced tools you have learned including KNIME, Spark's MLLib and Gephi. Finally, during the fifth and final week, we will show you how to bring it all together to create engaging and compelling reports and slide presentations. As a result of our collaboration with Splunk, a software company focus on analyzing machine-generated big data, learners with the top projects will be eligible to present to Splunk and meet Splunk recruiters and engineering leadership.
Acquiring, Exploring, and Preparing the Data
Data Classification with KNIME
Clustering with Spark
Graph Analytics of Simulated Chat Data With Neo4j
Reporting and Presenting Your Work
Final Submission
Explore more from Data Analysis
©2025 ementorhub.com. All rights reserved