top of page

INDIA : +91 9999 608 499

 

Email Us :  sales@eduvantage.co.in

Drop Us a Query:

Your details were sent successfully!

HOME > COURSES FOR STUDENTS > BIG DATA & HADOOP

BIG DATA & HADOOP 

Become a Big Data & Hadoop Expert by mastering MapReduce, Yarn, Pig, Hive, HBase, Oozie, Flume and Sqoop while working on industry based Use-cases and Projects.

Upcoming BIG DATA & HADOOP Batches:

About The Course:

Big Data and Hadoop training course is designed to provide knowledge and skills to become a successful Hadoop Developer. In-depth knowledge of core concepts will be covered in the course along with implementation on varied industry use-cases. 

 

Course Objectives

At the end of the course, participants should be able to: 

1. Master the concepts of HDFS and MapReduce framework 

2. Understand Hadoop 2.x Architecture 

3. Setup Hadoop Cluster and write Complex MapReduce programs 

4. Learn the data loading techniques using Sqoop and Flume 

5. Perform Data Analytics using Pig, Hive and YARN 

6. Implement HBase and MapReduce Integration 

7. Implement Advanced Usage and Indexing 

8. Schedule jobs using Oozie 

9. Implement best Practices for Hadoop Development 

10. Work on a Real Life Project on Big Data Analytics

Module 1. Understanding Big Data and Hadoop

Learning Objectives -  In this module, you will understand Big Data, the limitations of the existing solutions for Big Data problem, how Hadoop solves the Big Data problem, the common Hadoop ecosystem components, Hadoop Architecture, HDFS, Anatomy of File Write and Read, Rack Awareness.


Topics -  Big Data, Limitations and Solutions of existing Data Analytics Architecture, Hadoop, Hadoop Features, Hadoop Ecosystem, Hadoop 2.x core components, Hadoop Storage: HDFS, Hadoop Processing: MapReduce Framework, Anatomy of File Write and Read, Rack Awareness.

Module 2. Hadoop Architecture and HDFS

Learning Objectives - In this module, you will understand Hadoop MapReduce framework and the working of MapReduce on data stored in HDFS. You will learn about YARN concepts in MapReduce.

 

Topics - MapReduce Use Cases, Traditional way Vs MapReduce way, Why MapReduce, Hadoop 2.x MapReduce Architecture, Hadoop 2.x MapReduce Components, YARN MR Application Execution Flow, YARN Workflow, Anatomy of MapReduce Program, Demo on MapReduce.

Module 3. Hadoop MapReduce Framework - I

Learning Objectives - In this module, you will understand Hadoop MapReduce framework and the working of MapReduce on data stored in HDFS. You will learn about YARN concepts in MapReduce.

 

Topics - MapReduce Use Cases, Traditional way Vs MapReduce way, Why MapReduce, Hadoop 2.x MapReduce Architecture, Hadoop 2.x MapReduce Components, YARN MR Application Execution Flow, YARN Workflow, Anatomy of MapReduce Program, Demo on MapReduce.

Module 4. Hadoop MapReduce Framework - II

Learning Objectives - In this module, you will understand concepts like Input Splits in MapReduce, Combiner & Partitioner and Demos on MapReduce using different data sets.

 

Topics - Input Splits, Relation between Input Splits and HDFS Blocks, MapReduce Job Submission Flow, Demo of Input Splits, MapReduce: Combiner & Partitioner, Demo on de-identifying Health Care Data set, Demo on Weather Data set.

Module 5. Advanced MapReduce

Learning Objectives - In this module, you will learn Advanced MapReduce concepts such as Counters, Distributed Cache, MRunit, Reduce Join, Custom Input Format, Sequence Input Format and how to deal with complex MapReduce programs.

 

Topics - Counters, Distributed Cache, MRunit, Reduce Join, Custom Input Format, Sequence Input Format.

Module 6. Pig

Learning Objectives - In this module, you will learn Pig, types of use case we can use Pig, tight coupling between Pig and MapReduce, and Pig Latin scripting.

 

Topics - About Pig, MapReduce Vs Pig, Pig Use Cases, Programming Structure in Pig, Pig Running Modes, Pig components, Pig Execution, Pig Latin Program, Data Models in Pig, Pig Data Types.

Pig Latin : Relational Operators, File Loaders, Group Operator, COGROUP Operator, Joins and COGROUP, Union, Diagnostic Operators, Pig UDF, Pig Demo on Healthcare Data set.

Module 7. Hive

Learning Objectives - This module will help you in understanding Hive concepts, Loading and Querying Data in Hive and Hive UDF. 

 

Topics - Hive Background, Hive Use Case, About Hive, Hive Vs Pig, Hive Architecture and Components, Metastore in Hive, Limitations of Hive, Comparison with Traditional Database, Hive Data Types and Data Models, Partitions and Buckets, Hive Tables(Managed Tables and External Tables), Importing Data, Querying Data, Managing Outputs, Hive Script, Hive UDF, Hive Demo on Healthcare Data set.

Module 8. Advance Hive and HBase

Learning Objectives - In this module, you will understand Advanced Hive concepts such as UDF, Dynamic Partitioning. You will also acquire in-depth knowledge of HBase, HBase Architecture and its components.

 

Topics - Hive QL: Joining Tables, Dynamic Partitioning, Custom Map/Reduce Scripts, Hive : Thrift Server, User Defined Functions.

HBase: Introduction to NoSQL Databases and HBase, HBase v/s RDBMS, HBase Components, HBase Architecture, HBase Cluster Deployment.

Module 9. Advance HBase

Learning Objectives - This module will cover Advanced HBase concepts. We will see demos on Bulk Loading , Filters. You will also learn what Zookeeper is all about, how it helps in monitoring a cluster, why HBase uses Zookeeper.

 

Topics - HBase Data Model, HBase Shell, HBase Client API, Data Loading Techniques, ZooKeeper Data Model, Zookeeper Service, Zookeeper, Demos on Bulk Loading, Getting and Inserting Data, Filters in HBase.

Module 10. Oozie and Hadoop Project

Learning Objectives - In this module, you will understand working of multiple Hadoop ecosystem components together in a Hadoop implementation to solve Big Data problems. We will discuss multiple data sets and specifications of the project. This module will also cover Flume & Sqoop demo and Apache Oozie Workflow Scheduler for Hadoop Jobs.

 

Topics - Flume and Sqoop Demo, Oozie, Oozie Components, Oozie Workflow, Scheduling with Oozie, Demo on Oozie Workflow, Oozie Co-ordinator, Oozie Commands, Oozie Web Console, Hadoop Project Demo.

Please reload

Course Curriculum

EduVantage Certification:

EduVantage certifies that you have successfully completed its skill assessment course. At the end of your course, you will work on a real time Project and will receive a Problem Statement to work.

Once you are successfully through the project which will be reviewed by an expert, you will be awarded a certificate with a performance-based grading.

If your project is not approved in 1st attempt, you can take extra assistance for any of your doubts to understand the concepts better and reattempt the Project free of cost.

BIG DATA & HADOOP FAQs

Why learn Big Data & Hadoop?

A study by Forrester predicts that CIOs who are late to the Hadoop game will finally make the platform a priority in 2015. Hadoop has evolved as a must-to-know technology and has been a reason for better career, salary and job opportunities for many professionals. 

Who should attend?

Predictions say 2015 will be the year Hadoop finally becomes a cornerstone of your business technology agenda. To stay ahead in the game, Hadoop has become a must-know technology for the following professionals:

1. Analytics Professionals 

2. BI /ETL/DW Professionals 

3. Project Managers

4. Testing Professionals 

5. Mainframe Professionals 

6. Software Developers and Architects 

7. Graduates aiming to build a career in Big Data

What are the pre-requisites for this Course?

Knowledge of core java concepts is the pre-requisite for this course. 

Who are the Instructors?

All our instructors are working professionals from the Industry and have at least 10-12 yrs of relevant experience in respective domains. They are subject matter experts and are trained by Eduvantage for providing training so that participants get a great learning experience.

What if I miss a class?

If the same session is available in other classes in next few weeks, you can be accommodated in another class depending upon the seats available in that class.

I have paid the enrolment fee but I am unable to continue in the present batch. Can I reschedule it to a future date?

Yes, you can do that subject to the availability of the seats available in the upcoming batches.

Do you provide placement assistance?

Eduvantage is a startup education company and lots of recruitment firms contacts us for our students profiles from time to time. Since there is a big demand for this skill, we help our certified students get connected to prospective employers. We also help our customers prepare their resumes, work on real life projects and provide assistance for interview preparation. Having said that, please understand that we don't guarantee any placements however if you go through the course diligently and complete the project you will have a very good chance of securing a jpb of your choice.

Can I attend a Demo Session?

We have limited number of participants in a live session to maintain the Quality Standards, hence, participation in a live class without enrollment is not possible. However, you can go through the sample class recording and it would give you a clear insight about how are the classes conducted, quality of instructors and the level of interaction in the class.

 

What are the payment options?

You can pay by Credit Card, Debit Card or NetBanking from all the leading banks. We use a CCAvenue Payment Gateway. For USD payment, you can pay by Paypal. We also have EMI options available.

Is the course material available to the students even after the course training is over?

Yes, the course material is provided permanently to the students and they can refer to it whenever they need. We also provide you fee access to relevant online modules. 

What if I have queries after I complete this course?

Our instructors will be available to you on phone and you can also personally meet them any time even after you have completed the course.

What if I have more queries?

You can give us a CALL at +91 9999 608 499 or email us at sales@eduvantage.co.in

Please reload

certificate
batch
curriculum
about

BIG DATA & HADOOP Course Features

Online Training

The training will comprise of 10 live modules of 3 hrs each and will be delivered by Industry Experts

Assignment

Assignments will be an integral part of the assessment

Certification

After the completion of the course and based on your Project Report you will receive the EduVantage Certification of Qualification. You will also be graded based on your performance in the course.

Project

Aspitants will be evaluated on the performance of the Project will be given at the end of the training capsule

24 x 7 Support

Aspirants will get access to the support team (available 24 x 7) in resolving queries during and after the course completion

Lifetime Access

Aspirants will be given Lifetime free access to our training resources

Please reload

sample
faqs
bottom of page