Hadoop And Spark

HadoopAndSparkLogo

ENROLL NOW

27-28 FEBRUARY 2016

About The Course

The Big Data ,Hadoop and Spark training course from Tarah Technologies is designed to enhance your knowledge and skills to become a successful Hadoop and Spark developer. In-depth knowledge of core concepts will be covered in the course along with implementation on varied industry use-cases.Course Contents

  1. Introduction to Scala for Apache Spark
  2. OOPS and Functional Programming in Scala
  3. Introduction to Big Data and Apache Spark
  4. Spark Common Operations
  5. Playing with RDDs
  6. Spark Streaming and MLlib
  7. GraphX, SparkSQL and Performance Tuning in Spark
  8. A complete project on Apache Spark
  9. Introduction to Big Data and Hadoop
    1. Big Data Introduction and Relevance
    2. History of Google’s Paper on MapReduce and Big Table
    3. Introduction to Hadoop
    4. Hadoop Eco-System
    5. Problems with Hadoop and New Framework
  10. Hadoop Distributed File System
    1. Introduction to Distributed File System and Problem with RAID
    2. HDFS Overview and Architecture
    3. HDFS File System Shell
  11. Introduction to Map Reduce
    1. Map and Reduce Paradigm
    2. Overview of Map Reduce in Hadoop
    3. Task Tracker and Job Tracker
    4. Job Configuration, Job Submission and Input Output Formats
  12. Hadoop v 2.X
    1. Problems with Hadoop v 1.X
    2. Container System and Scheduler
    3. YARN Introduction
    4. How Jobs can be configured on YARN
    5. Tez an Introduction

Who should go for this course?

Today, Hadoop and Spark has become a cornerstone of every business technology professional.To stay ahead in the game, Hadoop and Spark has become a must-know technology for the following professionals:

  1. Analytics professionals
  2. BI /ETL/DW professionals
  3. Project managers
  4. Testing professionals
  5. Mainframe professionals
  6. Software developers and architects
  7. Graduates aiming to build a successful career around Big Data

Why learn Big Data and Hadoop?

CIOs are making Hadoop and Spark their platform of choice in 2015. For better career prospects, bigger job opportunities and financial growth, Hadoop and Spark is a must-know.

What are the pre-requisites for this Course?

You can master Hadoop and Spark, irrespective of your IT background. While basic knowledge of Core Java and SQL might help, it is not a pre-requisite for learning Hadoop.

1. A complete project on Apache Spark

Learning Objectives – In this module, you will get an opportunity to work on a live Spark project where you can implement the learnings from previous modules hands-on, and solve a real-time use case.

Topics – Introduction to Scala for Apache Spark, OOPS and Functional Programming in Scala, Introduction to Big Data and Apache Spark, Spark Common Operations, Playing with RDDs,Spark Streaming and MLlib, GraphX, SparkSQL and Performance Tuning in Spark

2. Introduction to Big Data and Hadoop

Learning Objectives – In this module, you will understand Big Data, the limitations of the existing solutions for Big Data problem, how Hadoop solves the Big Data problem, the common Hadoop ecosystem components.

Topics – Big Data Introduction and Relevance,History of Google’s Paper on MapReduce and Big Table,Introduction to Hadoop,Hadoop Eco-System,Problems with Hadoop and New Framework

3. Hadoop Distributed File System

Learning Objectives – In this module, you will learn the Hadoop Distributed File Sysetm, Overview and Architecture ,Important Configuration files in a Hadoop Cluster.

Learning Objectives – In this module, you will learn the Hadoop Distributed File Sysetm, Overview and Architecture ,Important Configuration files in a Hadoop Cluster.

Topics – Introduction to Distributed File System and Problem with RAID,HDFS Overview and Architecture,HDFS File System Shell

4.Introduction to Map Reduce

Learning Objectives – In this module, you will understand Hadoop MapReduce framework and the working of MapReduce on data stored in HDFS.

Topics – Map and Reduce Paradigm, Overview of Map Reduce in Hadoop, Task Tracker and Job Tracker Job Configuration, Job Submission and Input Output Formats

5.Hadoop v 2.X

Learning Objectives – In this module, you will learn about the Problems with Hadoop v 1.X, Hadoop 2.x Cluster Architecture ,You will learn about YARN concepts in MapReduce.

Topics – Problems with Hadoop v 1.X , Container System and Scheduler ,YARN Introduction ,How Jobs can be configured on YARN , Tez an Introduction

Do you provide any Certification? If yes, what is the Certification process?Yes, we provide Tarah Technologies Certification. At the end of your course, you will work on a real time Project. You will receive a Problem Statement along with a dataset to work. Once you are successfully through the project(Reviewed by an Expert), you will be awarded a certificate with a performance-based grading. If your project is not approved in 1st attempt, you can take extra assistance for any of your doubts to understand the concepts better and re-attempt the Project free of cost.

Will I get help from Tarah technologies during the Certification Project?

Yes. Tarah technologies will help you at every stage of your learning and our 24/7 expert support team will ensure that you don’t get stuck. Once you submit the project, our subject matter experts will review the same and share feedback to optimize it, if required

About the Trainer(s)

1. Dr. Srinivas Padmanabhuni

He is the President of ACM India. Prior to co-founding Tarah Technologies, he was the Associate Vice President and was heading the research lab at Infosys till oct. 2015. He has rich experience of more than 15 years in IT and Services Industry. He has given expert invited talks across universities in US, China, Australia, Canada, Singapore, UK and India including ivy league universities like CMU, Purdue, RUC etc. He is a prolific, astute researcher who craves for new challenges. He has six granted patents, around 15 filed patents, one published book by Wiley, one book in process, several book chapters, multiple journal and conference papers, to his credit in addition to marquee invited talks and editorial positions. He obtained a doctorate degree in Artificial Intelligence from University of Alberta, Edmonton, Canada. Prior to Ph.D he secured his B.Tech and M.Tech in computer science from premier institutes in India Indian Institute of Technology (IIT), Kanpur and IIT, Mumbai respectively.

2. Naveen Kulkarni

He is a researcher and a practitioner with 15 years of experience in building frameworks with special interest in service based architecture and mining software artifacts. He has actively published in international conferences on software engineering, software architecture and services computing. He has also spoken at many technical events and conducted workshops at international conferences. He is also actively involved in peer reviewing and is part of IEEE software board. Naveen is registered to PhD program at IIITH, Hyderabad.

3. Trainer 3

He works as Technical Architect in a leading IT Service firm. He has over 9 years of experience in software industry in various technologies and is a proficient leader. He has worked on GWT, Eclipse Plugin development, Lucene, Solr, NoSQL databases etc. He has been invited as a speaker to developer events like ACM Compute, Indic Threads and Dev Camps. In addition to this, the speaker has teaching experience in renowned colleges like IIIT Bangalore, IIIT Jabalpur. He has 3 granted patents. He has presented research papers in world-known conferences on Hadoop, Data Mining, Data Analytics, Machine Learning etc.

Founder of Tarah Technologies

Neelima Vobugari is founder of Tarah Technologies, http://www.tarahtech.com. She is a certified CRM consultant and a certified Data Scientist. She is an alumnus of John Hopkins University, Maryland, where she finished her specialization in Data Sciences. She has worked on interesting real-time data science and customer centric CRM projects. Before starting Tarah Technologies, she worked for giant IT companies including IBM. She was invited by the Chief Minister of Karnataka for the pre-budget session of Karnataka for representing the women entrepreneurs of Karnataka where she suggested the measures to be taken to encourage more women entrepreneurs. She has also attended the International Women Entrepreneur Conference held at Minneapolis, USA in 2013 and represented India.

Kumbh Mela Survey App

Tarah Technologies developed an app for Kumbh Mela Survey. This app is the part of Indo-Dutch ...

Learn more