Hadoop Course Content
INTRODUCTION
- Big Data
- 3Vs
- Role of Hadoop in Big data
- Hadoop and its ecosystem
- Overview of other Big Data Systems
- Requirements in Hadoop
- UseCases of Hadoop
HDFS
- Design
- Architecture
- Data Flow
- CLI Commands
- Java API
- Data Flow Archives
- Data Integrity
- WebHDFS
- Compression
MAPREDUCE
- Theory
- Data Flow (Map – Shuffle – Reduce)
- Programming [Mapper, Reducer, Combiner, Partitioner]
- Writables
- InputFormat
- Outputformat
- Streaming API
ADVANCED MAPREDUCE PROGRAMMING
- SCounters
- SCustomInputFormat
- SDistributed Cache
- SSide Data Distribution
- SJoins
- SSorting
- SToolRunner
- SDebugging
- SPerformance Fine tuning
ADMINISTRATION – Information required at Developer level
- Hardware Considerations – Tips and Tricks
- Schedulers
- Balancers
- NameNode Failure and Recovery
HBase
- NoSQL vs SQL
- CAP Theorem
- Architecture
- Configuration
- Role of Zookeeper
- Java Based APIs
- MapReduce Integration
- Performance Tuning
HIVE
- Architecture
- Tables
- DDL – DML – UDF – UDAF
- Partitioning
- Bucketing
- Hive-Hbase Integration
- Hive Web Interface
- Hive Server
String Handling
- Overview of String in C
- Reading String from Terminal
- Writing String to console screen
- String Handling Functions - string.h
- gets() & puts() functions
OTHER HADOOP ECOSYSTEMS
- Pig (Pig Latin , Programming)
- Sqoop (Need – Architecture ,Examples)
- Introduction to Components (Flume, Oozie,ambari)
Benefits of Hadoop Training
- Complete code explanation and implementation
- Course Starts from installation of technology to deployment of product
- Trainers from Industry with good hand on experience
- You can develop your own programs after understanding the basics with our experienced Faculties
- Weekdays, fast track and weekend Batches
- Certificate after Successful completion of Training
- Online and Offline material support for better learning
- Software and Installation support will be provided
- Regular Machine Test for better understandings
- Free Live Project Support to all participants
- Industry Exposure via Live Troubleshooting
- Guaranteed placement to meritorious students
Required Software/ Platforms for hadoop Training
- Any OS
- Java Oracle JDK 1.6
- Server: GlassFish, Tomcat