Fundamentals of Apache Hive

With this comprehensive course on Apache Hive, get comfortable with Joins, Views and different File Formats with Hive. It will help you be hands-on with querying Big Data effectively using concepts of Bucketing and Partitioning.

Enroll Now

Your Journey to Master Apache Hive Starts Here

As we know, data is the Oil for the current generation. Through data, companies can understand their customers, hence make products and services as per their requirements. For this, you can use various distributed storage and processing frameworks. The most commonly and widely used framework is Hadoop.

If you are working with Hadoop, it might get difficult for you to code directly in MapReduce for even simple tasks. But Hive will resolve this problem, as it makes it simple for you to code directly in a SQL like querying language.

This course will primarily focus on Apache Hive. You will learn about processing data in Hadoop’s MapReduce processing engine using Hive’s Query Language. Also, we will be covering Joins, Views, File Formats, and querying Big Data effectively using concepts of Bucketing and Partitioning.

The world is changing rapidly. This changing environment is creating demands of Big Data and Hive professionals. Don’t lose the opportunity to be a part of this revolution. Here are a few Professional IT groups that are continuously enjoying the benefits and perks of moving into the Big Data domain.

Developers and Architects
BI /ETL/DW Professionals
Senior IT Professionals
Testing Professionals
Mainframe Professionals
Freshers
Big Data Enthusiasts
Software Architects, Engineers, and Developers
Data Scientists and Analytics Professionals

Pre-Requisites

Good to have knowledge of any programming language like Python, Java, Scala etc.
Knowledge of Structured Query Language
Overview of Hadoop

Key Takeaways from the Hive Course

Basic Commands in Hive
Hive Query Language
Joins in Hive
File Formats
Views in Hive
Partitioning and Bucketing

What I need to start the Apache Hive Course?

A working laptop/desktop
A working Internet connection
Basic knowledge of SQL

Course curriculum

1

Introduction to the Course
- Introduction to the Course
- Course Handouts
- AI&ML Blackbelt Plus Program (Sponsored)
- Itversity Credentials
2

Introducing Hive
- What is Hive
- Features of Hive
- Working of Hive
- Quiz : Introducing Hive
3

Basic Hive Commands
- Module Overview
- Connecting to Hive
- Creating Database
- Hive Data Types
- File Encoding of Data Values
- Creating Tables in Hive
- Loading data in Hive Tables
- Managed vs External Tables
- Creating External Table
- Creating Tables from existing tables
- Dropping Tables
- Altering Tables
- Quiz : Basic Hive Commands
4

Hive Query Language
- Module Overview
- Reading Records in Hive
- Filtering Data in Hive
- Grouping Data in Hive
- Ordering Records in Hive
- ORDER BY vs SORT BY
- Distributing Data in Hive
- Built-in Functions in Hive
- Quiz : Hive Query Language
5

Joins in Hive
- Joins in Hive
- Introducing the Dataset
- Inner Join
- Outer Joins
- Map-Side Join
- Quiz : Joins in HIve
6

Partitioning and Bucketing
- What is Partitioning
- Introducing the Dataset
- Creating Partitioned Table
- Querying Partitioned Table
- Types of Partitioning
- Static Partitioning in Hive
- Dynamic Partition Properties
- Altering Partitioned Table
- Bucketing in Hive
- Bucketing Table
- Quiz : Partitioning and Bucketing
7

File Formats
- Module Overview
- File Formats in Hive
- ORC File Format
- What are SerDes
- CSV SerDe
- Customising SerDes
- Quiz : File Formats
8

Views in Hive
- What are Views
- Creating View
- Altering View
- Quiz : Views in Hive

A working laptop/desktop

A working Internet connection

Basic knowledge of SQL