Apache Drill Performance and Debugging

Apache Drill Performance and Debugging

Use SQL queries on a variety of data types, including structured data in a Hive table, semi-structured data in HBase or Data Fabric-DB, and complex data file types such as Parquet and JSON.

About this Course

In this course, you will write SQL queries on a variety of data types, including structured data in a Hive table, semi-structured data in HBase or MapR-DB, and complex data file types such as Parquet and JSON. You will also learn the different services involved at each step, and how Drill optimizes a query for distributed SQL execution.

What's Covered

Course Lessons Lab Activities

4: Apache Drill Architecture

Drill Execution Process
Drill Architecture Components

 

Order Query Process Steps
Sketch Drillbit Architecture

5: Query Plans and Optimization

About Physical Query Plans
Examine a Physical Query Plan
Optimize Queries

 

Examine Physical Query Plans
Create a Partitioned Table

6: Apache Drill Performance and Debugging

Analyze Drill Error Messages
Configure Log File Settings
Troubleshoot Apache Drill
Examine Drill Log Files
 

Prerequisites

  • Completion of DA 400 - Intro to SQL Analytics with Apache Drill, recommended
  • Linux skills, including familiarity with command-line options such as ls, cd, cp, and su
  • Beginning to intermediate proficiency with SQL
  • Basic Hadoop knowledge

Curriculum

  • Lesson 4 - Apache Drill Architecture
  • Quiz 4
  • Lesson 5 - Apache Drill Query Plans and Optimization
  • Quiz 5
  • Lesson 6 - Apache Drill Performance and Debugging
  • Quiz 6
  • Course Materials
  • Lab Guide
  • Lab Environment Connection Guide

About this Course

In this course, you will write SQL queries on a variety of data types, including structured data in a Hive table, semi-structured data in HBase or MapR-DB, and complex data file types such as Parquet and JSON. You will also learn the different services involved at each step, and how Drill optimizes a query for distributed SQL execution.

What's Covered

Course Lessons Lab Activities

4: Apache Drill Architecture

Drill Execution Process
Drill Architecture Components

 

Order Query Process Steps
Sketch Drillbit Architecture

5: Query Plans and Optimization

About Physical Query Plans
Examine a Physical Query Plan
Optimize Queries

 

Examine Physical Query Plans
Create a Partitioned Table

6: Apache Drill Performance and Debugging

Analyze Drill Error Messages
Configure Log File Settings
Troubleshoot Apache Drill
Examine Drill Log Files
 

Prerequisites

  • Completion of DA 400 - Intro to SQL Analytics with Apache Drill, recommended
  • Linux skills, including familiarity with command-line options such as ls, cd, cp, and su
  • Beginning to intermediate proficiency with SQL
  • Basic Hadoop knowledge

Curriculum

  • Lesson 4 - Apache Drill Architecture
  • Quiz 4
  • Lesson 5 - Apache Drill Query Plans and Optimization
  • Quiz 5
  • Lesson 6 - Apache Drill Performance and Debugging
  • Quiz 6
  • Course Materials
  • Lab Guide
  • Lab Environment Connection Guide