Apache Drill Performance and Debugging

Apache Drill Performance and Debugging

Use SQL queries on a variety of data types, including structured data in a Hive table, semi-structured data in HBase or Data Fabric-DB, and complex data file types such as Parquet and JSON.

About this Course

In this course, you will write SQL queries on a variety of data types, including structured data in a Hive table, semi-structured data in HBase or MapR-DB, and complex data file types such as Parquet and JSON. You will also learn the different services involved at each step, and how Drill optimizes a query for distributed SQL execution.

What's Covered

Course Lessons Lab Activities

4: Apache Drill Architecture

Drill Execution Process
Drill Architecture Components

 

Order Query Process Steps
Sketch Drillbit Architecture

5: Query Plans and Optimization

About Physical Query Plans
Examine a Physical Query Plan
Optimize Queries

 

Examine Physical Query Plans
Create a Partitioned Table

6: Apache Drill Performance and Debugging

Analyze Drill Error Messages
Configure Log File Settings
Troubleshoot Apache Drill
Examine Drill Log Files
 

Prerequisites

  • Completion of DA 400 - Intro to SQL Analytics with Apache Drill, recommended
  • Linux skills, including familiarity with command-line options such as ls, cd, cp, and su
  • Beginning to intermediate proficiency with SQL
  • Basic Hadoop knowledge

Curriculum

  • Lesson 4 - Apache Drill Architecture
  • Quiz 4
  • Lesson 5 - Apache Drill Query Plans and Optimization
  • Quiz 5
  • Lesson 6 - Apache Drill Performance and Debugging
  • Quiz 6
  • Course Materials
  • Lab Guide
  • Lab Environment Connection Guide

About this Course

In this course, you will write SQL queries on a variety of data types, including structured data in a Hive table, semi-structured data in HBase or MapR-DB, and complex data file types such as Parquet and JSON. You will also learn the different services involved at each step, and how Drill optimizes a query for distributed SQL execution.

What's Covered

Course Lessons Lab Activities

4: Apache Drill Architecture

Drill Execution Process
Drill Architecture Components

 

Order Query Process Steps
Sketch Drillbit Architecture

5: Query Plans and Optimization

About Physical Query Plans
Examine a Physical Query Plan
Optimize Queries

 

Examine Physical Query Plans
Create a Partitioned Table

6: Apache Drill Performance and Debugging

Analyze Drill Error Messages
Configure Log File Settings
Troubleshoot Apache Drill
Examine Drill Log Files
 

Prerequisites

  • Completion of DA 400 - Intro to SQL Analytics with Apache Drill, recommended
  • Linux skills, including familiarity with command-line options such as ls, cd, cp, and su
  • Beginning to intermediate proficiency with SQL
  • Basic Hadoop knowledge

Curriculum

  • Lesson 4 - Apache Drill Architecture
  • Quiz 4
  • Lesson 5 - Apache Drill Query Plans and Optimization
  • Quiz 5
  • Lesson 6 - Apache Drill Performance and Debugging
  • Quiz 6
  • Course Materials
  • Lab Guide
  • Lab Environment Connection Guide

For more information on how HPE manages, uses and protects your information please refer to HPE Privacy Statement. You can always withdraw or modify your consent to receive marketing communication from HPE. This can be done by using the opt-out and preference mechanism at the bottom of our email marketing communication or by following this link.

×