Название: Getting Started with Big Data Query using Apache Impala Автор: Agus Kurniawan Издательство: PE Press Год: 2021 Страниц: 120 Язык: английский Формат: pdf, epub Размер: 15.5 MB
This book provides alternative approach to get started with Big Data Query using Apache Impala. This book describes how to work with Apache Impala and to perform queries inside Apache Impala.
Apache Impala is a modern, open source, distributed SQL query engine for Apache Hadoop. With Impala, we can query data, whether stored in HDFS, Apache Hive or Apache HBase – including SELECT, JOIN, and aggregate functions. You can find the official project on this link, In this book, we learn how to perform queries on Apache Impala.
You can set up Apache Impala with Cloudera Manager or own Linux. For demo, I use Apache Impala on Cloudera environment. I deployed Apache Impala on Ubuntu Linux.
Apache Hue is a web tool that can be used to perform queries on Apache Impala. We can say Apache Hue like MySQL Workbench in MySQL or SQL Server Management Studio in SQL Server. We can use Apache Hue to write queries to Apache Impala easily. This tool has a form in web application so we only need a browser to access. If you have Cloudera platform, you can install Apache Hue using Cloudera Manager. Add a new service on your existing Cloudera Manager. Click Hue and the install.
In this chapter, we learn how to access Apache Impala from a program. We will use Java application for a sample of client application. To access Apache Impala, we can use ODCB and JDBC drivers from Cloudera. We will JDBC 4.2 driver for Java application. Next, we create Java application project. You can create Java application using any editor tool. In this book, I use Jetbrain IntelliJ IDEA. This tool is available for community edition. Now we can create a new project using IntelliJ IDEA. You can select Java application with project template.
The following is a list of highlight topics:
* Introduction to Apache Impala * Working with Apache Impala Shell * SQL Querying with Apache Hue and Apache Impala * Loading Dataset to Apache Impala * Basic SQL Query for Apache Impala * Joining Query and Subquery on Apache Impala * Partition Data on Apache Impala * Apache Impala Database Programming with Java
Скачать Getting Started with Big Data Query using Apache Impala
Внимание
Уважаемый посетитель, Вы зашли на сайт как незарегистрированный пользователь.
Мы рекомендуем Вам зарегистрироваться либо войти на сайт под своим именем.