Cloudera Data Analyst Training

Course Description

Cloudera Educational Services' four-day Data Analyst Training training will teach you how to apply traditional data analytics and business intelligence skills to big data. This course provides data professionals with the tools they need to access, manipulate, transform, and analyze complex data sets using SQL and common scripting languages.

Prerequisites

SQL knowledge is assumed, as is basic Linux command-line familiarity. It is not necessary to have prior knowledge of Apache Hadoop.

Audience Profile

This course is intended for data analysts, business intelligence specialists, developers, system architects, and database administrators.

Learning Objectives

Through instructor-led discussion & interactive, hands-on exercises, participants will navigate the ecosystem, learning:

  • How the open source big data ecosystem addresses challenge that traditional RDBMSs do not.
  • Using Apache Hive & Apache Impala to provide SQL access to data
  • Hive & Impala syntax & data formats, including functions & subqueries
  • Make, modify, & delete tables, views, & databases; load data; & store results of queries
  • Make & use partitions & different file formats
  • Using JOIN or UNION, as appropriate, to combine two or more datasets
  • What analytic & windowing functions are, & how to use them
  • Store & query complex or nested data structures
  • Process & analyze semi-structured & unstructured data
  • Techniques for optimizing Hive & Impala queries
  • Extending the capabilities of Hive & Impala using parameters, custom file formats & SerDes, & external scripts
  • How to decide whether Hive, Impala, an RDBMS, or a combination of these is the best choice for a given task.

Content Outline

Lessons 

  • The Motivation for Hadoop
  • Hadoop Overview
  • Data Storage: HDFS
  • Distributed Data Processing: YARN, MapReduce, & Spark
  • Data Processing & Analysis: Pig, Hive, & Impala
  • Database Integration: Sqoop
  • Other Hadoop Data Tools
  • Exercise Scenario Explanation  

Lessons  

  • What Is Hive?
  • What Is Impala?
  • Why Use Hive & Impala?
  • Schema & Data Storage
  • Comparing Hive & Impala to Traditional Databases
  • Use Cases 

Lessons 

  • Databases & Tables
  • Basic Hive & Impala Query Language Syntax
  • Data Types
  • Using Hue to Execute Queries
  • Using Beeline (Hive's Shell)
  • Using the Impala Shell 

Lessons

  • Operators
  • Scalar Functions
  • Aggregate Functions

Lessons

  • Data Storage
  • Making Databases & Tables
  • Loading Data
  • Altering Databases & Tables
  • Simplifying Queries with Views
  • Storing Query Results

Lessons

  • Partitioning Tables
  • Loading Data into Partitioned Tables
  • When to Use Partitioning
  • Choosing a File Format
  • Using Avro & Parquet File Formats

Lessons

  • UNION & Joins
  • Handling NULL Values in Joins
  • Advanced Joins

Lessons

  • Using Common Analytic Functions
  • Other Analytic Functions
  • Sliding Windows

Lessons

  • Complex Data with Hive
  • Complex Data with Impala

Lessons

  • Using Regular Expressions with Hive & Impala
  • Processing Text Data with SerDes in Hive
  • Sentiment Analysis & n-grams

Lessons

  • Understanding Query Performance
  • Bucketing
  • Hive on Spark

Lessons

  • How Impala Executes Queries
  • Improving Impala Performance

Lessons

  • Custom SerDes & File Formats in Hive
  • Data Transformation with Custom Scripts in Hive
  • User-Defined Functions
  • Parameterized Queries

Lessons

  • Comparing Hive, Impala, & Relational Databases
  • Which to Choose?

Certification

Required exams: CCA Data Analyst Exam (CCA159)

 

FAQs

A: To attend the training session, you should have operational Desktops or Laptops with the required specifications and a good internet connection to access the labs.

A: We recommend you attend the live session to practice & clarify the doubts instantly & get more value from your investment. However, if, due to some contingency if you have to skip the class, Radiant Tech learning will help you with the recorded session of that particular day. However, those recorded sessions are not meant only for personal consumption & NOT for distribution or commercial use.

 

A: Radiant Tech learning has a data center containing a Virtual Training environment for participants' hand-on-practice. Participants can easily access these labs over Cloud with the help of a remote desktop connection. Radiant virtual labs allow you to learn from anywhere in the world & in any time zone

A: The learners will be enthralled as we engage them the real-world & Oriented industry projects during the training program. These projects will improve your skills & knowledge, & you will gain a better experience. These real-time projects will help you a lot in your future tasks & assignments.

 

A: You can request a refund if you do not wish to enroll in the course.

 

A: Yes, you can.

 

A: We adhere to the highest Internet security standards. Any data that is kept is never shared with third parties.

 

A: It is recommended but optional. Being acquainted with the primary course material will enable students & the trainer to move at the desired pace during classes. You can access courseware for most vendors.

 

A: You can buy online from the page by clicking on "Buy Now ."You can view alternate payment methods on the payment options page.

 

A: Yes, students can pay from the course page.

 

A: The course completion certification will be awarded to all the professionals who have completed the training program & the project assignment given by your instructor. Using the certificate in your future job interviews will help you to l& your dream job.

A: Radiant believes in a practical & creative approach to training & development, which distinguishes it from other training & developmental platforms. Moreover, training courses are undertaken by experts with a range of experience in their domain.

 

A: Radiant team of experts will be available at e-mail support@radianttechlearning.com to answer your technical queries after the training program.

A: Yes, Radiant will provide you most updated, high, value-relevant real-time projects & case studies in each training program.

 

A: Technical issues are unpredictable & might occur with us as well. Participants must ensure access to the required configuration with good internet speed.

 

A: Radiant Techlearning offers training programs on weekdays, weekends & combination of weekdays & weekends. We provide you with complete liberty to choose the schedule that suits your need.

 

A: Radiant has highly intensive selection criteria for Technology Trainers & Consultants who deliver training programs. Our trainers & consultants undergo rigorous technical & behavioral interview & assessment processes before they are boarded in the company.

Our Technology experts/trainers & consultants carry deep-dive knowledge in the technical subject & are certified by the OEM.

Our training programs are practically oriented with 70% – 80% hands-on training technology tools. Our training program focuses on one-on-one interaction with each participant, the latest content in the curriculum, real-time projects & case studies during the training program.

Our faculty will provide you with the knowledge of each course from the fundamental level in an easy way & you are free to ask your doubts any time from your respective faculty.

Our trainers have the patience & ability to explain complex concepts simplistically with depth & width of knowledge.

To ensure quality learning, we provide a support session even after the training program.

Send a Message.


  • Enroll
    • Learning Format: ILT
    • Duration: 80 Hours
    • Training Level : Beginner
    • Jan 29th : 8:00 - 10:00 AM (Weekend Batch)
    • Price : INR 25000
    • Learning Format: VILT
    • Duration: 50 Hours
    • Training Level : Beginner
    • Validity Period : 3 Months
    • Price : INR 6000
    • Learning Format: Blended Learning (Highly Interactive Self-Paced Courses +Practice Lab+VILT+Career Assistance)
    • Duration: 160 Hours 50 Hours Self-paced courses+80 Hours of Boot Camp+20 Hours of Interview Assisstance
    • Training Level : Beginner
    • Validity Period : 6 Months
    • Jan 29th : 8:00 - 10:00 AM (Weekend Batch)
    • Price : INR 6000

    This is id #d