Cloudera Data Governance with SDX

Course Description

This course helps customers use the Cloudera Data Platform to address data governance tasks, motivated by the need for compliance with regulations such as the United State's Health Insurance Portability & the European Union's (GDPR) General Data Protection Regulation & Accountability Act (HIPAA).

Prerequisites

Install:

  • Demonstrate a knowledge of the installation process for Cloudera Manager, CDH, & the ecosystem projects.
  • Set up a local CDH repository
  • Perform OS-level configuration for Hadoop installation
  •  Addition of the new node to an existing cluster
  • Install the Cloudera Manager server & agents
  • Install CDH using Cloudera Manager

          

Adding a service using Cloudera Manager

Configure:

  • Perform basic & advanced configurations needed to administer a Hadoop cluster effectively.
  • Configure a service using Cloudera Manager
  • Make an HDFS user's home directory
  • Configure NameNode HA
  • Configure ResourceManager HA
  • Configure proxy for Hiveserver2/Impala

 

Manage:

  • Maintain & modify the cluster to support day-to-day operations in the enterprise.
  • Rebalance the cluster
  • Set up alerts for excessive disk fill
  • Define & install a rack topology script
  • Install a new type of I/O compression library in the cluster
  • Revising YARN resource assignment based on user feedback
  • Commission/decommission a node

 

Secure:

  • Enable relevant services & configure the cluster to meet goals defined by security policy; demonstrate understanding of basic security practices.
  • Configure HDFS ACLs
  • Install & configure Sentry
  • Configure Hue user authorization & authentication
  • Enable/configure log & query redaction
  • Make encrypted zones in HDFS

 

Test:

  • Benchmark the cluster operational metrics and test system configuration for operation & efficiency.
  • Execute file system commands via HTTPFS
  • Efficiently copy data within a cluster/between clusters
  • Make/restore a snapshot of an HDFS directory
  • Get and set ACLs for a file or directory structure
  • Benchmark the cluster (I/O, CPU, network)

 

Troubleshoot:

  • Demonstrate ability to find the root cause of a problem, optimize inefficient execution, & resolve resource contention scenarios.
  • Resolve errors/warnings in Cloudera Manager
  • Resolve performance problems/errors in cluster operation
  • Determine the reason for application failure
  • Set up the Fair Scheduler to resolve application delays

Audience Profile

This course is best suited for data stewards & others who are responsible for or are interested in implementing regulatory compliance or performing typical data governance activities using the Cloudera Data Platform.

Learning Objectives

Through instructor-led discussion, demonstrations, & hands-on exercises, you will learn how to:

Identify which tools in Cloudera Data Platform (CDP) to use for key data governance activities

Organize data objects using classifications & business glossary terms

  • Find access history for data objects & Policies
  • Use Data Catalog Profilers in CDP to assist in organizing data objects
  • Use Data Catalog to foster collaboration with colleagues
  • View & interpret a data object's lineage
  • Make & apply resource- & tag-based access control policies
  • Make policies for data masking & row-level filtering
  • Prerequisites 
  • Familiarity with basic data governance concepts is helpful but optional.

Content Outline

Lessons  

  • What Is Data Governance?
  • Basic Concepts
  • SDX: Data Governance in CDP

Lessons  

  • Searching for Objects by Type
  • Classifications
  • Glossary Terms

Lessons 

  • Auditing Overview
  • Viewing Audit Information

Lessons

  • Data Catalog Overview
  • Sensitive Data Profiler
  • Defining & Monitoring Data Quality
  • Preparing for Audits Using Data Catalog
  • Collaborating

Lessons

  • Inspecting Lineage
  • Propagation & Lineage in Atlas
  • Inspecting Lineage in Atlas

Lessons

  • Apache Ranger Basics
  • Creating Users & Roles
  • Resource-Based Policies
  • Tag-Based Policies
  • Securing Metadata Objects
  • Providing Partial Access

Lessons

Governing the Data Lifecycle

 

Certification

Required exams: CCA Administrator Exam (CCA131)

 

FAQs

A: To attend the training session, you should have operational Desktops or Laptops with the required specifications and a good internet connection to access the labs.

 

A: We recommend you attend the live session to practice & clarify the doubts instantly & get more value from your investment. However, if, due to some contingency if you have to skip the class, Radiant Tech learning will help you with the recorded session of that particular day. However, those recorded sessions are not meant only for personal consumption & NOT for distribution or commercial use.

 

A: Radiant Tech learning has a data center containing a Virtual Training environment for participants' hand-on-practice. Participants can easily access these labs over Cloud with the help of a remote desktop connection. Radiant virtual labs allow you to learn from anywhere around the world & in any time zone.

 

A: The learners will be enthralled as we engage them the real-world & Oriented industry projects during the training program. These projects will improve your skills & knowledge, & you will gain a better experience. These real-time projects will help you a lot in your future tasks & assignments.

 

A: You can request a refund if you do not wish to enroll in the course.

 

A: Yes, you can.

A: We adhere to the highest Internet security standards. Any data that is kept is not disclosed to outside parties.

A: It is recommended but optional. Being acquainted with the primary course material will enable students & the trainer to move at the desired pace during classes. You can access courseware for most vendors.

 

A: You can buy online from the page by clicking on "Buy Now ."You can view alternate payment methods on the payment options page.

A: Yes, students can pay from the course page.

 

A: The course completion certification will be awarded to all the professionals who have completed the training program & the project assignment given by your instructor. Using the certificate in future job interviews will help you land your dream job.

 

A: Radiant believes in a practical & creative approach to training & development, which distinguishes it from other activity & developmental platforms. Moreover, training courses are undertaken by experts with a range of experience in their domain.

 

A: Radiant team of experts will be available at e-mail support@radianttechlearning.com to answer your technical queries after the training program.

 

A: Yes, Radiant will provide you most updated, high, value-relevant real-time projects & case studies in each training program.

 

A: Technical issues are unpredictable & might occur with us as well. Participants must ensure access to the required configuration with good internet speed.

 

A: Radiant Techlearning offers training programs on weekdays, weekends & combination of weekdays & weekends. We provide you with complete liberty to choose the schedule that suits your need.

 

A: Radiant has highly intensive selection criteria for Technology Trainers & Consultants who deliver training programs. Our trainers & consultants undergo rigorous technical & behavioral interview & assessment processes before they are boarded in the company.

Our Technology experts/trainers & consultants carry deep-dive knowledge in the technical subject & are certified by the OEM.

Our training programs are practically oriented with 70% – 80% hands-on training technology tools. Our training program focuses on one-on-one interaction with each participant, the latest content in the curriculum, real-time projects & case studies during the training program.

Our faculty will provide you with the knowledge of each course from the fundamental level in an easy way & you are free to ask your doubts any time from your respective faculty.

Our trainers have the patience & ability to explain complex concepts simplistically with depth & width of knowledge.

To ensure quality learning, we provide a support session even after the training program.

 

Send a Message.


  • Enroll