Data warehousing on AWS

Course Description

Data Warehousing on AWS introduces professionals to concepts, strategies, & ideal practices for designing a cloud-based data warehousing solution using Amazon Redshift, the petabyte-scale data warehouse in AWS. This course demonstrates how to collect, store, & prepare data for the data warehouse using AWS services such as Amazon DynamoDB, Amazon EMR, Amazon Kinesis, & Amazon S3. This course demonstrates how to use Amazon QuickSight to analyze your data.

Prerequisites

We recommend that participants of this course have the following:

  • Taken AWS Technical Essentials (or equivalent experience with AWS)
  • Familiarity with relational databases & database design concepts

Target Audience

This course is intended for:                                                                                                                                                           

  • Database Architects
  • Database Administrators
  • Database Developers
  • Data Analysts
  • Data Scientists

Course Objectives

In this course, you will:

  • Discuss the core concepts of data warehousing, & the intersection between data warehousing & big data solutions
  • Launch an Amazon Redshift cluster & use the components, features, & functionality to implement a data warehouse in the cloud
  • Use other AWS data & analytic services, such as Amazon DynamoDB, Amazon EMR, Amazon Kinesis, & Amazon S3, to contribute to the data warehousing solution
  • Architect the data warehouse
  • Identify performance issues, optimize queries, & tune the database for better performance
  • Use Amazon Redshift Spectrum to analyze data directly from an Amazon S3 bucket
  • Use Amazon QuickSight to perform data analysis & visualization tasks against the data warehouse

Content Outline

  • Relational databases
  • Data warehousing concepts
  • The intersection of data warehousing & big data
  • Overview of data management in AWS
  • Hands-on lab 1: Introduction to Amazon Redshift
  • Conceptual Overview
  • Real-world use cases
  • Hands-on lab 2: Launching an Amazon Redshift cluster
  • Building the cluster
  • Connecting to the cluster
  • Controlling access
  • Database security
  • Load data
  • Hands-on lab 3: Optimizing database schemas
  • Schemas & data types
  • Columnar compression
  • Data distribution styles
  • Data sorting methods
  • Data sources overview
  • Amazon S3
  • Amazon DynamoDB
  • Amazon EMR
  • Amazon Kinesis Data Firehose
  • AWS Lambda Database Loader for Amazon Redshift
  • Hands-on lab 4: Loading real-time data into an Amazon Redshift database
  • Preparing Data
  • Loading data using COPY
  • Maintaining tables
  • Concurrent write operations
  • Troubleshooting load issues
  • Hands-on lab 5: Loading data with the COPY command
  • Amazon Redshift SQL
  • User-Defined Functions (UDFs)
  • Factors that affect query performance
  • The EXPLAIN command & query plans
  • Workload Management (WLM)
  • Hands-on lab 6: Configuring workload management
  • Amazon Redshift Spectrum
  •  Configuring data for Amazon Redshift Spectrum
  •  Amazon Redshift Spectrum Queries
  •  Hands-on lab 7: Using Amazon Redshift Spectrum
  •  Audit logging
  •  Performance monitoring
  •  Events & notifications
  • Power of visualizations
  • Building dashboards
  • Amazon QuickSight editions & features

FAQs

A data lake is a centralized & secured repository that collects all your data, both in its original form & prepared for analysis.

There are three methods of data storage, namely: –

  • Object storage
  • File storage
  • Block storage

EC2 is a service that enables business clients to run application programs in the computing environment.

AWS security provides opportunities to protect the data, check out security-related activity & receive automated responses.

Radiant believes in a practical & creative approach to training & development, which distinguishes it from other training & development platforms. Moreover, training courses are undertaken by some experts with a vast range of experience in their domain.

Radiant team of experts will be available at e-mail support@radianttechlearning.com to answer your technical queries after the training program.

Yes, Radiant will provide you most updated, high, value & relevant real-time projects & case studies in each training program.

Technical issues are unpredictable & might occur with us as well. Participants must ensure access to the required configuration with good internet speed.

Radiant Techlearning offers training programs on weekdays, weekends & combination of weekdays & weekends. We provide you with complete liberty to choose the schedule that suits your needs.

Send a Message.


  • Enroll
    • Learning Format: ILT
    • Duration: 80 Hours
    • Training Level : Beginner
    • Jan 29th : 8:00 - 10:00 AM (Weekend Batch)
    • Price : INR 25000
    • Learning Format: VILT
    • Duration: 50 Hours
    • Training Level : Beginner
    • Validity Period : 3 Months
    • Price : INR 6000
    • Learning Format: Blended Learning (Highly Interactive Self-Paced Courses +Practice Lab+VILT+Career Assistance)
    • Duration: 160 Hours 50 Hours Self-paced courses+80 Hours of Boot Camp+20 Hours of Interview Assisstance
    • Training Level : Beginner
    • Validity Period : 6 Months
    • Jan 29th : 8:00 - 10:00 AM (Weekend Batch)
    • Price : INR 6000

    This is id #d