Data Science & Big Data Analytics v2

Course Description

This course provides practical foundation-level training that enables immediate & effective participation in Big Data & other analytics projects. It includes an introduction to Big Data & the analytics lifecycle to address business challenges leveraging Big Data. The course provides grounding in basic & advanced analytic methods & an introduction to Big Data analytics technology & tools, including MapReduce & Hadoop. Labs offer opportunities for students to understand how these methods & tools may be applied to real-world business challenges by a practicing data scientist. 

The course takes an “open” or technology-neutral approach & includes a final lab that addresses a big data analytics challenge by applying the concepts taught in the system in the context of the data analytics lifecycle. The course prepares the student for the Dell EMC Proven™ Professional Data Scientist Associate (EMCDSA) certification exam. 

Prerequisites

To complete this course successfully & gain the maximum benefits from it, a student should have the following knowledge & skill sets: 

  •  A robust quantitative background with a solid understanding of basic statistics, as would be found in a statistics 101-level course 
  •  Experience with scripting languages like Java, Perl, or Python (or R). Many of the lab examples taught in the course use R (with an RStudio GUI), which is an open-source statistical tool & programming  
  •  Experience with SQL 

Audience Profile

This course is intended for individuals seeking to develop an understanding of Data Science from the perspective of a practicing Data Scientist, including: 

  •  Managers of teams of business intelligence, analytics, & prominent data professionals 
  •  Current Business & Data Analysts want to add big data analytics to their skills.
  •   Data & database professionals looking to exploit their analytic skills in a big data environment 
  •  Recent college graduates & graduate students with academic experience in a related discipline looking to move into the world of data science & big data
  •   Individuals seeking to take advantage of the EMC Proven™ Professional Data Scientist Associate (EMCDSA) certification

Learning Objectives

Upon successful completion of this course, participants should be able to: 

  •  Immediately participate as a data science team member 
  •  Work with large data sets & generate insights 
  •  Build predictive & classification models  
  •  Manage a data analytics project throughout the entire lifecycle

Content Outline

  •  Big Data & its characteristics Lesson 
  •  Business value from Big Data 
  •  Data scientist 
  •  Data analytics lifecycle overview 
  •  Discovery phase 
  •  Data preparation phase 
  •  Model planning phase 
  •  Model building phase  
  •  Communicate results phase 
  •  Operationalize phase 
  •  Introduction to the R programming language 
  •  Analyzing & exploring data 
  •  Statistics for model building & evaluation
  •  Introduction to advanced analytics—theory & methods 
  •  K-means clustering 
  •  Association rules 
  •  Linear regression 
  •  Logistic regression 
  •  Text analysis 
  •  Naïve Bayes 
  •  Decision trees 
  •  Time series analysis
  •  Introduction to advanced analytics—technology & tools 
  •  Hadoop ecosystem  
  •  In-database analytics SQL essentials  
  •  Advanced SQL & Madlib
  •  Preparing to operationalize 
  •  Preparing project presentations 
  •  Data visualization techniques

Certification

Associate – Data Science Version 2.0 (DCA-DS)

FAQs

A: This course provides practical foundation-level training that enables immediate & effective participation in Big Data & other analytics projects. It includes an introduction to Big Data & the analytics lifecycle to address business challenges leveraging Big Data.

 

A: To attend the training session, you should have operational Desktops or Laptops with the required specification and a good internet connection to access the labs. 

 

A: We recommend you attend the live session to practice & clarify the doubts instantly & get more value from your investment. However, if, due to some contingency, you have to skip the class, Radiant Techlearning will help you with the recorded session of that particular day. However, those recorded sessions are not meant only for personal consumption & NOT for distribution or commercial use.

 

A: Radiant Techlearning has a data center containing a Virtual Training environment for participants’ hand-on-practice. 

Participants can easily access these labs over Cloud with the help of a remote desktop connection. 

Radiant virtual labs allow you to learn from anywhere in the world & in any time zone. 

 

A: The learners will be enthralled as we engage them the real-world & industry Oriented projects during the training program. These projects will improve your skills & knowledge & you will gain a better experience. These real-time projects will help you a lot in your future tasks & assignments.

Send a Message.


  • Enroll