The IBM InfoSphere Big Match on Hadoop training program will introduce the professionals to the Probabilistic Matching Engine (PME) and the ways in which it can be used to resolve and find out entities across multiple data sets in Hadoop.  

Professionals will acquire the basic knowledge of a PME algorithm including data model configuration, standardization, weight generation, as well as threshold, comparison and bucketing functions.

During the exercises, the professionals will work on large cases that would be in use, where they will apply their knowledge of Big Match to discover relationships be two data sets that can be used to understand the full view of the member data.


Radiant Techlearning offers the InfoSphere BigMatch v11.4 for Apache Hadoop training program in Classroom & Virtual Instructor Led / Online mode.


Duration: 2 days


Learning Objective


  • Acquiring the knowledge about the capabilities of the Probabilistic Matching Engine
  • Acquiring an understanding about the ways in which the Probabilistic Matching engine is used with Big Insights to solve certain use cases.
  • Acquiring an understanding about the technical framework of the Big Match solution and the ways in which member data is derived, bucketed and compared to produce a complete entity from multiple data sets.
  • Designing a project as well as data model, by using the Big Match Console
  • Configuring the HBase tables which will be used in a Big Match solution
  • Configuring an algorithm with the help of the Big Match console which will include Standardization, Comparison and Bucketing functions.
  • Set up Strings for Anonymous value, Frequency values, Equivalency values, along with the character maps, by using the Big Match console
  • Set up as well as run the Weight Generation process
  • Evaluate, as well as set the thresholds for the algorithm
  • Deploying a new algorithm to Big Match
  • Evaluating Entity results and reconfigure algorithm based on evaluation.  E.g. Large Entities, Large Buckets, Member, those who are not belonging to any buckets, etc


  • Basic knowledge about Java developments and XML concepts can be helpful, but are not necessary. 


Audience profile

The training program is designed for the technical audience who will be setting up a custom algorithm for the Probabilistic Matching Engine to utilize the Big Match on Apache Hadoop to compare, match and search for the member records across multiple data sets.

Course content

1. Introduction to Big Match for Apache Hadoop


  • What is Big Match
  • How Big Match Works
  • Big Match Components
  • Big Match Architecture


2. Big Match Data Model Definition


  • Members
  • Attribute Types
  • Member Attributes
  • Sources
  • Information Sources


3. PME Algorithm


  • Standardization
  • Bucketing
  • Comparison Functions


4. Bucket Analysis


  • Bucket Optimization
  • Bucket Concerns


5. Weights


  • String Weights
  • Numeric Weights
  • Multi-dimensional Weights
  • Solving Weights


6. Base Tables


  • HBase concepts
  • Big Match commands
  • Big Match Tables (.pmebktidx, .pmemdmidx, .pmeentidx)
  • Best Practices


7. BigMatch Applications


  • PME Derive
  • PME Compare
  • PME Link
  • PME Analysis



Q: What is big match in InfoSphere?


A: IBM InfoSphere Big Match is a Certified Technology Partner


Q: What is InfoSphere Information Server?


A: IBM InfoSphere Information Server is a market-leading data integration platform which contains a family of products that allow you to understand, cleanse, monitor, transform and deliver data, as well as to collaborate to bridge the gap among business as well as IT.


Q: How big match works?


A: With the web-based Big Match Console, you can make as well as configure the algorithms that you want to utilize to process the data in your HBase tables. In the realm of IBM InfoSphere MDM, an algorithm is a step-by-step procedure that compares as well as scores the similarities and also the differences of member attributes.


Q: What is the main purpose behind designing this course?


A: This training program is designed to allow a trained programmer to develop and maintain easy RPG IV programs written with the use of the latest features and techniques available in the Version 7 compiler.


Q: How does the infosphere MDM works?


A: IBM InfoSphere Master Data Management (MDM) manages all aspects of your critical enterprise data, no matter what system or model, and delivers it to your application users in a single, trusted view.


Q: For whom this course is recommended?


A: The course is designed for a technical audience that will be setting up a custom algorithm for the Probabilistic Matching Engine to use Big Match on Apache Hadoop to relate, match as well as search member records across multiple data sets.


Q: How is the Radiant Techlearning verified certificate awarded?


A: Radiant awards course completion certificate to all the participants who have completed the training program which includes various real time projects, assignments, quizzes and some other tasks.  Once the course is done you would be assigned with a project which you would have to submit in 2 weeks’ time. 

Radiant Techlearning experts will be evaluating the project on various parameter. To be eligible for the verified certificate you would have to score more than 60% marks.

 Only after completion of these criteria you would be awarded with Radiant verified certificate and which the participants can use for their future job purpose. Participants will be awarded with grades according to the following criteria:

90% – 100% – AAA+

80% – 90% – AA+

70% – 80% – A+

60% – 70% – A


Q: Is there any job assistant guarantee?


A: No. These training program are helpful to improve your skills & knowledge on the technology which would help you to land in your dream job by learning them. Our training program will maximise your ability and chances of getting a successful job. You have to select job according to your convenience. Your performance in the training program and interview is crucial for getting good job.


Q: Does my employer can pay the fees of my courses?


A: Yes, your employer can pay your fees. 


Q: Is there any EMI option?


A: Yes you can easily choose an EMI option through your credit card or Debit card.


Q: What is the mode of payment?


A: You can submit payment to Radiant by:

  • Debit or credit card
  • Bank transfer 
  • Google pay

Unble To Find a Batch..?

Request a Batch