Project Overview & Challenges
Our Client an MNC E-Learning company offering educational content, technology, and services for higher education, K-12, professional, and library markets across the globe. They have numerous e-learning platforms, to support their business decisions and to track & understand user needs, our client has setup various kind of analytics events.
To make analytics with segregated data we used Snowflake warehouse. RAW data was then cleansed, transformed to power stakeholders and Business Analysts to make business decisions.
The client wanted the Data be clean to satisfy their Business requirements. However, the data collected had DQ issues such as duplicate in source, duplicate in target after Extraction Loading, Null values, Data integrity issue and Invalid patterns and so on.
Codoid offered a customized automation testing framework in Python using BeHave to overcome all the Data Quality related issues we faced while testing the data.
With this solution provided, the Invalid data were identified early and were notified to the Development team and Product owners. Many false positives were located which in turn helped the Business owners to take precise decisions.