What Is Data Profiling? Steps and Types of Data Profiling
What Is Data Profiling? Steps and Types of Data Profiling Data Profiling is the analysis of source data to determine how it is structured, what it contains, and how it interacts with other data, as well as the identification of projects that could benefit from it. Data Profiling Procedures Step 1: Conduct data profiling at the beginning of a project to establish if the data is suitable for analysis and if the project should continue. Step 2: Before putting the source data into the target database, identify and resolve any quality concerns with the source data. Step 3: As the data moves from source to destination, look for data quality issues that can be fixed using Extract-Transform-Load (ETL). Profiling data can tell whether additional manual processing is necessary. Step 4: Use unexpected business rules, hierarchical structures, and relationships between foreign keys and private keys to refine the ETL process in step four. Data Profiling Types Content Discovery: An individual