Ensure Data Integrity with Automated Data Screening in Omniscope

Modified on Tue, 8 Jul at 2:20 PM

For anyone invested in robust data governance, maintaining the cleanliness and reliability of the datasets is paramount. A common challenge arises when integrating fresh data with existing, clean historical records: how do you prevent anomalies from polluting your valuable reports? This is where automated data screening in Omniscope becomes indispensable.


The Use Case: Preventing Data Pollution

Imagine a scenario where you have a pristine historical dataset powering your critical reports. New data arrives daily, and while most of it is accurate, some inevitably contains errors or inconsistencies. Adding this "fresh" data directly to your historical records without validation is a recipe for data quality issues. Anomalies can skew your analysis, leading to flawed insights and misinformed decisions.

Omniscope offers a powerful solution: an instant data screening app that acts as your first line of defense. This app can:

  • Evaluate the data schema: Ensuring all necessary fields are present and correctly formatted, preventing structural inconsistencies.

  • Check cell values: Validating data against predefined rules, catching outliers or incorrect entries.

  • Verify record counts and missing data: Identifying unexpected data volumes or significant gaps that could indicate issues.

By performing these checks, users gain instant insight and can explore the data to pinpoint anomalies. These issues can then be addressed proactively, preventing them from ever reaching and polluting your clean historical data and subsequent reports. This is a critical component of any effective data governance strategy.


Your Instant Data Quality Screening Tool in Omniscope


Users can easily build a powerful and readily adjustable template that can serve as a go-to data quality screening tool. This template isn't bundled with Omniscope but is provided as a downloadable file alongside this article.

It allows you to instantly spot data quality issues, interactively explore your data, and diagnose any problems before they impact your analysis.

To get started, simply download the provided IOZ file below and upload it to your Omniscope Templates folder. Then, import and open the project within your Omniscope installation.



Applying the Template: Easy as 1-2-3


Applying this template to any new datasets is straightforward:

  1. Set Your Data Path: Open the first File input block. Navigate to your target data file (the one you wish to use to create screening model) and copy its file path. Next, open the workflow’s Parameters menu (accessible via the three-dot menu) and paste the copied path to replace the existing "File path" parameter value.

  2. Configure Data Checks: In the Data Validate block, click on the “Reset to Input” option to rebuild the data schema check based on your new data. You can also customise this block further by creating additional criteria to screen actual data cell values or to verify the number of records coming through.

  3. Visualise and Diagnose: Now, refresh the Report block to visualise your data along with diagnostics. The template is now fully configured and available on your Omniscope projects page. Users can simply drag and drop any new dataset onto this template to have it instantly checked for anomalies.

By leveraging this automated data screening capability in Omniscope, you can maintain high standards of data quality and ensure that your reports are always based on clean, reliable information.




Was this article helpful?

That’s Great!

Thank you for your feedback

Sorry! We couldn't be helpful

Thank you for your feedback

Let us know how can we improve this article!

Select at least one of the reasons
CAPTCHA verification is required.

Feedback sent

We appreciate your effort and will try to fix the article