Configure parameters for quality rules applied to data tables, metrics, and real-time meta-tables.
Data table parameter configuration
Data table rule configuration
|
Template type |
Description |
|
Completeness/Uniqueness |
Completeness-Null Value Validation/Empty String Validation Uniqueness-Uniqueness Validation/Field Group Count Validation/Duplicate Value Count Validation:
|
|
Timeliness |
|
|
Validity |
|
|
Consistency |
|
|
Stability |
|
|
Custom SQL |
Note
If you select a custom rule template for a Custom SQL rule, the configuration section automatically parses the template's variable fields as property values. Configure these values based on the descriptions of the template variables. |
Data table validation configuration description
|
Template type |
Configuration item |
Description |
|
Completeness |
Abnormal rows/Normal rows/Abnormal rate/Normal rate |
|
|
Uniqueness |
Abnormal rows/Normal rows/Abnormal rate/Normal rate |
|
|
Statistical value |
Refers to the unique value definition, which is the data after a |
|
|
Statistical value (Duplicate rows/Duplication rate) |
|
|
|
Timeliness, Validity |
Abnormal rows/Normal rows/Abnormal rate/Normal rate |
|
|
Consistency |
Statistical difference, Statistical difference rate (%) |
Statistical difference: Validation field - Comparison field. Statistical difference rate: Validation field/Comparison field. |
|
Stability |
Statistical value (1-day volatility, 7-day volatility, 30-day volatility) |
Compares the table row count with those collected 1 day, 7 days, and 30 days ago, compares the volatility, and then compares with the set threshold. If any one does not meet the rule, an alarm is triggered. |
|
Custom SQL |
Abnormal rows/Normal rows/Abnormal rate/Normal rate |
|
|
Statistical value (1-day volatility, 7-day volatility, 30-day volatility) |
Compares the table row count with those collected 1 day, 7 days, and 30 days ago, compares the volatility, and then compares with the set threshold. If any one does not meet the rule, an alarm is triggered. |
Metric parameter configuration
Metric rule configuration
|
Template type |
Description |
|
Uniqueness |
Field Group Count Validation/Duplicate Value Count Validation: Data filtering needs to be configured. Data Filtering: Disabled by default. After it is enabled, you can configure filter conditions, partition filters, or regular data filters for the validation table. The filter conditions will be directly appended to the validation SQL statement. To filter partitions for the validation table, we recommend that you configure a partition filter expression in the scheduling configuration. After the configuration, the quality report will be viewed at the minimum granularity of the validation partition. Enter the data filtering content, for example:
|
|
Stability |
Column Stability Validation/Column Volatility Validation:
|
Metric validation configuration
|
Template type |
Configuration item |
Description |
|
Uniqueness |
Number of field groups |
Compares the count of this field after grouping with the specified static field. |
|
Statistics (duplicate rows/duplication rate) |
|
|
|
Stability |
Statistical value |
The unique value count, which is obtained from a |
|
Statistical values (1-day volatility, 7-day volatility, 30-day volatility) |
Compares the number of table rows collected with those collected 1 day, 7 days, and 30 days ago. Compares the fluctuation rates with the set threshold. If any of these comparisons does not comply with the rule, an alarm is triggered. |
|
|
Mean fluctuation detection (7-day fluctuation, 30-day fluctuation) |
The baseline value is the average number of table rows in the last 7 days or 30 days, compared with the fluctuation rate of the average value over the last 7 days or 30 days. |
|
|
Statistical values (fluctuation rate compared to the 1st day of the current month, fluctuation rate compared to the previous month, fluctuation rate compared to the previous year) |
Compares the number of table rows collected with those from the 1st day of the current month, the previous month, and the previous year to calculate fluctuation rates. These rates are then compared with the set threshold. An alarm is triggered if any of the comparisons does not comply with the rule. |
Real-time meta-table parameter configuration
Offline link comparison parameter configuration
When real-time and offline data share the same statistical ingest endpoint logic, this quality rule detects differences between them. Significant differences may indicate data quality issues.
|
Parameter |
Description |
|
Validation Field |
Select the field that needs to be validated. |
|
Metric Operator |
Select the algorithm for the data. |
|
Object Form |
Select Single Value Data and Multiple Value Data. |
|
Time Constraint Condition |
Select the field for time constraint. |
|
Enable Condition Constraint |
Select Enable or Shutdown condition constraint. |
|
Offline Data |
Select offline data table from the dropdown. |
|
Offline Data Retrieval |
The default is Shutdown. When enabled, you can configure data retrieval from offline data tables through SQL statements. |
|
Time Zone Setting |
Select time zone from the dropdown. |
Multi-path comparison parameter configuration
For scenarios requiring strong data guarantees, real-time dual-link or triple-link quality rules monitor data across multiple paths. If abnormalities occur, O&M personnel can promptly switch or back up data. These rules detect issues such as data retention and statistical drift.
|
Parameter |
Description |
|
Validation Field |
Select the field that needs to be validated. |
|
Metric Operator |
Select the algorithm for the data. |
|
Object Form |
Select Single-value Data and Multi-value Data. |
|
Time Limit Condition |
Select the time-limited field. |
|
Enable Condition Restriction |
Select Enable or Shutdown for condition restriction. |
|
Number Of Comparison Links |
Select the number of comparison links for the quality rule. The system supports selecting Real-time Three-link Comparison and Real-time Two-link Comparison. |
|
Comparison Trace 1/Comparison Trace 2 |
Select a real-time meta table as the comparison trace data:
|
|
Time Zone Settings |
Select a time zone from the dropdown list. |
