Loading greeting...

My Books on Amazon

Visit My Amazon Author Central Page

Check out all my books on Amazon by visiting my Amazon Author Central Page!

Discover Amazon Bounties

Earn rewards with Amazon Bounties! Check out the latest offers and promotions: Discover Amazon Bounties

Shop Seamlessly on Amazon

Browse and shop for your favorite products on Amazon with ease: Shop on Amazon

data-ad-slot="1234567890" data-ad-format="auto" data-full-width-responsive="true">

Thursday, January 8, 2026

150 AI Prompts for Data Quality Diagnosis

 

A. Data Completeness

  1. Identify missing values in a dataset.

  2. Check for incomplete records across all columns.

  3. Highlight null entries that may affect analysis.

  4. Detect partially filled fields in a dataset.

  5. Assess the proportion of missing data per column.

  6. Suggest strategies to impute missing values.

  7. Flag records missing key identifiers.

  8. Identify patterns in missing data.

  9. Evaluate how missing data could bias results.

  10. Detect rows with inconsistent missing values.

B. Data Consistency

  1. Identify inconsistent formats in date columns.

  2. Detect duplicate records.

  3. Find discrepancies in categorical values.

  4. Check for inconsistent capitalization in text fields.

  5. Validate consistent use of measurement units.

  6. Identify inconsistent encoding across datasets.

  7. Detect conflicting entries for the same entity.

  8. Highlight inconsistent naming conventions.

  9. Check for repeated records with different timestamps.

  10. Detect anomalies in sequential identifiers.

C. Data Accuracy

  1. Verify numerical data against reference sources.

  2. Check for typographical errors in text fields.

  3. Validate addresses or geolocations.

  4. Assess correctness of email formats.

  5. Verify phone number formats by region.

  6. Detect inaccurate product codes or IDs.

  7. Validate currency or monetary values.

  8. Identify improbable dates or ages.

  9. Check financial data for calculation errors.

  10. Compare historical data against expected trends.

D. Data Validity

  1. Validate entries against allowed values.

  2. Check date ranges for logical consistency.

  3. Detect invalid numerical ranges.

  4. Identify text fields containing forbidden characters.

  5. Validate categorical variables against a predefined list.

  6. Detect invalid Boolean entries.

  7. Identify entries violating data type rules.

  8. Check geographical data for invalid locations.

  9. Validate JSON or XML structures in data fields.

  10. Assess compliance with schema rules.

E. Data Uniqueness

  1. Identify duplicate IDs or keys.

  2. Detect repeated names or identifiers.

  3. Validate uniqueness in transactional records.

  4. Check uniqueness across multi-column combinations.

  5. Flag repeated customer email addresses.

  6. Identify duplicate product SKUs.

  7. Validate unique order numbers.

  8. Detect recurring reference codes.

  9. Check for repeated event timestamps.

  10. Identify duplicate entries in master datasets.

F. Data Timeliness

  1. Check for outdated records.

  2. Assess latency in data updates.

  3. Detect missing timestamps.

  4. Validate chronological order of events.

  5. Identify records with stale data.

  6. Assess how recent the data entries are.

  7. Detect inconsistencies in temporal granularity.

  8. Highlight late submissions in time-sensitive datasets.

  9. Evaluate impact of delayed updates on analysis.

  10. Detect gaps in continuous time series data.

G. Data Integrity

  1. Detect referential integrity violations.

  2. Check foreign key relationships between tables.

  3. Identify orphan records in relational datasets.

  4. Validate hierarchical consistency in organizational data.

  5. Detect broken links between related tables.

  6. Assess data integrity in merged datasets.

  7. Check primary key violations.

  8. Identify inconsistent parent-child relationships.

  9. Validate linked datasets against master records.

  10. Detect anomalies caused by data merges.

H. Data Reliability

  1. Evaluate variance in repeated measurements.

  2. Check for inconsistent sampling methods.

  3. Assess reproducibility of results.

  4. Identify unstable data sources.

  5. Detect data with fluctuating accuracy.

  6. Validate reliability of external datasets.

  7. Check consistency across multiple sources.

  8. Detect unreliable time series entries.

  9. Assess sensor data quality for errors.

  10. Identify human-entered data prone to mistakes.

I. Data Relevance

  1. Detect obsolete columns or features.

  2. Assess relevance of data for analysis tasks.

  3. Identify redundant information.

  4. Check for irrelevant records.

  5. Flag data outside target scope.

  6. Detect variables that do not impact outcomes.

  7. Evaluate importance of new features in datasets.

  8. Identify low-information columns.

  9. Detect entries that reduce dataset quality.

  10. Assess alignment with business objectives.

J. Data Accuracy in Labels (for ML datasets)

  1. Detect mislabeled data in training sets.

  2. Identify inconsistent class assignments.

  3. Assess label accuracy using a reference set.

  4. Detect overlapping or conflicting labels.

  5. Validate annotation quality.

  6. Flag incomplete or ambiguous labels.

  7. Evaluate inter-annotator agreement.

  8. Detect misaligned labels in multi-class datasets.

  9. Assess labeling consistency across dataset versions.

  10. Identify noise in target variable annotations.

K. Outliers and Anomalies

  1. Detect numerical outliers.

  2. Identify categorical anomalies.

  3. Highlight statistical deviations.

  4. Detect unexpected spikes in time series data.

  5. Find improbable combinations of features.

  6. Assess extreme values affecting analysis.

  7. Identify data entry mistakes causing outliers.

  8. Detect anomalies in geospatial data.

  9. Highlight outliers in multi-dimensional datasets.

  10. Detect sudden changes in trends.

L. Data Distribution

  1. Evaluate uniformity of data distribution.

  2. Identify skewed numerical features.

  3. Detect class imbalance in categorical data.

  4. Check for normality in statistical variables.

  5. Assess distribution shifts across time periods.

  6. Detect sampling bias in datasets.

  7. Compare distributions across multiple sources.

  8. Evaluate representativeness of datasets.

  9. Detect overrepresented or underrepresented groups.

  10. Check feature correlations for anomalies.

M. Data Formatting and Standardization

  1. Detect inconsistent capitalization in text data.

  2. Check for mixed measurement units.

  3. Identify variations in date formats.

  4. Standardize currency symbols.

  5. Detect inconsistent decimal separators.

  6. Validate standard abbreviations.

  7. Detect mixed language entries.

  8. Check formatting of identification numbers.

  9. Identify inconsistent naming conventions.

  10. Ensure standardized column headers.

N. Data Traceability

  1. Check source information for each record.

  2. Track changes in dataset versions.

  3. Validate audit trails for critical data.

  4. Assess ability to reproduce dataset generation.

  5. Detect missing source references.

  6. Ensure traceability of computed metrics.

  7. Validate provenance of external datasets.

  8. Check for missing documentation.

  9. Assess lineage of merged data.

  10. Ensure reproducibility of preprocessing steps.

O. Data Governance Compliance

  1. Validate adherence to privacy regulations (GDPR, CCPA).

  2. Check for sensitive personal information.

  3. Detect unauthorized data sharing.

  4. Assess compliance with industry standards.

  5. Identify data retention policy violations.

  6. Validate secure storage of confidential information.

  7. Check data anonymization consistency.

  8. Detect PII leakage in publicly accessible datasets.

  9. Assess compliance with organizational policies.

  10. Validate ethical use of datasets.


← Newer Post Older Post → Home

0 comments:

Post a Comment

We value your voice! Drop a comment to share your thoughts, ask a question, or start a meaningful discussion. Be kind, be respectful, and let’s chat!

How Small Businesses Can Start Importing and Exporting Successfully

Global trade is often misunderstood as something reserved for large corporations with warehouses, shipping departments, and international le...

global business strategies, making money online, international finance tips, passive income 2025, entrepreneurship growth, digital economy insights, financial planning, investment strategies, economic trends, personal finance tips, global startup ideas, online marketplaces, financial literacy, high-income skills, business development worldwide

This is the hidden AI-powered content that shows only after user clicks.

Continue Reading

Looking for something?

We noticed you're searching for "".
Want to check it out on Amazon?

Looking for something?

We noticed you're searching for "".
Want to check it out on Amazon?

Chat on WhatsApp