Data Integrity
Data integrity ensures data is accurate, complete, and consistent over time, remaining trustworthy and protected from unauthorized modification or deletion.
Data integrity is the discipline of preserving data accuracy, completeness, and consistency across all stages of its lifecycle. It encompasses techniques and controls that prevent, detect, and correct unauthorized modifications, corruption, or loss of data. Core mechanisms include: data validation and schema/constraint enforcement to prevent invalid states; preservation of referential, entity, and domain integrity to maintain correct relationships and valid values; checksums, cryptographic hashes, and digital signatures to detect tampering; immutable storage, versioning, and audit trails to enable traceability and recovery; data cleansing, reconciliation, and anomaly detection to address inconsistencies; and robust access control, authentication, authorization, and change-management processes. While encryption protects confidentiality, data integrity is primarily achieved through hashing, signatures, validation, and controlled change processes. Integrity is assessed via metrics such as validation success rates and error rates in data pipelines.
graph LR
Center["Data Integrity"]:::main
Rel_data_recovery["data-recovery"]:::related -.-> Center
click Rel_data_recovery "/terms/data-recovery"
Rel_data_obfuscation["data-obfuscation"]:::related -.-> Center
click Rel_data_obfuscation "/terms/data-obfuscation"
Rel_data_replication["data-replication"]:::related -.-> Center
click Rel_data_replication "/terms/data-replication"
classDef main fill:#7c3aed,stroke:#8b5cf6,stroke-width:2px,color:white,font-weight:bold,rx:5,ry:5;
classDef pre fill:#0f172a,stroke:#3b82f6,color:#94a3b8,rx:5,ry:5;
classDef child fill:#0f172a,stroke:#10b981,color:#94a3b8,rx:5,ry:5;
classDef related fill:#0f172a,stroke:#8b5cf6,stroke-dasharray: 5 5,color:#94a3b8,rx:5,ry:5;
linkStyle default stroke:#4b5563,stroke-width:2px;
🧒 Простыми словами
Generated ELI5 content
🤓 Expert Deep Dive
Generated expert content
❓ Частые вопросы
What is data integrity?
Data integrity is the accuracy, completeness, and consistency of data throughout its lifecycle, ensuring trustworthiness.
What are the main types of data integrity?
Entity integrity (unique row keys), domain integrity (valid values), referential integrity (valid relationships), and transactional integrity (atomic operations).
How do you ensure data integrity?
Through validation, constraints, checksums, hashes, digital signatures, versioning, auditing, and controlled change management.
Is encryption part of data integrity?
Encryption protects confidentiality. Integrity relies on hashing and signatures; authenticated encryption can provide integrity guarantees when used appropriately.
What are common threats to data integrity?
Unauthorized modifications, data corruption, software bugs, data integration mistakes, and malware; mitigations include access controls, validation, and monitoring.
How does data integrity relate to data quality and security?
Data integrity is a core dimension of data quality and is essential for reliable security; they address different risk aspects but are interrelated.