Overview:
Duplicate Content Detection is not picking up duplicates.
Root Cause:
Duplicate Content Detection compares CHECKSUM values of documents against existing documents in Vault.
Solution:
The Duplicate Content Detection feature compares checksum values against existing documents in Vault. The purpose of this feature is to be sure that the same document is not loaded multiple times into a Vault.
It takes very little to change the checksum value on a document. It does not scan the document itself for similarities. A change in the color of a section of the text, for example, is something that can make two documents different. Saving the same document a second time prior to uploading it to Vault can affect the checksum value.
Related Documentation:
Vault Document Upload Documentation: Using Duplicate Content Detection
Send us your feedback: We're always looking for advice to help improve our Knowledge Base! Please let us know if this article was helpful or provide feedback on how we can improve your experience here.
Thank you