Detect duplicates with exact and fuzzy matching on encrypted match indexes

US11748320B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11748320-B2
Application numberUS-202117184697-A
CountryUS
Kind codeB2
Filing dateFeb 25, 2021
Priority dateMay 31, 2018
Publication dateSep 5, 2023
Grant dateSep 5, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed herein are system, method, and computer program product embodiments for detecting duplicates with exact and fuzzy matching on encrypted match indexes using an encryption key in a cloud computing platform. An embodiment operates by determining a match rule index value upon reception of a new record. The embodiment encrypts the match index rule value using the customer's encryption key and a deterministic encryption method and stores the encrypted match rule index value. Duplicate detection may be later performed by using the same deterministic encryption method to determine a cypher text for a candidate entry and comparing the ciphertext to the stored encrypted match indexes.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: receiving, by one more processors, a match rule from a user of a customer relationship management tool, wherein the match rule comprises a unique identifier, a table in the customer relationship management tool, and one or more fields of applicability; and creating, by the one or more processors, a match index column in the table comprising encrypted match indexes using the unique identifier as an initialization vector in an encryption scheme, wherein the match index column is used when performing duplicate detection to determine if a new record duplicates an encrypted match index in the encrypted match indexes. 2. The method of claim 1 , further comprising: in response to receiving the new record for the table, calculating, by the one or more processors, a match index for the new record based on the one or more fields of applicability; and when encryption is enabled on any of the one or more fields of applicability, encrypting the match index for the new record using the encryption scheme to derive a new encrypted match index. 3. The method of claim 2 , further comprising: comparing, by the one or more processors, the new encrypted match index to the encrypted match indexes in the match index column; and when no duplicate is found, adding the new record to the table. 4. The method of claim 2 , further comprising: comparing, by the one or more processors, the new encrypted match index to the encrypted match indexes in the match index column; and when a duplicate is found, displaying an error message. 5. The method of claim 1 , wherein the match rule is specified by the user using Boolean logic. 6. The method of claim 1 , where the encryption scheme is a deterministic encryption scheme. 7. The method of claim 1 , wherein the encryption scheme is a probabilistic encryption scheme. 8. A system, comprising: a memory; and at least one processor coupled to the memory and configured to: receive a match rule from a user of a customer relationship management tool, wherein the match rule comprises a unique identifier, a table in the customer relationship management tool, and one or more fields of applicability; and create a match index column in the table comprising encrypted match indexes using the unique identifier as an initialization vector in an encryption scheme, wherein the match index column is used when performing duplicate detection to determine if a new record duplicates an encrypted match index in the encrypted match indexes. 9. The system of claim 8 , the at least one processor further configured to: in response to receiving the new record for the table, calculate a match index for the new record based on the one or more fields of applicability; and when encryption is enabled on any of the one or more fields of applicability, encrypt the match index for the new record using the encryption scheme to derive a new encrypted match index. 10. The method of claim 9 , the at least one processor further configured to: compare the new encrypted match index to the encrypted match indexes in the match index column; and when no duplicate is found, add the new record to the table. 11. The method of claim 9 , the at least one processor further configured to: compare the new encrypted match index to the encrypted match indexes in the match index column; and when a duplicate is found, display an error message. 12. The method of claim 8 , wherein the match rule is specified by the user using Boolean logic. 13. The method of claim 8 , where the encryption scheme is a deterministic encryption scheme. 14. The method of claim 8 , wherein the encryption scheme is a probabilistic encryption scheme. 15. A non-transitory computer-readable device having instructions stored thereon that, when executed by at least one computing device, cause the at least one computing device to perform operations comprising: receiving a match rule from a user of a customer relationship management tool, wherein the match rule comprises a unique identifier, a table in the customer relationship management tool, and one or more fields of applicability; and creating a match index column in the table comprising encrypted match indexes using the unique identifier as an initialization vector in an encryption scheme, wherein the match index column is used when performing duplicate detection to determine if a new record duplicates an encrypted match index in the encrypted match indexes. 16. The non-transitory computer-readable device of claim 15 , the operations further comprising: in response to receiving the new record for the table, calculating a match index for the new record based on the one or more fields of applicability; and when encryption is enabled on any of the one or more fields of applicability, encrypting the match index for the new record using the encryption scheme to derive a new encrypted match index. 17. The non-transitory computer-readable device of claim 16 , the operations further comprising: comparing the new encrypted match index to the encrypted match indexes in the match index column; and when no duplicate is found, adding the new record to the table. 18. The non-transitory computer-readable device of claim 16 , the operations further comprising: comparing the new encrypted match index to the encrypted match indexes in the match index column; and when a duplicate is found, displaying an error message. 19. The non-transitory computer-readable device of claim 15 , wherein the match rule is specified by the user using Boolean logic. 20. The non-transitory computer-readable device of claim 15 , where the encryption scheme is a deterministic encryption scheme or a probabilistic encryption scheme.

Assignees

Inventors

Classifications

  • G06F16/215Primary

    Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors · CPC title

  • Vectors, bitmaps or matrices · CPC title

  • Ensuring data consistency and integrity · CPC title

  • Fuzzy queries · CPC title

  • Providing cryptographic facilities or services · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11748320B2 cover?
Disclosed herein are system, method, and computer program product embodiments for detecting duplicates with exact and fuzzy matching on encrypted match indexes using an encryption key in a cloud computing platform. An embodiment operates by determining a match rule index value upon reception of a new record. The embodiment encrypts the match index rule value using the customer's encryption key …
Who is the assignee on this patent?
Salesforce Com Inc, Salesforce Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/215. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 05 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).