Conversion of input text strings
US-10133737-B2 · Nov 20, 2018 · US
US10466965B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10466965-B2 |
| Application number | US-201715439218-A |
| Country | US |
| Kind code | B2 |
| Filing date | Feb 22, 2017 |
| Priority date | Feb 22, 2017 |
| Publication date | Nov 5, 2019 |
| Grant date | Nov 5, 2019 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A computer system creates a plurality of indexes from a first plurality of records, wherein each index corresponds to an attribute of a plurality of attributes. The computer system detects a record of a second plurality of records, wherein the record includes a value corresponding to each of the plurality of attributes. The computer system determines a first set of values from a first index of the plurality of indexes that corresponds to a first attribute. The computer system determines a plurality of individual similarity scores for the first set of values by utilizing a similarity function. The computer system determines an overall similarity score for each record of at least a portion of the first plurality of records and based on the overall similarity scores, determines a record of the first plurality of records that corresponds to the record of the second plurality of records.
Opening claim text (preview).
What is claimed is: 1. A system, comprising: one or more processors and one or more computer-readable memories, the one or more processors configured to execute the program instructions via at least one of the one or more computer-readable memories to cause the system to perform the operations comprising: creating a plurality of indexes from a first plurality of records, wherein each index of the plurality of indexes corresponds to an attribute of a plurality of attributes; detecting a record of a second plurality of records, wherein the record includes a value corresponding to each of the plurality of attributes; determining a first set of values from a first index of the plurality of indexes, the first index corresponding to a first attribute of the plurality of attributes, by utilizing an indexing function to compare each value in the first index to a value in the record corresponding to the first attribute; determining a plurality of individual similarity scores for the first set of values by utilizing a similarity function to compare each value of the first set of values with a corresponding value of the record of the second plurality of records; determining an overall similarity score for each record of at least a portion of the first plurality of records based on at least one of the plurality of individual similarity scores for the first set of values; and determining a record of the first plurality of records that corresponds to the record of the second plurality of records based on the determined overall similarity scores. 2. The system of claim 1 , the operations further comprising: based on determining that an individual similarity score for a first value of the first set of values does not exceed a first threshold score, providing an indication that an overall similarity score for a first record of the first plurality of records that corresponds to the first value is not to be calculated; and wherein the first record of the first plurality of records is not included in the at least portion of the first plurality of records. 3. The system of claim 1 , the operations further comprising determining to utilize the indexing function for the first index based on cross-referencing the first attribute with a database containing one or more attributes and one or more corresponding indexing functions. 4. The system of claim 1 , the operations further comprising: determining a second set of values from a second index of the plurality of indexes, the second index corresponding to a second attribute of the plurality of attributes, by utilizing a second indexing function to compare each value in the second index to a value in the record corresponding to the second attribute; determining a plurality of individual similarity scores for the second set of values by utilizing a second similarity function to compare each value of the second set of values with a corresponding value of the record of the second plurality of records; and wherein the determining the overall similarity score for each record of the at least a portion of the first plurality of records is further based on at least one of the plurality of individual similarity scores for the second set of values. 5. The system of claim 1 , further comprising: based on determining that an overall similarity score associated with the record of the first plurality of records does not exceed an overall threshold score, providing an indication that a record from the first plurality of records that corresponds to the record of the second plurality of records was not found. 6. The system of claim 1 , further comprising: based on determining that an overall similarity score associated with the record of the first plurality of records exceeds an overall threshold score, creating an association between the determined record of the first plurality of records and the record of the second plurality of records. 7. The system of claim 1 , wherein the first plurality of records are in a first database format and the second plurality of records are in a second database format. 8. A method comprising: creating a first index and a second index from a first plurality of records, wherein the first index corresponds to a first attribute and the second index corresponds to a second attribute; detecting a record of a second plurality of records, wherein the record includes a first value corresponding to the first attribute and a second value corresponding to the second attribute; determining a first set of values from the first index by utilizing a first indexing function to compare each value in the first index to the first value in the record of the second plurality of records; determining a first plurality of individual similarity scores for the first set of values by utilizing a first similarity function to compare each value of the first set of values with the first value of the second plurality of records; determining a second set of values from the second index by utilizing a second indexing function to compare each value in the second index to the second value in the record of the second plurality of records; determining a second plurality of individual similarity scores for the second set of values by utilizing a second similarity function to compare each value of the second set of values with the second value of the second plurality of records; determining an overall similarity score for each record of the first plurality of records based on at least one of the first plurality of individual similarity scores and at least one of the second plurality of individual similarity scores; and determining a record of the first plurality of records that corresponds to the record of the second plurality of records based on the determined overall similarity scores. 9. The method of claim 8 , the operations further comprising: based on determining that an individual similarity score for a first value of the first set of values does not exceed a first threshold score, providing an indication that an overall similarity score for a first record of the first plurality of records that corresponds to the first value is not to be calculated; and wherein the first record of the first plurality of records is not included in the at least portion of the first plurality of records. 10. The method of claim 8 , the operations further comprising determining to utilize the first indexing function for the first index based on cross-referencing the first attribute with a database containing one or more attributes and one or more corresponding indexing functions. 11. The method of claim 8 , wherein the determining the overall similarity score for each record of the first plurality of records further comprises determining a first weight value that corresponds with the first plurality of individual similarity scores and a second weight value that corresponds with the second plurality of individual similarity scores by cross-referencing a database containing one or more weight values and one or more corresponding attributes. 12. The method of claim 8 , further comprising: based on determining that an overall similarity score associated with the record of the first plurality of records does not exceed an overall threshold score, providing an indication that a record from the first plurality of records that corresponds to the record of the second plurality of records was not found. 13. The method of claim 8 , further comprising: based on determining that an overall similarity score associated with the record of the first plurality of records exceeds an overall threshold score, creating an association between the determined record
Query execution · CPC title
using phonetics · CPC title
Indexing structures · CPC title
Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor · CPC title
Comparing digital values (G06F7/06, {G06F7/22,} G06F7/38 take precedence) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.