Systems and methods for correcting incorrect product information in an electronic data catalog

US11062365B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11062365-B2
Application numberUS-201715467854-A
CountryUS
Kind codeB2
Filing dateMar 23, 2017
Priority dateMar 23, 2017
Publication dateJul 13, 2021
Grant dateJul 13, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods including one or more processing modules and one or more non-transitory storage modules storing computing instructions configured to run on the one or more processing modules and perform acts of determining an accuracy score for existing product information using a first set of rules that compares the existing product information for the product with product information of other internal or external sources, determining if the accuracy score exceeds a predetermined accuracy threshold, automatically replacing incorrect product information in the existing product information with correct product information from the other sources if the accuracy score does not exceed the predetermined accuracy threshold, coordinating displaying of the existing product information with the correct product information replacing the incorrect product information on a webpage for the product on a website of the online retailer.

First claim

Opening claim text (preview).

What is claimed is: 1. A system comprising: one or more processors; and one or more non-transitory storage devices storing computing instructions configured to run on the one or more processors and perform: accessing existing product information for a product in a data catalog of an online retailer; determining a respective confidence score for each respective source of one or more internal sources internal to the online retailer and one or more external sources external to the online retailer; determining an accuracy score for the existing product information using a first set of rules that: comprises: a binary function that outputs a value based on a variance of an attribute of the product in the existing product information and the one or more internal sources or the one or more external sources; a weight given to either the one or more internal sources or the one or more external sources; and a number of the one or more internal sources or the one or more external sources; and compares the existing product information for the product with (1) internal product information of the one or more internal sources internal to the online retailer when the respective confidence score for a respective one of the one or more internal sources is above a predetermined threshold or (2) external product information of the one or more external sources external to the online retailer when the respective confidence score for a respective one of the one or more external sources is above the predetermined threshold; when the accuracy score exceeds a predetermined accuracy threshold, determining that product information in the existing product information is correct product information; when the accuracy score does not exceed the predetermined accuracy threshold, automatically replacing incorrect product information in the existing product information with the correct product information from the external product information; and coordinating displaying the existing product information with the correct product information replacing the incorrect product information on a webpage for the product on a website of the online retailer. 2. The system of claim 1 , wherein determining the accuracy score for the existing product information using the first set of rules comprises: determining when the existing product information matches the internal product information in a database of the online retailer using machine-learned attribute extraction algorithms; and automatically marking the existing product information when the existing product information does not match the internal product information. 3. The system of claim 2 , wherein the internal product information in the database of the online retailer comprises the internal product information (1) stored in the database of the online retailer and (2) collected by the online retailer from one or more sellers of the product or suppliers of the product. 4. The system of claim 1 , wherein determining the accuracy score for the existing product information using the first set of rules comprises: crawling a website of one or more first external sources of the one or more external sources to collect first external product information of the external product information; and requesting second external product information of the external product information from one or more second external sources of the one or more external sources. 5. The system of claim 1 , wherein the one or more non-transitory storage devices storing the computing instructions are further configured to run on the one or more processors and perform: matching product identifiers of the product in: the existing product information; the internal product information; and the external product information. 6. The system of claim 1 , wherein the first set of rules comprises: Accuracy ⁢ ⁢ Score ⁢ ⁢ ( X ) = ∑ i n ⁢ wi * σ ⁢ ⁢ x ∑ i n ⁢ wi , where: x is the attribute of the product in the existing product information; wi is the weight given to either the one or more internal sources or the one or more external sources; σx is the binary function that outputs the value based on the variance of the attribute in the existing product information and the one or more internal sources or the one or more external sources; and n is the number of the one or more internal sources or the one or more external sources. 7. The system of claim 6 , wherein: σx is also equal to a length of a longest matching substring of the one or more internal sources or the one or more external sources divided by a length of a substring in the existing product information. 8. The system of claim 1 , wherein: determining the accuracy score for the existing product information using the first set of rules comprises: determining when the existing product information matches the internal product information in a database of the online retailer using machine-learned attribute extraction algorithms; automatically marking the existing product information when the existing product information does not match the internal product information; crawling a website of one or more first external sources of the one or more external sources to collect first external product information of the external product information; and requesting second external product information of the external product information from one or more second external sources of the one or more external sources; the internal product information in the database of the online retailer comprises the internal product information (1) stored in the database of the online retailer and (2) collected by the online retailer from one or more sellers of the product or suppliers of the product; the one or more non-transitory storage devices storing the computing instructions are further configured to run on the one or more processors and perform: matching product identifiers of the product in the existing product information, the internal product information, and the external product information; the first set of rules comprises: Accuracy ⁢ ⁢ Score ⁢ ⁢ ( X )

Assignees

Inventors

Classifications

  • Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors · CPC title

  • utilising user interfaces specially adapted for shopping · CPC title

  • Machine learning · CPC title

  • using ranking · CPC title

  • Presentation of query results · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11062365B2 cover?
Systems and methods including one or more processing modules and one or more non-transitory storage modules storing computing instructions configured to run on the one or more processing modules and perform acts of determining an accuracy score for existing product information using a first set of rules that compares the existing product information for the product with product information of o…
Who is the assignee on this patent?
Wal Mart Stores Inc, Walmart Apollo Llc
What technology area does this patent fall under?
Primary CPC classification G06Q30/0603. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 13 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).