Method, medium, and system for intelligent online personal assistant with image text localization

US2025117839A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2025117839-A1
Application numberUS-202418986611-A
CountryUS
Kind codeA1
Filing dateDec 18, 2024
Priority dateNov 11, 2016
Publication dateApr 10, 2025
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems, methods, and computer program products for identifying a candidate product in an electronic marketplace based on a visual comparison between candidate product image visual text content and input query image visual text content. Unlike conventional optical character recognition (OCR) based systems, embodiments automatically localize and isolate portions of a candidate product image and an input query image that each contain visual text content, and calculate a visual similarity measure between the respective portions. A trained neural network may be re-trained to more effectively find visual text content by using the localized and isolated visual text content portions as additional ground truths. The visual similarity measure serves as a visual search result score for the candidate product. Any number of images of any number of candidate products may be compared to an input query image to enable text-in-image based product searching without resorting to conventional OCR techniques.

First claim

Opening claim text (preview).

1 - 20 . (canceled) 21 . A method comprising: receiving an input query image; analyzing the input query image using a machine learning model to identify input query image visual text content; determining a visual similarity measure between a candidate product image visual text content and the input query image visual text content based on image signatures associated with the candidate product image and the input query image; recommending a candidate product based on the visual similarity measure; and causing presentation of the recommended candidate product on a graphical user interface of a client device. 2 . The method of claim 1 , wherein the recommending a candidate product further comprises ranking the candidate product in a product list based on the visual similarity measure. 3 . The method of claim 2 , wherein the candidate product has a highest ranking in the candidate product list. 4 . The method of claim 3 , wherein the causing presentation of the recommended candidate product comprises presenting the candidate product at a top position of the candidate product list. 5 . The method of claim 1 , wherein the candidate product image is associated with an electronic marketplace. 6 . The method of claim 1 , wherein the machine learning model comprises a neural network. 7 . The method of claim 6 , further comprising retraining the neural network. 8 . A non-transitory computer-readable storage medium having embedded therein a set of instructions which, when executed by one or more processors of a computer, causes the computer to execute operations comprising: receiving an input query image; analyzing the input query image using a machine learning model to identify input query image visual text content; determining a visual similarity measure between a candidate product image visual text content and the input query image visual text content based on image signatures associated with the candidate product image and the input query image; recommending a candidate product based on the visual similarity measure; and causing presentation of the recommended candidate product on a graphical user interface of a client device. 9 . The non-transitory computer-readable storage medium of claim 8 , wherein the recommending a candidate product further comprises ranking the candidate product in a product list based on the visual similarity measure. 10 . The non-transitory computer-readable storage medium of claim 9 , wherein the candidate product has a highest ranking in the candidate product list. 11 . The non-transitory computer-readable storage medium of claim 10 , wherein the causing presentation of the recommended candidate product comprises presenting the candidate product at a top position of the candidate product list. 12 . The non-transitory computer-readable storage medium of claim 8 , wherein the candidate product image is associated with an electronic marketplace. 13 . The non-transitory computer-readable storage medium of claim 8 , wherein the machine learning model comprises a neural network. 14 . The non-transitory computer-readable storage medium of claim 13 , wherein the operations further comprise retraining the neural network. 15 . A system comprising: at least one processor; and memory encoding computer-executable instructions that, when executed by the at least one processor, cause the system to perform operations comprising: receiving an input query image; analyzing the input query image using a machine learning model to identify input query image visual text content; determining a visual similarity measure between a candidate product image visual text content and the input query image visual text content based on image signatures associated with the candidate product image and the input query image; recommending a candidate product based on the visual similarity measure; and causing presentation of the recommended candidate product on a graphical user interface of a client device. 16 . The system of claim 15 , wherein the recommending a candidate product further comprises ranking the candidate product in a product list based on the visual similarity measure. 17 . The system of claim 16 , wherein the candidate product has a highest ranking in the candidate product list. 18 . The system of claim 17 , wherein the causing presentation of the recommended candidate product comprises presenting the candidate product at a top position of the candidate product list. 19 . The system of claim 15 , wherein the candidate product image is associated with an electronic marketplace. 20 . The system of claim 15 , wherein the machine learning model comprises a neural network.

Assignees

Inventors

Classifications

  • Supervised learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • Industrial image inspection · CPC title

  • Industrial image inspection · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025117839A1 cover?
Systems, methods, and computer program products for identifying a candidate product in an electronic marketplace based on a visual comparison between candidate product image visual text content and input query image visual text content. Unlike conventional optical character recognition (OCR) based systems, embodiments automatically localize and isolate portions of a candidate product image and …
Who is the assignee on this patent?
Ebay Inc
What technology area does this patent fall under?
Primary CPC classification G06Q30/0625. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Apr 10 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).