Systems and methods for utilizing a convolutional neural network architecture for visual product recommendations

US10275820B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10275820-B2
Application numberUS-201715420885-A
CountryUS
Kind codeB2
Filing dateJan 31, 2017
Priority dateJan 31, 2017
Publication dateApr 30, 2019
Grant dateApr 30, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods including one or more processing modules and one or more non-transitory storage modules storing computing instructions configured to run on the one or more processing modules and perform acts of accessing an online catalog for an online retailer comprising a plurality of digital images of a plurality of items for sale by the online retailer, training a two-branch a Siamese convolutional neural network (CNN) model to determine a similarity between two digital images of the plurality of digital images, receiving one or more digital images of a new item for the online catalog, determining, using the two-branch Siamese CNN model and the one or more digital images of the new item, a similar item of the plurality of items to which the new item is most similar, and coordinating a display of the new item on a webpage based on a ranking of the similar item.

First claim

Opening claim text (preview).

What is claimed is: 1. A system comprising: one or more processors; and one or more non-transitory storage devices storing computing instructions configured to run on the one or more processors and perform acts of: accessing an online catalog for an online retailer, the online catalog comprising a plurality of digital images of a plurality of items for sale by the online retailer; training a two-branch Siamese convolutional neural network (CNN) model, using the plurality of digital images and user session data from a plurality of users of the online catalog, to determine a similarity between two digital images of the plurality of digital images; receiving one or more digital images of a new item for the online catalog; determining, using the two-branch Siamese CNN model and the one or more digital images of the new item, at least one similar item of the plurality of items to which the new item is most similar; and coordinating a display of the new item on a webpage based on a ranking of the at least one similar item, wherein: each branch of the two-branch Siamese CNN model comprises a plurality of layers that transform each digital image of the plurality of digital images into a 100-dimensional vector; and training the two-branch Siamese CNN model comprises: inputting a plurality of pairs of digital images of the plurality of digital images into the two-branch Siamese CNN model, each pair of the plurality of pairs of digital images being labeled as similar or dissimilar before being input into the two-branch Siamese CNN model; determining a distance between the 100-dimensional vector of each digital image of each pair of the plurality of pairs of digital images; and determining a contrastive loss for each pair of the plurality of pairs of digital images using a first set of rules, the contrastive loss comprising a number indicating the similarity of a pair of digital images. 2. The system of claim 1 , wherein: an input for the two-branch Siamese CNN model comprises the pair of digital images from the plurality of digital images; and an output of the two-branch Siamese CNN model comprises the number indicating the similarity of the pair of digital images. 3. The system of claim 2 , wherein the plurality of layers of each branch of the two-branch Siamese CNN model comprise, in order of application to the input: a first convolutional layer, a first maximum pool layer, a first normalization layer, a second convolutional layer, a second maximum pool layer, a second normalization layer, a third convolutional layer, a fourth convolutional layer, a fifth convolutional layer, a first fully connected layer, a second fully connected layer, a third fully connected layer, a dropout layer, a rectified linear unit layer, and a fourth fully connected layer. 4. The system of claim 1 , wherein the first set of rules for determining the contrastive loss of each pair of the plurality of pairs of digital images comprise: L (θ)=Σ (x p, x q ) L 2 ( x p ,x q ) 2 +Σ (x m, x n ) (margin −L 2 ( x m ,x n )) 2 where L(θ) is the contrastive loss, Σ (x p , x q) L 2 (x p , x q ) 2 is a first penalty for one or more first pairs of digital images x p and x q of the plurality of pairs of digital images labeled as similar but determined by the two-branch Siamese CNN model to be dissimilar, Σ (x m, x n) (margin−L 2 (x m , x n )) 2 is a second penalty for one or more second pairs of digital images x m and x n of the plurality of digital images labeled as dissimilar but determined by the two-branch Siamese CNN model to be similar, margin is 1, and L 2 is a Euclidian distance. 5. The system of claim 1 , wherein the one or more non-transitory storage devices storing the computing instructions are further configured to run on the one or more processors and perform acts of: combining the similarity between the two digital images of the plurality of digital images with one or more content-based features and one or more feedback-based features to determine at least one additional similar item of the plurality of items similar to an additional item of the plurality of items; and coordinating a display of the additional item on the webpage based on a ranking of the at least one additional similar item. 6. The system of claim 5 , wherein combining the similarity between two digital images of the plurality of digital images with the one or more content-based features and the one or more feedback-based features comprises: indexing each 100-dimensional vector of the plurality of digital images onto an index; determining that the 100-dimensional vector of one or more digital images of the plurality of digital images is within a predetermined distance on the index of the 100-dimensional vector of a digital image of the plurality of digital images associated with a particular item of the plurality of items; constraining the particular item with each item of the plurality of items associated with the one or more digital images of the plurality of digital images within the predetermined distance on the index to form one or more constrained pairs; accessing additional user session data for a predetermined period of time relating to each of the one or more constrained pairs, the additional user session data comprising the one or more content-based features and the one or more feedback-based features; creating training data using the additional user session data for the predetermined period of time relating to each of the one or more constrained pairs; and inserting the training data into an item ranking model configured to rank one or more items of the plurality of items related to the particular item. 7. The system of claim 1 , wherein: an input for the two-branch Siamese CNN model comprises the pair of digital images from the plurality of digital images; an output of the two-branch Siamese CNN model comprises the number indicating the similarity of the pair of digital images; the plurality of layers of each branch of the two-branch Siamese CNN model comprise, in order of application to the input: a first convolutional layer, a first maximum pool layer, a first normalization layer, a second convolutional layer, a second maximum pool layer, a second normalization layer, a third convolutional layer, a fourth convolutional layer, a fifth convolutional layer, a first fully connected layer, a second fully connected layer, a third fully connected layer, a dropout layer, a rectified linear unit layer, and a fourth fully connected layer; the first set of rules for determining the contrastive loss of each pair of the plurality of pairs of digital images comprise: L (θ)=Σ (x p, x q ) L 2 ( x p ,x q ) 2 +Σ (x m, x n ) (margin −L 2 ( x m ,x n )) 2 where L(θ) is the contrastive loss, Σ (x p, x q ) L 2 (x p , x q ) 2 is a first penalty for one or more first pairs of digital images x p and x q of the plurality of pairs of digital images labeled as similar but determined by the two-branch Siamese CNN model to be dissimilar, Σ (x m, x n ) (margin−L 2 (x m , x n )) 2 is a second penalty for one or more second pairs of digital images x m and x n of the plurality of digital images labeled as dissimilar but determined by the two-branch Siamese CNN model to be similar, margin is 1, and L 2 is a Euclidian distance; the one or more non-transitory storage devices storing the computing instructions are further configured to run on the one or more processors and perform acts of: combining the similarity between two digital images of the plurality of digital images with one or more content-based features and one or more feedback-bas

Assignees

Inventors

Classifications

  • Matching criteria, e.g. proximity measures · CPC title

  • Combinations of networks · CPC title

  • using ranking · CPC title

  • Search customisation based on user profiles and personalisation · CPC title

  • Recommending goods or services · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10275820B2 cover?
Systems and methods including one or more processing modules and one or more non-transitory storage modules storing computing instructions configured to run on the one or more processing modules and perform acts of accessing an online catalog for an online retailer comprising a plurality of digital images of a plurality of items for sale by the online retailer, training a two-branch a Siamese c…
Who is the assignee on this patent?
Wal Mart Stores Inc, Walmart Apollo Llc
What technology area does this patent fall under?
Primary CPC classification G06Q30/0631. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 30 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).