Who is the assignee on this patent?

Cognizant Tech Solutions U S Corporation

What technology area does this patent fall under?

Primary CPC classification G06N3/084. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Feb 02 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Content embedding using deep metric learning algorithms

US10909459B2 · US · B2

Patent metadata
Field	Value
Publication number	US-10909459-B2
Application number	US-201715619299-A
Country	US
Kind code	B2
Filing date	Jun 9, 2017
Priority date	Jun 9, 2016
Publication date	Feb 2, 2021
Grant date	Feb 2, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The technology disclosed introduces a concept of training a neural network to create an embedding space. The neural network is trained by providing a set of K+2 training documents, each training document being represented by a training vector x, the set including a target document represented by a vector xt, a favored document represented by a vector xs, and K>1 unfavored documents represented by vectors xiu, each of the vectors including input vector elements, passing the vector representing each document set through the neural network to derive an output vectors yt, ys and yiu, each output vector including output vector elements, the neural network including adjustable parameters which dictate an amount of influence imposed on each input vector element to derive each output vector element, adjusting the parameters of the neural network to reduce a loss, which is an average over all of the output vectors yiu of [D(yt,ys)−D(yt, yiu)].

First claim

Opening claim text (preview).

The invention claimed is: 1. A method of training a neural network to create an embedding space including a catalog of documents, the method comprising: providing a plurality of training sets of K+2 training documents to a computer system, K being an integer greater than 1, each training document being represented by a corresponding training vector x, each set of training documents including a target document represented by a vector x t , a favored document represented by a vector x s , and K unfavored documents represented respectively by vectors x i u , where i is an integer from 1 to K, and each of the vectors including a plurality of input vector elements; for each given one of the training sets, passing, by the computer system, the vector representing each document of the training set through a neural network to derive a corresponding output vector y t a corresponding output vector y s , and corresponding output vectors y i u , each of the output vectors including a plurality of output vector elements, the neural network including a set of adjustable parameters which dictate an amount of influence that is imposed on each input vector element of an input vector to derive each output vector element of the output vector; adjusting the parameters of the neural network so as to reduce a loss L, which is an average over all of the output vectors y i u of [D(y t ,y s )−D(y t ,y i u )], where D is a distance wherein the vectors, wherein the loss L is log(1+Σ i=1 K e D(y t ,y s )−D(y t ,y i u ) ); and for each given one of the training sets, passing the vector representing each document of the training set through the neural network having the adjusted parameters to derive the output vectors. 2. The method of claim 1 , wherein the parameters of the neural network include weights and the weights of the neural network are adjusted by back propagation as a function of the loss L. 3. The method of claim 1 , wherein for each given one of the training sets, K+2 identical neural networks are implemented, such that each document of a respective training set passes through a corresponding neural network of the K+2 identical neural networks. 4. The method of claim 3 , wherein the parameters of the neural networks include weights and the weights of each respective neural network, of the K+2 neural networks, are adjusted by back propagation as a function of the loss L and in dependence on the output vector output from the respective neural network. 5. The method of claim 1 , wherein for each given one of the training sets, K+2 neural networks are implemented, such that each document of a respective training set passes through a corresponding neural network of the K+2 identical neural networks. 6. The method of claim 5 , wherein the parameters of the neural networks include weights and the weights of each respective neural network, of the K+2 neural networks, are adjusted by back propagation as a function of the loss L and in dependence on the output vector output from the respective neural network. 7. The method of claim 1 , further comprising: repeatedly passing each given one of the training sets through the neural network to adjust the parameters until a value of the loss L is satisfactory and identify the neural network as a production model; obtaining raw data representing documents from a particular data domain for which the production model has been trained, the documents represented by the raw data being unlabeled with no information regarding a measure of dissimilarity between any the documents; and passing the raw data through the production model to create a production embedding of documents. 8. The method of claim 1 , wherein each training set of K+2 documents is obtained by: providing, to a user, the K+2 training documents including the target document; receiving, from the user, a selection of the favored document determined to most closely match the target document; and identifying the unfavored documents of the K+2 training documents as the unfavored documents. 9. The method of claim 1 , wherein each training set of K+2 documents is obtained by: providing, to a model replicating user behavior, the K+2 training documents including the target document; receiving, from the model, a selection of the favored document determined to most closely match the target document; and identifying the unfavored documents of the K+2 training documents as the unfavored documents. 10. A method of training a neural network to create an embedding space including a catalog of documents, the method comprising: obtaining a set of K+2 training documents, K being an integer greater than 1, the set of K+2 documents including a target document represented by a vector x t , a favored document represented by a vector x s and unfavored documents represented by vectors x i u , where i is an integer from 1 to K; passing each of the vector representations of the set of K+2 training documents through a neural network to derive corresponding output vectors, including vector y t derived from the vector x t , vector y s derived from the vector x s and vectors y i u respectively derived from vectors x i u ; and repeatedly adjusting parameters of the neural network through back propagation until a sum of differences calculated from (i) a distance between the vector y t and the vector y s and (ii) distances between the vector y t and each of the vectors y i u satisfies a predetermined criteria, wherein the sum of differences corresponds to a likelihood that the favored document will be selected over the unfavored documents and further wherein the calculated sum of differences is a loss L function calculated as log(1+Σ i=1 K e D(y t ,y s )−D(y t ,y i u ) ) and wherein the parameters of the neural network include weights and the weights of the neural network are adjusted by back propagation as a function of the loss L. 11. The method of claim 10 , wherein the obtaining of the set of training documents includes: providing, to a user, the K+2 training documents including the target document; receiving, from the user, a selection of the favored document determined to most closely match the target document; and identifying the unfavored documents of the K+2 training documents as the unfavored documents. 12. The method of claim 10 , wherein the obtaining of the set of training documents includes: providing, to a model replicating user behavior, the K+2 training documents including the target document; receiving, from the model, a selection of the favored document determined to most closely match the target document; and identifying the unfavored documents of the K+2 training documents as the unfavored documents. 13. A non-transitory computer readable storage medium impressed with computer program instructions to train a neural network to create an embedding space including a catalog of documents, the instructions, when executed on a processor, implement a method comprising: providing a plurality of training sets of K+2 training documents to a computer system, K being an integer greater than 1, each training document being represented by a corresponding training vector x, each set of training documents including a target document represented by a vector x t , a favored document represented by a vector x s , and K>1 unfavored documents represented respectively by vectors x i u , where i is an integer from 1 to K, and each of the vectors including a plurality of input vector elements; for each given one of the training sets, passing, by the computer system, the vector represent

Assignees

Cognizant Tech Solutions U S Corporation

Inventors

Classifications

G06N3/045
Combinations of networks · CPC title
G06N3/0455
Auto-encoder networks; Encoder-decoder networks · CPC title
G06N3/09
Supervised learning · CPC title
G06N3/0464
Convolutional networks [CNN, ConvNet] · CPC title
G06N3/084Primary
Backpropagation, e.g. using gradient descent · CPC title

Patent family

Related publications grouped by family.

View patent family 60572740

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10909459B2 cover?: The technology disclosed introduces a concept of training a neural network to create an embedding space. The neural network is trained by providing a set of K+2 training documents, each training document being represented by a training vector x, the set including a target document represented by a vector xt, a favored document represented by a vector xs, and K>1 unfavored documents represented …
Who is the assignee on this patent?: Cognizant Tech Solutions U S Corporation
What technology area does this patent fall under?: Primary CPC classification G06N3/084. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Feb 02 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).