Method for object segmentation in videos tagged with semantic labels
US-2016379371-A1 · Dec 29, 2016 · US
US10909459B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10909459-B2 |
| Application number | US-201715619299-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jun 9, 2017 |
| Priority date | Jun 9, 2016 |
| Publication date | Feb 2, 2021 |
| Grant date | Feb 2, 2021 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
The technology disclosed introduces a concept of training a neural network to create an embedding space. The neural network is trained by providing a set of K+2 training documents, each training document being represented by a training vector x, the set including a target document represented by a vector xt, a favored document represented by a vector xs, and K>1 unfavored documents represented by vectors xiu, each of the vectors including input vector elements, passing the vector representing each document set through the neural network to derive an output vectors yt, ys and yiu, each output vector including output vector elements, the neural network including adjustable parameters which dictate an amount of influence imposed on each input vector element to derive each output vector element, adjusting the parameters of the neural network to reduce a loss, which is an average over all of the output vectors yiu of [D(yt,ys)−D(yt, yiu)].
Opening claim text (preview).
The invention claimed is: 1. A method of training a neural network to create an embedding space including a catalog of documents, the method comprising: providing a plurality of training sets of K+2 training documents to a computer system, K being an integer greater than 1, each training document being represented by a corresponding training vector x, each set of training documents including a target document represented by a vector x t , a favored document represented by a vector x s , and K unfavored documents represented respectively by vectors x i u , where i is an integer from 1 to K, and each of the vectors including a plurality of input vector elements; for each given one of the training sets, passing, by the computer system, the vector representing each document of the training set through a neural network to derive a corresponding output vector y t a corresponding output vector y s , and corresponding output vectors y i u , each of the output vectors including a plurality of output vector elements, the neural network including a set of adjustable parameters which dictate an amount of influence that is imposed on each input vector element of an input vector to derive each output vector element of the output vector; adjusting the parameters of the neural network so as to reduce a loss L, which is an average over all of the output vectors y i u of [D(y t ,y s )−D(y t ,y i u )], where D is a distance wherein the vectors, wherein the loss L is log(1+Σ i=1 K e D(y t ,y s )−D(y t ,y i u ) ); and for each given one of the training sets, passing the vector representing each document of the training set through the neural network having the adjusted parameters to derive the output vectors. 2. The method of claim 1 , wherein the parameters of the neural network include weights and the weights of the neural network are adjusted by back propagation as a function of the loss L. 3. The method of claim 1 , wherein for each given one of the training sets, K+2 identical neural networks are implemented, such that each document of a respective training set passes through a corresponding neural network of the K+2 identical neural networks. 4. The method of claim 3 , wherein the parameters of the neural networks include weights and the weights of each respective neural network, of the K+2 neural networks, are adjusted by back propagation as a function of the loss L and in dependence on the output vector output from the respective neural network. 5. The method of claim 1 , wherein for each given one of the training sets, K+2 neural networks are implemented, such that each document of a respective training set passes through a corresponding neural network of the K+2 identical neural networks. 6. The method of claim 5 , wherein the parameters of the neural networks include weights and the weights of each respective neural network, of the K+2 neural networks, are adjusted by back propagation as a function of the loss L and in dependence on the output vector output from the respective neural network. 7. The method of claim 1 , further comprising: repeatedly passing each given one of the training sets through the neural network to adjust the parameters until a value of the loss L is satisfactory and identify the neural network as a production model; obtaining raw data representing documents from a particular data domain for which the production model has been trained, the documents represented by the raw data being unlabeled with no information regarding a measure of dissimilarity between any the documents; and passing the raw data through the production model to create a production embedding of documents. 8. The method of claim 1 , wherein each training set of K+2 documents is obtained by: providing, to a user, the K+2 training documents including the target document; receiving, from the user, a selection of the favored document determined to most closely match the target document; and identifying the unfavored documents of the K+2 training documents as the unfavored documents. 9. The method of claim 1 , wherein each training set of K+2 documents is obtained by: providing, to a model replicating user behavior, the K+2 training documents including the target document; receiving, from the model, a selection of the favored document determined to most closely match the target document; and identifying the unfavored documents of the K+2 training documents as the unfavored documents. 10. A method of training a neural network to create an embedding space including a catalog of documents, the method comprising: obtaining a set of K+2 training documents, K being an integer greater than 1, the set of K+2 documents including a target document represented by a vector x t , a favored document represented by a vector x s and unfavored documents represented by vectors x i u , where i is an integer from 1 to K; passing each of the vector representations of the set of K+2 training documents through a neural network to derive corresponding output vectors, including vector y t derived from the vector x t , vector y s derived from the vector x s and vectors y i u respectively derived from vectors x i u ; and repeatedly adjusting parameters of the neural network through back propagation until a sum of differences calculated from (i) a distance between the vector y t and the vector y s and (ii) distances between the vector y t and each of the vectors y i u satisfies a predetermined criteria, wherein the sum of differences corresponds to a likelihood that the favored document will be selected over the unfavored documents and further wherein the calculated sum of differences is a loss L function calculated as log(1+Σ i=1 K e D(y t ,y s )−D(y t ,y i u ) ) and wherein the parameters of the neural network include weights and the weights of the neural network are adjusted by back propagation as a function of the loss L. 11. The method of claim 10 , wherein the obtaining of the set of training documents includes: providing, to a user, the K+2 training documents including the target document; receiving, from the user, a selection of the favored document determined to most closely match the target document; and identifying the unfavored documents of the K+2 training documents as the unfavored documents. 12. The method of claim 10 , wherein the obtaining of the set of training documents includes: providing, to a model replicating user behavior, the K+2 training documents including the target document; receiving, from the model, a selection of the favored document determined to most closely match the target document; and identifying the unfavored documents of the K+2 training documents as the unfavored documents. 13. A non-transitory computer readable storage medium impressed with computer program instructions to train a neural network to create an embedding space including a catalog of documents, the instructions, when executed on a processor, implement a method comprising: providing a plurality of training sets of K+2 training documents to a computer system, K being an integer greater than 1, each training document being represented by a corresponding training vector x, each set of training documents including a target document represented by a vector x t , a favored document represented by a vector x s , and K>1 unfavored documents represented respectively by vectors x i u , where i is an integer from 1 to K, and each of the vectors including a plurality of input vector elements; for each given one of the training sets, passing, by the computer system, the vector represent
Combinations of networks · CPC title
Auto-encoder networks; Encoder-decoder networks · CPC title
Supervised learning · CPC title
Convolutional networks [CNN, ConvNet] · CPC title
Backpropagation, e.g. using gradient descent · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.