Multilabel learning via supervised joint embedding of documents and labels

US10552501B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10552501-B2
Application numberUS-201715471455-A
CountryUS
Kind codeB2
Filing dateMar 28, 2017
Priority dateMar 28, 2017
Publication dateFeb 4, 2020
Grant dateFeb 4, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method implemented by at least one server computer is provided, including the following operations: receiving a plurality of training documents, each training document being defined by a sequence of words, each training document having one or more labels associated therewith; embedding the training documents, the words, and the labels in a vector space, wherein the embedding is configured to locate a given training document and its associated labels in proximity to each other in the vector space; embedding a new document in the vector space; performing a proximity search in the vector space to identify a set of nearest labels to the new document in the vector space; associating the nearest labels to the new document.

First claim

Opening claim text (preview).

What is claimed is: 1. A method implemented by at least one server computer, comprising: receiving a plurality of training documents, each training document being defined by a sequence of words, each training document having one or more labels associated therewith; embedding the plurality of training documents, the words, and the labels in a vector space, wherein the embedding is configured to locate a given training document and its associated labels in proximity to each other in the vector space; embedding a new document in the vector space; performing a proximity search in the vector space to identify a set of nearest labels to the new document in the vector space; and associating the set of nearest labels to the new document, wherein embedding the new document is configured to predict a target word in the new document using context words in the new document and identification of the new document, and wherein the embedding is configured to minimize a loss function that includes a component configured to approximate a conditional probability of the target word based on the context words and the identification of the new document. 2. The method of claim 1 , wherein the method solves a multi-label learning problem, such that a number of the set of nearest labels to the new document is not predefined prior to performing the proximity search. 3. The method of claim 1 , wherein the embedding is configured to simultaneously learn document vectors corresponding to the plurality of training documents and label vectors corresponding to the labels directly from the words of the plurality of training documents. 4. The method of claim 3 , wherein the embedding is configured to use the document vectors to learn the label vectors by solving a multiclass classification problem. 5. The method of claim 4 , wherein the embedding is configured to predict the labels of the given training document using the identification of the given training document. 6. The method of claim 1 , wherein the loss function includes a component configured to approximate a conditional probability of the labels of the given training document based on the identification of the given training document. 7. The method of claim 1 , wherein performing the proximity search includes performing a k-nearest neighbor search. 8. The method of claim 1 , wherein the new document is not one of the plurality of training documents and does not have labels already associated therewith, and wherein the embedding of the new document is independent of the embedding of the labels in the vector space. 9. The method of claim 1 , further comprising: receiving a request to access documents associated with one label of the set of nearest labels to the new document; and in response to the request, providing access to the new document in association with the one label of the set of nearest labels. 10. The method of claim 9 , wherein the documents define one or more of articles, product descriptions, and social media posts. 11. The method of claim 9 , wherein the request is defined from a search query, a social media access request, a product search, a category request, a topic request, or a community access request. 12. A computer readable medium, being non-transitory, having program instructions embodied thereon, the program instructions being configured, when executed by a computing device, to cause the computing device to perform the following operations: receive a plurality of training documents, each training document being defined by a sequence of words, each training document having one or more labels associated therewith; embed the plurality of training documents, the words, and the labels in a vector space, wherein the embedding of the plurality of training documents, the words, and the labels is configured to locate a given training document and its associated labels in proximity to each other in the vector space; embed a new document in the vector space; perform a proximity search in the vector space to identify a set of nearest labels to the new document in the vector space; and associate the set of nearest labels to the new document, wherein the new document that is embedded is configured to predict a target word in the new document using context words in the new document and identification of the new document, and the embedding is configured to minimize a loss function that includes a component configured to approximate a conditional probability of the target word based on the context words and the identification of the new document. 13. The computer readable medium of claim 12 , wherein the operations solve a multi-label learning problem, such that a number of the set of nearest labels to the new document is not predefined prior to performing the proximity search. 14. The computer readable medium of claim 12 , wherein the embedding is configured to simultaneously learn document vectors corresponding to the plurality of training documents and label vectors corresponding to the labels directly from the words of the plurality of training documents; and wherein the embedding is configured to use the document vectors to learn the label vectors by solving a multiclass classification problem. 15. A server computer, comprising: training logic, the training logic configured to, receive a plurality of training documents, each training document being defined by a sequence of words, each training document having one or more labels associated therewith, and embed the plurality of training documents, the words, and the labels in a vector space, wherein the embedding is configured to locate a given training document and its associated labels in proximity to each other in the vector space; and inference logic, the inference logic configured to, embed a new document in the vector space, perform a proximity search in the vector space to identify a set of nearest labels to the new document in the vector space, and associate the set of nearest labels to the new document, wherein the new document that is embedded is configured to predict a target word in the new document using context words in the new document and identification of the new document, and the embedding is configured to minimize a loss function that includes a component configured to approximate a conditional probability of the target word based on the context words and the identification of the new document. 16. The server computer of claim 15 , wherein operations solve a multi-label learning problem, such that a number of the set of nearest labels to the new document is not predefined prior to performing the proximity search. 17. The server computer of claim 15 , wherein the embedding is configured to simultaneously learn document vectors corresponding to the plurality of training documents and label vectors corresponding to the labels directly from the words of the plurality of training documents; and wherein the embedding is configured to use the document vectors to learn the label vectors by solving a multiclass classification problem.

Assignees

Inventors

Classifications

  • Document management systems · CPC title

  • Knowledge representation; Symbolic representation · CPC title

  • Search customisation based on user profiles and personalisation · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10552501B2 cover?
A method implemented by at least one server computer is provided, including the following operations: receiving a plurality of training documents, each training document being defined by a sequence of words, each training document having one or more labels associated therewith; embedding the training documents, the words, and the labels in a vector space, wherein the embedding is configured to …
Who is the assignee on this patent?
Oath Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/9535. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 04 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).