Who is the assignee on this patent?

Microsoft Technology Licensing Llc

What technology area does this patent fall under?

Primary CPC classification G06F17/30705. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Jan 23 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Learning multimedia semantics from large-scale unstructured data

US9875301B2 · US · B2

Patent metadata
Field	Value
Publication number	US-9875301-B2
Application number	US-201414266228-A
Country	US
Kind code	B2
Filing date	Apr 30, 2014
Priority date	Apr 30, 2014
Publication date	Jan 23, 2018
Grant date	Jan 23, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for learning topic models from unstructured data and applying the learned topic models to recognize semantics for new data items are described herein. In at least one embodiment, a corpus of multimedia data items associated with a set of labels may be processed to generate a refined corpus of multimedia data items associated with the set of labels. Such processing may include arranging the multimedia data items in clusters based on similarities of extracted multimedia features and generating intra-cluster and inter-cluster features. The intra-cluster and the inter-cluster features may be used for removing multimedia data items from the corpus to generate the refined corpus. The refined corpus may be used for training topic models for identifying labels. The resulting models may be stored and subsequently used for identifying semantics of a multimedia data item input by a user.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method comprising: extracting, by at least one or more computing devices, visual features from images of a corpus of images; arranging, by the at least one or more computing devices, the images in clusters based at least in part on similarities of the visual features; calculating, by the at least one or more computing devices, at least two relevance features, including: first relevance features representing distribution characteristics of distances between pairs of images in a same cluster; and second relevance features representing distribution characteristics of distances between different clusters of images; and refining, by the at least one or more computing devices, the corpus by removing one or more images from the corpus based in part on the at least two relevance features to create a refined corpus. 2. A method as claim 1 recites wherein a first cluster of the different clusters is associated with a first label and a second cluster of the different clusters is associated with a second label. 3. A method as claim 1 recites, further comprising: processing the refined corpus by applying one or more learning algorithms to the refined corpus; and creating one or more models associated with a topic for identifying an image. 4. A method as claim 1 recites, wherein the visual features include at least one of edges, corners, or objects. 5. A method as claim 1 recites, further comprising: extracting textual features from textual data associated with the images; and arranging the images in the clusters based at least in part on similarities of the visual features and the textual features. 6. A method comprising: receiving, by at least one or more computing devices, a corpus of images associated with a set of labels; extracting, by the at least one or more computing devices, visual features from the images; arranging, by the at least one or more computing devices; the images into a plurality of clusters based at least in part on similarities of the visual features; determining, by the at least one or more computing devices, at least two relevance features associated with individual clusters of the plurality of clusters, wherein: first relevance features of the at least two relevance features are based on pairs of images in a first cluster of the plurality of clusters; the first cluster is associated with a first label of the set of labels; and second relevance features of the at least two relevance features are based on the first cluster and at least one second cluster associated with a second label of the set of labels; processing, by the at least one or more computing devices, the corpus of images to generate a refined corpus of images associated with the set of labels based in part on the at least two relevance features; and training, by the at least one or more computing devices, a set of models for identifying individual labels of the set of labels based at least in part on the extracted visual features. 7. A method as claim 6 recites wherein the processing further comprises removing images from the corpus based in part on the at least two relevance features. 8. A method as claim 7 recites wherein the first relevance features represent distribution characteristics of distances between the pairs of images in the first cluster. 9. A method as claim 7 recites wherein the second relevance features represent distribution characteristics of distances between the first cluster and a plurality of second clusters associated with the second label. 10. A method as claim 6 recites wherein the receiving the corpus of images comprises receiving individual images from at least one of one or more search engines, sharing sites, or websites. 11. A method as claim 6 recites, further comprising receiving textual queries corresponding to individual labels of the set of labels, wherein the individual labels represent a semantic meaning associated with individual images of the corpus of images. 12. A method as claim 11 recites, further comprising, prior to receiving the textual queries: receiving a topic query identifying a topic; sending the topic query to at least one of one or more search engines, sharing sites, knowledge databases, or websites; responsive to sending the topic query, receiving a set of labels associated with the topic; and identifying the textual queries from the set of labels. 13. A method as claim 6 recites, further comprising: receiving a new textual query identifying a new label; receiving a new corpus of images associated with the new label; extracting new visual features from the new corpus of images; training a new model for identifying the new label based at least in part on the new visual features; and storing the new model for identifying the new label with a set of previously stored models. 14. A system comprising: memory; one or more processors; and one or more modules stored in the memory and executable by the one or more processors, the one or more modules including: a labeling module configured to learn a topic model associated with one or more based at least in part on: extracting visual features from a corpus of images associated with the one or more labels; and processing the corpus of images based in part on at least two relevance features: first relevance features of the at least two relevance features representing distribution characteristics of distances between pairs of images in a same cluster; and second relevance features of the at least two relevance features representing distribution characteristics of distances between different clusters of images. 15. A system as claim 14 recites, wherein the one or more modules further include: an input module configured to receive an input including an image; and an output module configured to output one or more results based on applying the topic model to the image, the one or more results including at least one label of the one or more labels identifying the image. 16. A system s claim 15 recites wherein the input further includes a topic associated with the image. 17. A system as claim 15 recites wherein the output module is further configured to rank the one or more labels identifying the image based at least in part on a confidence score. 18. A system as claim 15 recites, further comprising an annotation module configured to: query one or more search engines, sharing sites, or websites, wherein individual queries include individual labels of the one or more labels identifying the image; receive annotation information associated with the one or more labels identifying the image; and present the annotation information associated with the one or more labels identifying the image to the output module. 19. A system as claim 18 recites, wherein the output module is further configured to output the annotation information with the one or more labels identifying the image. 20. A system as claim 15 recites, wherein: the image is associated with two or more regions of interest; the labeling module is further configured to apply the topic model to determine two or more labels associated with the two or more regions of interest; and the output module is further configured to output the two or more labels identifying the two or more regions of interest.

Assignees

Microsoft Technology Licensing Llc

Inventors

Classifications

G06F17/30705Primary
Physics · mapped topic
G06F17/30864
Physics · mapped topic
G06N99/005
Physics · mapped topic
G06F17/30675
Physics · mapped topic
G06N20/00Primary
Machine learning · CPC title

Patent family

Related publications grouped by family.

View patent family 53059491

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9875301B2 cover?: Systems and methods for learning topic models from unstructured data and applying the learned topic models to recognize semantics for new data items are described herein. In at least one embodiment, a corpus of multimedia data items associated with a set of labels may be processed to generate a refined corpus of multimedia data items associated with the set of labels. Such processing may includ…
Who is the assignee on this patent?: Microsoft Technology Licensing Llc
What technology area does this patent fall under?: Primary CPC classification G06F17/30705. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Jan 23 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).