System and method for large-scale multi-label learning using incomplete label assignments
US-2016140451-A1 · May 19, 2016 · US
US9875301B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9875301-B2 |
| Application number | US-201414266228-A |
| Country | US |
| Kind code | B2 |
| Filing date | Apr 30, 2014 |
| Priority date | Apr 30, 2014 |
| Publication date | Jan 23, 2018 |
| Grant date | Jan 23, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Systems and methods for learning topic models from unstructured data and applying the learned topic models to recognize semantics for new data items are described herein. In at least one embodiment, a corpus of multimedia data items associated with a set of labels may be processed to generate a refined corpus of multimedia data items associated with the set of labels. Such processing may include arranging the multimedia data items in clusters based on similarities of extracted multimedia features and generating intra-cluster and inter-cluster features. The intra-cluster and the inter-cluster features may be used for removing multimedia data items from the corpus to generate the refined corpus. The refined corpus may be used for training topic models for identifying labels. The resulting models may be stored and subsequently used for identifying semantics of a multimedia data item input by a user.
Opening claim text (preview).
The invention claimed is: 1. A method comprising: extracting, by at least one or more computing devices, visual features from images of a corpus of images; arranging, by the at least one or more computing devices, the images in clusters based at least in part on similarities of the visual features; calculating, by the at least one or more computing devices, at least two relevance features, including: first relevance features representing distribution characteristics of distances between pairs of images in a same cluster; and second relevance features representing distribution characteristics of distances between different clusters of images; and refining, by the at least one or more computing devices, the corpus by removing one or more images from the corpus based in part on the at least two relevance features to create a refined corpus. 2. A method as claim 1 recites wherein a first cluster of the different clusters is associated with a first label and a second cluster of the different clusters is associated with a second label. 3. A method as claim 1 recites, further comprising: processing the refined corpus by applying one or more learning algorithms to the refined corpus; and creating one or more models associated with a topic for identifying an image. 4. A method as claim 1 recites, wherein the visual features include at least one of edges, corners, or objects. 5. A method as claim 1 recites, further comprising: extracting textual features from textual data associated with the images; and arranging the images in the clusters based at least in part on similarities of the visual features and the textual features. 6. A method comprising: receiving, by at least one or more computing devices, a corpus of images associated with a set of labels; extracting, by the at least one or more computing devices, visual features from the images; arranging, by the at least one or more computing devices; the images into a plurality of clusters based at least in part on similarities of the visual features; determining, by the at least one or more computing devices, at least two relevance features associated with individual clusters of the plurality of clusters, wherein: first relevance features of the at least two relevance features are based on pairs of images in a first cluster of the plurality of clusters; the first cluster is associated with a first label of the set of labels; and second relevance features of the at least two relevance features are based on the first cluster and at least one second cluster associated with a second label of the set of labels; processing, by the at least one or more computing devices, the corpus of images to generate a refined corpus of images associated with the set of labels based in part on the at least two relevance features; and training, by the at least one or more computing devices, a set of models for identifying individual labels of the set of labels based at least in part on the extracted visual features. 7. A method as claim 6 recites wherein the processing further comprises removing images from the corpus based in part on the at least two relevance features. 8. A method as claim 7 recites wherein the first relevance features represent distribution characteristics of distances between the pairs of images in the first cluster. 9. A method as claim 7 recites wherein the second relevance features represent distribution characteristics of distances between the first cluster and a plurality of second clusters associated with the second label. 10. A method as claim 6 recites wherein the receiving the corpus of images comprises receiving individual images from at least one of one or more search engines, sharing sites, or websites. 11. A method as claim 6 recites, further comprising receiving textual queries corresponding to individual labels of the set of labels, wherein the individual labels represent a semantic meaning associated with individual images of the corpus of images. 12. A method as claim 11 recites, further comprising, prior to receiving the textual queries: receiving a topic query identifying a topic; sending the topic query to at least one of one or more search engines, sharing sites, knowledge databases, or websites; responsive to sending the topic query, receiving a set of labels associated with the topic; and identifying the textual queries from the set of labels. 13. A method as claim 6 recites, further comprising: receiving a new textual query identifying a new label; receiving a new corpus of images associated with the new label; extracting new visual features from the new corpus of images; training a new model for identifying the new label based at least in part on the new visual features; and storing the new model for identifying the new label with a set of previously stored models. 14. A system comprising: memory; one or more processors; and one or more modules stored in the memory and executable by the one or more processors, the one or more modules including: a labeling module configured to learn a topic model associated with one or more based at least in part on: extracting visual features from a corpus of images associated with the one or more labels; and processing the corpus of images based in part on at least two relevance features: first relevance features of the at least two relevance features representing distribution characteristics of distances between pairs of images in a same cluster; and second relevance features of the at least two relevance features representing distribution characteristics of distances between different clusters of images. 15. A system as claim 14 recites, wherein the one or more modules further include: an input module configured to receive an input including an image; and an output module configured to output one or more results based on applying the topic model to the image, the one or more results including at least one label of the one or more labels identifying the image. 16. A system s claim 15 recites wherein the input further includes a topic associated with the image. 17. A system as claim 15 recites wherein the output module is further configured to rank the one or more labels identifying the image based at least in part on a confidence score. 18. A system as claim 15 recites, further comprising an annotation module configured to: query one or more search engines, sharing sites, or websites, wherein individual queries include individual labels of the one or more labels identifying the image; receive annotation information associated with the one or more labels identifying the image; and present the annotation information associated with the one or more labels identifying the image to the output module. 19. A system as claim 18 recites, wherein the output module is further configured to output the annotation information with the one or more labels identifying the image. 20. A system as claim 15 recites, wherein: the image is associated with two or more regions of interest; the labeling module is further configured to apply the topic model to determine two or more labels associated with the two or more regions of interest; and the output module is further configured to output the two or more labels identifying the two or more regions of interest.
Physics · mapped topic
Physics · mapped topic
Physics · mapped topic
Physics · mapped topic
Machine learning · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.