Systems and methods for inferring gender by fusion of multimodal content

US9684852B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9684852-B2
Application numberUS-201615226789-A
CountryUS
Kind codeB2
Filing dateAug 2, 2016
Priority dateJun 29, 2015
Publication dateJun 20, 2017
Grant dateJun 20, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and systems are provided. A system includes a set of visual and textual classifiers for recognizing semantic concepts in a set of images and assigning semantic scores for the images to predict a gender of a user, and performing gender prediction from visual content and textual content in the images to respectively generate visual-based gender predictions and textual-based gender predictions. The system further includes a multimodal information fusion device for combining, using multimodal information fusion, the visual-based gender predictions, the textual-based gender predictions, and the semantic scores to infer a gender of a user.

First claim

Opening claim text (preview).

What is claimed is: 1. A system, comprising: a set of visual and textual classifiers for recognizing semantic concepts in a set of images and assigning semantic scores for the images to predict a gender of a user, and performing gender prediction from visual content and textual content in the images to respectively generate visual-based gender predictions and textual-based gender predictions; and a multimodal information fusion device for combining, using multimodal information fusion, the visual-based gender predictions, the textual-based gender predictions, and the semantic scores to infer a gender of a user. 2. The system of claim 1 , wherein the semantic concepts comprise visual-based semantic concepts and textual-based semantic concepts. 3. The system of claim 1 , wherein at least one of the visual classifiers and at least one of the textual classifiers generate semantic distributions from the semantic scores, and wherein some of the visual-based gender predictions and the textual-based gender predictions are determined from the semantic distributions. 4. The system of claim 1 , wherein at least one of the visual classifiers and at least one of the textual classifiers generate semantic distributions from the semantic scores, wherein some of the visual-based gender predictions and the textual-based gender predictions are determined from aggregations of the semantic distributions. 5. The system of claim 1 , wherein said multimodal information fusion device performs filtered fusion by selecting which multimodal gender information to use directly and which of the multimodal gender information to aggregate, to infer the gender of the user, from among the visual-based gender predictions, the textual-based gender predictions, and the semantic scores. 6. The system of claim 1 , wherein said multimodal information fusion device further comprises a confidence checker for performing confidence checking on at least some of the semantic scores to influence an inference of the gender of the user. 7. The system of claim 6 , wherein the confidence checking is performed separately by a first confidence check on the at least some of the semantic scores relating to the visual content and by a second confidence check on the at least some of the semantic scores relating to the textual content, and the inference of the gender of the user is influenced differently based on an agreement or a non-agreement between the first and the second confidence checks. 8. The system of claim 1 , wherein the visual content comprises profile colors and a background scene in a profile picture of the user. 9. A system for inferring gender, comprising: a set of visual classifiers, each for recognizing visual content from a set of images associated with a user, assigning respective visual-based prediction confidence scores for each of the images based the recognized visual-based content, and at least one for generating a visual-based gender prediction of the user based on the respective visual-based prediction confidence scores for each of the images; a set of textual classifiers, each for recognizing textual content from the set of images, assigning respective textual-based prediction confidence scores for each of the images based the recognized textual content, and at least one for generating a visual-based gender prediction of the user based on the respective visual-based prediction confidence scores for each of the images; and a multimodal information fusion device for inferring a gender of a user selectively from the visual-based gender predictions, the textual-based gender predictions, the visual-based prediction confidence scores and the textual-based prediction confidence scores. 10. The system of claim 9 , wherein the at least one of the visual classifiers determines respective semantic distributions across the set of images for different ones of the recognized visual content, each of the respective semantic distribution based on an aggregation of the respective visual-based prediction confidence scores for a same one of the recognized visual content for each of the images. 11. The system of claim 9 , wherein the at least one of the visual classifiers generates the visual-based gender prediction based on the respective semantic distributions.

Assignees

Inventors

Classifications

  • Multiple classes · CPC title

  • G06F40/30Primary

    Semantic analysis · CPC title

  • of results relating to different input data, e.g. multimodal recognition · CPC title

  • Probabilistic graphical models, e.g. probabilistic networks · CPC title

  • Inference or reasoning models · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9684852B2 cover?
A method and systems are provided. A system includes a set of visual and textual classifiers for recognizing semantic concepts in a set of images and assigning semantic scores for the images to predict a gender of a user, and performing gender prediction from visual content and textual content in the images to respectively generate visual-based gender predictions and textual-based gender predic…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F40/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 20 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).