Automatic media naming using facial recognization and/or voice based identification of people within the named media content

US9535921B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9535921-B2
Application numberUS-201514791300-A
CountryUS
Kind codeB2
Filing dateJul 3, 2015
Priority dateMar 14, 2013
Publication dateJan 3, 2017
Grant dateJan 3, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computing device identifies a plurality of media files within a data store, each of the media files lacking user established file names. The computing device analyzing the plurality of media files to recognize humans in the media files based on facial recognition and/or voice recognition programs. Using results of the analyzing to generate a plurality of content identification keywords, which are scored and ranked. Establishing a filename prefix for the media files using scored and ranked content identification keywords. Automatically generating a unique file name for each of the media files, wherein each unique file name includes the established filename prefix.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for intelligent media naming comprising: a computing device, comprising hardware and software that the hardware executes, identifying a plurality of media files within a data store, each of the media files lacking user established file names; the computing device analyzing the plurality of media files for images to automatically recognize faces within image content of the media files using a facial recognition program; the computing device analyzing the plurality of media files for audio to automatically recognize voices within audio content of the media files using a speaker identification recognition or speaker identify verification program; the computing device based upon the recognized faces and voices generating a plurality of different content identification keywords for naming the media files, wherein the content identification keywords vary based on identities of humans with the recognized faces and voices; the computing device establishing a filename prefix for the media files using scored and ranked content identification keywords; for at least a subset of the media files with a common filename prefix, automatically determining a filename suffix for the subset of media files and automatically concatenating the filename prefix and a filename suffix to create a unique file name for each of the subset of the media files; automatically generating a unique file name for each of the media files using semantic metadata, wherein each unique file name includes the established filename prefix, and wherein for the subset of media files the unique file names includes the concatenation of the established filename prefix and the unique filename suffix. 2. The method of claim 1 , further comprising: the computing device computing frequency or predominance of appearance in the media files of humans using the recognized faces and voices, computing relationships between humans having recognized faces and voices, and utilizing results of the computing to score and rank the content identification keywords in order of significance. 3. The method of claim 1 , wherein each of the media files is a user taken video, wherein the computing device is a digital camera, a video recorder, or mobile phone, said computing device comprising a camera and a memory, wherein the camera captures environmental input used to create the media files, and wherein the memory stores the media files. 4. The method of claim 1 , wherein the computing device is a set of one or more servers linked to a network, wherein the one or more servers permit users to upload and view the media files, wherein the set of one or more servers provide an option for automatically renaming uploaded media files per the method. 5. A method for intelligent media naming comprising: a computing device, comprising hardware and software that the hardware executes, identifying a plurality of media files within a data store, each of the media files lacking user established file names; the computing device analyzing the plurality of media files to automatically recognize faces and voices within content of the media files; the computing device based upon the recognized faces generating a plurality of different content identification keywords for naming the media files, wherein the content identification keywords vary based on identifies of humans with the recognized faces; the computing device establishing a filename prefix for the media files using scored and ranked content identification keywords; for at least a subset of the media files with a common filename prefix, automatically determining a filename suffix for the subset of media files and automatically concatenating the filename prefix and a filename suffix to create a unique file name for each of the subset of the media files; and automatically generating a unique file name for each of the media files using semantic metadata, wherein each unique file name includes the established filename prefix, and wherein for the subset of media files the unique file names includes the concatenation of the established filename prefix and the unique filename suffix. 6. The method of claim 5 , further comprising: the computing device computing frequency or predominance of appearance in the media files of the recognized faces, relationships between the recognized faces, and identity based prioritization values based on the recognized facts to score and rank the content identification keywords in order of significance. 7. The method of claim 5 , wherein each of the media files is a user taken photograph or user taken video, wherein the computing device is a digital camera, a video recorder, or mobile phone, said computing device comprising a camera and a memory, wherein the camera captures environmental input used to create the media files, and wherein the memory stores the media files. 8. The method of claim 5 , wherein the computing device is a set of one or more servers linked to a network, wherein the one or more servers permit users to upload and view the media files, wherein the set of one or more servers provide an option for automatically renaming uploaded media files utilizing semantic metadata. 9. The method of claim 5 , further comprising; automatically generating an icon for each of the media files, wherein the icons generated for the media files vary from each other depending on the automatically recognized faces contained within. 10. The method of claim 5 , wherein the media files are digital images. 11. The method of claim 5 , wherein the media files are digital home videos. 12. The method of claim 5 , wherein the plurality of media files comprise one or more file types from a group of file types comprising an image file type, a video file type, an audio file type, and a document file. 13. The method of claim 5 , wherein the analyzing determines an identity of a human facially recognized in the corresponding one of the subset of media files, wherein a file name that includes a name of the identified human is generated for the corresponding media file. 14. The method of claim 5 , further comprising: the computing device analyzing the plurality of media files for audio to automatically recognize voices within audio content of the media files using a speaker identification recognition or speaker identity verification program, wherein the scoring of the content identification keywords is adjusted based on identifies of recognized voices in corresponding ones of the media files. 15. A method for intelligent media naming comprising: a computing device, comprising hardware and software that the hardware executes, identifying a plurality of media files within a data store, each of the media files lacking user established file names; the computing device analyzing the plurality of media files for audio to automatically recognize voices within audio content of the media files using a speaker recognition or speaker identity verification program; the computing device based upon the recognized voices generating a plurality of different content identification keywords for naming the media files, wherein the content identification keywords vary based on identities of humans with the recognized voices; the computing device establishing a filename prefix for the media files using scored and ranked content identification keywords; for at least a subset of the media files with a common filename prefix, automatically determining a filename suffix for the subset of media files and automatically concatenating the filename prefix and a filename suffix to create a unique file name for each of the subset

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9535921B2 cover?
A computing device identifies a plurality of media files within a data store, each of the media files lacking user established file names. The computing device analyzing the plurality of media files to recognize humans in the media files based on facial recognition and/or voice recognition programs. Using results of the analyzing to generate a plurality of content identification keywords, which…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/166. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 03 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).