Method and apparatus for managing images using a voice tag

US9916864B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9916864-B2
Application numberUS-201514882879-A
CountryUS
Kind codeB2
Filing dateOct 14, 2015
Priority dateOct 14, 2014
Publication dateMar 13, 2018
Grant dateMar 13, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An electronic device is provided. The electronic device includes a voice input module which receives a voice from an outside to generate voice data, a memory which stores one or more images or videos, and a processor which is electrically connected to the voice input module and the memory. The memory includes instructions, when executed by the processor, causing the electronic device to link at least one of the voice data, the first metadata information based on the voice data, or second metadata information generated from the voice data and/or the first metadata information with the second image or video.

First claim

Opening claim text (preview).

What is claimed is: 1. An electronic device comprising: a voice input module; a memory; and a processor electrically connected to the voice input module and the memory, wherein the memory is configured to store one or more images or videos, and wherein the memory comprises instructions, the instructions, when executed by the processor, causing the electronic device to: generate voice data on a voice received through the voice input module with respect to a first image or video stored on the memory, link the voice data or first metadata information based on the voice data, with the first image or video, determine a relation between a second image or video stored on the memory, and the first image or video, and link at least one of (1) the voice data, (2) the first metadata information, or (3) second metadata information generated from the voice data and/or the first metadata information with the second image or video, based on at least a part of the relation determined between the second image or video stored on the memory and the first image or video. 2. The electronic device of claim 1 , wherein the electronic device links the first metadata information with the first image or video in the form of a tag, and wherein the electronic device is configured to link at least one of (1) the voice data, (2) the first metadata information, or (3) the second metadata information with the second image or video in the form of a tag. 3. The electronic device of claim 1 , wherein the first metadata information comprises speech-to-text information extracted from the voice data. 4. The electronic device of claim 1 , wherein the electronic device is configured to determine the relation using at least one of an image analysis, location information, time information, text information, or face recognition information associated with the first image or video and the second image or video. 5. An electronic device comprising: a voice input module configured to receive a voice from an outside to generate voice data; a communication module; a memory; and a processor electrically connected to the voice input module, the communication module, and the memory, wherein the memory is configured to store one or more images or videos, and wherein the memory comprises instructions, the instructions, when executed by the processor, causing the electronic device to: generate voice data on a voice received through the voice input module with respect to a first image or video stored on the memory, link the voice data or first metadata information based on the voice data, with the first image or video, transmit the first image or video and the linked voice data or the first metadata information to the outside of the electronic device through the communication module, transmit a request for requiring one or more images or videos associated with the linked voice data or the first metadata information to the outside of the electronic device, and receive one or more images or videos linked with (1) the voice data, (2) the first metadata information, or (3) second metadata information generated from the voice data and/or the first metadata information from the outside of the electronic device. 6. An electronic device comprising: a voice input module configured to obtain voice data on a specific image; and a processor configured to: analyze the voice data to determine at least one portion of metadata information of the specific image, register the voice data as a voice tag with the specific image; and register the voice data as the voice tag with at least one association image, which satisfies a specific reference with respect to the specific image or the determined metadata information, from among a plurality of images. 7. The electronic device of claim 6 , wherein a plurality of metadata information comprises at least one of information on a location or a time where the specific image is captured, information on a device capturing the specific image, or information on a shooting mode of the specific image. 8. The electronic device of claim 6 , further comprising: a shooting module, wherein if the specific image is captured by the shooting module, the processor is configured to activate the voice input module to guide obtaining of the voice data. 9. The electronic device of claim 6 , wherein the processor is configured to provide a user interface (UI) for guiding obtaining of the voice data if the specific image is selected. 10. The electronic device of claim 6 , wherein the processor is configured to register a text tag, which is obtained by converting the voice data into a text, together with the voice tag with respect to the at least one association image. 11. The electronic device of claim 6 , wherein the processor is configured to analyze the voice data using an object appearing at the specific image. 12. The electronic device of claim 7 , wherein the processor is configured to determine at least one portion of metadata information among information on the location, the time, the device capturing the specific image, and the shooting mode, based on a relation between an analysis result of the voice data and each of the plurality of information. 13. The electronic device of claim 12 , wherein the processor is configured to determine an image, which includes location information belonging within a specific range from a position of the specific image as metadata information, from among the plurality of images as the at least one association image. 14. The electronic device of claim 12 , wherein the processor is configured to determine an image, which includes time information belonging within a specific range from the time of the specific image as metadata information, from among the plurality of images as the at least one association image. 15. The electronic device of claim 12 , wherein the processor is configured to determine an image, which includes location information having a specific relation with the time of the specific image as metadata information, from among the plurality of images as the at least one association image. 16. The electronic device of claim 6 , wherein the processor is configured to determine an image, which has a similarity of a threshold value or more to the specific image, from among the plurality of images as the at least one association image. 17. The electronic device of claim 6 , wherein at least a part of the plurality of images is stored on an external device functionally connected with the electronic device, and wherein the electronic device further comprises: a communication module communicating with the external device. 18. A method for registering a voice tag, comprising: obtaining voice data on at least one image; determining at least one portion of metadata information for a specific image based on the voice data; registering the voice data as a voice tag with the specific image; determining at least one association image which satisfies a specific reference with respect to the specific image or the determined metadata information; and registering the voice data as the voice tag with the at least one association image. 19. The method of claim 18 , wherein the determining of the at least one association image comprises: determining association image candidates based on the specific image or a priority of the determined metadata information; determining whether a number of the association image candidates satisfies a specific range; and determining at least a part of the association image ca

Assignees

Inventors

Classifications

  • Physics · mapped topic

  • Speech classification or search · CPC title

  • Digital still camera · CPC title

  • H04N1/212Primary

    Motion video recording combined with still video recording (television signal recording H04N5/76) · CPC title

  • Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9916864B2 cover?
An electronic device is provided. The electronic device includes a voice input module which receives a voice from an outside to generate voice data, a memory which stores one or more images or videos, and a processor which is electrically connected to the voice input module and the memory. The memory includes instructions, when executed by the processor, causing the electronic device to link at…
Who is the assignee on this patent?
Samsung Electronics Co Ltd
What technology area does this patent fall under?
Primary CPC classification H04N1/212. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Mar 13 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).