Autocaptioning of images
US-9317531-B2 · Apr 19, 2016 · US
US10567726B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10567726-B2 |
| Application number | US-200913509796-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 10, 2009 |
| Priority date | Dec 10, 2009 |
| Publication date | Feb 18, 2020 |
| Grant date | Feb 18, 2020 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method, apparatus, system for image processing is provided that creates an object note for a picture. The method, apparatus or system includes: automatically obtaining picture description information from a picture source, wherein the picture description information at least partly have been formed automatically; obtaining annotation information from at least two object sources, wherein the two object sources are different from the picture sources, automatically fusing the said annotation information from the two object sources to form fused annotation information, and attaching said fused annotation information to the picture to create the object note for the picture.
Opening claim text (preview).
What is claimed is: 1. A method comprising: obtaining picture description information from a picture source, wherein the picture description information having been formed by adding at least one of a time, a location, or user information to the picture description information, obtaining annotation information from at least two object sources, wherein the at least two object sources are different from said picture source, fusing said annotation information from said at least two object sources to form fused annotation information, wherein fusing the annotation information includes: selecting content for annotation from the annotation information from the at least two object sources by parsing the annotation information into a text format and selecting relevant text, assigning a score value to each sentence from the selected relevant text based upon relevancy and redundancy in comparison with other sentences from the selected relevant text; filtering, using assigned score values for the selected content, to reduce irrelevant or redundant information by removing stigma words and repetitive information, wherein repetitive information comprises selected content that overlaps in subject matter, and enhancing the cohesion or coherence of the content by at least arranging the text with topically related themes to reduce non-fluency, and attaching said fused annotation information to a picture to create an object note for said picture. 2. A method according to claim 1 , further comprising: forming a hyper-object-link between said picture and at least one object source, wherein said hyper-object-link comprises a link to an object in said object source, and attaching said link to said object to said picture to create an object note for a picture. 3. A method according to claim 1 , further comprising: forming relevance information by automatically analyzing information from said two sources against information from said picture source, and obtaining said annotation information from said at least two sources based on said relevance information. 4. A method according to claim 3 , further comprising: forming said relevance information by determining a correlation between the picture and the at least two sources by determining their similarity using at least one of the group of time information, location information, event information and person information, and forming a weighted similarity indicator by using the at least one of the group of time information, location information, event information and person information. 5. A method according to claim 1 , wherein said at least two sources comprise at least two of the group of email messages, short messages, multimedia messages, instant messages, calendar entries, contact cards, blog entries, wiki entries and social network service entries. 6. A method according to claim 1 , further comprising: clustering pictures based on said annotation information from said at least two sources. 7. A method according to claim 1 , further comprising: receiving filter information or source selection information from the user for restricting the data from said at least two sources. 8. A method according to claim 1 , wherein forming the fused annotation information comprises: generating summarization of the content through natural language processing. 9. The method according to claim 1 , wherein filtering the selected content to reduce irrelevant or redundant information comprises: determining a ratio of overlap in subject matter of the selected content, and removing content from at least one of the at least two object sources, in an instance in which the ratio of overlap satisfies a specified degree. 10. An apparatus comprising at least one processor, and memory including computer program code, the memory and the computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following: obtain picture description information from a picture source, wherein the picture description information having been formed by adding at least one of a time, a location, or user information to the picture description information, obtain annotation information from at least two object sources, wherein the at least two object sources are different from said picture source, fuse said annotation information from said at least two object sources to form fused annotation information, wherein fusing the annotation information includes: select content for annotation from the annotation information from the at least two object sources by parsing the annotation information into a text format and selecting relevant text, assign a score value to each sentence from the selected relevant text based upon relevancy and redundancy in comparison with other sentences from the selected relevant text; filter, using assigned score values for the selected content, to reduce irrelevant or redundant information by removing stigma words and repetitive information, wherein repetitive information comprises selected content that overlaps in subject matter, and enhance the cohesion or coherence of the content by at least arranging the text with topically related themes to reduce non-fluency, and attach said fused annotation information to a picture to create an object note for said picture. 11. An apparatus according to claim 10 , further comprising computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following: form a hyper-object-link between said picture and at least one object source, wherein said hyper-object-link comprises a link to an object in said object source, and attach said link to said object to said picture to create an object note for a picture. 12. An apparatus according to claim 10 , further comprising computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following: form relevance information by automatically analyzing information from said two sources against information from said picture source, and obtain said annotation information from said at least two sources based on said relevance information. 13. An apparatus according to claim 12 , further comprising computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following: form said relevance information by determining a correlation between the picture and the at least two sources by determining their similarity using at least one of the group of time information, location information, event information and person information, and form a weighted similarity indicator by using the at least one of the group of time information, location information, event information and person information. 14. An apparatus according to claim 10 , wherein said at least two sources comprise at least two of the group of email messages, short messages, multimedia messages, instant messages, calendar entries, contact cards, blog entries, wiki entries and social network service entries. 15. An apparatus according to claim 10 , further comprising computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following: cluster pictures based on said annotation information from said at least two sources. 16. An apparatus according to claim 10 , further comprising computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following: receive filter information or source selection information from th
of a date · CPC title
used signal is digitally coded · CPC title
Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually · CPC title
Position information, e.g. geographical position at time of capture, GPS data · CPC title
of identification information or the like, e.g. ID code, index, title, part of an image, reduced-size image · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.