Augmented reality interface for video tagging and sharing

US9280849B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9280849-B2
Application numberUS-201113291851-A
CountryUS
Kind codeB2
Filing dateNov 8, 2011
Priority dateNov 8, 2010
Publication dateMar 8, 2016
Grant dateMar 8, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system, method, and computer program product for automatically combining computer-generated imagery with real-world imagery in a portable electronic device by retrieving, manipulating, and sharing relevant stored videos, preferably in real time. A video is captured with a hand-held device and stored. Metadata including the camera's physical location and orientation is appended to a data stream, along with user input. The server analyzes the data stream and further annotates the metadata, producing a searchable library of videos and metadata. Later, when a camera user generates a new data stream, the linked server analyzes it, identifies relevant material from the library, retrieves the material and tagged information, adjusts it for proper orientation, then renders and superimposes it onto the current camera view so the user views an augmented reality.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for providing an augmented reality interface, comprising: retrieving, from a server, and caching data associated with a plurality of locations within a two-dimension grid based on a GPS coordinate of a portable electronic device, the cached data comprises a plurality of image snippets preloaded from the server based on a location information of the portable electronic device, each of the plurality of image snippets being a smaller image file of a corresponding stored image; overlaying one or more of the plurality of image snippets corresponding to a location and a direction at which a corresponding image was originally captured based on the location information and an orientation information of the portable electronic device without manipulating the plurality of image snippets based on corner features of the image of the real-world scene; identifying at least one retrieved image with metadata having selected features; manipulating a retrieved image corresponding to an displayed image snippet based on matching corner feature information of the retrieved images generated and provided by the server and corner features of a currently observed real-world scene; and combining the manipulated image with the currently observed real-world scene viewed with the portable electronic device, wherein the metadata includes annotations by at least one of the server and a user who acquired the image; and wherein the cache data is purged based on a distance between the portable electronic device from the two-dimension grid. 2. The method of claim 1 , wherein the annotations include details of at least one of a person, an object, and a location being photographed. 3. The method of claim 1 , wherein the annotations help users share at least one of experiences and recommended locations. 4. The method of claim 1 , wherein the retrieved image is at least one of a still photograph, at least one video frame, analog, digital, recorded, live, and communicated in a data stream. 5. The method of claim 1 , wherein the metadata describes the physical location and orientation of the camera during the acquiring, and is provided by at least one of a GPS system, a gyroscope, and an accelerometer. 6. The method of claim 1 , wherein at least one of the currently observed scene, images, and metadata are stored on at least one of a server and the portable electronic device. 7. The method of claim 1 , wherein the selected features include the stored physical location and orientation best matching one of (a) a current physical location and orientation of the portable electronic device and (b) a predicted physical location and orientation of the portable electronic device. 8. The method of claim 1 , wherein the server searches for the selected features. 9. The method of claim 1 , wherein the retrieved image is in a second data stream. 10. The method of claim 1 , wherein the portable electronic device is at least one of a smartphone, a hand-held device, the camera, a second camera, a PDA, and a tablet computer. 11. The method of claim 1 , wherein the manipulating includes adjusting image orientation. 12. The method of claim 1 , wherein the combining includes superimposing the manipulated image on the currently observed scene. 13. The method of claim 1 , wherein the combining includes merging a first data stream comprising the image of the currently observed real-world scene with a second data stream comprising the retrieved image. 14. The method of claim 1 , wherein the combining includes displaying the manipulated image with the portable electronic device in one of a display and a viewfinder. 15. The method of claim 1 , wherein the method operates continuously and substantially in real time. 16. The method of claim 1 , wherein the method operates as the currently observed scene changes as the portable electronic device is moved, such motion including at least one of translating, tilting, panning, and zooming. 17. A system for providing an augmented reality interface, comprising: a processor; and a memory containing instructions that, when executed by the processor cause the processor to: retrieve, from a server, and cache data associated with a plurality of locations within a two-dimension grid based on a GPS coordinate of a portable electronic device, the cached data comprises a plurality of video snippets pre-loaded from the server based on a location information of the portable electronic device, each of the plurality of video snippets being a smaller video file of a corresponding stored video; overlay one or more of the plurality of video snippets corresponding to a location and a direction at which a corresponding video was originally captured based on the location information and an orientation information of the portable electronic device without manipulating the plurality of video snippets based on corner features of the video of the real-world scene; identify at least one retrieved video with metadata having selected features; manipulate a retrieved video corresponding to an displayed video snippet based on matching corner feature information of the retrieved videos generated and provided by the server and corner features of a currently observed real-world scene; and combine the manipulated video with the currently observed real-world scene viewed with the portable electronic device, wherein the metadata includes annotations by at least one of the server and a user who acquired the video; and wherein the cache data is purged based on a distance between the portable electronic device from the two-dimension grid. 18. A computer program product for providing an augmented reality interface, comprising a non-transitory computer readable medium embodying computer-executable program instructions thereon that, when executed, cause a computing device to: retrieve, from a server, and a plurality of videos and cache data associated with a plurality of locations within a two-dimension grid based on a GPS coordinate of a portable electronic device, the cached data comprises a plurality of video snippets from the server based on a location information of the portable electronic device, each of the plurality of video snippets being a smaller video file of a corresponding stored video; overlay one or more of the plurality of video snippets corresponding to a location and a direction at which a corresponding video was originally captured based on the location information and an orientation information of the portable electronic device without manipulating the plurality of video snippets based on corner features of the video of the real-world scene; identify at least one retrieved video with metadata having selected features; manipulate a retrieved video corresponding to an displayed video snippet based on matching corner feature information of the retrieved videos generated and provided by the server and corner features of a currently observed real-world scene; and combine the manipulated video with the currently observed real-world scene viewed with the portable electronic device, wherein the metadata includes annotations by at least one of the server and a user who acquired the video; and wherein the cache data is purged based on a distance between the portable electronic device from the two-dimension grid. 19. A system for providing an augmented reality interface, comprising: means for retrieving, from a server, and a least one stored video caching data associated with a plurality of locations within a two-dime

Assignees

Inventors

Classifications

  • Control arrangements or circuits for visual indicators common to cathode-ray tube indicators and other visual indicators (image data processing or generation, in general G06T) · CPC title

  • G06T19/006Primary

    Mixed reality (object pose determination, tracking or camera calibration for mixed reality G06T7/00) · CPC title

  • Overlay of images, i.e. displayed pixel being the result of switching between the corresponding input pixels · CPC title

  • wherein one of the images is motion video · CPC title

  • Aspects of interface with display user · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9280849B2 cover?
A system, method, and computer program product for automatically combining computer-generated imagery with real-world imagery in a portable electronic device by retrieving, manipulating, and sharing relevant stored videos, preferably in real time. A video is captured with a hand-held device and stored. Metadata including the camera's physical location and orientation is appended to a data strea…
Who is the assignee on this patent?
Adhikari Suranjit, Dunn Ted, Hsiao Eric, and 1 more
What technology area does this patent fall under?
Primary CPC classification G06T19/006. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 08 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).