Generating a virtual world to assess real-world video analysis performance

US10019652B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10019652-B2
Application numberUS-201615051005-A
CountryUS
Kind codeB2
Filing dateFeb 23, 2016
Priority dateFeb 23, 2016
Publication dateJul 10, 2018
Grant dateJul 10, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and method are suited for assessing video performance analysis. A computer graphics engine clones real-world data in a virtual world by decomposing the real-world data into visual components and objects in one or more object categories and populates the virtual world with virtual visual components and virtual objects. A scripting component controls the virtual visual components and the virtual objects in the virtual world based on the set of real-world data. A synthetic clone of the video sequence is generated based on the script controlling the virtual visual components and the virtual objects. The real-world data is compared with the synthetic clone of the video sequence and a transferability of conclusions from the virtual world to the real-world is assessed based on this comparison.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of assessing video performance analysis comprising: with a camera, acquiring a first set of real-world data including a video sequence, the video sequence including visual components and objects; with at least one sensor, automatically generating a second set of real-world data including a set of physical measurements for the objects and visual components of the video sequence; with a computer graphics engine, cloning the first and second sets of real-world data in a virtual world, comprising: decomposing the visual components and objects in the video sequence of the first set of real-world data into at least one object category, populating the virtual world with virtual visual components and virtual objects, generating a script for controlling the virtual visual components and the virtual objects in the virtual world based on the acquired visual components and objects in the first set of real-world data and the automatically generated set of physical measurements for the objects and visual components in the second set of real-world data, and generating a synthetic clone of the video sequence based on the script for controlling the virtual visual components and the virtual objects; generating a set of ground truth annotations for the virtual objects in the synthetic clone; comparing the real-world data with the synthetic clone of the video sequence; and assessing a transferability of conclusions from the virtual world to the real-world based on the comparison of the real-world data with the synthetic clone of the video sequence. 2. The method of claim 1 , wherein the acquiring the first and second sets of real-world data further comprises annotating the objects in an object category with the set of physical measurements from the at least one sensor, the at least one sensor including at least one of a global positioning system, an inertia measurement unit, an ultraviolet camera, and a 3D laser scanner, and wherein the camera includes at least one of a monochrome camera, a color camera, and an infrared camera. 3. The method of claim 2 , wherein the annotating with the set of physical measurements includes annotating the objects with at least one of a position, size, inertia, and orientation of each of the objects in the at least one object category. 4. The method of claim 1 , further comprising changing a condition of one of the virtual visual components or the virtual objects. 5. The method of claim 4 , further comprising generating a modified synthetic video based on the changed condition and generating a set of ground truth annotations for the virtual objects in the modified synthetic video. 6. The method of claim 4 , wherein the changing a condition includes changing a position, orientation, trajectory, size, color, or shape of at least one of the virtual objects. 7. The method of claim 4 , wherein the changing a condition of one of the virtual visual components includes changing a lighting or a weather condition. 8. The method of claim 4 , wherein the changing a condition includes manually adding, modifying or removing at least one of the virtual objects. 9. The method of claim 1 , further comprising performing a specific task with an algorithm on the first and second sets of real-world data and performing the specific task with the algorithm on the synthetic clone of the video sequence. 10. The method of claim 9 , further comprising evaluating the performance of the algorithm against the set of ground truth annotations for the virtual objects in the synthetic clone. 11. The method of claim 9 , wherein the algorithm is for multi-object tracking. 12. The method of claim 5 , further comprising performing a specific task with an algorithm on the modified synthetic video. 13. The method of claim 12 , further comprising evaluating the performance of the algorithm against the set of ground truth annotations for the virtual objects the modified synthetic video. 14. The method of claim 12 , wherein the algorithm is for multi-object tracking. 15. The method of claim 1 , wherein the at least one object category is selected from a vehicle category, an animal category, a structure category, a signage category, and an environmental category. 16. A computer program product comprising a non-transitory recording medium storing instructions, which when executed on a computer, causes the computer to perform the method of claim 1 . 17. A system comprising memory which stores instructions for performing the method of claim 1 and a processor in communication with the memory which executes the instructions. 18. A system for assessing video performance analysis comprising: a computer graphics engine component, which: clones first and second sets of real-world data in a virtual world, the first set of real-world data being acquired by a camera and including a video sequence having visual components and objects, the second set of real-world data being automatically generated by at least one sensor and including sensor data having a set of physical measurements for the objects and visual components of the video sequence, decomposes the visual components and objects in the video sequence of the first set of real world data into at least one object category, and populates the virtual world with virtual visual components and virtual objects; a scripting component which: generates a script to control the virtual visual components and virtual objects in the virtual world based on the acquired visual components and objects in the first set of real-world data and the automatically generated sensor data for the objects and visual components in the second set of real-world data, and generates a synthetic video sequence clone; a modification component for changing a condition of one of the virtual visual components and the virtual objects; an annotation component for generating a set of ground truth annotations for the virtual objects; a performance component for performing a specific task with an algorithm and assessing a transferability of conclusions based on a performance of the algorithm; and a processor which implements the computer graphics engine component, the scripting component, the modification component, the annotation component, and the performance component. 19. The system of claim 18 further comprising a seeding component for annotating the first and second sets of real-world data, enabling the graphics engine component to initialize the virtual world. 20. A method of assessing video performance analysis comprising: acquiring a first set of real-world data with a camera, the first set of real-world data including a video sequence, the video sequence including visual components and objects; automatically generating a second set of real-world data with at least one sensor, the second set of real-world data including sensor data, the sensor data including a set of physical measurements for the objects and visual components of the video sequence; with a computer graphics engine, cloning the first and second sets of real-world data in a virtual world, comprising: decomposing the visual components and objects in the video sequence of the first set of real world data into at least one object category, and populating the virtual world with virtual visual components and virtual objects; generating a script for controlling the virtual visual components and the virtual objects in the virtual world based on the acquired visual components and objects in the first set of real-world da

Assignees

Inventors

Classifications

  • G06T7/20Primary

    Analysis of motion (motion estimation for coding, decoding, compressing or decompressing digital video signals H04N19/43, H04N19/51) · CPC title

  • Validation; Performance evaluation; Active pattern learning techniques · CPC title

  • Manipulating three-dimensional [3D] models or images for computer graphics · CPC title

  • Video; Image sequence · CPC title

  • Traffic on road, railway or crossing · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10019652B2 cover?
A system and method are suited for assessing video performance analysis. A computer graphics engine clones real-world data in a virtual world by decomposing the real-world data into visual components and objects in one or more object categories and populates the virtual world with virtual visual components and virtual objects. A scripting component controls the virtual visual components and the…
Who is the assignee on this patent?
Xerox Corp
What technology area does this patent fall under?
Primary CPC classification G06T7/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 10 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).