What technology area does this patent fall under?

Primary CPC classification G06T17/00. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu Sep 16 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Providing a 2-dimensional dataset from 2-dimensional and 3-dimensional computer vision techniques

US2021287433A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2021287433-A1
Application number	US-202016814051-A
Country	US
Kind code	A1
Filing date	Mar 10, 2020
Priority date	Mar 10, 2020
Publication date	Sep 16, 2021
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Implementations generally provide a 2-dimensional dataset from 2-dimensional and 3-dimensional computer vision techniques. In some implementations, a method includes obtaining a plurality of 2-dimensional (2D) videos of a subject performing at least one action. The method further includes generating a 3-dimensional (3D) model based on the plurality of 2D videos. The method further includes generating a 3D scene based on the 3D model. The method further includes generating a 2D dataset based on the 3D scene.

First claim

Opening claim text (preview).

What is claimed is: 1 . A system comprising: one or more processors; and logic encoded in one or more non-transitory computer-readable storage media for execution by the one or more processors and when executed operable to cause the one or more processors to perform operations comprising: obtaining a plurality of 2-dimensional (2D) videos of a subject performing at least one action; generating a 3-dimensional (3D) model based on the plurality of 2D videos; generating a 3D scene based on the 3D model; and generating a 2D dataset based on the 3D scene. 2 . The system of claim 1 , wherein the plurality of 2D videos is synchronized. 3 . The system of claim 1 , wherein the plurality of 2D videos is obtained from a plurality of physical cameras that are positioned at arbitrary locations in a physical environment. 4 . The system of claim 1 , wherein the logic when executed is further operable to cause the one or more processors to perform operations comprising obtaining one or more annotations associated with the plurality of 2D videos. 5 . The system of claim 1 , wherein the logic when executed is further operable to cause the one or more processors to perform operations comprising: determining one or more model data modifications to 3D model data; and applying the one or more model data modifications to the 3D model data. 6 . The system of claim 1 , wherein the logic when executed is further operable to cause the one or more processors to perform operations comprising: determining one or more scene settings; generating one or more virtual cameras; and adding the one or more virtual cameras to the 3D scene set based on the scene settings. 7 . The system of claim 1 , wherein the logic when executed is further operable to cause the one or more processors to perform operations comprising: obtaining one or more annotations associated with the plurality of 2D videos; and applying the one or more annotations to the 2D dataset. 8 . A non-transitory computer-readable storage medium with program instructions stored thereon, the program instructions when executed by one or more processors are operable to cause the one or more processors to perform operations comprising: obtaining a plurality of 2-dimensional (2D) videos of a subject performing at least one action; generating a 3-dimensional (3D) model based on the plurality of 2D videos; generating a 3D scene based on the 3D model; and generating a 2D dataset based on the 3D scene. 9 . The computer-readable storage medium of claim 8 , wherein the plurality of 2D videos is synchronized. 10 . The computer-readable storage medium of claim 8 , wherein the plurality of 2D videos is obtained from a plurality of physical cameras that are positioned at arbitrary locations in a physical environment. 11 . The computer-readable storage medium of claim 8 , wherein the instructions when executed are further operable to cause the one or more processors to perform operations comprising obtaining one or more annotations associated with the plurality of 2D videos. 12 . The computer-readable storage medium of claim 8 , wherein the instructions when executed are further operable to cause the one or more processors to perform operations comprising: determining one or more model data modifications to 3D model data; and applying the one or more model data modifications to the 3D model data. 13 . The computer-readable storage medium of claim 8 , wherein the instructions when executed are further operable to cause the one or more processors to perform operations comprising: determining one or more scene settings; generating one or more virtual cameras; and adding the one or more virtual cameras to the 3D scene set based on the scene settings. 14 . The computer-readable storage medium of claim 8 , wherein the instructions when executed are further operable to cause the one or more processors to perform operations comprising: obtaining one or more annotations associated with the plurality of 2D videos; and applying the one or more annotations to the 2D dataset. 15 . A computer-implemented method comprising: obtaining a plurality of 2-dimensional (2D) videos of a subject performing at least one action; generating a 3-dimensional (3D) model based on the plurality of 2D videos; generating a 3D scene based on the 3D model; and generating a 2D dataset based on the 3D scene. 16 . The method of claim 15 , wherein the plurality of 2D videos is synchronized. 17 . The method of claim 15 , wherein the plurality of 2D videos is obtained from a plurality of physical cameras that are positioned at arbitrary locations in a physical environment. 18 . The method of claim 15 , further comprising obtaining one or more annotations associated with the plurality of 2D videos. 19 . The method of claim 15 , further comprising: determining one or more model data modifications to 3D model data; and applying the one or more model data modifications to the 3D model data. 20 . The method of claim 15 , further comprising: determining one or more scene settings; generating one or more virtual cameras; and adding the one or more virtual cameras to the 3D scene set based on the scene settings.

Assignees

Sony Corp

Inventors

Classifications

G06V20/647
by matching two-dimensional images to three-dimensional objects · CPC title
G06V20/52
Surveillance or monitoring of activities, e.g. for recognising suspicious objects (recognising microscopic objects G06V20/69) · CPC title
G06V20/49
Segmenting video sequences, i.e. computational techniques such as parsing or cutting the sequence, low-level clustering or determining units such as shots or scenes · CPC title
G06V10/774
Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title
G06T2207/30196
Human being; Person · CPC title

Patent family

Related publications grouped by family.

View patent family 77664852

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2021287433A1 cover?: Implementations generally provide a 2-dimensional dataset from 2-dimensional and 3-dimensional computer vision techniques. In some implementations, a method includes obtaining a plurality of 2-dimensional (2D) videos of a subject performing at least one action. The method further includes generating a 3-dimensional (3D) model based on the plurality of 2D videos. The method further includes gene…
Who is the assignee on this patent?: Sony Corp
What technology area does this patent fall under?: Primary CPC classification G06T17/00. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu Sep 16 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).