Image processing method and image processing apparatus
US-12169910-B2 · Dec 17, 2024 · US
US2026057639A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2026057639-A1 |
| Application number | US-202519289499-A |
| Country | US |
| Kind code | A1 |
| Filing date | Aug 4, 2025 |
| Priority date | Aug 22, 2024 |
| Publication date | Feb 26, 2026 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An information processing device includes a memory configured to store instructions; and one or more processors configured to execute the instructions to: predict a trajectory of an object included in at least one of a plurality of target images with reference to a plurality of target images; extract a feature of the trajectory and calculate a reliability of the trajectory based on the feature; generate a virtual label in which each of the plurality of target images, a position of the object included in the target image, the trajectory, and the reliability are associated with each other; and learn a state transition model that predicts a state of an object included in a plurality of images by using the virtual label.
Opening claim text (preview).
1 . An information processing device comprising: a memory configured to store instructions; and one or more processors configured to execute the instructions to: predict a trajectory of an object included in at least one of a plurality of target images with reference to the plurality of target images; extract a feature of the trajectory; calculate a reliability of the trajectory based on the feature; generate a virtual label in which each of the plurality of target images, a position of the object included in the target image, the trajectory, and the reliability are associated with each other; and learn a state transition model that predicts a state of an object included in a plurality of images by using the virtual label. 2 . The information processing device according to claim 1 , wherein the one or more processors are further configured to execute the instructions to: integrate a plurality of trajectories; predict one or a plurality of trajectories of an object included in at least one of a plurality of target images with reference to the plurality of target images; extract a feature of each of the one or a plurality of trajectories; calculate a reliability of each of the one or a plurality of trajectories based on the feature; integrate trajectories having similar features and high reliabilities among the one or a plurality of trajectories; and generate a virtual label in which each of the plurality of target images, a position of the object included in the target image, the integrated trajectory, and the reliability are associated with each other. 3 . The information processing device according to claim 1 , wherein the one or more processors are further configured to execute the instructions to: predict a trajectory of an object included in at least one of a plurality of target images by inputting the plurality of target images to a prediction model that predicts a trajectory of an object included in a plurality of images using the plurality of images as inputs. 4 . The information processing device according to claim 3 , wherein the one or more processors are further configured to execute the instructions to: learn the prediction model using the virtual label. 5 . The information processing device according to claim 3 , wherein the one or more processors are further configured to execute the instructions to: perform knowledge distillation of a state transition model lighter than the prediction model by using the virtual label. 6 . The information processing device according to claim 2 , wherein the one or more processors are further configured to execute the instructions to: acquire information regarding the object detected by a sensor; and refer to the information to integrate trajectories. 7 . The information processing device according to claim 1 , wherein the plurality of target images are images captured by a plurality of cameras that capture the object from different positions, and the one or more processors are further configured to execute the instructions to: calibrate postures and camera parameters of the plurality of cameras using the virtual label. 8 . The information processing device according to claim 7 , wherein the plurality of cameras include a camera that captures the object from above. 9 . An information processing method comprising: predicting a trajectory of an object included in at least one of a plurality of target images with reference to the plurality of target images; extracting a feature of the trajectory; calculating a reliability of the trajectory based on the feature; generating a virtual label in which each of the plurality of target images, a position of the object included in the target image, the trajectory, and the reliability are associated with each other; and learning a state transition model that predicts a state of an object included in a plurality of images by using the virtual label. 10 . A non-transitory computer-readable recording medium storing a program for causing a computer to execute the steps of: predicting a trajectory of an object included in at least one of a plurality of target images with reference to the plurality of target images; extracting a feature of the trajectory; calculating a reliability of the trajectory based on the feature; generating a virtual label in which each of the plurality of target images, a position of the object included in the target image, the trajectory, and the reliability are associated with each other; and learning a state transition model that predicts a state of an object included in a plurality of images by using the virtual label.
Related publications grouped by family.
Answers are generated from the same data shown on this page.