Generation of inference model by machine learning using pile images

US2026073249A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2026073249-A1
Application numberUS-202519322713-A
CountryUS
Kind codeA1
Filing dateSep 9, 2025
Priority dateSep 12, 2024
Publication dateMar 12, 2026
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system includes circuitry configured to: generate a plurality of workpiece images each of which shows a workpiece viewed from a different viewpoint; generate, based on the plurality of workpiece images, one or more virtual pile images showing a plurality of piled workpieces; and generate, by machine learning using the one or more virtual pile images, an inference model configured to infer workpiece information regarding one or more of the workpieces shown in the pile image.

First claim

Opening claim text (preview).

What is claimed is: 1 . A system comprising circuitry configured to: generate a plurality of workpiece images each of which shows a workpiece viewed from a different viewpoint; generate, based on the plurality of workpiece images, one or more virtual pile images showing a plurality of piled workpieces; and generate, by machine learning using the one or more virtual pile images, an inference model configured to infer workpiece information regarding one or more of the workpieces shown in the pile image. 2 . The system according to claim 1 , wherein the circuitry is configured to: generate the plurality of workpiece images for each of a plurality of classes of workpieces; generate the one or more virtual pile images showing the plurality of classes of workpieces that are piled; and generate, based on the one or more virtual pile images, the inference model so as to infer the workpiece information for each of the plurality of classes of workpieces. 3 . The system according to claim 1 , wherein the circuitry is configured to generate the plurality of workpiece images based on a 3D model of the workpiece. 4 . The system according to claim 3 , wherein the circuitry is configured to: acquire a plurality of captured images obtained by capturing an actual workpiece from different viewpoints; and generate the 3D model of the workpiece from the plurality of captured images of the workpiece. 5 . The system according to claim 4 , wherein the circuitry is configured to generate the 3D model of the workpiece by image synthesis using a neural radiance field based on the plurality of captured images of the workpiece. 6 . The system according to claim 3 , wherein the circuitry is configured to: approximate the 3D model of the workpiece with a plurality of particles connected by elastic parameters to deform the 3D model; and generate the plurality of workpiece images based on the deformed 3D model. 7 . The system according to claim 3 , wherein the circuitry is configured to: attach a ground truth label of the workpiece information of the workpiece to the 3D model of the workpiece; associate the ground truth label attached to the 3D model of the workpiece with each of the plurality of workpiece images of the workpiece; generate the one or more virtual pile images based on the plurality of workpiece images with which the ground truth label is associated; and generate the inference model by the machine learning using the one or more virtual pile images with which a plurality of the ground truth labels are associated. 8 . The system according to claim 7 , wherein the circuitry is configured to: select, in response to a user operation, one or more types of information items from a plurality of types of information items prepared in advance for the workpiece information; connect one or more heads corresponding to the selected one or more types of information items to an output layer of a network constituting the inference model; and generate the inference model including the network to which the one or more heads are connected. 9 . The system according to claim 8 , wherein the circuitry is configured to: select, as the one or more types of information items, an information item regarding at least one of a relative position and a relative region that are determined relative to the workpiece; and connect the head corresponding to the information item regarding at least one of the relative position and the relative region to the output layer. 10 . The system according to claim 9 , wherein the relative position is a working position where a task is executed on the workpiece, and wherein the head corresponding to the information item regarding the relative position is a head configured to recognize the working position. 11 . The system according to claim 9 , wherein the relative position is a skeleton of the workpiece, and wherein the head corresponding to the information item regarding the relative position is a head configured to recognize the skeleton. 12 . The system according to claim 9 , wherein the relative region is associated with one or more part-classes set for the workpiece, and wherein the head corresponding to the information item regarding the relative region is a head configured to recognize the one or more part-classes. 13 . The system according to claim 8 , wherein the circuitry is configured to: for each of a plurality of classes of workpieces, connect the one or more heads corresponding to the workpiece and the selected one or more types of information items to the output layer, and configure the inference model so as to output a class indicating a type of the workpiece from the output layer, and to switch the one or more heads for inferring the workpiece information according to the output class. 14 . The system according to claim 1 , wherein the circuitry is configured to configure the inference model so as not to present the workpiece information of a workpiece whose recognition score obtained by object detection does not satisfy a predetermined criterion. 15 . The system according to claim 1 , wherein the circuitry is configured to: input a real pile image showing a plurality of real workpieces into the generated inference model to infer the workpiece information regarding one or more of the real workpieces; and execute a task on at least one of the one or more real workpieces based on the inferred workpiece information. 16 . The system according to claim 15 , wherein the circuitry is configured to cause a machine to execute the task. 17 . The system according to claim 16 , wherein the machine is a robot, and wherein the task circuitry is configured to: generate a path of the robot for executing the task; and cause the robot to execute the task based on the generated path. 18 . The system according to claim 5 , wherein the circuitry is configured to generate, based on the 3D model of the workpiece, the plurality of workpiece images each of which shows the workpiece viewed from a viewpoint different from all of the viewpoints of the plurality of captured images of the workpiece. 19 . A processor-executable method comprising: generating a plurality of workpiece images each of which shows a workpiece viewed from a different viewpoint; generating, based on the plurality of workpiece images, one or more virtual pile images showing a plurality of piled workpieces; and generating, by machine learning using the one or more virtual pile images, an inference model configured to infer workpiece information regarding one or more of the workpieces shown in the pile image. 20 . A non-transitory computer-readable storage medium storing processor-executable instructions to: generate a plurality of workpiece images each of which shows a workpiece viewed from a different viewpoint; generate, based on the plurality of workpiece images, one or more virtual pile images showing a plurality of piled workpieces; and generate, by machine learning using the one or more virtual pile images, an inference model configured to infer workpiece information regarding one or more of the workpieces shown in the pile image.

Assignees

Inventors

Classifications

  • Three-dimensional [3D] modelling for computer graphics · CPC title

  • Two-dimensional [2D] image generation · CPC title

  • G06N5/022Primary

    Knowledge engineering; Knowledge acquisition · CPC title

  • G06T15/20Primary

    Perspective computation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2026073249A1 cover?
A system includes circuitry configured to: generate a plurality of workpiece images each of which shows a workpiece viewed from a different viewpoint; generate, based on the plurality of workpiece images, one or more virtual pile images showing a plurality of piled workpieces; and generate, by machine learning using the one or more virtual pile images, an inference model configured to infer wor…
Who is the assignee on this patent?
Yaskawa Electric Corp
What technology area does this patent fall under?
Primary CPC classification G06N5/022. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Mar 12 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).