Inference computing apparatus, model training apparatus, inference computing system

US12236362B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12236362-B2
Application numberUS-201917044276-A
CountryUS
Kind codeB2
Filing dateDec 20, 2019
Priority dateDec 20, 2019
Publication dateFeb 25, 2025
Grant dateFeb 25, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An inference computing apparatus comprises at least one processor and a memory with program instructions stored therein, the program instructions can be executed by the at least one processor to cause the inference computing apparatus to perform the following operations: receiving a first inference model from a model training apparatus, the first inference model being obtained through a model training by the model training apparatus based on a first training sample library, the first training sample library comprising training samples from historical data generated in a manufacturing stage; performing an inference computing on data to be processed generated in the manufacturing stage based on the first inference model to obtain the inference result which is sent to a user-side device; and evaluating performance of the first inference model to determine whether the first inference model needs to be updated, and if yes, updating the first inference model.

First claim

Opening claim text (preview).

What is claimed is: 1. An inference computing apparatus, comprising at least one processor and a memory, wherein the memory has stored therein program instructions that, when executed by the at least one processor, cause the inference computing apparatus to perform following operations: receiving a first inference model from a model training apparatus, the first inference model being obtained by the model training apparatus through a first model training based on a first training sample library, the first training sample library including first training samples from historical data generated in a manufacturing stage, and the model training apparatus including a cloud device; performing an inference computing on data to be processed generated in the manufacturing stage based on the first inference model to obtain an inference result, and sending the inference result to a user-side device, the inference computing apparatus being closer to the user-side device than the model training apparatus; evaluating a performance of the first inference model to determine whether the first inference model needs to be updated; and updating the first inference model when the first inference model needs to be updated, wherein updating the first inference model includes: performing a second model training based on a second training sample library to obtain a second inference model, or sending a model update request to the model training apparatus to obtain the second inference model, the second training sample library including: second training samples from the historical data, third training samples being from the inference result and subjected to re-judging, or the second training samples from the historical data and the third training samples being from the inference result and subjected to re-judging; and updating the first inference model with the second inference model in a case where the second inference model meets an update condition, wherein the update condition includes that: a test is performed on the second inference model and the second inference model passes the test, wherein the test includes: evaluating a performance of the second inference model based on test samples; and when the performance of the second inference model meets evaluation requirements, determining that the second inference model passes the test; and a gray-scale deployment is performed on the second inference model, the performance of the second inference model is evaluated during the gray-scale deployment, and the performance of the second inference model meets the evaluation requirements, wherein the gray-scale deployment is that the inference computing apparatus performs a simulation processing using the second inference model within a preset period of time; wherein before performing the second model training to obtain the second inference model, or before sending the model update request to the model training apparatus to obtain the second inference model, the inference computing apparatus further performs following operations: determining whether training parameters required for the second model training to be performed are within a preset range of the training parameters; and when the training parameters are within the preset range, performing the second model training; when the training parameters are not within the preset range, sending the model update request to the model training apparatus, wherein the training parameters include at least one of data size, training duration and computing power required for the second model training, and the preset range of the training parameters is a range of the training parameters which corresponds to the case where a training capacity of the inference computing apparatus meets the requirements of the second model training to be performed. 2. The inference computing apparatus according to claim 1 , wherein the historical data includes a product image marked with defect information having a correct result; the data to be processed includes an original product image generated in the manufacturing stage; and the inference result includes defect information of the original product image. 3. The inference computing apparatus according to claim 2 , wherein after obtaining the inference result, the inference computing apparatus further performs following operations: generating a decision instruction according to the inference result; wherein the defect information of the original product image includes: the identified original product image having a defect, and defect location and defect type; and the decision instruction includes: performing a corresponding defect processing on a product corresponding to the original product image having the defect according to the defect information of the original product image. 4. The inference computing apparatus according to claim 1 , wherein the update condition further includes that: configuration information of the second inference model is verified, and the configuration information matches with the inference computing to be performed. 5. The inference computing apparatus according to claim 1 , wherein evaluation parameters used for evaluating the performance of the first inference model include at least one of accuracy rate, precision rate, recall rate and F Score of the first inference model during an inference computing process; a case where the first inference model needs to be updated according to the performance evaluation includes: the performance of the first inference model fluctuates or decreases, wherein whether the performance of the first inference model fluctuates or decreases is determined according to a variation of the evaluation parameters in a continuous period of time. 6. The inference computing apparatus according to claim 1 , wherein updating the first inference model according to the performance evaluation includes: receiving a third inference model, the third inference model being obtained by the model training apparatus through a third model training based on an updated first training sample library in a case where the model training apparatus has not received the model update request, and the updated first training sample library including the third training samples being from the inference result and subjected to re-judging; comparing the performance of the first inference model with a performance of the third inference model, and when the performance of the third inference model is better than the performance of the first inference model, or when an abnormality occurs in the first inference model, updating the first inference model with the third inference model. 7. The inference computing apparatus according to claim 1 , wherein in a case where the first inference model is updated with the second inference model, the inference computing apparatus further performs following operations: performing the performance evaluation on the second inference model, and comparing the performance of the first inference model with the performance of the second inference model; when the performance of the second inference model is worse than the performance of the first inference model, rolling back the second inference model to the first inference model; and performing the second model training again, or requesting the model training apparatus to perform the second model training again. 8. A model training apparatus, comprising at least one processor and a memory, wherein the memory has stored therein program instructions that, when executed by the at least one processor, cause the model training apparatus to perform following operations: performing a first model training based on a first training sample library to obtain a first inference model, the first training sample librar

Assignees

Inventors

Classifications

  • Generative networks · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Supervised learning · CPC title

  • Learning methods · CPC title

  • G06T7/0004Primary

    Industrial image inspection · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12236362B2 cover?
An inference computing apparatus comprises at least one processor and a memory with program instructions stored therein, the program instructions can be executed by the at least one processor to cause the inference computing apparatus to perform the following operations: receiving a first inference model from a model training apparatus, the first inference model being obtained through a model t…
Who is the assignee on this patent?
Boe Technology Group Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06T7/0004. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 25 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).