Machine learning model training using an analog processor

US12373687B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12373687-B2
Application numberUS-202117537156-A
CountryUS
Kind codeB2
Filing dateNov 29, 2021
Priority dateNov 30, 2020
Publication dateJul 29, 2025
Grant dateJul 29, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Described herein are techniques of training a machine learning model and performing inference using an analog processor. Some embodiments mitigate the loss in performance of a machine learning model resulting from a lower precision of an analog processor by using an adaptive block floating-point representation of numbers for the analog processor. Some embodiments mitigate the loss in performance of a machine learning model due to noise that is present when using an analog processor. The techniques involve training the machine learning model such that it is robust to noise.

First claim

Opening claim text (preview).

What is claimed is: 1. A system comprising: circuitry comprising an analog processor; wherein the circuitry is configured to train a machine learning model, the training comprising performing one or more matrix operations to learn parameters of the machine learning model using the analog processor, wherein performing a matrix operation of the one or more matrix operations to learn the parameters of the machine learning model using the analog processor comprises: determining a scaling factor for a first portion of a first matrix involved in the matrix operation; scaling the first portion of the first matrix using the scaling factor for the first portion of the first matrix to obtain a scaled first portion of the first matrix; programming the analog processor using the scaled first portion of the first matrix; performing, by the analog processor programmed using the scaled first portion of the first matrix, the matrix operation to generate a first output; and determining a result of the matrix operation using the first output generated by the analog processor. 2. The system of claim 1 , wherein performing the matrix operation further comprises: determining a scaling factor for a first portion of a second matrix involved in the matrix operation; scaling the first portion of the second matrix using the scaling factor for the first portion of the second matrix to obtain a scaled first portion of the second matrix; programming the analog processor using the scaled first portion of the second matrix; and performing, by the analog processor programmed using the scaled first portion of the second matrix, the matrix operation to generate the first output. 3. The system of claim 1 , wherein performing the one or more matrix operations to learn the parameters of the machine learning model using the analog processor comprises: determining a scaling factor for a second portion of the first matrix; scaling the second portion of the first matrix using the scaling factor for the second portion of the first matrix to obtain a scaled second portion of the first matrix; programming the analog processor using the scaled second portion of the first matrix; performing, by the analog processor programmed using the scaled second portion of the first matrix, the matrix operation to generate a second output; and determining the result of the matrix operation using the second output generated by the analog processor. 4. The system of claim 3 , wherein the first scaling factor is different from the second scaling factor. 5. The system of claim 1 , wherein the first portion of the first matrix is a first vector of the first matrix. 6. The system of claim 1 , wherein determining the result of the matrix operation using the first output generated by the analog processor comprises: determining an output scaling factor for the first output generated by the analog processor using the first scaling factor; scaling the first output using the output scaling factor to obtain a scaled first output; and determining the result of the matrix operation using the scaled first output. 7. The system of claim 1 , wherein: determining the first scaling factor for the first portion of the first matrix comprises determining a maximum absolute value of the first portion of the first matrix; and scaling the first portion of the first matrix using the first scaling factor comprises scaling values of the first portion of the first matrix using the maximum absolute value of the first portion of the first matrix. 8. The system of claim 1 , wherein: the analog processor is configured to operate using a fixed-point representation of values; and programming the analog processor using the scaling of the first portion of the first matrix comprises converting values of the scaled first portion of the first matrix into the fixed-point representation. 9. The system of claim 8 , wherein: the circuitry further comprises a digital controller configured to operate using a floating-point representation of values; and a dynamic range of the floating-point representation is greater than a dynamic range of the fixed-point representation. 10. The system of claim 1 , wherein performing the one or more matrix operations to learn parameters of the machine learning model using the analog processor comprises amplifying or attenuating at least one analog signal used to perform a matrix operation of the one or more matrix operations. 11. The system of claim 10 , wherein amplifying or attenuating the at least one analog signal used to perform the matrix operation comprises: distributing a zero pad among different portions of a matrix involved in the matrix operation; and programming of the analog processor using the matrix with the zero pad distributed among different portions of the matrix. 12. The system of claim 1 , wherein performing the one or more matrix operations comprises performing the one or more matrix operations between a matrix of parameters of the machine learning model and a matrix of inputs to the machine learning model. 13. The system of claim 1 , wherein performing the one or more matrix operations to learn the parameters of the machine learning model using the analog processor comprises performing the one or more matrix operations to determine outputs of the machine learning model for a set of inputs. 14. The system of claim 1 , wherein performing the one or more matrix operations to learn the parameters of the machine learning model using the analog processor further comprises: performing the one or more matrix operations using the analog processor to determine a gradient of a loss function; and updating parameters of the machine learning model using the gradient of the loss function. 15. The system of claim 1 , wherein the training comprises performing a plurality of iterations, wherein performing each of at least some of the plurality of iterations comprises: determining updated parameters of the machine learning model; and setting parameters of the machine learning model to an average of the updated parameters and parameters set at one or more previous iterations of the plurality of iterations. 16. The system of claim 1 , wherein the matrix operation is a matrix multiplication. 17. The system of claim 1 , wherein the analog processor is a photonic processor, wherein performing the one or more matrix operations to learn the parameters of the machine learning model using the analog processor comprises processing light using the photonic processor. 18. The system of claim 1 , wherein the circuitry further comprises a digital controller. 19. The system of claim 1 , wherein the machine learning model is a neural network. 20. A system comprising: circuitry comprising an analog processor; wherein the circuitry is configured to train a machine learning model, the training comprising performing one or more matrix operations to learn parameters of the machine learning model using the analog processor, wherein performing the one or more matrix operations to learn parameters of the machine learning model using the analog processor comprises amplifying or attenuating at least one analog signal used to perform a matrix operation of the one or more matrix operations, wherein amplifying or attenuating the at least one analog signal used to perform the matrix operation comprises: programming the analog processor using multiple copies of a matrix involved in the matrix operation. 21. The system of claim 20 , wherein perf

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12373687B2 cover?
Described herein are techniques of training a machine learning model and performing inference using an analog processor. Some embodiments mitigate the loss in performance of a machine learning model resulting from a lower precision of an analog processor by using an adaptive block floating-point representation of numbers for the analog processor. Some embodiments mitigate the loss in performanc…
Who is the assignee on this patent?
Lightmatter Inc
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 29 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).