Dynamic precision for neural network compute operations

US12346798B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12346798-B2
Application numberUS-202318351124-A
CountryUS
Kind codeB2
Filing dateJul 12, 2023
Priority dateApr 24, 2017
Publication dateJul 1, 2025
Grant dateJul 1, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In an example, an apparatus comprises a compute engine comprising a high precision component and a low precision component; and logic, at least partially including hardware logic, to receive instructions in the compute engine; select at least one of the high precision component or the low precision component to execute the instructions; and apply a gate to at least one of the high precision component or the low precision component to execute the instructions. Other embodiments are also disclosed and claimed.

First claim

Opening claim text (preview).

The invention claimed is: 1. An apparatus comprising: processor circuitry coupled to a memory, the processor circuitry to: track precision level data relating to one or more neural network operations; and expose the precision level data in one or more model specific registers, wherein the one or more model specific registers are used to tune one or more neural network applications executing on the processor circuitry. 2. The apparatus of claim 1 , wherein tracking further includes tracking one or more weights associated with the one or more neural network operations, wherein tracking is performed while a neural network application is being executed on the processor circuitry. 3. The apparatus of claim 1 , wherein the processor circuitry is further to automatically schedule, based on the precision level data, runtimes for workloads associated with the processor circuitry. 4. The apparatus of claim 1 , wherein the processor circuitry comprises one or more of graphics processor circuitry or application processor circuitry. 5. A method comprising: tracking, by a processor of a computing device, precision level data relating to one or more neural network operations; and exposing the precision level data in one or more model specific registers, wherein the one or more model specific registers are used to tune one or more neural network applications executing on the processor. 6. The method of claim 5 , wherein tracking further includes tracking one or more weights associated with the one or more neural network operations, wherein tracking is performed while a neural network application is being executed on the processor. 7. The method of claim 5 , further comprising automatically scheduling, based on the precision level data, runtimes for workloads associated with the processor. 8. The method of claim 5 , wherein the processor comprises a graphics processor coupled to a memory and an application processor. 9. At least one non-transitory computer-readable medium having stored thereon instructions which, when executed, cause a computing device to perform operations comprising: tracking precision level data relating to one or more neural network operations; and exposing the precision level data in one or more model specific registers, wherein the one or more model specific registers are used to tune one or more neural network applications executing on the one or more processors. 10. The non-transitory computer-readable medium of claim 9 , wherein tracking further includes tracking one or more weights associated with the one or more neural network operations, wherein tracking is performed while a neural network application is being executed on one or more processors of the computing device. 11. The non-transitory computer-readable medium of claim 9 , wherein the operations further comprise automatically scheduling, based on the precision level data, runtimes for workloads associated with one or more processors. 12. The non-transitory computer-readable medium of claim 9 , wherein the one or more processors comprise one or more graphics processors or one or more application processors.

Assignees

Inventors

Classifications

  • Learning methods · CPC title

  • Supervised learning · CPC title

  • Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12346798B2 cover?
In an example, an apparatus comprises a compute engine comprising a high precision component and a low precision component; and logic, at least partially including hardware logic, to receive instructions in the compute engine; select at least one of the high precision component or the low precision component to execute the instructions; and apply a gate to at least one of the high precision com…
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification G06T1/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 01 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).