Enhanced computer vision application programming interface

US12530732B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12530732-B2
Application numberUS-202318178059-A
CountryUS
Kind codeB2
Filing dateMar 3, 2023
Priority dateApr 26, 2022
Publication dateJan 20, 2026
Grant dateJan 20, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An image processing system includes one or more processors operative to receive a graph application programming interface (API) call to add a complex node to a graph. The graph includes at least the complex node connected to other nodes by edges that are directed and acyclic. The one or more processors are further operative to process, by a graph compiler at compile time, the complex node by iteratively expanding the complex node into multiple nodes with each node corresponding to one operation in an image processing pipeline. The system further includes one or more target devices to execute executable code compiled from each node to perform operations of the image processing pipeline. The system further includes memory to store the graph compiler and the executable code.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for image processing, comprising: receiving a graph application programming interface (API) call that specifies a complex node representing a multi-layered neural network model used in an image processing pipeline; adding the complex node to a graph that contains other nodes representing computer vision operations, wherein the graph is directed and acyclic; attaching a buffer to the complex node to store code and parameters of the complex node; generating, by a graph compiler processing contents of the buffer at compile time, a sequence of nodes from the complex node to add to the graph, each node in the sequence corresponding to one operation layer of the neural network model; and executing executable code compiled from each node in the graph on one or more target devices to perform computer vision operations and neural network operations in the image processing pipeline. 2 . The method of claim 1 , further comprising: adding to the graph another complex node that represents a subgraph of nodes corresponding to a plurality of computer vision operations. 3 . The method of claim 1 , further comprising: adding to the graph another complex node that corresponds to customer-defined code for image processing. 4 . The method of claim 1 , wherein operations corresponding to the complex node are selected from a first library of functions, and operations corresponding to the other nodes in the graph are selected from a second library of functions, and wherein the first library and the second library are based on different programming models optimized for different types of operations. 5 . The method of claim 4 , wherein the first library of functions is provided by a deep learning framework, and the second library of functions is provided by a computer vision framework. 6 . The method of claim 1 , wherein the graph API call identifies a graph-based programming model used by operations corresponding to the complex node. 7 . The method of claim 1 , further comprising: receiving another graph API call that identifies another complex node as one of following: a proprietary computer vision (CV) node, a customer-defined node, a neural network model node, and a VX graph node based on OpenVX. 8 . A system for image processing, comprising: one or more processors to: receive a graph application programming interface (API) call that specifies a complex node representing a multi-layered neural network model used in an image processing pipeline; add a complex node to a graph that contains other nodes representing computer vision operations, wherein the graph is directed and acyclic; attach a buffer to the complex node to store code and parameters of the complex node; and generate, by a graph compiler processing contents of the buffer at compile time, a sequence of nodes from the complex node to add to the graph, each node in the sequence corresponding to one operation layer of the neural network model; one or more target devices to execute executable code compiled from each node in the graph to perform computer vision operations and neural network operations in the image processing pipeline; and memory to store the graph compiler and the executable code. 9 . The system of claim 8 , wherein the one or more processors are further operative to add to the graph another complex node that represents a subgraph of nodes corresponding to a plurality of computer vision operations. 10 . The system of claim 8 , wherein the one or more processors are further operative to add to the graph another complex node that corresponds to customer-defined code for image processing. 11 . The system of claim 8 , wherein operations corresponding to the complex node are selected from a first library of functions, and operations corresponding to the other nodes in the graph are selected from a second library of functions, and wherein the first library and the second library are based on different programming models optimized for different types of operations. 12 . The system of claim 11 , wherein the first library of functions is optimized for deep learning, and the second library of functions is optimized for computer vision. 13 . The system of claim 8 , wherein the graph API call identifies a graph-based programming model used by operations corresponding to the complex node. 14 . The system of claim 8 , wherein the one or more processors are further operative to receive another graph API call that identifies another complex node as one of following: a proprietary computer vision (CV) node, a customer-defined node, a neural network model node, and a VX graph node based on OpenVX.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12530732B2 cover?
An image processing system includes one or more processors operative to receive a graph application programming interface (API) call to add a complex node to a graph. The graph includes at least the complex node connected to other nodes by edges that are directed and acyclic. The one or more processors are further operative to process, by a graph compiler at compile time, the complex node by it…
Who is the assignee on this patent?
Mediatek Inc
What technology area does this patent fall under?
Primary CPC classification G06T1/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 20 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 11 related publications on this page (citations in our corpus or others sharing the same primary CPC).