What technology area does this patent fall under?

Primary CPC classification G06V10/82. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Jan 21 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Learning ordinal representations for deep reinforcement learning based object localization

US12205357B2 · US · B2

Patent metadata
Field	Value
Publication number	US-12205357-B2
Application number	US-202217715901-A
Country	US
Kind code	B2
Filing date	Apr 7, 2022
Priority date	Apr 8, 2021
Publication date	Jan 21, 2025
Grant date	Jan 21, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A reinforcement learning based approach to the problem of query object localization, where an agent is trained to localize objects of interest specified by a small exemplary set. We learn a transferable reward signal formulated using the exemplary set by ordinal metric learning. It enables test-time policy adaptation to new environments where the reward signals are not readily available, and thus outperforms fine-tuning approaches that are limited to annotated images. In addition, the transferable reward allows repurposing of the trained agent for new tasks, such as annotation refinement, or selective localization from multiple common objects across a set of images. Experiments on corrupted MNIST dataset and CU-Birds dataset demonstrate the effectiveness of our approach.

First claim

Opening claim text (preview).

The invention claimed is: 1. A deep reinforcement learning (RL) method for object localization comprising: acquiring a seed dataset including a set of seed images each with ground truth bounding box annotation; pretrain ordinal embedding by randomly perturbing the ground truth bounding box at different levels denoted by parameter p, said ordinal embedding satisfying an ordinal constraint locally for each pair of perturbed data augmented from the same image, wherein the pretraining is performed through the effect of a backbone network, a region of interest (RoI) head, and a triplet loss; and using an embedding function, configuring RL agents to start from a whole image and recursively sample actions from a discrete action space such that rewards are produced, the rewards of a sample action determined from embedding distances and updating a policy network based on the rewards so determined; and outputting an annotation policy and embedding function. 2. The method of claim 1 wherein the seed image bounding box annotation is initially provided by a human action.

Assignees

Inventors

Classifications

G06V30/19167
Active pattern learning · CPC title
G06V10/82Primary
using neural networks · CPC title
G06V10/7788Primary
the supervisor being a human, e.g. interactive learning with a human teacher · CPC title

Patent family

Related publications grouped by family.

View patent family 83510851

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12205357B2 cover?: A reinforcement learning based approach to the problem of query object localization, where an agent is trained to localize objects of interest specified by a small exemplary set. We learn a transferable reward signal formulated using the exemplary set by ordinal metric learning. It enables test-time policy adaptation to new environments where the reward signals are not readily available, and th…
Who is the assignee on this patent?: Nec Lab America Inc, Nec Corp
What technology area does this patent fall under?: Primary CPC classification G06V10/82. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Jan 21 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Identifying similarity matrix for derived perceptions

Quality assessment of extracted features from high-dimensional machine learning datasets

Systems and methods for fast training of more robust models against adversarial attacks

System and method for real-time large image homography processing

Frequently asked questions