What technology area does this patent fall under?

Primary CPC classification G06F16/2462. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu Nov 27 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Local context for context-based tabular classification

US2025363123A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2025363123-A1
Application number	US-202519209875-A
Country	US
Kind code	A1
Filing date	May 16, 2025
Priority date	May 23, 2024
Publication date	Nov 27, 2025
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Context-based tabular data models use a context to evaluate a queried data point. Rather than a randomized or full context of domain data points, a local context of data points is selected that is customized for a particular data query. The system uses a pre-trained model, such as a TabPFN, that is trained on a classification for different types of data sets along with a “context” for applying the model with the nearest neighbors of that data point. The number of neighbors may vary and may be determined based on the distance of data points to the query point. The system also optimizes fine-tuning of tabular data models with neighborhood data so that local context can be used to select training batches of data using a common context. This allows local context fine-tuning without excess training costs of single-item training batches.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computing system for training a tabular data model with localized context, comprising: one or more processors configured to execute instructions; and a non-transitory computer-readable storage medium containing instructions executable by the one or more processors for: selecting a training data point from a set of training data points for a domain of tabular data; identifying a subset of data points in the set of training data that form a neighborhood around the training data point; selecting a context and a plurality of query points from the subset of data points that form the neighborhood around the training data point; and training parameters of a tabular data model with a training batch including the context and the plurality of query points. 2 . The computing system of claim 1 , wherein identifying the subset of data points comprises selecting nearest-neighbors of the identified training data point as the neighborhood. 3 . The computing system of claim 1 , wherein a number of the subset of data points varies based on the distance of data points to the training data point. 4 . The computing system of claim 1 , wherein the tabular data model is a transformer model. 5 . The computing system of claim 1 , wherein training parameters of the tabular data model comprises masking attention between the plurality of query points during application of the tabular data model. 6 . The computing system of claim 1 , wherein selecting the context and the plurality of query points comprises randomly assigning the subset of data points to the context or the plurality of query points. 7 . A method for training a tabular data model with localized content, comprising: selecting a training data point from a set of training data points for a domain of tabular data; identifying a subset of data points in the set of training data that form a neighborhood around the training data point; selecting a context and a plurality of query points from the subset of data points that form the neighborhood around the training data point; and training parameters of a tabular data model with a training batch including the context and the plurality of query points. 8 . The method of claim 7 , wherein identifying the subset of data points comprises selecting nearest-neighbors of the identified training data point as the neighborhood. 9 . The method of claim 7 , wherein a number of the subset of data points varies based on the distance of data points to the training data point. 10 . The method of claim 7 , wherein the tabular data model is a transformer model. 11 . The method of claim 7 , wherein training parameters of the tabular data model comprises masking attention between the plurality of query points during application of the tabular data model. 12 . The method of claim 7 , wherein selecting the context and the plurality of query points comprises randomly assigning the subset of data points to the context or the plurality of query points. 13 . A non-transitory computer-readable medium for training a tabular data model with localized content, the non-transitory computer-readable medium comprising instructions executable by a processor for: selecting a training data point from a set of training data points for a domain of tabular data; identifying a subset of data points in the set of training data that form a neighborhood around the training data point; selecting a context and a plurality of query points from the subset of data points that form the neighborhood around the training data point; and training parameters of a tabular data model with a training batch including the context and the plurality of query points. 14 . The non-transitory computer-readable medium of claim 13 , wherein identifying the subset of data points comprises selecting nearest-neighbors of the identified training data point as the neighborhood. 15 . The non-transitory computer-readable medium of claim 13 , wherein a number of the subset of data points varies based on the distance of data points to the training data point. 16 . The non-transitory computer-readable medium of claim 13 , wherein the tabular data model is a transformer model. 17 . The non-transitory computer-readable medium of claim 13 , wherein training parameters of the tabular data model comprises masking attention between the plurality of query points during application of the tabular data model. 18 . The non-transitory computer-readable medium of claim 13 , wherein selecting the context and the plurality of query points comprises randomly assigning the subset of data points to the context or the plurality of query points.

Assignees

Toronto Dominion Bank

Inventors

Classifications

G06F16/215
Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors · CPC title
G06F16/24575
using context · CPC title
G06F16/285
Clustering or classification · CPC title
G06F16/2462Primary
Approximate or statistical queries · CPC title

Patent family

Related publications grouped by family.

View patent family 97755365

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025363123A1 cover?: Context-based tabular data models use a context to evaluate a queried data point. Rather than a randomized or full context of domain data points, a local context of data points is selected that is customized for a particular data query. The system uses a pre-trained model, such as a TabPFN, that is trained on a classification for different types of data sets along with a “context” for applying …
Who is the assignee on this patent?: Toronto Dominion Bank
What technology area does this patent fall under?: Primary CPC classification G06F16/2462. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu Nov 27 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Systems and methods for generating contextual table embeddings for tabular data

System and method for language-independent contextual embedding

Hybrid nearest neighbor search tree with hashing table

Frequently asked questions