What technology area does this patent fall under?

Primary CPC classification G06F16/355. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu Jan 13 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Embedding multi-modal time series and text data

US2022012274A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2022012274-A1
Application number	US-202117370498-A
Country	US
Kind code	A1
Filing date	Jul 8, 2021
Priority date	Jul 13, 2020
Publication date	Jan 13, 2022
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods and systems of training and using a neural network model include training a time series embedding model and a text embedding model with unsupervised clustering to translate time series and text, respectively, to a shared latent space. The time series embedding model and the text embedding model are further trained using semi-supervised clustering that samples training data pairs of time series information and associated text for annotation.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method of training a neural network, comprising: training, using a hardware processor, a time series embedding model and a text embedding model with unsupervised clustering to translate time series and text, respectively, to a shared latent space; and training the time series embedding model and the text embedding model further using semi-supervised clustering that samples training data pairs of time series information and associated text for annotation. 2 . The method of claim 1 , wherein the training data pairs each include a time series segment and an associated text. 3 . The method of claim 2 , wherein the associated text describes circumstances relating to the time series segment. 4 . The method of claim 1 , wherein training using semi-supervised clustering includes annotating the sampled pairs to indicate constraints. 5 . The method of claim 4 , wherein the constraints are selected from the group consisting of a “must-link” constraint and a “cannot-link” constraint. 6 . A method of querying a time series database, comprising: transforming a query to an embedded vector in a multi-modal shared latent space that encodes time series information and textual information; identifying a feature vector in the multi-modal shared latent space, stored in a time series dataspace, that matches the embedded vector, and that is associated with a data type complementary to the query; and returning data associated with the identified feature vector, responsive to the query. 7 . The method of claim 6 , wherein the query includes a text information, and the feature vector is associated with time series information. 8 . The method of claim 7 , wherein the associated text describes circumstances relating to the time series segment. 9 . The method of claim 7 , wherein the query further includes a time series segment. 10 . The method of claim 6 , wherein the query includes time series information, and the feature vector is associated with text information. 11 . The method of claim 6 , wherein identifying the feature vector that matches the embedded vector includes identifying a nearest neighbor according to Euclidean distance within in the multi-modal shared latent space. 12 . A system for training a neural network, comprising: a hardware processor; and a memory that stores a computer program product, which, when executed by the hardware processor, causes the hardware processor to: train a time series embedding model and a text embedding model using unsupervised clustering to translate time series and text, respectively, to a multi-modal shared latent space; train the time series embedding model and the text embedding model further using semi-supervised clustering that samples training data pairs of time series information and associated text for annotation; transform a query to an embedded vector in the multi-modal shared latent space that encodes time series information and textual information; identifying a feature vector in the multi-modal shared latent space, stored in a time series dataspace, that matches the embedded vector, and that is associated with a data type complementary to the query; and returning data associated with the identified feature vector, responsive to the query. 13 . The system of claim 12 , wherein the training data pairs each include a time series segment and an associated text. 14 . The system of claim 13 , wherein the associated text of each pair describes circumstances relating to the respective time series segment. 15 . The system of claim 13 , wherein the computer program product further causes the hardware processor to annotate the sampled pairs to indicate constraints during semi-supervised clustering. 16 . The system of claim 15 , wherein the constraints are selected from the group consisting of a “must-link” constraint and a “cannot-link” constraint. 17 . The system of claim 12 , wherein the query includes a text information, and the feature vector is associated with time series information. 18 . The system of claim 16 , wherein the query further includes a time series segment. 19 . The system of claim 12 , wherein the query includes time series information, and the feature vector is associated with text information. 20 . The system of claim 12 , wherein the computer program product further causes the hardware processor to identify a nearest neighbor of the embedded vector according to Euclidean distance within in the multi-modal shared latent space.

Assignees

Nec Lab America Inc

Inventors

Classifications

G06V10/62
relating to a temporal dimension, e.g. time-based feature extraction; Pattern tracking · CPC title
G06V10/82
using neural networks · CPC title
G06V10/7715
Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods · CPC title
G06F18/23
Clustering techniques · CPC title
G06N3/045
Combinations of networks · CPC title

Patent family

Related publications grouped by family.

View patent family 79172658

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2022012274A1 cover?: Methods and systems of training and using a neural network model include training a time series embedding model and a text embedding model with unsupervised clustering to translate time series and text, respectively, to a shared latent space. The time series embedding model and the text embedding model are further trained using semi-supervised clustering that samples training data pairs of time…
Who is the assignee on this patent?: Nec Lab America Inc
What technology area does this patent fall under?: Primary CPC classification G06F16/355. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu Jan 13 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Failure Prediction In Distributed Environments

Methods and Apparatus for Classification

Multi-domain machine translation system with training data clustering and dynamic domain adaptation

Efficient event searching

Acoustic event detection based on modelling of sequence of event subparts

Periodicity Analysis on Heterogeneous Logs

Frequently asked questions