Processing structured documents using convolutional neural networks
US-10387531-B1 · Aug 20, 2019 · US
US2021056127A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2021056127-A1 |
| Application number | US-202016996110-A |
| Country | US |
| Kind code | A1 |
| Filing date | Aug 18, 2020 |
| Priority date | Aug 21, 2019 |
| Publication date | Feb 25, 2021 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method for embedding learning and clustering for paired multi-modal data using deep canonical correlation analysis and active learning with pairwise queries is presented. The method includes collecting time-series data from a plurality of sensors, training, in an unsupervised manner, a cross-modal retrieval system by using the time-series data and relevant comment texts, depending on a modality of a query, retrieving the relevant comment texts from a time-series segment of the time-series data, the relevant comment texts used as human-readable explanations of a query segment, retrieving relevant time-series segments given a sentence or a set of keywords such that the relevant time-series segments match the sentence or set of keywords, and retrieving the relevant time-series segments given the time-series segment and the sentence or set of keywords such that a first subset of attributes match the set of keywords and a second subset of attributes resembles the time-series segment.
Opening claim text (preview).
What is claimed is: 1 . A computer-implemented method executed on a processor for embedding learning and clustering for paired multi-modal data using deep canonical correlation analysis (CCA) and active learning with pairwise queries, the method comprising: collecting time-series data from a plurality of sensors; training, in an unsupervised manner, a cross-modal retrieval system by using the time-series data and relevant comment texts; depending on a modality of a query: retrieving the relevant comment texts from a time-series segment of the time-series data, the relevant comment texts used as human-readable explanations of a query segment; retrieving relevant time-series segments given a sentence or a set of keywords such that the relevant time-series segments match the sentence or set of keywords; and retrieving the relevant time-series segments given the time-series segment and the sentence or set of keywords such that a first subset of attributes match the set of keywords and a second subset of attributes resembles the time-series segment. 2 . The method of claim 1 , wherein the time-series segment and the relevant comment texts are transformed into points in a common latent space. 3 . The method of claim 2 , wherein the cross-modal retrieval system finds nearest neighbors of the query in the common latent space. 4 . The method of claim 1 , wherein the cross-modal retrieval system uses multi-modal neural networks to encode the time-series data and the relevant comment texts into vector representations. 5 . The method of claim 4 , wherein the multi-modal neural networks are trained by a two-stage training algorithm employing examples from a user-provided database of time-series text pairs. 6 . The method of claim 5 , wherein the first stage of the training algorithm is a deep CCA-based pre-training. 7 . The method of claim 6 , wherein the second stage of the training algorithm is active clustering. 8 . The method of claim 7 , wherein the active clustering includes query pair selection based on Gaussian mixture modeling (GMM) and query-based selection using active spectral clustering. 9 . A non-transitory computer-readable storage medium comprising a computer-readable program for embedding learning and clustering for paired multi-modal data using deep canonical correlation analysis (CCA) and active learning with pairwise queries, wherein the computer-readable program when executed on a computer causes the computer to perform the steps of: collecting time-series data from a plurality of sensors; training, in an unsupervised manner, a cross-modal retrieval system by using the time-series data and relevant comment texts; depending on a modality of a query: retrieving the relevant comment texts from a time-series segment of the time-series data, the relevant comment texts used as human-readable explanations of a query segment; retrieving relevant time-series segments given a sentence or a set of keywords such that the relevant time-series segments match the sentence or set of keywords; and retrieving the relevant time-series segments given the time-series segment and the sentence or set of keywords such that a first subset of attributes match the set of keywords and a second subset of attributes resembles the time-series segment. 10 . The non-transitory computer-readable storage medium of claim 9 , wherein the time-series segment and the relevant comment texts are transformed into points in a common latent space. 11 . The non-transitory computer-readable storage medium of claim 10 , wherein the cross-modal retrieval system finds nearest neighbors of the query in the common latent space. 12 . The non-transitory computer-readable storage medium of claim 9 , wherein the cross-modal retrieval system uses multi-modal neural networks to encode the time-series data and the relevant comment texts into vector representations. 13 . The non-transitory computer-readable storage medium of claim 12 , wherein the multi-modal neural networks are trained by a two-stage training algorithm employing examples from a user-provided database of time-series text pairs. 14 . The non-transitory computer-readable storage medium of claim 13 , wherein the first stage of the training algorithm is a deep CCA-based pre-training. 15 . The non-transitory computer-readable storage medium of claim 14 , wherein the second stage of the training algorithm is active clustering. 16 . The non-transitory computer-readable storage medium of claim 15 , wherein the active clustering includes query pair selection based on Gaussian mixture modeling (GMM) and query-based selection using active spectral clustering. 17 . A system for embedding learning and clustering for paired multi-modal data using deep canonical correlation analysis (CCA) and active learning with pairwise queries, the system comprising: a memory; and one or more processors in communication with the memory configured to: collect time-series data from a plurality of sensors; train, in an unsupervised manner, a cross-modal retrieval system by using the time-series data and relevant comment texts; depending on a modality of a query: retrieve the relevant comment texts from a time-series segment of the time-series data, the relevant comment texts used as human-readable explanations of a query segment; retrieve relevant time-series segments given a sentence or a set of keywords such that the relevant time-series segments match the sentence or set of keywords; and retrieve the relevant time-series segments given the time-series segment and the sentence or set of keywords such that a first subset of attributes match the set of keywords and a second subset of attributes resembles the time-series segment. 18 . The system of claim 17 , wherein the time-series segment and the relevant comment texts are transformed into points in a common latent space. 19 . The system of claim 18 , wherein the cross-modal retrieval system finds nearest neighbors of the query in the common latent space. 20 . The system of claim 17 , wherein the cross-modal retrieval system uses multi-modal neural networks to encode the time-series data and the relevant comment texts into vector representations.
Probabilistic graphical models, e.g. probabilistic networks · CPC title
Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound · CPC title
Combinations of networks · CPC title
Non-supervised learning, e.g. competitive learning · CPC title
Convolutional networks [CNN, ConvNet] · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.