Method and apparatus for classifying data

US12566820B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12566820-B2
Application numberUS-201817296638-A
CountryUS
Kind codeB2
Filing dateDec 3, 2018
Priority dateDec 3, 2018
Publication dateMar 3, 2026
Grant dateMar 3, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and an apparatus are for classifying data. In an embodiment, the method includes: classifying at least two pieces of data, to obtain at least two data clusters; determining a bias degree of classification; re-classifying the at least two pieces of data by merging any several of the at least two data clusters; determining a bias degree of re-classification; and determining, by comparing the bias degree of first classification and the bias degree of re-classification, which classification is more accurate. By way of the method and apparatus, a related data cluster can be found from multiple data clusters for better data analysis.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for classifying data, the method comprising: first classifying at least two pieces of data to obtain at least two first data clusters; determining a first bias degree of the first classifying based on a plurality of first difference degrees, each of the plurality of first difference degrees being determined based on a difference between first output data and a respective first data cluster among the at least two first data clusters, the first output data being obtained by inputting the respective first data cluster into a corresponding first neural network model, and the corresponding first neural network model being among a plurality of first neural network models; re-classifying the at least two pieces of data by merging at least two of the at least two first data clusters to obtain a second data cluster; determining a second bias degree of the re-classifying based on a plurality of second difference degrees, each of the plurality of second difference degrees being determined based on a difference between second output data and a respective subset among subsets of data in the second data cluster, and the second output data being obtained by inputting the respective subset into a second neural network model; and determining which of the first classifying and the re-classifying is more accurate based on the first bias degree and the second bias degree. 2 . The method of claim 1 , wherein each respective first neural network model among the plurality of first neural network models is obtained through training by using data in a corresponding first data cluster among the at least two first data clusters; an input layer and an output layer of each among the plurality of first neural network models have a same quantity of nodes; the second neural network model is obtained through training by using all data included in the second data clusters; an input layer and an output layer of the second neural network model have a same quantity of nodes; and the determining which of the first classifying and the re-classifying is more accurate includes determining related data clusters among the at least two first data clusters, and marking the related data clusters as related in response to the determining the related data clusters. 3 . The method of claim 2 , wherein the determining the related data clusters is based on the second bias degree being less than the first bias degree. 4 . The method of claim 2 , further comprising: merging data clusters marked as related among the at least two first data clusters into a third data cluster. 5 . The method of claim 2 , wherein each among the plurality of first neural network models and the second neural network model are neural network models of a same type. 6 . The method of claim 1 , wherein each respective first neural network model among the plurality of first neural network models is trained as an auto encoder for a corresponding first data cluster among the at least two first data clusters; and the second neural network model is trained as an auto encoder for the second data cluster. 7 . The method of claim 1 , wherein the first output data has a same number of attributes as each respective first data cluster among the at least two first data clusters; and the second output data has a same number of attributes as each respective subset among the subsets of data in the second data cluster. 8 . The method of claim 1 , wherein an input layer and an output layer of each among the plurality of first neural network models have a same quantity of nodes; and an input layer and an output layer of the second neural network model have a same quantity of nodes. 9 . A computation device, comprising: at least one memory storing an executable instruction; and at least one processor configured to execute the executable instruction to cause the computation device to first classifying at least two pieces of data, to obtain at least two first data clusters, determine a first bias degree of the first classification based on a plurality of first difference degrees, each of the plurality of first difference degrees being determined based on a difference between first output data and a respective first data cluster among the at least two first data clusters, the first output data being obtained by inputting the respective first data cluster into a corresponding first neural network model, and the corresponding first neural network model being among a plurality of first neural network models, re-classify the at least two pieces of data by merging at least two of the at least two first data clusters to obtain a second data cluster, determine a second bias degree of the re-classification based on a plurality of second difference degrees, each of the plurality of second difference degrees being determined based on a difference between second output data and a respective subset among subsets of data in the second data cluster, and the second output data being obtained by inputting the respective subset into a second neural network model, and determine which of the first classification and the re-classification is more accurate based on the first bias degree and the second bias degree. 10 . The computation device of claim 9 , wherein each respective first neural network model among the plurality of first neural network models is obtained through training by using data in a corresponding first data structure among the at least two first data clusters; an input layer and an output layer of each among the plurality of first neural network models have a same quantity of nodes; the second neural network model is obtained through training by using all data included in the second data clusters; an input layer and an output layer of the second neural network model have a same quantity of nodes; and the at least one processor is configured to execute the executable instruction to cause the computation device to determine which of the first classification and the re-classification is more accurate including determining related data clusters among the at least two first data clusters, and marking the related data clusters as related in response to the determining the related data clusters. 11 . The computation device of claim 10 , wherein the determining the related data clusters is based on; the second bias degree being less than the first bias degree. 12 . The computation device of claim 10 , wherein the at least one processor is configured to execute the executable instruction to cause the computation device to merge data clusters marked as related among the at least two first data clusters into a third data cluster. 13 . The computation device of claim 10 , wherein each among the plurality of first neural network models and the second neural network model are neural network models of a same type. 14 . The computation device of claim 9 , wherein each respective first neural network model among the plurality of first neural network models is trained as an auto encoder for a corresponding first data cluster among the at least two first data clusters; and the second neural network model is trained as an auto encoder for the second data cluster. 15 . The computation device of claim 9 , wherein the first output data has a same number of attributes as each respective first data cluster among the at least two first data clusters; and the second output data has a same number of attributes as each respective subset among the subsets of data in the second data cluster. 16 . A non

Assignees

Inventors

Classifications

  • Combinations of networks · CPC title

  • Fusion techniques · CPC title

  • Clustering techniques · CPC title

  • Learning methods · CPC title

  • Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12566820B2 cover?
A method and an apparatus are for classifying data. In an embodiment, the method includes: classifying at least two pieces of data, to obtain at least two data clusters; determining a bias degree of classification; re-classifying the at least two pieces of data by merging any several of the at least two data clusters; determining a bias degree of re-classification; and determining, by comparing…
Who is the assignee on this patent?
Siemens Mobility GmbH
What technology area does this patent fall under?
Primary CPC classification G06F18/241. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 03 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).