Method and apparatus for monitoring number density of aerosol particles
US-12146809-B2 · Nov 19, 2024 · US
US2025035602A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2025035602-A1 |
| Application number | US-202318487185-A |
| Country | US |
| Kind code | A1 |
| Filing date | Oct 16, 2023 |
| Priority date | Jul 25, 2023 |
| Publication date | Jan 30, 2025 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method for automatically identifying emission sources in a source apportionment process of pollutants is provided, which relates to the field of air pollution prevention and control. The method includes: integrating measured source spectrum data and factor spectrum data to generate a labeled data set and an unlabeled data set, respectively; preprocessing the labeled data set to generate a continuous labeled data set; constructing a tree classification model based on the continuous labeled data set; optimizing the tree classification model to determine the optimized tree classification model; coupling the optimized tree classification model and a pseudo-labeling algorithm to generate an integrated model based on the unlabeled data set to automatically identify factor profiles in the unlabeled data set; and determining types of the emission sources based on the factor profiles. The factor profiles can be automatically identified, so that types of emission sources can be quickly determined.
Opening claim text (preview).
What is claimed is: 1 . A method for automatically identifying emission sources in a source apportionment process of pollutant, comprising: integrating measured source profiles and factor profiles to generate a labeled data set and an unlabeled data set, respectively: wherein the measured source profiles are priori knowledge, which are derived from actually measured samples of the emission sources and are configured for revealing physical and chemical features of the emission sources; preprocessing the labeled data set to generate a continuous labeled data set; constructing a tree classification model based on the continuous labeled data set; optimizing the tree classification model to determine the optimized tree classification model; coupling the optimized tree classification model and a pseudo-labeling algorithm to generate an integrated model based on the unlabeled data set, so as to automatically identify the factor profiles in the unlabeled data set; and determining types of the emission sources based on the factor profiles. 2 . The method according to claim 1 , wherein preprocessing the labeled data set to generate the continuous labeled data set comprises: oversampling a measured source spectrum data in the labeled data set to generate oversampled measured source spectrum data; normalizing independent variables of the oversampled measured source profiles to generate normalized measured source profiles; and encoding dependent variables of the normalized measured source profiles to form the continuous labeled data set. 3 . The method according to claim 1 , wherein constructing the tree classification model based on the continuous labeled data set comprises: dividing the continuous labeled data set into a training data set and a testing data set; training a plurality of machine learning models by using the training data set to generate a plurality of trained machine learning models; testing each of the trained machine learning models by using the testing data set to generate evaluation indexes, wherein the evaluation indexes comprise accuracy, a precision rate and a recall rate; and screening one of the machine learning models as the tree classification model based on all of the evaluation indexes. 4 . The method according to claim 1 , wherein optimizing the tree classification model to determine the optimized tree classification model comprises: traversing a gradient change of key parameters of the optimized tree classification model to determine optimal key parameters, wherein the key parameters comprise a number of decision trees and a maximum number of features; and optimizing the tree classification model based on the optimal key parameters to determine the optimized tree classification model. 5 . The method according to claim 3 , wherein coupling the optimized tree classification model and a pseudo-labeling algorithm to generate an integrated model based on the unlabeled data set, so as to automatically identify the factor profiles in the unlabeled data set comprises: screening factor profiles with prediction probabilities greater than a predetermined probability from the unlabeled data set by using the integrated model; assigning pseudo labels to the screened factor profiles by using the pseudo-labeling algorithm; adding a data set of the factor profiles assigned with the pseudo labels to the training data set to form a new training data set; constructing a new tree classification model based on the new training data set to identify remaining factor profiles in the unlabeled data set.
concerning the measuring method or the display, e.g. intermittent measurement or digital display · CPC title
Machine learning, data mining or chemometrics · CPC title
Government or public services (business processes related to the transportation industry G06Q50/40) · CPC title
Validation; Performance evaluation; Active pattern learning techniques · CPC title
Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.