What technology area does this patent fall under?

Primary CPC classification G01N33/0062. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu Jan 30 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Method for automatically identifying emission sources in source apportionment process of pollutants

US2025035602A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2025035602-A1
Application number	US-202318487185-A
Country	US
Kind code	A1
Filing date	Oct 16, 2023
Priority date	Jul 25, 2023
Publication date	Jan 30, 2025
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for automatically identifying emission sources in a source apportionment process of pollutants is provided, which relates to the field of air pollution prevention and control. The method includes: integrating measured source spectrum data and factor spectrum data to generate a labeled data set and an unlabeled data set, respectively; preprocessing the labeled data set to generate a continuous labeled data set; constructing a tree classification model based on the continuous labeled data set; optimizing the tree classification model to determine the optimized tree classification model; coupling the optimized tree classification model and a pseudo-labeling algorithm to generate an integrated model based on the unlabeled data set to automatically identify factor profiles in the unlabeled data set; and determining types of the emission sources based on the factor profiles. The factor profiles can be automatically identified, so that types of emission sources can be quickly determined.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for automatically identifying emission sources in a source apportionment process of pollutant, comprising: integrating measured source profiles and factor profiles to generate a labeled data set and an unlabeled data set, respectively: wherein the measured source profiles are priori knowledge, which are derived from actually measured samples of the emission sources and are configured for revealing physical and chemical features of the emission sources; preprocessing the labeled data set to generate a continuous labeled data set; constructing a tree classification model based on the continuous labeled data set; optimizing the tree classification model to determine the optimized tree classification model; coupling the optimized tree classification model and a pseudo-labeling algorithm to generate an integrated model based on the unlabeled data set, so as to automatically identify the factor profiles in the unlabeled data set; and determining types of the emission sources based on the factor profiles. 2 . The method according to claim 1 , wherein preprocessing the labeled data set to generate the continuous labeled data set comprises: oversampling a measured source spectrum data in the labeled data set to generate oversampled measured source spectrum data; normalizing independent variables of the oversampled measured source profiles to generate normalized measured source profiles; and encoding dependent variables of the normalized measured source profiles to form the continuous labeled data set. 3 . The method according to claim 1 , wherein constructing the tree classification model based on the continuous labeled data set comprises: dividing the continuous labeled data set into a training data set and a testing data set; training a plurality of machine learning models by using the training data set to generate a plurality of trained machine learning models; testing each of the trained machine learning models by using the testing data set to generate evaluation indexes, wherein the evaluation indexes comprise accuracy, a precision rate and a recall rate; and screening one of the machine learning models as the tree classification model based on all of the evaluation indexes. 4 . The method according to claim 1 , wherein optimizing the tree classification model to determine the optimized tree classification model comprises: traversing a gradient change of key parameters of the optimized tree classification model to determine optimal key parameters, wherein the key parameters comprise a number of decision trees and a maximum number of features; and optimizing the tree classification model based on the optimal key parameters to determine the optimized tree classification model. 5 . The method according to claim 3 , wherein coupling the optimized tree classification model and a pseudo-labeling algorithm to generate an integrated model based on the unlabeled data set, so as to automatically identify the factor profiles in the unlabeled data set comprises: screening factor profiles with prediction probabilities greater than a predetermined probability from the unlabeled data set by using the integrated model; assigning pseudo labels to the screened factor profiles by using the pseudo-labeling algorithm; adding a data set of the factor profiles assigned with the pseudo labels to the training data set to form a new training data set; constructing a new tree classification model based on the new training data set to identify remaining factor profiles in the unlabeled data set.

Assignees

Univ Nankai

Inventors

Classifications

G01N33/0062Primary
concerning the measuring method or the display, e.g. intermittent measurement or digital display · CPC title
G16C20/70Primary
Machine learning, data mining or chemometrics · CPC title
G06Q50/26
Government or public services (business processes related to the transportation industry G06Q50/40) · CPC title
G06F18/217
Validation; Performance evaluation; Active pattern learning techniques · CPC title
G06F18/214
Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

Patent family

Related publications grouped by family.

View patent family 88389389

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025035602A1 cover?: A method for automatically identifying emission sources in a source apportionment process of pollutants is provided, which relates to the field of air pollution prevention and control. The method includes: integrating measured source spectrum data and factor spectrum data to generate a labeled data set and an unlabeled data set, respectively; preprocessing the labeled data set to generate a con…
Who is the assignee on this patent?: Univ Nankai
What technology area does this patent fall under?: Primary CPC classification G01N33/0062. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu Jan 30 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).