Air quality inference using multiple data sources

US2016125307A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016125307-A1
Application numberUS-201314896344-A
CountryUS
Kind codeA1
Filing dateJun 5, 2013
Priority dateJun 5, 2013
Publication dateMay 5, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The use of data from multiple data source provides inferred air quality indices with respect to a particular pollutant for multiple areas without the addition of air quality monitor stations to those areas. Labeled air quality index data for a pollutant in a region may be obtained from one or more air quality monitor stations. Spatial features for the region may be extracted from spatially-related data for the region. The spatially-related data may include information on fixed infrastructures in the region. Likewise, temporal features for the region may be extracted from temporally-related data for the region that changes over time. A co-training based learning framework may be further applied to co-train a spatial classifier and a temporal classifier based at least on the labeled air quality index data, the spatial features for the region, and the temporal features for the region.

First claim

Opening claim text (preview).

What is claimed is: 1 . One or more computer-readable media storing computer-executable instructions that are executed to cause one or more processors to perform acts comprising: obtaining labeled air quality index data for a pollutant in a region from one or more air quality monitor stations; extracting spatial features for the region from spatially-related data for the region, the spatially-related data including information associated with fixed infrastructures in the region; extracting temporal features for the region from temporally-related data for the region, the temporally-related data including data for the region that changes over time; and applying a co-training based learning framework to co-train a spatial classifier and a temporal classifier based at least on the labeled air quality index data, the spatial features for the region, and the temporal features for the region. 2 . The one or more computer-readable media of claim 1 , wherein the acts further comprise: obtaining spatial features for an area in the region based on spatially-related data observed for the area; obtaining temporal features for the area in the region based on temporally-related data observed for the area; generating a spatial probability score for the pollutant in the area based at least on the spatial features using the spatial classifier; generating a temporal probability score for the pollutant in the area based at least on the temporal features using the temporal classifier; and calculating an air quality index level with respect to the pollutant in the area based at least on the spatial probability score and the temporal probability score. 3 . The one or more computer-readable media of claim 2 , wherein the calculating includes calculating the air quality index level based at least on a product of the spatial probability score and the temporal probability score. 4 . The one or more computer-readable media of claim 2 , wherein the spatially-related data observed for the area includes at least one of road network data or points-of-interest data, and wherein the temporally-related data observed for the area includes at least one of automobile traffic data, human movement data, or meteorological data. 5 . The one or more computer-readable media of claim 2 , wherein the area lacks an air quality monitor station that provides an air quality index level for the pollutant. 6 . The one or more computer-readable media of claim 1 , wherein the applying includes applying the co-training based learning framework to co-train the spatial classifier and the temporal classifier for inferring an air quality index level of the pollutant for an area in the region. 7 . The one or more computer-readable media of claim 1 , wherein the applying the co-training based learning framework includes: training the spatial classifier with the spatial features for the region; training the temporal classifier with temporal features for the region; and applying the spatial classifier and the temporal classifier to infer unlabeled areas iteratively by adding one or more most confidently classified examples into labeled areas in the region for each subsequent training iteration round until remaining unlabeled areas in the region are labeled or a predetermined numbers of iteration rounds have been performed. 8 . The one or more computer-readable media of claim 1 , wherein the spatially-related data for the region includes at least one of road network data or points-of-interest data, and wherein the temporally-related data observed for the region includes at least one of automobile traffic data, human movement data, or meteorological data. 9 . The one or more computer-readable media of claim 1 , wherein the spatial classifier is an artificial neural network (ANN) classifier, and wherein the temporal classifier is one of a linear-chain conditional random field (CRF) classifier, a hidden Markov model (HMM) classifier, or a maximum entropy Markov model classifier. 10 . A computer-implemented method, comprising: applying a co-training based learning framework to co-train a spatial classifier and a temporal classifier based at least on labeled air quality index data from one or more air quality monitor stations in a region, a set of spatial features associated with the region, and a set of temporal features associated with the region; obtaining an additional set of spatial features for an area in the region based on spatially-related data observed for the area; obtaining an additional set of temporal features for the area in the region based on temporally-related data observed for the area; generating a spatial probability score for a pollutant in the area based at least on the additional set of spatial features for the area using the spatial classifier; generating a temporal probability score the pollutant in the area based at least on the additional set of temporal features for the area using the temporal classifier; and calculating an air quality index level with respect to the pollutant in the area based at least on the spatial probability score and the temporal probability score. 11 . The computer-implemented method of claim 10 , further comprising: obtaining the labeled air quality index data for the pollutant in the region from one or more air quality monitor stations; extracting the spatial features associated with the region from spatially-related data for the region, the spatially-related data including information associated with fixed infrastructures in the region; and extracting the temporal features for the region from temporally-related data for the region, the temporally-related data including data for the region that changes over time. 12 . The computer-implemented method of claim 11 , wherein the spatially-related data for the region includes at least one of road network data or points-of-interest data, and wherein the temporally-related data observed for the region includes at least one of automobile traffic data, human movement data, or meteorological data. 13 . The computer-implemented method of claim 10 , wherein the applying the co-training based learning framework includes: training the spatial classifier with the set of spatial features associated with the region; training the temporal classifier with the set of temporal features associated with the region; and applying the spatial classifier and the temporal classifier to infer unlabeled areas iteratively by adding one or more most confidently classified examples into labeled areas in the region for each subsequent training iteration round until remaining unlabeled areas in the region are labeled or a predetermined numbers of iteration rounds have been performed. 14 . The computer-implemented method of claim 10 , wherein the spatially-related data observed for the area includes at least one of road network data or points-of-interest data, and wherein the temporally-related data observed for the area includes at least one of automobile traffic data, human movement data, or meteorological data. 15 . The computer-implemented method of claim 10 , further comprising: calculating a deviation between an air quality index level and a corresponding linear interpolation level for each of multiple pollutants at a plurality of areas for a set of periodic intervals; positioning a corresponding deviation for each pollutant at each of the plurality of areas and each of the periodic intervals in a multi-dimensional grid space; and applying a skyline detection algorithm to deviations in the multi-dimensional grid space to identify one or more areas for air qua

Assignees

Inventors

Classifications

  • G06N7/01Primary

    Probabilistic graphical models, e.g. probabilistic networks · CPC title

  • Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title

  • Feedforward networks · CPC title

  • G06N7/005Primary

    Physics · mapped topic

  • Learning methods · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016125307A1 cover?
The use of data from multiple data source provides inferred air quality indices with respect to a particular pollutant for multiple areas without the addition of air quality monitor stations to those areas. Labeled air quality index data for a pollutant in a region may be obtained from one or more air quality monitor stations. Spatial features for the region may be extracted from spatially-rela…
Who is the assignee on this patent?
Zheng Yu, Xie Xing, Ma Wei-Ying, and 3 more
What technology area does this patent fall under?
Primary CPC classification G06N7/01. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu May 05 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).