Reinforcement learning for h2s abatement

US2021284551A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2021284551-A1
Application numberUS-202117186849-A
CountryUS
Kind codeA1
Filing dateFeb 26, 2021
Priority dateFeb 27, 2020
Publication dateSep 16, 2021
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer implemented method and a system abates the presence of sulphide (H2S(g), H2S(aq) or HS-(aq)) in a wastewater flowing in a specific wastewater network from an upstream pumping pit to a downstream pumping pit or manhole. The computer implemented method and a system includes dosing into the wastewater at a position upstream of the downstream pit or manhole a chemical for abatement of sulphide, determining by use of a sensor the concentration of sulphide at a position downstream of the position at which chemical is dosed into the wastewater, such as located in the downstream manhole. The amount of chemical dosed is determined by use of a general agent and a specific agent.

First claim

Opening claim text (preview).

1 . A method for abating the presence of sulphide (H2S(g), H2S(aq) or HS-(aq)) in a wastewater flowing in a specific wastewater network ( 1 ) from an upstream pumping pit ( 9 ) to a downstream pumping pit or manhole ( 4 ) wherein the specific wastewater network being an existing or planned physical realization of a wastewater network, the method comprising providing the specific wastewater network comprising the upstream pit ( 9 ) and the downstream pit or manhole ( 4 ); providing a dosing unit ( 6 ) configured for dosing a chemical capable of sulphide abatement upstream of the downstream pit or manhole; providing a sensor ( 5 ) downstream of the dosing unit ( 6 ), said sensor being capable of measure concentration of sulphide and provide a readout indicative of the sulphide concentration in a region surrounding the sensor ( 5 ); providing a control unit ( 7 ) configured for providing the dosing unit ( 6 ) with a dosing signal and for receiving as input the readout from the sensor ( 5 ); dosing into the wastewater at a position upstream of the downstream pit or manhole ( 4 ) said chemical for abatement of sulphide, and determining by use of a sensor ( 5 ) the concentration of sulphide at a position downstream of the position at which chemical is dosed into the wastewater, such as located in the downstream manhole ( 4 ); wherein the amount dosed of said chemical for abatement of sulphide is determined by providing a general agent and training the general agent by reinforcement learning (A), to determine a dosing amount of chemical on the basis of a concentration of sulphide, the training of the general agent (A) is based on numerical simulations of a plurality of real wastewater networks, and/or: training the/a general agent (A) by reinforcement learning (B), on a numerical simulation of a/said specific wastewater network to determine a dosing amount of chemical on the basis of a concentration of sulphide; deploying the general agent (A or B) to determine amounts of said chemical for abatement of sulphide to be dosed into said specific wastewater network ( 1 ) and dosing the determined amounts of chemical into the specific wastewater network ( 1 ), wherein the deployment and dosing comprising training by reinforcement learning the general agent (A, B), to obtain a specific trained agent (C), wherein the reinforcement learning comprising: initially, determining by use of the general agent (A, B) an amount of said chemical for abatement of sulphide to be dosed on the basis of a determined concentration of sulphide in the specific wastewater network ( 1 ), dosing the determined amount into the wastewater, and after dosing of said chemical determining the concentration of sulphide in the specific wastewater system and further train the general agent (A, B) to obtain the specific trained agent (C); subsequently determining by use of the specific trained agent (C) an amount of said chemical for abatement of sulphide to be dosed on the basis of a determined concentration of sulphide in the specific wastewater network ( 1 ), dosing the determined amount into the wastewater, and after dosing said chemical determining the concentration of sulphide in the specific wastewater system and further train the specific trained agent (C). 2 . A method according to claim 1 , wherein the further training the specific agent comprising an exploratory element, where the specific agent executes a different amount of dosing of chemical to abate sulphide than what would have been determined by the specific agent, preferably the proportion of times to choose an exploratory action is determined by a probability of taking an exploratory step that, preferably starts high during initial learning and is then annealed towards a very low value after some fixed time or when some conditions are met. 3 . A method according to claim 1 , wherein training the general agent comprising an exploratory element, where the general agent executes a different amount of dosing of chemical to abate sulphide than what would have been determined by the general agent, preferably the proportion of times to choose an exploratory action is determined by a probability of taking an exploratory step that, preferably starts high during initial learning and is then annealed towards a very low value after some fixed time or when some conditions are met. 4 . A method according to claim 1 , wherein the reinforcement learning comprises a reinforcement learning reward, preferably based on the negative absolute difference between a pre-selected level of sulphide concentration and an actual determination of sulphide concentration or based on the negative sum of estimated cost of sulphide and the estimated cost of chemical. 5 . A method according to claim 1 , wherein the reinforcement learning is implemented as a reinforcement learning reward routine based on the negative absolute difference between a pre-selected level of sulphide concentration and an actual determination of sulphide concentration. 6 . A method according to claim 1 , wherein the numerical simulation of a wastewater network system(s) is/are based on the basis of a plurality of data sets from real dosing scenario(s) from wastewater network system(s). 7 . A method according to claim 1 , wherein the determined concentration of sulphide is a value timely averaged over a preselected time, such as over 5.0 minutes, such as over 10.0 minutes. 8 . A method according to claim 1 , wherein the determined concentration of sulphide is determined at preselected points in time, such as at regular intervals. 9 . A method according to claim 1 , wherein the general agent (B) is trained on the basis of simulations on at least some characteristics of the specific wastewater network ( 1 ), wherein the characteristics includes one or more of geometries of the networks system ( 1 ), expected timewise load exposure, expected quality, expected rain, specific information, such as houses connected, number of dimension, and policy from agents acting in other wastewater networks system(s). 10 . A method according to claim 1 , wherein the general agent (A, B) and the specific agent (C) use a policy, for determining the best action given the state of the system and its surroundings, trained on Q learning, deep Q learning, model-based algorithms, actor-critique algorithm, federated learning or transfer_learning the state of the system being e.g. the sulphide concentration history and flow history of the system, the time of the week, the rain in the area, the temperature of the waste water. 11 . A method according to claim 1 , wherein the pre-selected level of sulphide concentration is a concentration interval, such as 5.0±0.1 ppm, preferably 4.0±0.1 ppm, such as 3.0±0.1 ppm, and where in the reinforcement learning comprising providing the specific agent (C) with a negative reward if a determined concentration of sulphide is outside the concentration interval. 12 . A method according to claim 1 , wherein the pre-selected level of sulphide concentration is a concentration value, such as 5.0 ppm, preferably 4.0 ppm, such as 3.0 ppm, and where in the reinforcement learning comprising providing the specific agent (C) with a negative reward if a determined concentration of sulphide is larger or smaller than the concentration value. 13 . A method according to claim 1 , wherein the chemical dosed for abatement of sulphide is iron in one of its common oxidation states, such as Fe 2+ , Fe 3+ and/or Fe 6+ , or NO 3 . 14 . A method according to claim 1 , wherein the specific wastewater network further comprising an inlet ( 14 ) for receiving wastewater provided in the

Assignees

Inventors

Classifications

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • Reinforcement learning · CPC title

  • Transfer learning · CPC title

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • Distributed learning, e.g. federated learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2021284551A1 cover?
A computer implemented method and a system abates the presence of sulphide (H2S(g), H2S(aq) or HS-(aq)) in a wastewater flowing in a specific wastewater network from an upstream pumping pit to a downstream pumping pit or manhole. The computer implemented method and a system includes dosing into the wastewater at a position upstream of the downstream pit or manhole a chemical for abatement of su…
Who is the assignee on this patent?
Grundfos Holding As
What technology area does this patent fall under?
Primary CPC classification C02F1/008. Mapped technology areas include Chemistry & Metallurgy.
When was this patent published?
Publication date Thu Sep 16 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).