High Purity Distillation Process Control
US-2020108327-A1 · Apr 9, 2020 · US
US11508480B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11508480-B2 |
| Application number | US-201916554344-A |
| Country | US |
| Kind code | B2 |
| Filing date | Aug 28, 2019 |
| Priority date | Aug 28, 2019 |
| Publication date | Nov 22, 2022 |
| Grant date | Nov 22, 2022 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A feature vector characterizing a system to be analyzed via online partially rewarded machine learning is obtained. Based on the feature vector, a decision is made, via the machine learning, using an online policy. The system is observed for environmental feedback. In at least a first instance, wherein the observing indicates that the environmental feedback is available, the environmental feedback is obtained. In at least a second instance, wherein the observing indicates that the environmental feedback is missing, the environmental feedback is imputed via an online imputation method. the online policy is updated based on results of the obtained environmental feedback and the online imputation method. A decision is output based on the updated online policy.
Opening claim text (preview).
What is claimed is: 1. A method comprising: obtaining a feature vector characterizing a system to be analyzed via online partially rewarded machine learning; based on said feature vector, making a decision, via said machine learning, using an online policy; observing said system for environmental feedback; in at least a first instance, wherein said observing indicates that said environmental feedback is available, obtaining said environmental feedback; in at least a second instance, wherein said observing indicates that said environmental feedback is missing, imputing said environmental feedback via an online imputation method; updating said online policy based on results of said obtained environmental feedback and said online imputation method; and outputting a decision based on said updated online policy. 2. The method of claim 1 , wherein said system comprises a medical system conducting clinical trials. 3. The method of claim 1 , wherein said system comprises a human-machine dialog system. 4. The method of claim 1 , wherein said system comprises a medical diagnostic system. 5. The method of claim 1 , wherein said imputing and said making of said decision comprise applying a rewarded online graph convolutional network by updating weights of said online policy via graph convolutional network back-propagation. 6. The method of claim 1 , wherein said making of said decision comprises applying a linear upper confidence bound bandit and wherein said imputation comprises a bounded imputation. 7. The method of claim 1 , wherein: making said decision includes retrieving a graph convolutional network embedding of said feature vector and providing same to a linear upper confidence bound bandit to make said decision; imputing said environmental feedback via said online imputation method comprises applying said graph convolutional network; and said updating comprises updating said linear upper confidence bound bandit with said environmental feedback and updating said graph convolutional network with said environmental feedback and said results of said online imputation method. 8. A non-transitory computer readable medium comprising computer executable instructions which when executed by a computer cause the computer to perform a method of: obtaining a feature vector characterizing a system to be analyzed via online partially rewarded machine learning; based on said feature vector, making a decision, via said machine learning, using an online policy; observing said system for environmental feedback; in at least a first instance, wherein said observing indicates that said environmental feedback is available, obtaining said environmental feedback; in at least a second instance, wherein said observing indicates that said environmental feedback is missing, imputing said environmental feedback via an online imputation method; updating said online policy based on results of said obtained environmental feedback and said online imputation method; and outputting a decision based on said updated online policy. 9. The non-transitory computer readable medium of claim 8 , wherein said system comprises a medical system conducting clinical trials. 10. The non-transitory computer readable medium of claim 8 , wherein said system comprises a human-machine dialog system. 11. The non-transitory computer readable medium of claim 8 , wherein said system comprises a medical diagnostic system. 12. The non-transitory computer readable medium of claim 8 , wherein said imputing and said making of said decision comprise applying a rewarded online graph convolutional network by updating weights of said online policy via graph convolutional network back-propagation. 13. The non-transitory computer readable medium of claim 8 , wherein said making of said decision comprises applying a linear upper confidence bound bandit and wherein said imputation comprises a bounded imputation. 14. The non-transitory computer readable medium of claim 8 , wherein: making said decision includes retrieving a graph convolutional network embedding of said feature vector and providing same to a linear upper confidence bound bandit to make said decision; imputing said environmental feedback via said online imputation method comprises applying said graph convolutional network; and said updating comprises updating said linear upper confidence bound bandit with said environmental feedback and updating said graph convolutional network with said environmental feedback and said results of said online imputation method. 15. An apparatus comprising: a memory; and at least one processor, coupled to said memory, and operative to: obtain a feature vector characterizing a system to be analyzed via online partially rewarded machine learning; based on said feature vector, make a decision, via said machine learning, using an online policy; observe said system for environmental feedback; in at least a first instance, wherein said observing indicates that said environmental feedback is available, obtain said environmental feedback; in at least a second instance, wherein said observing indicates that said environmental feedback is missing, impute said environmental feedback via an online imputation method; update said online policy based on results of said obtained environmental feedback and said online imputation method; and output a decision based on said updated online policy. 16. The apparatus of claim 15 , wherein said system to be analyzed is selected from the group consisting of a medical system conducting clinical trials and a medical diagnostic system. 17. The apparatus of claim 15 , wherein said system comprises a human-machine dialog system. 18. The apparatus of claim 15 , wherein said imputing and said making of said decision comprise applying a rewarded online graph convolutional network by updating weights of said online policy via graph convolutional network back-propagation. 19. The apparatus of claim 15 , wherein said making of said decision comprises applying a linear upper confidence bound bandit and wherein said imputation comprises a bounded imputation. 20. The apparatus of claim 15 , wherein: making said decision includes retrieving a graph convolutional network embedding of said feature vector and providing same to a linear upper confidence bound bandit to make said decision; imputing said environmental feedback via said online imputation method comprises applying said graph convolutional network; and said updating comprises updating said linear upper confidence bound bandit with said environmental feedback and updating said graph convolutional network with said environmental feedback and said results of said online imputation method.
Probabilistic graphical models, e.g. probabilistic networks · CPC title
Combinations of networks · CPC title
based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO] · CPC title
Backpropagation, e.g. using gradient descent · CPC title
Correlation function computation {including computation of convolution operations (arithmetic circuits for sum of products per se, e.g. multiply-accumulators G06F7/5443; digital filters, e.g. FIR, IIR, adaptive filters H03H17/00)} · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.