Machine learning integration in robotic process automation
US-2024304017-A1 · Sep 12, 2024 · US
US10140556B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10140556-B2 |
| Application number | US-201514844713-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 3, 2015 |
| Priority date | Jul 17, 2015 |
| Publication date | Nov 27, 2018 |
| Grant date | Nov 27, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Disclosed is an Arabic optical character recognition method using Hidden Markov Models and decision trees, comprising: receiving an input image containing Arabic text, removing all diacritics from the input image by detecting a bounding box of each diacritic and comparing coordinates thereof to those of a bounding box of a text body, segmenting the input image into four layers, and conducting feature extraction on the segmented four layers, inputting results of feature extraction into a Hidden Markov Model thereby generating HMM models for representing each Arabic character, conducting iterative training of the HMM models until an overall likelihood criterion is satisfied, and inputting results of iterative training into a decision tree thereby predicting locations and the classes of the diacritics and producing final recognition results. The invention is capable of facilitating simple recognition of Arabic by utilizing writing feature thereof, and meanwhile featuring comparatively high recognition precision.
Opening claim text (preview).
What is claimed is: 1. A method for establishing a HMM-based encoding network using connectivity rules of Arabic optical characters in a recognition process, the method comprising: providing three non-emitting states in the HMM-based encoding network: a beginning non-emitting state disposed at the beginning of said network, an end non-emitting state disposed at the end of said network, and a middle non-emitting state disposed at the middle of said network; connecting said beginning non-emitting state to an isolated position character hidden Markov model, and said isolated position character hidden Markov model to said end non-emitting state; connecting said beginning non-emitting state to a beginning position character hidden Markov model, and said beginning position character hidden Markov model to said middle non-emitting state; connecting said middle non-emitting state to an end position character hidden Markov model, and said end position character hidden Markov model to said end non-emitting state; connecting in parallel said middle non-emitting state to a middle position character hidden Markov model, and connecting said middle position character hidden Markov model back to said middle non-emitting state thereby forming a loop between said middle non-emitting state and said middle position character hidden Markov model; connecting in parallel said middle non-emitting state to an elongation hidden Markov model, and connecting said elongation hidden Markov model back to said middle non-emitting state thereby forming a loop between said middle non-emitting state and said elongation hidden Markov model; and connecting said end non-emitting state to said beginning non-emitting state. 2. The method of claim 1 , further comprising: providing an input for said HMM-based encoding network, the input comprising feature vectors; and using said HMM-based encoding network and a Viterbi decoder to output the most optimal character sequence. 3. The method of claim 2 , further comprising combining an output of said HMM-based encoding network and Viterbi decoder with a diacritics feature using a decision tree, thereby producing a final recognition result.
Graphical models, e.g. Bayesian networks or Markov models · CPC title
Classification techniques · CPC title
Tree-organised classifiers · CPC title
Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text · CPC title
Markov models or related models, e.g. semi-Markov models; Markov random fields; Networks embedding Markov models · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.