Arabic optical character recognition method using hidden markov models and decision trees

US10140556B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10140556-B2
Application numberUS-201514844713-A
CountryUS
Kind codeB2
Filing dateSep 3, 2015
Priority dateJul 17, 2015
Publication dateNov 27, 2018
Grant dateNov 27, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed is an Arabic optical character recognition method using Hidden Markov Models and decision trees, comprising: receiving an input image containing Arabic text, removing all diacritics from the input image by detecting a bounding box of each diacritic and comparing coordinates thereof to those of a bounding box of a text body, segmenting the input image into four layers, and conducting feature extraction on the segmented four layers, inputting results of feature extraction into a Hidden Markov Model thereby generating HMM models for representing each Arabic character, conducting iterative training of the HMM models until an overall likelihood criterion is satisfied, and inputting results of iterative training into a decision tree thereby predicting locations and the classes of the diacritics and producing final recognition results. The invention is capable of facilitating simple recognition of Arabic by utilizing writing feature thereof, and meanwhile featuring comparatively high recognition precision.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for establishing a HMM-based encoding network using connectivity rules of Arabic optical characters in a recognition process, the method comprising: providing three non-emitting states in the HMM-based encoding network: a beginning non-emitting state disposed at the beginning of said network, an end non-emitting state disposed at the end of said network, and a middle non-emitting state disposed at the middle of said network; connecting said beginning non-emitting state to an isolated position character hidden Markov model, and said isolated position character hidden Markov model to said end non-emitting state; connecting said beginning non-emitting state to a beginning position character hidden Markov model, and said beginning position character hidden Markov model to said middle non-emitting state; connecting said middle non-emitting state to an end position character hidden Markov model, and said end position character hidden Markov model to said end non-emitting state; connecting in parallel said middle non-emitting state to a middle position character hidden Markov model, and connecting said middle position character hidden Markov model back to said middle non-emitting state thereby forming a loop between said middle non-emitting state and said middle position character hidden Markov model; connecting in parallel said middle non-emitting state to an elongation hidden Markov model, and connecting said elongation hidden Markov model back to said middle non-emitting state thereby forming a loop between said middle non-emitting state and said elongation hidden Markov model; and connecting said end non-emitting state to said beginning non-emitting state. 2. The method of claim 1 , further comprising: providing an input for said HMM-based encoding network, the input comprising feature vectors; and using said HMM-based encoding network and a Viterbi decoder to output the most optimal character sequence. 3. The method of claim 2 , further comprising combining an output of said HMM-based encoding network and Viterbi decoder with a diacritics feature using a decision tree, thereby producing a final recognition result.

Assignees

Inventors

Classifications

  • Graphical models, e.g. Bayesian networks or Markov models · CPC title

  • Classification techniques · CPC title

  • Tree-organised classifiers · CPC title

  • G06V30/414Primary

    Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text · CPC title

  • Markov models or related models, e.g. semi-Markov models; Markov random fields; Networks embedding Markov models · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10140556B2 cover?
Disclosed is an Arabic optical character recognition method using Hidden Markov Models and decision trees, comprising: receiving an input image containing Arabic text, removing all diacritics from the input image by detecting a bounding box of each diacritic and comparing coordinates thereof to those of a bounding box of a text body, segmenting the input image into four layers, and conducting f…
Who is the assignee on this patent?
Univ Huazhong Science Tech
What technology area does this patent fall under?
Primary CPC classification G06V30/414. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 27 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).