Systems and methods for recognizing text of interest

US2024233422A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2024233422-A1
Application numberUS-202418616005-A
CountryUS
Kind codeA1
Filing dateMar 25, 2024
Priority dateJul 20, 2021
Publication dateJul 11, 2024
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In some embodiments, apparatuses and methods are provided herein useful to determine text on an object. In some embodiments, there is provided a system to determine text of interest on an object of interest including a control circuit configured to execute a machine learning model trained to identify the text of interest, group into a cluster each character in the text of interest located substantially in the same location in the text of interest, determine a score value of each particular character in the cluster, identify the particular character that has a determined score value corresponding to at least a threshold score value relative to all characters in the cluster, assign the particular character having the determined score value corresponding to at least the threshold score value as a recognized character in the cluster, and output data comprising each recognized character associated with each cluster in the text of interest.

First claim

Opening claim text (preview).

What is claimed is: 1 . A pattern agnostic optical character recognition (OCR) system for determining text of interest on an object of interest, the pattern agnostic OCR system comprising: a control circuit configured to execute a machine learning model trained to: identify the text of interest on the object of interest in a captured digital image; group into a cluster each character in the text of interest located substantially in the same location in the text of interest; determine a score value of each particular character in the cluster based on at least one of sum of a number of occurrences of the particular character in the cluster relative to a total sum of occurrences of all characters in the cluster and an average confidence value associated with the particular character; identify the particular character that has a determined score value corresponding to at least a threshold score value relative to all characters in the cluster; assign the particular character having the determined score value corresponding to at least the threshold score value as a recognized character in the cluster; and output data comprising each recognized character associated with each cluster in the text of interest. 2 . The pattern agnostic OCR system of claim 1 , wherein the object comprises a cargo trailer for use in distribution of goods, the text of interest identifying the cargo trailer. 3 . The pattern agnostic OCR system of claim 1 , wherein the text of interest comprises a corresponding identification associated with the object. 4 . The pattern agnostic OCR system of claim 1 , wherein the character comprises one of a letter, a symbol of an alphabet, or a number, wherein the sum of the number of occurrences of the particular character in the cluster is weighted, and wherein the captured digital image was captured by at least one camera. 5 . The pattern agnostic OCR system of claim 1 , wherein each character in the text of interest is vertically aligned relative to other characters in the text of interest. 6 . The pattern agnostic OCR system of claim 5 , wherein the text of interest comprises one or more characters to be identified, wherein each character in the text of interest corresponds to a node point, and wherein the cluster comprises at least five node points to be grouped into the cluster. 7 . The pattern agnostic OCR system of claim 1 , wherein the text of interest comprises one or more characters to be identified, and wherein each character in the text of interest corresponds to a node point, and wherein each character in the text of interest is horizontally aligned relative to other characters in the text of interest. 8 . The pattern agnostic OCR system of claim 7 , wherein the cluster comprises at least four node points to be grouped into the cluster. 9 . The pattern agnostic OCR system of claim 1 , wherein the text of interest comprises one or more characters to be identified, and wherein each character in the text of interest corresponds to a node point, wherein the control circuit is further configured to determine that a plurality of node points are located substantially in the same location in the text of interest of the captured digital image, and wherein the plurality of node points are grouped into the cluster when each node point of the plurality of node points is not more than seven unit distance from other node points in the plurality of node points. 10 . The pattern agnostic OCR system of claim 9 , wherein the control circuit is further configured to: determine that a plurality of clusters are located substantially in the same location in the text of interest of the captured digital image; and merge two or more clusters of the plurality of clusters based on a unit distance between each cluster of the two or more clusters being less than a threshold merging value. 11 . The pattern agnostic OCR system of claim 10 , wherein the threshold merging value is based on a ratio between a predetermined constant value associated with a total number of clusters associated with the text of interest and an average cluster distance between each cluster of the two or more clusters. 12 . The pattern agnostic OCR system of claim 1 , further comprising a database configured to store a plurality of threshold merging values, each threshold merging value is associated with a possible total number of clusters in the text of interest and usable when a plurality of node points are located substantially in the same location in the text of interest. 13 . The pattern agnostic OCR system of claim 1 , wherein the threshold score value corresponds to a highest score value relative to determined score values of all characters in the cluster. 14 . A method for pattern agnostic optical character recognition (OCR) for determining text of interest on an object of interest, the method comprising: identifying, by a control circuit, text of interest on the object of interest in a captured digital image; grouping, by the control circuit, into a cluster each character in the text of interest located substantially in the same location in the text of interest; determining, by the control circuit, a score value of each particular character in the cluster based on a sum of a number of occurrences of the particular character in the cluster relative to a total sum of occurrences of all characters in the cluster and an average confidence value associated with the particular character; identifying, by the control circuit, the particular character that has a determined score value corresponding to at least a threshold score value relative to all characters in the cluster; assigning, by the control circuit, the particular character having the determined score value corresponding to at least the threshold score value as a recognized character in the cluster; outputting, by the control circuit, each recognized character associated with each cluster in the text of interest. 15 . The method of claim 14 , wherein the object comprises a cargo trailer for use in distribution of goods, the text of interest identifying the cargo trailer. 16 . The method of claim 14 , wherein the text of interest comprises a corresponding identification associated with the object. 17 . The method of claim 14 , wherein the character comprises a letter, a symbol of an alphabet, and a number, wherein the sum of the number of occurrences of the particular character in the cluster is weighted, and wherein the captured digital image was captured by at least one camera. 18 . The method of claim 14 , wherein each character in the text of interest is vertically aligned relative to other characters in the text of interest. 19 . The method of claim 18 , wherein the text of interest comprises one or more characters to be identified, wherein each character in the text of interest corresponds to a node point, and wherein the cluster comprises at least five node points to be grouped into the cluster. 20 . The method of claim 14 , wherein the text of interest comprises one or more characters to be identified, and wherein each character in the text of interest corresponds to a node point, and wherein each character in the text of interest is horizontally aligned relative to other characters in the text of interest. 21 . The method of claim 20 , wherein the cluster comprises at least four node points to be grouped into the cluster. 22 . The method of claim 14 , wherein the text of interest comprises one or m

Assignees

Inventors

Classifications

  • Clustering techniques · CPC title

  • Text, e.g. of license plates, overlay texts or captions on TV images · CPC title

  • Creating or editing images; Combining images with text · CPC title

  • using neural networks · CPC title

  • Detecting or categorising vehicles · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2024233422A1 cover?
In some embodiments, apparatuses and methods are provided herein useful to determine text on an object. In some embodiments, there is provided a system to determine text of interest on an object of interest including a control circuit configured to execute a machine learning model trained to identify the text of interest, group into a cluster each character in the text of interest located subst…
Who is the assignee on this patent?
Walmart Apollo Llc
What technology area does this patent fall under?
Primary CPC classification G06V30/18. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Jul 11 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).