Sequence labeling task extraction from inked content
US-2024378915-A1 · Nov 14, 2024 · US
US9424249B1 · US · B1
| Field | Value |
|---|---|
| Publication number | US-9424249-B1 |
| Application number | US-201213622080-A |
| Country | US |
| Kind code | B1 |
| Filing date | Sep 18, 2012 |
| Priority date | Sep 18, 2012 |
| Publication date | Aug 23, 2016 |
| Grant date | Aug 23, 2016 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Disclosed are various embodiments for a text module that receives, in at least one computing device, an encoded text block, the encoded text block comprising user generated text. A set of signals is identified in the encoded text block, each signal specifying a respective text unit, each text unit corresponding to a respective series of characters in the user generated text. The text module may render the user generated text and each series of characters in the user generated text. A text selection of a subset of one of the series of characters is initially prevented. The text module receives a selection of the text unit corresponding to the one of the series of characters, the selection of the text unit triggering a text selection of one of the series of characters.
Opening claim text (preview).
Therefore, the following is claimed: 1. A non-transitory computer-readable medium comprising a plurality of computer instructions executable by a computing device, wherein, when executed, the plurality of computer instructions cause the computing device to at least: obtain a text block; identify a series of characters within the text block according to at least one rule, the series of characters being a subset of the text block; bind the series of characters to generate a text unit; assign a label to the text unit based at least upon content in the text unit, wherein the label specifies that the text unit is a particular class of text unit; encode the text block to generate an encoded text block, wherein the encoded text block specifies the label for the text unit and comprises a first signal that instructs an application to: cause an entirety of the series of characters in the text unit to be selected in response to a first selection of a subset of the series of characters; and cause a text format of the text unit to be visually contrasted from a remainder of the text in the text block; decode the encoded text block to generate a decoded text block, the decoded text block comprising the series of characters bound as the text unit; and encode, in response to a second selection of the subset of the series of characters in the decoded text block, the decoded text block to generate an additional encoded text block, wherein the additional encoded text block comprises metadata indicating an unbinding of the series of characters, and wherein the additional encoded text block comprises a second signal that instructs the application to: cause the label to be removed; and cause the entirety of the series of characters to be treated as being unbound. 2. The non-transitory computer-readable medium of claim 1 , wherein the first signal further instructs an application to cause the entirety of the series of characters in the text unit to be selected in response to a selection of the text unit. 3. The non-transitory computer-readable medium of claim 1 , wherein the at least one rule is directed to identifying at least one of a geographic address, a personal address, a geographic location, a phone number, a commercial item identifier, or a tracking number. 4. The non-transitory computer-readable medium of claim 1 , wherein the label specifies that the series of characters in the text unit indicates a location. 5. The non-transitory computer-readable medium of claim 1 , wherein the label specifies that the series of characters in the text unit indicates a tracking number. 6. The non-transitory computer-readable medium of claim 1 , wherein the label specifies that the series of characters in the text unit indicates an item identifier. 7. The non-transitory computer-readable medium of claim 1 , wherein the label comprises a sub-label that identifies at least one bibliographic component within the text block. 8. A system, comprising: at least one computing device; and a plurality of computer instructions executable by the at least one computing device, wherein the plurality of computer instructions, when executed, cause the at least one computing device to at least: designate a series of characters in a text block as being a text unit; bind the series of characters together in response to the series of characters being designated as the text unit; assign a label to the text unit based at least upon content in the text unit, wherein the label specifies that the text unit is a particular class of text unit; encode the text block to generate an encoded text block for an application, the encoded text block specifying the label for the text unit and comprising a first signal that instructs an application to: cause an entirety of the series of characters in the text unit to be selected in response to a first selection of a subset of the series of characters; and cause a text format of the text unit to be visually contrasted from a remainder of the text block; decode the encoded text block to generate a decoded text block, the decoded text block comprising the series of characters bound as the text unit; and encode, in response to a second selection of the subset of the series of characters in the decoded text block, the decoded text block to generate an additional encoded text block, wherein the additional encoded text block comprises metadata indicating an unbinding of the series of characters as the text unit and comprises a second signal that instructs the application to: cause the label to be removed; and cause the entirety of the series of characters to be treated as being unbound. 9. The system of claim 8 , wherein the series of characters is a first series of characters, wherein the text unit is a first text unit, and wherein the plurality of computer instructions further cause the at least one computing device to at least designate a second series of characters in the text block as a second text unit, wherein the first series of characters overlaps with the second series of characters. 10. The system of claim 8 , wherein the plurality of computer instructions further cause the at least one computing device to at least designate the series of characters in the text block as the text unit according to at least one rule. 11. The system of claim 10 , wherein the at least one rule specifies designating the series of characters as the text unit in response to a presence of at least one sentinel character in the text block. 12. The system of claim 10 , wherein the at least one rule specifies designating the series of characters as a text unit when at least a portion of the series of characters comprises at least one symbol or at least one number. 13. The system of claim 10 , wherein the at least one rule specifies designating the series of characters as a text unit when the series of characters forms at least one of a word, a sentence, or a paragraph. 14. The system of claim 8 , wherein the plurality of computer instructions further cause the at least one computing device to at least designate the series of characters in the text block as the text unit in response to a user-specified text unit designation. 15. The system of claim 8 , wherein the plurality of computer instructions further cause the at least one computing device to at least format a presentation of the text unit to distinguish the text unit from at least a portion of the text block. 16. A method, comprising; receiving, in at least one computing device, an encoded text block, the encoded text block comprising text; identifying, in the at least one computing device, a first signal in the encoded text block, the first signal specifying a text unit corresponding to a series of characters in the text; rendering, in the at least one computing device, the text in the encoded text block; assigning, in the at least one computing device, a label to the text unit based at least upon content in the text unit; selecting, in the at least one computing device, an entirety of the series of characters in response to a first selection of a subset of the series of characters in the text unit; causing a text format of the text unit to be visually contrasted from a remainder of the text block; decoding the encoded text block to generate a decoded text block, the decoded text block comprising the series of characters bound as the text unit; and encoding, in response to a second selection of the subset of the series of characters in the decoded text block, the decoded text block to generate an additional encoded text block, the additional enc
using a plurality of keys or algorithms · CPC title
Semantic analysis · CPC title
Tagging; Marking up (details of markup languages G06F40/143); Designating a block; Setting of attributes (style sheets, e.g. eXtensible Stylesheet Language Transformation [XSLT], G06F40/154) · CPC title
Fragmentation of text files, e.g. creating reusable text-blocks; Linking to fragments, e.g. using XInclude; Namespaces · CPC title
Character encoding · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.