Segmenting and interpreting a document, and relocating document fragments to corresponding sections

US10176164B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10176164-B2
Application numberUS-201715627169-A
CountryUS
Kind codeB2
Filing dateJun 19, 2017
Priority dateFeb 9, 2017
Publication dateJan 8, 2019
Grant dateJan 8, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method comprising receiving an item via a computer input device, comparing the item to a plurality of lexicons using a processor, and determining, using the processor and based upon the comparison, that the item is part of a fragment matching at least one of the plurality of lexicons. The method also comprises storing, using the processor, the fragment in a section of a document determined based on the at least one of the plurality of lexicons, and receiving at least one other item via the computer input device after the fragment is stored in the section.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: receiving a document including an item via a computer input device; comparing the item to a plurality of lexicons using a processor; determining, using the processor and based upon the comparison, that the item is part of a fragment matching at least one of the plurality of lexicons; storing, using the processor, the fragment in a section of the document determined based on the at least one of the plurality of lexicons; receiving at least one other item via the computer input device after the fragment is stored in the section; segmenting the document into multiple sections, wherein each of the multiple sections corresponds to a respective section type of multiple section types; segmenting items in a first section of the multiple sections of the document into multiple fragments, wherein the first section corresponds to a first section type; determining a section type of each of the multiple fragments in the first section; determining whether the multiple fragments include fragments that correspond to different section types and that are interspersed among each other in even proportions; and based on the multiple fragments in the first section including fragments that correspond to different section types and that are interspersed among each other in even proportions: determining that the fragments that correspond to different section types and that are interspersed among each other in even proportions do not belong in the first section; generating a new section corresponding to a section type that is different than the multiple section types; and re-locating the fragments that correspond to different section types and that are interspersed among each other in even proportions to the new section. 2. The method of claim 1 , further comprising determining that the at least one other item is part of another fragment matching a lexicon in the plurality of lexicons. 3. The method of claim 2 , further comprising storing the another fragment in at least one section of the document. 4. The method of claim 1 , further comprising storing the document in a hardware storage device. 5. The method of claim 1 , wherein the item includes at least one of a word, a phrase, a sentence, and a non-alphabetical symbol. 6. The method of claim 1 , wherein determining whether the multiple fragments include the fragments that correspond to the different section types and that are interspersed among each other in even proportions includes determining whether the fragments that correspond to the different section types and that are interspersed among each other in even proportions are disposed in an alternating fashion. 7. A system, comprising: an input device configured to receive a document including an item; and a processor communicably coupled to the input device and configured to: compare the item to a plurality of lexicons; determine, based upon the comparison, that the item is part of a fragment matching at least one of the plurality of lexicons; store the fragment in a section of the document determined based on the at least one of the plurality of lexicons; receive at least one other item via the computer input device after the fragment is stored in the section; segment the document into multiple sections, wherein each of the multiple sections corresponds to a respective section type of multiple section types; segment items in a first section of the multiple sections of the document into multiple fragments, wherein the first section corresponds to a first section type; determine a section type of each of the multiple fragments in the first section; determine whether the multiple fragments include fragments that correspond to different section types and that are interspersed among each other in even proportions; and based on the multiple fragments in the first section including fragments that correspond to different section types and that are interspersed among each other in even proportions: determine that the fragments that correspond to different section types and that are interspersed among each other in even proportions do not belong in the first section; generate a new section corresponding to a section type that is different than the multiple section types; and re-locate the fragments that correspond to different section types and that are interspersed among each other in even proportions to the new section. 8. The system of claim 7 , wherein the processor is further configured to determine that the at least one other item is part of another fragment matching a lexicon in the plurality of lexicons. 9. The system of claim 8 , wherein the processor is further configured to store the another fragment in at least one section of the document. 10. The system of claim 7 , wherein the processor is further configured to store the document in a hardware storage device. 11. The system of claim 7 , wherein the item includes at least one of a word, a phrase, a sentence, and a non-alphabetical symbol. 12. A computer program product comprising a non-transitory computer-readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to: receive a document including an item via a computer input device; compare the item to a plurality of lexicons using a processor; determine, using the processor and based upon the comparison, that the item is part of a fragment matching at least one of the plurality of lexicons; store, using the processor, the fragment in a section of the document determined based on the at least one of the plurality of lexicons; receive at least one other item via the computer input device after the fragment is stored in the section; segment the document into multiple sections, wherein each of the multiple sections corresponds to a respective section type of multiple section types; segment items in a first section of the multiple sections of the document into multiple fragments, wherein the first section corresponds to a first section type; determine a section type of each of the multiple fragments in the first section; determine whether the multiple fragments include fragments that correspond to different section types and that are interspersed among each other in even proportions; and based on the multiple fragments in the first section including fragments that correspond to different section types and that are interspersed among each other in even proportions: determine that the fragments that correspond to different section types and that are interspersed among each other in even proportions do not belong in the first section; generate a new section corresponding to a section type that corresponds to a section type that is different than the multiple section types; and re-locate the fragments that correspond to different section types and that are interspersed among each other in even proportions to the new section. 13. The computer program product of claim 12 , wherein the program instructions are further executable to cause the processor to determine that the at least one other item is part of another fragment matching a lexicon in the plurality of lexicons. 14. The computer program product of claim 13 , wherein the program instructions are further executable to cause the processor to store the another fragment in at least one section of the document. 15. The computer program product of claim 12 , wherein the program instructions are further executable to cause the processor to store the document in a hardware storage device. 1

Assignees

Inventors

Classifications

  • G16H15/00Primary

    ICT specially adapted for medical reports, e.g. generation or transmission thereof · CPC title

  • for patient-specific data, e.g. for electronic patient records · CPC title

  • Recognition of textual entities · CPC title

  • Phrasal analysis, e.g. finite state techniques or chunking · CPC title

  • Interaction with lists of selectable items, e.g. menus · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10176164B2 cover?
A method comprising receiving an item via a computer input device, comparing the item to a plurality of lexicons using a processor, and determining, using the processor and based upon the comparison, that the item is part of a fragment matching at least one of the plurality of lexicons. The method also comprises storing, using the processor, the fragment in a section of a document determined ba…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G16H15/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 08 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).