Segmenting and interpreting a document, and relocating document fragments to corresponding sections

US10169325B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10169325-B2
Application numberUS-201715428480-A
CountryUS
Kind codeB2
Filing dateFeb 9, 2017
Priority dateFeb 9, 2017
Publication dateJan 1, 2019
Grant dateJan 1, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system, comprising an input device configured to receive a first item and a second item, and a processor communicably coupled to the input device and configured to determine that the first item is a fragment matching a lexicon, and place the fragment in a section of a document, the section selected based on the matching lexicon, wherein the processor is configured to perform the determination and the placement after it receives the first item but before it receives the second item.

First claim

Opening claim text (preview).

What is claimed is: 1. A system, comprising: an input device configured to receive a first item and a second item; and a processor communicably coupled to the input device and configured to: determine that the first item is a fragment matching a lexicon; place the fragment in a first section of a document, the first section selected based on the matching lexicon; determine a section type for each fragment of multiple fragments in the first section; determine a first quantity of first fragments of the multiple fragments and a second quantity of second fragments of the multiple fragments, wherein the first fragments correspond to a first section type of the first section and the second fragments correspond to a second section type of a second section of the document; determine that the ratio-first quantity of the first fragments exceeds the second quantity of the second fragments by a predetermined quantity; and based on exceeding the predetermined quantity, re-locate the second fragments to the second section in the document or reclassify the second fragments to correspond to the first section type. 2. The system of claim 1 , wherein the processor is configured to compare the first item to a plurality of lexicons before it determines that the first item is the fragment matching the lexicon. 3. The system of claim 1 , wherein the processor is configured to store the document on a hardware storage device. 4. The system of claim 1 , wherein the processor is configured to request permission to place the fragment in the section. 5. The system of claim 1 , wherein the processor is configured to indicate that the section and another section correspond to the fragment. 6. The system of claim 5 , wherein the processor is configured to receive a user selection of the section, and wherein the processor is configured to place the fragment in the section in response to the receipt of the user selection. 7. The system of claim 1 , wherein the processor is configured to request and receive a user edit to the fragment, and wherein the processor is configured to select the section based at least in part on the user edit. 8. A computer program product comprising a non-transitory computer-readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to: receive a stream of words; dynamically match each of the words to one or more lexicons; dynamically categorize each of the words into one or more sections of a document based on the matching one or more lexicons; store the document to a hardware storage device; determine a section type for each fragment of multiple fragments in a first section of the one or more sections; determine a first quantity of first fragments of the multiple fragments and a second quantity of second fragments of the multiple fragments, wherein the first fragments correspond to a first section type of the first section and the second fragments correspond to a second section type of a second section of the one or more sections of the document; determine that the first quantity of the first fragments exceeds the second quantity of the second fragments by a predetermined quantity: and based on exceeding the predetermined quantity, re-locate the second fragments to the second section in the document or reclassify the second fragments to correspond to the first section type. 9. The computer program product of claim 8 , wherein the document comprises an electronic health record. 10. The computer program product of claim 8 , wherein each of the one or more lexicons corresponds to a different section. 11. The computer program product of claim 8 , wherein the program instructions are executable by a processor to cause the processor to receive at least one other word in the stream after the processor identifies the dynamic match and after the processor performs the dynamic categorization. 12. The computer program product of claim 8 , wherein the program instructions are executable by a processor to cause the processor to receive the stream of words using at least one of a microphone, a keyboard, and a touch screen. 13. A computer program product comprising a non-transitory computer-readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to: receive a first item and a second item; determine that the first item is a fragment matching a lexicon; place the fragment in a first section of a document, the first section selected based on the matching lexicon; segment the document into multiple sections, wherein each of the multiple sections corresponds to a respective section type of multiple section types; segment items in a first section of multiple sections of the document into multiple fragments, wherein the first section corresponds to a first section type; determine a section type of each of the multiple fragments in the first section; determine whether the multiple fragments include fragments that correspond to different section types and that are interspersed among each other in even proportions; and based on the multiple fragments in the first section including fragments that correspond to different section types and that are interspersed among each other in even proportions: determine that the fragments that correspond to different section types and that are interspersed among each other in even proportions do not belong in the first section; generate a new section corresponding to a section type that corresponds to a section type that is different than the multiple section types; and re-locate the fragments that correspond to different section types and that are interspersed among each other in even proportions to the new section. 14. The computer program product of claim 13 , wherein the program instructions are executable by the processor to cause the processor to determine that the multiple fragments include fragments that correspond to different section types and that are interspersed among each other in even proportions when the fragments that correspond to different section types and that are interspersed among each other in even proportions are in an alternating fashion.

Assignees

Inventors

Classifications

  • Clustering; Classification · CPC title

  • Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars · CPC title

  • Indexing; Data structures therefor; Storage structures · CPC title

  • for patient-specific data, e.g. for electronic patient records · CPC title

  • Lexical analysis, e.g. tokenisation or collocates · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10169325B2 cover?
A system, comprising an input device configured to receive a first item and a second item, and a processor communicably coupled to the input device and configured to determine that the first item is a fragment matching a lexicon, and place the fragment in a section of a document, the section selected based on the matching lexicon, wherein the processor is configured to perform the determination…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G16H15/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 01 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).