Identifying spans using visual recognition

US11120215B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11120215-B2
Application numberUS-201916393145-A
CountryUS
Kind codeB2
Filing dateApr 24, 2019
Priority dateApr 24, 2019
Publication dateSep 14, 2021
Grant dateSep 14, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Aspects of the present disclosure relate to identifying spans within unstructured electronic text. Natural language content is received. A part of speech and slot name of each word within the natural language content is identified. A parse tree representation is then generated based on the natural language content, wherein visual characteristics of each node of a plurality of nodes within the parse tree representation depend on the part of speech and slot name of each word. A bounding box identifying a span category is then generated around a set of nodes on the parse tree representation by a machine learning model.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: receiving natural language content; identifying, by analyzing the natural language content using a natural language processor, a part of speech and slot name of respective words within the natural language content; generating a parse tree representation based on the natural language content, wherein visual characteristics of respective nodes of a plurality of nodes within the parse tree representation depend on the part of speech and slot name of the respective words; generating, by a machine learning model, a bounding box identifying a span category around a set of nodes on the parse tree representation, wherein the machine learning model is trained using supervised learning input data of pre-classified parse tree representations; mapping the set of nodes encompassed by the bounding box to a set of words in the natural language content; and outputting the set of words and the span category to a graphical user interface (GUI). 2. The method of claim 1 , wherein the machine learning model utilizes a region-based convolutional neural network. 3. The method of claim 1 , wherein each unique part of speech is associated with a unique shape, and wherein each unique slot name is associated with a unique texture. 4. The method of claim 1 , wherein the span category is factual or hypothetical. 5. The method of claim 1 , wherein the span category is selected from a group consisting of: planned event, negation, family history, and factual event. 6. A system comprising: a computer-readable storage medium storing instructions, which, when executed by a processor, is configured to cause the processor to perform a method comprising: receiving natural language content; identifying, by analyzing the natural language content using a natural language processor, a part of speech and slot name of respective words within the natural language content; generating a parse tree representation based on the natural language content, wherein visual characteristics of respective nodes of a plurality of nodes within the parse tree representation depend on the part of speech and slot name of the respective words; generating, by a machine learning model, a bounding box identifying a span category around a set of nodes on the parse tree representation, wherein the machine learning model is trained using supervised learning input data of pre-classified parse tree representations; mapping the set of nodes encompassed by the bounding box to a set of words in the natural language content; and outputting the set of words and the span category to a graphical user interface (GUI). 7. The system of claim 6 , wherein the machine learning model utilizes a region-based convolutional neural network. 8. The system of claim 6 , wherein each unique part of speech is associated with a unique shape, and wherein each unique slot name is associated with a unique texture. 9. The system of claim 6 , wherein the span category is factual or hypothetical. 10. The system of claim 6 , wherein the span category is selected from a group consisting of: planned event, negation, family history, and factual event. 11. A computer program product comprising a computer readable storage medium having program instructions embodied therewith, wherein the computer readable storage medium is not a transitory signal per se, the program instructions executable by a processor to cause the processor to perform a method comprising: receiving natural language content; identifying, by analyzing the natural language content using a natural language processor, a part of speech and slot name of respective words within the natural language content; generating a parse tree representation based on the natural language content, wherein visual characteristics of respective nodes of a plurality of nodes within the parse tree representation depend on the part of speech and slot name of the respective words; generating, by a machine learning model, a bounding box identifying a span category around a set of nodes on the parse tree representation, wherein the machine learning model is trained using supervised learning input data of pre-classified parse tree representations; mapping the set of nodes encompassed by the bounding box to a set of words in the natural language content; and outputting the set of words and the span category to a graphical user interface (GUI). 12. The computer program product of claim 11 , wherein the machine learning model utilizes a region-based convolutional neural network. 13. The computer program product of claim 11 , wherein each unique part of speech is associated with a unique shape, and wherein each unique slot name is associated with a unique texture. 14. The computer program product of claim 11 , wherein the span category is factual or hypothetical.

Assignees

Inventors

Classifications

  • Combinations of networks · CPC title

  • Supervised learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars · CPC title

  • Lexical analysis, e.g. tokenisation or collocates · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11120215B2 cover?
Aspects of the present disclosure relate to identifying spans within unstructured electronic text. Natural language content is received. A part of speech and slot name of each word within the natural language content is identified. A parse tree representation is then generated based on the natural language content, wherein visual characteristics of each node of a plurality of nodes within the p…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F40/279. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 14 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).