Readability awareness in natural language processing systems

US9916380B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9916380-B2
Application numberUS-201615159986-A
CountryUS
Kind codeB2
Filing dateMay 20, 2016
Priority dateJan 5, 2016
Publication dateMar 13, 2018
Grant dateMar 13, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Electronic natural language processing in a natural language processing (NLP) system, such as a Question-Answering (QA) system. A receives electronic text input, in question form, and determines a readability level indicator in the question. The readability level indicator includes at least a grammatical error, a slang term, and a misspelling type. The computer determines a readability level for the electronic text input based on the readability level indicator, and retrieves candidate answers based on the readability level.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for electronic natural language processing in an electronic natural language processing (NLP) system, comprising: receiving an electronic text input; determining a readability level indicator of the electronic text input, wherein the readability level indicator comprises at least one of a grammatical error, a slang term, and a misspelling type in the electronic text input; determining a readability level of the electronic text input based on the readability level indicator: generating a plurality of candidate answers for a question; and selecting a set of candidate answers from among the plurality of candidate answers based on matching readability levels of the set of candidate answers to the readability level of the question. 2. The method of claim 1 , further comprising: receiving the electronic text input from an electronic input source in response to an input from a user. 3. The method of claim 1 , further comprising: identifying the electronic text input as the question. 4. The method of claim 1 , wherein identifying at least one of a slang term and a misspelling in the electronic text input comprises identifying, in the electronic text input, one or more of: an abbreviation of a word corresponding to an acronym associated with the word in a collection of text messaging acronyms; and a misspelling of the word, wherein the misspelling corresponds to a phonetic reading of the word. 5. The method of claim 1 , further comprising: parsing the electronic text input using a full parsing process. 6. The method of claim 1 , further comprising: comparing the readability indicators of the electronic text input with readability indicators of one or more questions in a corpus of questions, wherein determining the readability level for the electronic text input is based on readability levels of the one or more questions. 7. The method of claim 6 , further comprising: training a natural language processing model based on the results of the comparison.

Assignees

Inventors

Classifications

  • Summarisation for human users · CPC title

  • using natural language analysis · CPC title

  • G06F40/232Primary

    Orthographic correction, e.g. spell checking or vowelisation · CPC title

  • Grammatical analysis; Style critique · CPC title

  • Parsing · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9916380B2 cover?
Electronic natural language processing in a natural language processing (NLP) system, such as a Question-Answering (QA) system. A receives electronic text input, in question form, and determines a readability level indicator in the question. The readability level indicator includes at least a grammatical error, a slang term, and a misspelling type. The computer determines a readability level fo…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/3344. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 13 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).