Implicit dialog approach for creating conversational access to web content

US10824658B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10824658-B2
Application numberUS-201816053718-A
CountryUS
Kind codeB2
Filing dateAug 2, 2018
Priority dateAug 2, 2018
Publication dateNov 3, 2020
Grant dateNov 3, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method, apparatus and computer program product for creating a dialog system for web content is described. Knowledge is extracted from a target web application for the dialog system. The knowledge includes an organizational structure of the target web application and domain knowledge pertinent to the target web application. A deep learning process associates the domain knowledge with the organization structure of the target application. A plurality of knowledge sources of different respective types are created from the domain knowledge and the organizational structure. Each of the knowledge sources is used for providing answers to user queries to the dialog system. As part of the invention, a semantic matcher is provided to select among the answers provided by the plurality of knowledge sources for a best answer to a user query.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for creating a dialog system for web content comprising: extracting knowledge from a target web application for the dialog system, the knowledge including an organizational structure of the target web application and domain knowledge pertinent to the target web application; using a deep learning process, associating the domain knowledge with the organization structure of the target application; using the associated domain knowledge, creating a plurality of knowledge sources of different respective data storage types from the domain knowledge and the organizational structure, wherein ones of the plurality of knowledge sources are selected from the group of a relational database, a knowledge graph and a non-relational database, each of the knowledge sources for providing answers to user queries to the dialog system; and providing a semantic matcher to select among the answers provided by the plurality of knowledge sources for a best answer to a user query. 2. The method as recited in claim 1 , wherein the semantic matcher includes a deep learning process and the method further comprises training the semantic matcher using the domain knowledge. 3. The method as recited in claim 1 , further comprising: pointing a crawler to the target web application; identifying a set of web pages used with the target web application; extracting a set of natural language sentence and path pairs from the set of web pages; and creating a set of triples from the set of natural language and path pairs, each triple containing a pair of sentence elements and a relationship between the sentence elements. 4. The method as recited in claim 3 , further comprising indexing the set of triples and the set of natural language and path pairs using the organizational structure. 5. The method as recited in claim 1 , further comprising providing a query generator for creating respective queries formatted for each of the plurality of knowledge sources in response to a single user input. 6. The method as recited in claim 1 , wherein the plurality of knowledge sources comprise a frequently asked question database, a knowledge graph and a relational database, and the method further comprises: using a dialog question and answer construction module to create the frequently asked question database; using a triple construction module to create the knowledge graph; and using a table construction module to create a table in the relational database. 7. The method as recited in claim 1 , further comprising providing a domain trained natural language processing service which includes the semantic matcher for determining entities and user intent in a natural language user input using the domain knowledge. 8. The method as recited in claim 1 , wherein the semantic matcher selects among a plurality of answers respectively provided by different ones of the plurality of knowledge sources for the best answer to a user query according to the domain knowledge. 9. The method as recited in claim 1 , wherein the associating uses at least some unstructured domain knowledge from a source outside the organization structure. 10. The method as recited in claim 1 , wherein the plurality of knowledge sources includes a relational database, a knowledge graph and a non-relational base. 11. Apparatus, comprising: a processor; computer memory holding computer program instructions executed by the processor for creating a dialog system for web content, the computer program instructions comprising: program code, operative to extract knowledge from a target web application for the dialog system, the knowledge including an organizational structure of the target web application and domain knowledge pertinent to the target web application; program code, operative to use a deep learning process, associating the domain knowledge with the organization structure of the target application; program code, operative to use the associated domain knowledge to create a plurality of knowledge sources of different respective data storage types from the domain knowledge and the organizational structure, wherein ones of the plurality of knowledge sources are selected from the group of a relational database, a knowledge graph and a non-relational database, each of the knowledge sources for providing answers to user queries to the dialog system; and program code, operative to provide a semantic matcher to select among the answers provided by the plurality of knowledge sources for a best answer to a user query. 12. The apparatus as recited in claim 11 , wherein the semantic matcher includes a deep learning process and the method further comprises training the semantic matcher using the domain knowledge. 13. The apparatus as recited in claim 11 , further comprising: program code, operative to point a crawler to the target web application; program code, operative to identify a set of web pages used with the target web application; program code, operative to extract a set of natural language sentence and path pairs from the set of web pages; and program code, operative to create a set of triples from the set of natural language and path pairs, each triple containing a pair of sentence elements and a relationship between the sentence elements. 14. The apparatus as recited in claim 13 , further comprising program code, operative to index the set of triples and the set of natural language and path pairs using the organizational structure. 15. The apparatus as recited in claim 11 , further comprising program code, operative to provide a query generator for creating respective queries formatted for each of the plurality of knowledge sources in response to a single user input. 16. The apparatus as recited in claim 13 , wherein the plurality of knowledge sources comprise a frequently asked question database, a knowledge graph and a relational database, and the computer program instructions further comprises: program code, operative to use a dialog question and answer construction module to create the frequently asked question database; program code, operative to use a triple construction module to create the knowledge graph; and program code, operative to use a table construction module to create a table in the relational database. 17. A computer program product in a non-transitory computer readable medium for use in a data processing system, the computer program product holding computer program instructions executed by the data processing system for creating a dialog system for web content, the computer program instructions comprising: program code, operative to extract knowledge from a target web application for the dialog system, the knowledge including an organizational structure of the target web application and domain knowledge pertinent to the target web application; program code, operative to use a deep learning process, associating the domain knowledge with the organization structure of the target application; program code, operative to use the associated domain knowledge to create a plurality of knowledge sources of different respective data storage types from the domain knowledge and the organizational structure, wherein ones of the plurality of knowledge sources are selected from the group of a relational database, a knowledge graph and a non-relational database, each of the knowledge sources for providing answers to user queries to the dialog system; and program code, operative to provide a semantic matcher to select among the answers provided by the plurality of knowledge sources for a best answer to a u

Assignees

Inventors

Classifications

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • Reinforcement learning · CPC title

  • Supervised learning · CPC title

  • Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10824658B2 cover?
A method, apparatus and computer program product for creating a dialog system for web content is described. Knowledge is extracted from a target web application for the dialog system. The knowledge includes an organizational structure of the target web application and domain knowledge pertinent to the target web application. A deep learning process associates the domain knowledge with the organ…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/3329. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 03 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).