Graph and vector usage for automated QA system

US12346356B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12346356-B2
Application numberUS-202318500761-A
CountryUS
Kind codeB2
Filing dateNov 2, 2023
Priority dateNov 2, 2023
Publication dateJul 1, 2025
Grant dateJul 1, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for conversational query resolution includes receiving a query input from a user. The query input is decomposed into a plurality of tasks. A knowledge graph is queried to identify one or more relevant entities based on at least one of the plurality of tasks. A vector database is searched to identify one or more text chunks that correspond to the one or more relevant entities. Content relevant to at least one of the plurality of tasks is identified from the one or more text chunks. An answer is generated based on the identified content.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: receiving a query input from a user; decomposing the query input into a plurality of tasks; querying a knowledge graph to identify one or more relevant entities based on at least one of the plurality of tasks, wherein each of the one or more entities comprises text information related to a respective portion of a document; searching a vector database to identify one or more text chunks, wherein each of the one or more text chunks comprises contextual information in natural language for a respective identified relevant entity and is linked to the respective identified relevant entity in the knowledge graph using a chunk identifier; identifying content relevant to at least one of the plurality of tasks by parsing the contextual information in natural language within the one or more text chunks; and generating an answer for the query based on the identified content. 2. The computer-implemented method of claim 1 , wherein each of the one or more text chunks comprises text extracted from a corpus of unstructured documents. 3. The computer-implemented method of claim 1 , wherein each respective text chunk is vectorized into a respective vector representation using an embedding model. 4. The computer-implemented method of claim 1 , wherein the content relevant to the plurality of tasks from the one or more text chunks is identified using a vector-based similarity technique. 5. The computer-implemented method of claim 4 , wherein the vector-based similarity technique used to identify the content comprises comparing at least one of (i) cosine similarity or (ii) distance similarity between vector representations of the one or more text chunks and each of the plurality of tasks. 6. The computer-implemented method of claim 1 , further comprising: receiving feedback from the user regarding an accuracy of the answer; and updating the knowledge graph based on the feedback. 7. The computer-implemented method of claim 6 , wherein updating the knowledge graph and the vector database based on the feedback comprises adding one or more new entities to the knowledge graph, wherein each of the one or more new entities comprises data derived from the answer. 8. The computer-implemented method of claim 1 , wherein generating the answer based on the identified content comprises: aggregating the identified content based on logical relationships between each of the plurality of tasks; and generating the answer in natural language based on the aggregated content using natural language processing algorithms. 9. A system comprising: one or more memories collectively storing computer-executable instructions; and one or more processors, wherein the one or more processors are configured to, individually or collectively, perform an operation comprising: receiving a query input from a user; decomposing the query input into a plurality of tasks; querying a knowledge graph to identify one or more relevant entities based on at least one of the plurality of tasks, wherein each of the one or more entities comprises text information related to a respective portion of a document; searching a vector database to identify one or more text chunks, wherein each of the one or more text chunks comprises contextual information in natural language for a respective identified relevant entity and is linked to the respective identified relevant entity in the knowledge graph using a chunk identifier; identifying content relevant to at least one of the plurality of tasks by parsing the contextual information in natural language within the one or more text chunks; and generating an answer for the query based on the identified content. 10. The system of claim 9 , wherein each of the one or more text chunks comprises text extracted from a corpus of unstructured documents. 11. The system of claim 9 , wherein each respective text chunk is vectorized into a respective vector representation using an embedding model. 12. The system of claim 9 , wherein the content relevant to the plurality of tasks from the one or more text chunks is identified using a vector-based similarity technique. 13. The system of claim 12 , wherein the vector-based similarity technique used to identify the content comprises comparing at least one of (i) cosine similarity or (ii) distance similarity between vector representations of the one or more text chunks and each of the plurality of tasks. 14. The system of claim 9 , wherein the computer-executable instructions, when executed by the one or more processors, further cause the system to: receive feedback from the user regarding an accuracy of the answer; and update the knowledge graph based on the feedback. 15. The system of claim 14 , wherein the computer-executable instructions, when executed by the one or more processors, further cause the system to add one or more new entities to the knowledge graph based on the feedback, wherein each of the one or more new entities comprises data derived from the answer. 16. The system of claim 9 , wherein, to generate the answer based on the identified content, the computer-executable instructions, when executed by the one or more processors, further cause the system to: aggregate the identified content based on logical relationships between each of the plurality of tasks; and generate the answer in natural language based on the aggregated content using natural language processing algorithms. 17. One or more computer-readable storage media containing, in any combination, computer program code that, when executed by operation of a computer system, performs operations comprising: receiving a query input from a user; decomposing the query input into a plurality of tasks; querying a knowledge graph to identify one or more relevant entities based on at least one of the plurality of tasks, wherein each of the one or more entities comprises text information related to a respective portion of a document; searching a vector database to identify one or more text chunks, wherein each of the one or more text chunks comprises contextual information in natural language for a respective identified relevant entity and is linked to the respective identified relevant entity in the knowledge graph using a chunk identifier; identifying content relevant to at least one of the plurality of tasks by parsing the contextual information in natural language within the one or more text chunks; and generating an answer for the query based on the identified content.

Assignees

Inventors

Classifications

  • Graphs; Linked lists (G06F16/9027 takes precedence) · CPC title

  • Natural language query formulation · CPC title

  • using natural language analysis · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12346356B2 cover?
A method for conversational query resolution includes receiving a query input from a user. The query input is decomposed into a plurality of tasks. A knowledge graph is queried to identify one or more relevant entities based on at least one of the plurality of tasks. A vector database is searched to identify one or more text chunks that correspond to the one or more relevant entities. Content r…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/3344. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 01 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 11 related publications on this page (citations in our corpus or others sharing the same primary CPC).