Augmented question and answer (Q and A) with large language models

US12511487B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12511487-B2
Application numberUS-202318226303-A
CountryUS
Kind codeB2
Filing dateJul 26, 2023
Priority dateJul 26, 2023
Publication dateDec 30, 2025
Grant dateDec 30, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Large language models (LLMs) are versatile in responding to user questions on a wide variety of topics. However, LLMs suffer from several drawbacks, such as hallucinations, incomplete information, and inability to cite original sources of information. Disclosed herein are systems and methods for using an LLM in a restricted manner to respond to queries regarding document corpora, e.g., documents related to a set of products, such that the impact of these drawbacks is minimized. Information retrieval is coupled with LLMs to build a question and answer (Q&A) system on the text corpora. Complex retrieved information, incorporating human feedback, and recommendations in the Q&A system are provided.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method, comprising: receiving a query from a user device; mapping the query to a latent semantic embedding space; modeling a number of document segments of a number of documents, wherein each of the number of document segments comprises document content, and wherein a distance between any two of the number of document segments in the latent semantic embedding space is proportional to a degree of similarity between the document content thereof; generating a prompt comprising the query and a set of nearest document segments in the latent semantic embedding space matching the query; submitting the prompt to a large language model (LLM) and receiving a response therefrom, wherein the response comprises an answer to the query and indicia of a document segment of the number of document segments comprising document content matching the answer; providing the response to the user device; wherein the document content of at least one document of the number of documents comprises table data, the table data comprising a number of cells having a cell value, the cell value being further associated with at least one of a column heading, a row heading, or a table heading of the table data; and segmenting the table data into at least one of the number of document segments comprises: linearizing the table data to comprise a linearized cell value comprising the cell value, the column, and the heading; and segmenting at topic breakpoints between topics of the linearized table data. 2 . The method of claim 1 , further comprising formatting the response wherein the indicia of the document segment of the number of document segments comprising the document content matching the answer, comprises indicia of one of the number of documents comprising the document content matching the answer. 3 . The method of claim 1 , further comprising: accessing a document corpus having the number of documents; segmenting each document of the number of documents into the number of document segments, each document segment comprising the document content; and plotting each of the number of document segments into the latent semantic embedding space. 4 . The method of claim 3 , wherein segmenting each document of the number of documents into the number of document segments comprises, for at least one document of the number of documents, segmenting at formatting breakpoints of the document content. 5 . The method of claim 3 , wherein segmenting each document of the number of documents into the number of document segments comprises: segmenting at topic breakpoints within the document content. 6 . The method of claim 5 , wherein determining the topic breakpoints within the document content comprises: segmenting the document content into a first number of segments; computing a number of tokens for each of the first number of segments, wherein each token of the number of tokens represents a single word of the document content; accessing a context size of the LLM; estimating a high threshold number of tokens from the context size of the LLM; tokenizing the document content of a segment into a number of tokens; and upon determining the number of tokens is greater than the high threshold number of tokens, resegmenting the document content into a second number of segments that is greater than the first number of segments. 7 . The method of claim 5 , wherein determining the topic breakpoints within the document content comprises: segmenting the document content into a first number of segments; computing a number of tokens for each of the first number of segments, wherein each token of the number of tokens represents a single word of the document content; accessing a context size of the LLM; estimating a low threshold number of tokens from the context size of the LLM; tokenizing the document content of a segment into a number of tokens; and upon determining the number of tokens is less than the low threshold number of tokens, resegmenting the document content into a second number of segments that is less than the first number of segments. 8 . The method of claim 3 , wherein: the document content of at least one document of the number of documents comprises visual data; and the method further comprises extracting at least one textual description from metadata of the at least one document and segmenting the number of document segments of the at least one textual description. 9 . The method of claim 3 , wherein: the document content of at least one document of the number of documents comprises visual data, the visual data further comprising a number of video frames; and the method further comprises extracting at least one textual description from metadata of the at least one of the number of video frames and segmenting the number of document segments of the at least one textual description. 10 . The method of claim 9 , wherein the metadata comprises digital images of text. 11 . A system, comprising: a server, comprising at least one microprocessor coupled to a computer memory storing machine-readable instructions therein; the instructions causing the server to perform: receiving a query from a user device; mapping the query to a latent semantic embedding space, modeling a number of document segments of a number of documents, wherein each of the number of document segments comprises document content, and wherein a distance between any two of the number of document segments in the latent semantic embedding space is proportional to a degree of similarity between the document content thereof; generating a prompt comprising the query and a set of nearest document segments in the latent semantic embedding space matching the query; submitting the prompt to a large language model (LLM) and receiving a response therefrom, wherein the response comprises an answer to the query and indicia of a document segment of the number of document segments comprising document content matching the answer; providing the response to the user device; wherein the document content of at least one document of the number of documents comprises table data, the table data comprising a number of cells having a cell value, the cell value being further associated with at least one of a column heading, a row heading, or a table heading of the table data; and segmenting the table data into at least one of the number of document segments, comprises: linearizing the table data to comprise a linearized cell value comprising the cell value, the column, and the heading; and segmenting at topic breakpoints between topics of the linearized table data. 12 . The system of claim 11 , further comprising formatting the response, wherein the indicia of the document segment of the number of document segments comprising the document content matching the answer, comprises indicia of one of the number of documents comprising the document content matching the answer. 13 . The system of claim 11 , further comprising: accessing a document corpus having the number of documents; segmenting each document of the number of documents into the number of document segments, each document segment comprising the document content; and plotting each of the number of document segments into the latent semantic embedding space. 14 . The system of claim 13 , wherein segmenting each document of the number of documents into the number of document segments comprises, for at least one document of the number of documents, segmenting at formatting breakpoints of the document content. 15 . The system of claim 13 , wherein segmenting

Assignees

Inventors

Classifications

  • G06F40/30Primary

    Semantic analysis · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12511487B2 cover?
Large language models (LLMs) are versatile in responding to user questions on a wide variety of topics. However, LLMs suffer from several drawbacks, such as hallucinations, incomplete information, and inability to cite original sources of information. Disclosed herein are systems and methods for using an LLM in a restricted manner to respond to queries regarding document corpora, e.g., document…
Who is the assignee on this patent?
Micro Focus Llc
What technology area does this patent fall under?
Primary CPC classification G06F40/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 30 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).