Enhanced search result generation using multi-document summarization

US12561375B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12561375-B2
Application numberUS-202418443838-A
CountryUS
Kind codeB2
Filing dateFeb 16, 2024
Priority dateFeb 17, 2023
Publication dateFeb 24, 2026
Grant dateFeb 24, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Enhanced search results are generated using multi-document summarization. A multi-document summarization system receives a search query from a user and retrieves a plurality of search result documents based on the search query. The summarization system generates a summary of each of the plurality of search result documents using distinct per-document summarization machine learning models, where the distinct per-document summarization machine learning models are trained on a training dataset. The summarization system synthesizes the summary of each of the plurality of search result documents into a single-consolidated answer responsive to the received search query. The multi-document summarization system formats the single-consolidated answer to include citations to the plurality of search result documents.

First claim

Opening claim text (preview).

What is claimed is: 1 . A system comprising: one or more hardware processors of a machine; and at least one memory storing instructions that, when executed by the one or more hardware processors, cause the system to perform operations comprising: receiving a search query; retrieving, by at least one hardware processor, a plurality of search result documents based on the search query; generating a summary of each of the plurality of search result documents using distinct per-document summarization machine learning models, the distinct per-document summarization machine learning models being fine-tuned using one or more summarization-specific datasets to perform both extractive summarization and using labels generated by comprehensive models that are larger than the distinct per-document summarization machine learning models; synthesizing the generated summary of each of the plurality of search result documents into a single-consolidated answer responsive to the received search query, the synthesizing comprising performing cross-document attributed summarization of the generated summary of each of the plurality of search result documents while maintaining explicit linkages to source materials and while recognizing and excluding duplicative content from the generated summary of each of the plurality of search result documents, the performing cross-document attributed summarization of the generated summary of each of the plurality of search result documents comprising: selecting a representative sentence from the generated summary of each of the plurality of search result documents for each common particular content identified across the plurality of search result documents; and combining multiple selected representative sentences for each of the common particular content into the single-consolidated answer response using a unified generative artificial intelligence component that synthesizes information from disparate sources while preserving original context; formatting the single-consolidated answer to include citations to the plurality of search result documents; and presenting, in an interactive interface, the single-consolidated answer without requiring navigation to the plurality of search result documents. 2 . The system of claim 1 , the operations comprising: identifying a set of documents from the plurality of search result documents pertinent to a subject of the search query; employing an additional machine learning model to discern particular content within the identified set of documents; and identifying common particular content across the plurality of search result documents. 3 . The system of claim 1 , wherein summarization of multiple documents is performed using a hierarchical approach, first summarizing individual sections within each document and second summarizing collective sections to form the single-consolidated answer. 4 . The system of claim 1 , the operations comprising: identifying, in an automatic manner, a citation for source documents for extracted information; and embedding a hyperlink to the citation within the single-consolidated answer at a location corresponding to the extracted information. 5 . The system of claim 1 , the operations comprising: performing real-time updates to the search query based on detection of new information relevant to the search query, wherein the real-time updates are incorporated into the single-consolidated answer without user intervention. 6 . The system of claim 1 , the operations comprising: supporting multi-turn disambiguation by presenting a set of follow-up questions to a user based on the single-consolidated answer; receiving user responses to the set of follow-up questions; and refining the single-consolidated answer based on the user responses to the set of follow-up questions. 7 . The system of claim 1 , the operations comprising: presenting, via a web browser on a user device, the single-consolidated answer; and receiving user input for query refinement, wherein the single-consolidated answer is dynamically updated based on the user input for the query refinement. 8 . The system of claim 1 , the operations comprising: using one or more asymmetric compression techniques that reduce computational resources used by the distinct per-document summarization machine learning models and that enable the distinct per-document summarization machine learning models to handle concurrent user queries. 9 . A method comprising: receiving a search query; retrieving, by at least one hardware processor, a plurality of search result documents based on the search query; generating a summary of each of the plurality of search result documents using distinct per-document summarization machine learning models, the distinct per-document summarization machine learning models being fine-tuned using one or more summarization-specific datasets to perform both extractive summarization and using labels generated by comprehensive models that are larger than the distinct per-document summarization machine learning models; synthesizing the generated summary of each of the plurality of search result documents into a single-consolidated answer responsive to the received search query, the synthesizing comprising performing cross-document attributed summarization of the generated summary of each of the plurality of search result documents while maintaining explicit linkages to source materials and while recognizing and excluding duplicative content from the generated summary of each of the plurality of search result documents, the performing cross-document attributed summarization of the generated summary of each of the plurality of search result documents comprising: selecting a representative sentence from the generated summary of each of the plurality of search result documents for each common particular content identified across the plurality of search result documents; and combining multiple selected representative sentences for each of the common particular content into the single-consolidated answer response using a unified generative artificial intelligence component that synthesizes information from disparate sources while preserving original context; formatting the single-consolidated answer to include citations to the plurality of search result documents; and presenting, in an interactive interface, the single-consolidated answer without requiring navigation to the plurality of search result documents. 10 . The method of claim 9 , further comprising: identifying a set of documents from the plurality of search result documents pertinent to a subject of the search query; employing an additional machine learning model to discern particular content within the identified set of documents; and identifying common particular content across the plurality of search result documents. 11 . The method of claim 9 , wherein summarization of multiple documents is performed using a hierarchical approach, first summarizing individual sections within each document and second summarizing collective sections to form the single-consolidated answer. 12 . The method of claim 9 , further comprising: identifying, in an automatic manner, a citation for source documents for extracted information; and embedding a hyperlink to the citation within the single-consolidated answer at a location corresponding to the extracted information. 13 . The method of claim 9 , further comprising: performing real-time updates to the search query based on detection of new information relevant to the search query, wherein the real-time updates are incorporated into the single-consolid

Assignees

Inventors

Classifications

  • Presentation of query results · CPC title

  • using context · CPC title

  • Presentation of query results · CPC title

  • Details of hyperlinks; Management of linked annotations · CPC title

  • Document management systems · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12561375B2 cover?
Enhanced search results are generated using multi-document summarization. A multi-document summarization system receives a search query from a user and retrieves a plurality of search result documents based on the search query. The summarization system generates a summary of each of the plurality of search result documents using distinct per-document summarization machine learning models, where…
Who is the assignee on this patent?
Snowflake Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/24575. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 24 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).