Enhanced searching using fine-tuned machine learning models

US12314318B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12314318-B2
Application numberUS-202418444078-A
CountryUS
Kind codeB2
Filing dateFeb 16, 2024
Priority dateFeb 17, 2023
Publication dateMay 27, 2025
Grant dateMay 27, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An advanced search system leverages a pre-trained large language model to enhance user query responses. The system, equipped with hardware processors, a search query via an interface and accesses a pre-trained large language model designed to respond to the search query. The system fine-tunes the model to generate a task-specific generative model. The system employs the task-specific generative model to generate a search result to the search query and analyzes the search result based on a performance metric associated with the task-specific generative model. The system refines the task-specific generative model based on the analyzing of the search result.

First claim

Opening claim text (preview).

What is claimed is: 1. A system comprising: one or more hardware processors of a machine; and at least one memory storing instructions that, when executed by the one or more hardware processors, cause the system to perform operations comprising: receiving, by the one or more hardware processors, a search query via an interface; accessing a pre-trained large language model designed to respond to the search query; and performing a plurality of iterations, using the pre-trained large language model, to generate a task-specific generative model, each iteration of the plurality of iterations comprising: performing domain-specific pre-training on an index to fine tune the pre-trained large language model; employing the task-specific generative model to generate a search result to the search query; analyzing the search result based on a performance metric associated with the task-specific generative model; and refining the task-specific generative model based on the analyzing of the search result. 2. The system of claim 1 , wherein performing the plurality of iterations further comprises: generating one or more outputs from the task-specific generative model by applying the plurality of iterations on a new search query, the one or more outputs including the search result; and providing the one or more outputs to a user via the interface, wherein the interface is a browser-based interface. 3. The system of claim 2 , wherein performing the plurality of iterations further comprises: receiving user feedback based on the one or more outputs; and utilizing the user feedback to improve accuracy and fluency of the search result generated by the task-specific generative model. 4. The system of claim 1 , wherein performing the plurality of iterations further comprises: evaluating a quality of the task-specific generative model upon conclusion of each iteration, wherein the quality includes a percentage of correct search results; and stopping the plurality of iterations after the quality of the task-specific generative model is satisfactory to a user. 5. The system of claim 1 , the operations further comprising: reducing a size of the task-specific generative model using asymmetric compression techniques including selective pruning of task-specific generative model parameters without identified loss of model performance. 6. The system of claim 1 , wherein performing the domain-specific pre-training on the index to fine tune the pre-trained large language model further comprises: tailoring the pre-trained large language model using proprietary data, the proprietary data including a curated dataset representative of a plurality of types of queries and content associated with a search system. 7. The system of claim 1 , the operations further comprising: applying a reward modeling process to the task-specific generative model to align the search result with human preferences; and improving a quality of the search result based on the reward modeling process, wherein the reward modeling process includes collecting human annotations to define a reward function that approximates human judgments of fluency and relevance associated with the search result. 8. A method comprising: receiving, by one or more hardware processors, a search query via an interface; accessing a pre-trained large language model designed to respond to the search query; and performing a plurality of iterations, using the pre-trained large language model, to generate a task-specific generative model, each iteration comprising: performing domain-specific pre-training on an index to fine tune the pre-trained large language model; employing the task-specific generative model to generate a search result to the search query; analyzing the search result based on a performance metric associated with the task-specific generative model; and refining the task-specific generative model based on the analyzing of the search result. 9. The method of claim 8 , wherein performing the plurality of iterations further comprises: generating one or more outputs from the task-specific generative model by applying the plurality of iterations on a new search query, the one or more outputs including the search result; and providing the one or more outputs to the user. 10. The method of claim 9 , wherein performing the plurality of iterations further comprises: receiving, from the user, user feedback based on the one or more outputs; and utilizing the user feedback to literately improve accuracy and fluency of the search result generated by the task-specific generative model. 11. The method of claim 8 , wherein performing the plurality of iterations further comprises: evaluating a quality of the task-specific generative model at an end of each iteration, wherein the quality includes a percentage of correct search results; and stopping the plurality of iterations after the quality of the task-specific generative model is satisfactory to the user. 12. The method of claim 8 , further comprising: reducing a size of the task-specific generative model using asymmetric compression techniques including selective pruning of task-specific generative model parameters without identified loss of model performance. 13. The method of claim 8 , wherein performing the domain-specific pre-training on the index to fine tune the pre-trained large language model further comprises: tailoring the pre-trained large language model using proprietary data, the proprietary data including a curated dataset representative of a plurality of types of queries and content associated with a search system. 14. The method of claim 8 , further comprising: applying a reward modeling process to the task-specific generative model to align the search result with human preferences; and improving a quality of the search result based on the reward modeling process, wherein the reward modeling process includes collecting human annotations to define a reward function that approximates human judgments of fluency and relevance associated with the search result. 15. A machine-storage medium embodying instructions that, when executed by a machine, cause the machine to perform operations comprising: receiving, by one or more hardware processors, a search query via an interface; accessing a pre-trained large language model designed to respond to the search query; and performing a plurality of iterations, using the pre-trained large language model, to generate a task-specific generative model, each iteration comprising: performing domain-specific pre-training on an index to fine tune the pre-trained large language model; employing the task-specific generative model to generate a search result to the search query; analyzing the search result based on a performance metric associated with the task-specific generative model; and refining the task-specific generative model based on the analyzing of the search result. 16. The machine-storage medium of claim 15 , wherein performing the plurality of iterations further comprises: generating one or more outputs from the task-specific generative model by applying the plurality of iterations on a new search query, the one or more outputs including the search result; and providing the one or more outputs to a user device. 17. The machine-storage medium of claim 16 , wherein performing the plurality of iterations further comprises: receiving user feedback based on the one or more outputs; and utilizing the user feedback to literately improve accuracy and fluency of the search result generated by the

Assignees

Inventors

Classifications

  • Presentation of query results · CPC title

  • using context · CPC title

  • Presentation of query results · CPC title

  • Details of hyperlinks; Management of linked annotations · CPC title

  • Document management systems · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12314318B2 cover?
An advanced search system leverages a pre-trained large language model to enhance user query responses. The system, equipped with hardware processors, a search query via an interface and accesses a pre-trained large language model designed to respond to the search query. The system fine-tunes the model to generate a task-specific generative model. The system employs the task-specific generative…
Who is the assignee on this patent?
Snowflake Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/24575. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 27 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).