Who is the assignee on this patent?

Microsoft Technology Licensing Llc

What technology area does this patent fall under?

Primary CPC classification G06F21/554. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Jul 29 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Generating security language queries

US12373554B2 · US · B2

Patent metadata
Field	Value
Publication number	US-12373554-B2
Application number	US-202217900394-A
Country	US
Kind code	B2
Filing date	Aug 31, 2022
Priority date	Aug 31, 2022
Publication date	Jul 29, 2025
Grant date	Jul 29, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer-implemented method of generating a security language query from a user input query includes receiving, at a computer system, an input security hunting user query indicating a user intention; selecting, using a trained machine learning model and based on the input security hunting query, an example user security hunting query and corresponding example security language query; generating, using the trained machine learning model, query metadata from the input security hunting query; generating a prompt, the prompt comprising: the input security hunting user query; the selected example user security hunting query and the corresponding example security language query; and the generated query metadata; inputting the prompt to a large language model; receiving a security language query from the large language model corresponding to the input security hunting query reflective of the user intention.

First claim

Opening claim text (preview).

The invention claimed is: 1. A computer-implemented method comprising: receiving a training data set comprising a plurality of user security hunting queries, corresponding ground truth security language queries, and corresponding query metadata; generating a plurality of probe prompts from the training data set, wherein generating each probe prompt comprises: selecting one of the plurality of user security hunting queries as a subject of the probe prompt, and randomly selecting a subset of the plurality of user security hunting queries and corresponding ground truth security language queries as examples for inclusion into the probe prompt; inputting the probe prompts to a large language model; receiving, as output from the large language model, security language queries corresponding to the subjects of the probe prompts; comparing the security language queries for the probe prompts to the corresponding ground truth security language queries; calculating probe scores for the probe prompts based on the comparison; and training a machine learning model based on the calculated probe scores and the training data set, the machine learning model being trained to: receive an input user security hunting user query, generate output scores reflective of a utility of the user security hunting queries and corresponding ground truth security language queries of the training data set to the input user security hunting user query, and select a user security hunting query and corresponding ground truth security language query of the training data set as an example for inclusion into a prompt for input to the large language model based on the output scores, and generate query metadata for inclusion into the prompt for input to the large language model from the input security hunting user query. 2. The method of claim 1 , wherein the trained model comprises: a base large language model; a pooling layer; a ranking head; and a classification head, wherein the ranking head is trained to select the user security hunting query and corresponding ground truth security language query, and the classification head is trained to generate the query metadata. 3. The method of claim 1 , comprising generating the training data set by: receiving an initial training data set comprising a plurality of user security hunting queries, corresponding ground truth security language queries, and corresponding query metadata; receiving a security query set comprising security language queries and corresponding textual descriptions of the security language queries; generating a prompt including: a user security hunting query and corresponding ground truth security language query drawn from the initial training data set, and a security language query and corresponding textual description from the security query set; inputting the prompt to a natural language generation large language model; receiving, as output from the natural language generation large language model, a user security hunting query corresponding to the security language query from the security query set, including the received user security hunting query and the security language query from the security query set in the training data set. 4. The method of claim 1 , comprising: backtranslating the user security hunting queries to generate backtranslated security hunting queries; and including the backtranslated security hunting queries in the training data set. 5. The method of claim 1 , comprising: fitting a linear model to the probe scores; generating, from the linear model, rankings indicative of usefulness of the examples included in the probe prompts to the subjects of the probe prompts; wherein training the machine learning model to generate the output scores to select a user security hunting query and corresponding ground truth security language query of the training data set is based on the ranking. 6. The method of claim 5 , wherein training the machine learning model based on the ranking comprises: selecting a pair of examples from the ranking; inputting the pair of examples to the machine learning model; training the machine learning model to minimize a pairwise cross-entropy loss calculated from the pair of examples. 7. The method of claim 1 , comprising: generating a plurality of hyperparameter probe prompts, wherein generating each hyperparameter probe prompt comprises: selecting one of the plurality of user security hunting queries as a subject of the hyperparameter probe prompt, and randomly selecting values for a hyperparameter of the machine learning model; inputting the generated hyperparameter probe prompts to the large language model; receiving, as output from the large language model, security language queries corresponding to the subjects of the hyperparameter probe prompt; comparing the received security language queries for the hyperparameter probe prompts to the corresponding ground truth security language queries; calculating hyperparameter probe scores for the hyperparameter probe prompts based on the comparison; and training the machine learning model to select a value for the hyperparameter based on the calculated hyperparameter probe scores. 8. A system comprising a processor and a storage, the storage storing computer-readable instructions which, when executed by the processor, cause the system to perform operations comprising: receiving a training data set comprising a plurality of user security hunting queries, corresponding ground truth security language queries, and corresponding query metadata; generating a plurality of probe prompts from the training data set, wherein generating each probe prompt comprises: selecting one of the plurality of user security hunting queries as a subject of the probe prompt, and randomly selecting a subset of the plurality of user security hunting queries and corresponding ground truth security language queries as examples for inclusion into the probe prompt; inputting the probe prompts to a large language model; receiving, as output from the large language model, security language queries corresponding to the subjects of the probe prompts; comparing the security language queries for the probe prompts to the corresponding ground truth security language queries; calculating probe scores for the probe prompts based on the comparison; and training a machine learning model based on the calculated probe scores and the training data set, the machine learning model being trained to: receive an input user security hunting user query, generate output scores reflective of a utility of the user security hunting queries and corresponding ground truth security language queries of the training data set to the input user security hunting user query, and select a user security hunting query and corresponding ground truth security language query of the training data set as an example for inclusion into a prompt for input to the large language model based on the output scores, and generate query metadata for inclusion into the prompt for input to the large language model from the input security hunting user query. 9. The system of claim 8 , wherein the trained model comprises: a base large language model; a pooling layer; a ranking head; and a classification head, wherein the ranking head is trained to select the user security hunting query and corresponding ground truth security language query, and the classification head is trained to generate the query metadata. 10. The system of claim 8 , the operations further comprising generating the training data set by: receiving an initial training data set comprising a plurality of user security hunting que

Assignees

Microsoft Technology Licensing Llc

Inventors

Classifications

G06F40/47
Machine-assisted translation, e.g. using translation memory · CPC title
G06F16/2423
Interactive query statement specification based on a database schema · CPC title
G06F21/566
Dynamic detection, i.e. detection performed at run-time, e.g. emulation, suspicious activities · CPC title
G06F21/554Primary
involving event detection and direct action · CPC title
G06F21/552Primary
involving long-term monitoring or reporting · CPC title

Patent family

Related publications grouped by family.

View patent family 87760348

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12373554B2 cover?: A computer-implemented method of generating a security language query from a user input query includes receiving, at a computer system, an input security hunting user query indicating a user intention; selecting, using a trained machine learning model and based on the input security hunting query, an example user security hunting query and corresponding example security language query; generati…
Who is the assignee on this patent?: Microsoft Technology Licensing Llc
What technology area does this patent fall under?: Primary CPC classification G06F21/554. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Jul 29 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).