Log representation learning for automated system maintenance

US2025094271A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2025094271-A1
Application numberUS-202418829545-A
CountryUS
Kind codeA1
Filing dateSep 10, 2024
Priority dateSep 20, 2023
Publication dateMar 20, 2025
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for log representation learning for automated system maintenance. An optimized parser can transform collected system logs into log templates. A tokenizer can tokenize the log templates partitioned into time windows to obtain log template tokens. The log template tokens can train a language model (LM) with deep learning to obtain a trained LM. The trained LM can detect anomalies from system logs to obtain detected anomalies. A corrective action can be performed on a monitored entity based on the detected anomalies.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method for log representation learning for automated system maintenance, comprising: transforming collected system logs into log templates using an optimized parser; tokenizing the log templates partitioned into time windows to obtain log template tokens; training a language model (LM) with deep learning using the log template tokens to obtain a trained LM; detecting anomalies from system logs using the trained LM to obtain detected anomalies; and performing a corrective action to a monitored entity based on the detected anomalies. 2 . The computer-implemented method of claim 1 , wherein performing a corrective action further comprises updating a medical diagnosis of a patient based on the detected anomalies from system logs, that includes healthcare data of the patient, collected from a healthcare data system. 3 . The computer-implemented method of claim 1 , wherein transforming collected system logs further comprises optimizing a parser to eliminate noise and extraneous information from system logs. 4 . The computer-implemented method of claim 1 , wherein tokenizing the log templates further comprises partitioning system logs into multiple time windows with a fixed window size to capture unique log sequences within a specific time range. 5 . The computer-implemented method of claim 1 , wherein training a large language model further comprises fine-tuning the trained LM using incoming system logs to optimize performance and adaptability. 6 . The computer-implemented method of claim 5 , wherein training a large language model further comprises computing a global loss function that maps log sequences to representation vectors that have an average minimum distances to a center in a latent space in an embedding layer using a recurrent neural network. 7 . The computer-implemented method of claim 6 , wherein training a large language model further comprises computing a local loss function that obtains a sequence of hidden representations that encode sequential dependences in local regions in the latent space. 8 . The computer-implemented method of claim 7 , wherein training a large language model further comprises fusing the global loss function and the local loss function to obtain a fused loss function to train the LM using the fused loss function. 9 . The computer-implemented method of claim 1 , wherein training a large language model further comprises transforming system logs into an embedding layer to preserve relationships between system logs. 10 . A system, comprising: a memory device; and one or more processor devices operatively coupled with the memory device to: transform collected system logs into log templates using an optimized parser; tokenize the log templates partitioned into time windows to obtain log template tokens; train a language model (LM) with deep learning using the log template tokens to obtain a trained LM; detect anomalies from system logs using the trained LM to obtain detected anomalies; and perform a corrective action to a monitored entity based on the detected anomalies. 11 . The system of claim 10 , wherein to perform a corrective action further comprises to update a medical diagnosis of a patient based on the detected anomalies from system logs, that includes healthcare data of the patient, collected from a healthcare data system. 12 . The system of claim 10 , wherein to transform collected system logs further comprises optimizing a parser to eliminate noise and extraneous information from system logs. 13 . The system of claim 10 , wherein to tokenize the log templates further comprises to partition system logs into multiple time windows with a fixed window size to capture unique log sequences within a specific time range. 14 . The system of claim 10 , wherein to train a large language model further comprises to fine-tune the trained LM using incoming system logs to optimize performance and adaptability. 15 . The system of claim 10 , wherein training a large language model further comprises to compute a global loss function that maps log sequences to representation vectors that have an average minimum distances to a center in a latent space in an embedding layer using a recurrent neural network. 16 . The system of claim 15 , wherein training a large language model further comprises computing a local loss function that obtains a sequence of hidden representations that encode sequential dependences in local regions in the latent space. 17 . The system of claim 16 , wherein to train a large language model further comprises to fuse the global loss function and the local loss function to obtain a fused loss function to train the LM using the fused loss function. 18 . The computer-implemented method of claim 1 , wherein training a large language model further comprises transforming system logs into an embedding layer to preserve relationships between system logs. 19 . A non-transitory computer program product comprising a computer-readable storage medium including program code for log representation learning for automated system maintenance, wherein the program code when executed on a computer causes the computer to: transform collected system logs into log templates using an optimized parser; tokenize the log templates partitioned into time windows to obtain log template tokens; train a language model (LM) with deep learning using the log template tokens to obtain a trained LM; detect anomalies from system logs using the trained LM to obtain detected anomalies; and perform a corrective action to a monitored entity based on the detected anomalies. 20 . The non-transitory computer program product of claim 19 , wherein to perform a corrective action further comprises updating a medical diagnosis of a patient based on the detected anomalies from system logs, that includes healthcare data of the patient, collected from a healthcare data system.

Assignees

Inventors

Classifications

  • Timestamp · CPC title

  • Threshold · CPC title

  • where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems (multiprogramming arrangements G06F9/46; allocation of resources G06F9/50) · CPC title

  • Data logging (G06F11/14, G06F11/2205 take precedence) · CPC title

  • Remedial or corrective actions (recovery from an exception in an instruction pipeline G06F9/3861; by retry G06F11/1402; for recovering from a failure of a protocol instance or entity H04L69/40) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025094271A1 cover?
Systems and methods for log representation learning for automated system maintenance. An optimized parser can transform collected system logs into log templates. A tokenizer can tokenize the log templates partitioned into time windows to obtain log template tokens. The log template tokens can train a language model (LM) with deep learning to obtain a trained LM. The trained LM can detect anomal…
Who is the assignee on this patent?
Nec Lab America Inc
What technology area does this patent fall under?
Primary CPC classification G06F11/0793. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Mar 20 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).