Machine learning-based program analysis using synthetically generated labeled data
US-11593675-B1 · Feb 28, 2023 · US
US2024370352A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2024370352-A1 |
| Application number | US-202418628773-A |
| Country | US |
| Kind code | A1 |
| Filing date | Apr 7, 2024 |
| Priority date | May 15, 2020 |
| Publication date | Nov 7, 2024 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An automated program repair tool utilizes a neural transformer model with attention to predict the contents of a bug repair in the context of source code having a bug of an identified bug type. The neural transformer model is trained on a large unsupervised corpus of source code using a span-masking denoising optimization objective, and fine-tuned on a large supervised dataset of triplets containing a bug-type annotation, software bug, and repair. The bug-type annotation is derived from an interprocedural static code analyzer. A bug type edit centroid is computed for each bug type and used in the inference decoding phase to generate the bug repair.
Opening claim text (preview).
What is claimed: 1 . A system comprising: a processor; and a memory that stores a program configured to be executed by the processor; wherein the program comprises instructions to perform actions that: obtain a source code snippet with a software bug associated with a bug type; obtain a bug-type edit centroid for the bug type; access a neural transformer model with attention having at least one edit encoder block, at least one context encoder block, and at least one decoder block; and perform a beam search to generate a repaired source code snippet for the source code snippet with the software bug, wherein the beam search invokes the neural transformer model with attention given the bug-type edit centroid and the source code snippet with the software bug to predict each token of the repaired source code snippet autoregressively, wherein the at least one edit encoder block generates an edit embedding for the bug-type edit centroid, wherein the at least one context encoder generates a context embedding for the source code snippet with the software bug, wherein the at least one decoder block generates an output probability distribution given the edit embedding and the context embedding, wherein the output probability distribution associates a probability of a token following a preceding sequence of tokens.
Transfer learning · CPC title
Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title
Supervised learning · CPC title
Auto-encoder networks; Encoder-decoder networks · CPC title
Machine learning · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.