Duplicate and similar bug report detection and retrieval using neural networks

US10705795B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10705795-B2
Application numberUS-201715845669-A
CountryUS
Kind codeB2
Filing dateDec 18, 2017
Priority dateDec 19, 2016
Publication dateJul 7, 2020
Grant dateJul 7, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A device may receive information associated with first and second bug reports to be classified as duplicate or non-duplicate bug reports. The device may identify first and second descriptions associated with the first and second bug reports, respectively. The first and second descriptions may be different descriptions having a shared description type. The device may identify a neural network for encoding the first and second descriptions, based on the shared description type. The device may encode the first description into a first vector using the neural network, and may encode the second description into a second vector using the neural network. The device may classify the first and second bug reports as duplicate or non-duplicate bug reports based on the first vector and the second vector. The device may perform an action based on classifying the first and second bug reports as duplicate or non-duplicate bug reports.

First claim

Opening claim text (preview).

What is claimed is: 1. A device, comprising: one or more hardware processors to: receive information associated with a first bug report and a second bug report to be classified as duplicate bug reports or non-duplicate bug reports; identify a first description, associated with the first bug report, and a second description associated with the second bug report, the first description and the second description being different descriptions having a shared description type; identify a neural network, of a plurality of different types of neural networks, for encoding the first description and the second description, based on the shared description type; encode the first description into a first vector using the neural network; encode the second description into a second vector using the neural network; classify the first bug report and the second bug report as duplicate bug reports or non-duplicate bug reports based on the first vector and the second vector; and perform an action based on classifying the first bug report and the second bug report as duplicate bug reports or non-duplicate bug reports. 2. The device of claim 1 , where the one or more hardware processors, when performing the action, are to: retrieve one or more bug reports that share a threshold degree of similarity with the first bug report or the second bug report; and output information associated with the one or more bug reports. 3. The device of claim 1 , where the shared description type is an unstructured data type; and where the one or more hardware processors, when identifying the neural network, are to: identify the neural network based on the shared description type being the unstructured data type. 4. The device of claim 1 , where the shared description type is a short description type indicating a description length less than or equal to a threshold length; and where the one or more hardware processors, when identifying the neural network, are to: identify a bi-directional long short-term memory neural network based on the shared description type being the short description type. 5. The device of claim 1 , where the shared description type is a long description type indicating a description length greater than or equal to a threshold length; and where the one or more hardware processors, when identifying the neural network, are to: identify a convolutional neural network based on the shared description type being the long description type. 6. The device of claim 1 , where the shared description type is a structured description type; and where the one or more hardware processors, when identifying the neural network, are to: identify a single-layer neural network based on the shared description type being the structured description type. 7. The device of claim 1 , where the one or more hardware processors, when classifying the first bug report and the second bug report, are to: classify the first bug report and the second bug report as duplicate bug reports; and where the one or more hardware processors, when performing the action, are to: determine that the first bug report has been resolved; and mark the second bug report as resolved based on determining that the first bug report has been resolved and classifying the first bug report and the second bug report as duplicate bug reports. 8. A method, comprising: receiving, by a device, information associated with a first entity and a second entity to be classified as duplicate entities or non-duplicate entities; identifying, by the device, a first set of descriptions, associated with the first entity, and a second set of descriptions associated with the second entity, each description, included in the first set of descriptions, sharing a description type with a corresponding description included in the second set of descriptions; encoding, by the device, the first set of description into a first set of vectors using a corresponding set of neural networks, each neural network, of the corresponding set of neural networks, operating on a different description type; encoding, by the device, the second set of descriptions into a second set of vectors using the corresponding set of neural networks; classifying, by the device, the first entity and the second entity as duplicate entities or non-duplicate entities based on the first set of vectors and the second set of vectors; and performing, by the device, an action based on classifying the first entity and the second entity as duplicate entities or non-duplicate entities. 9. The method of claim 8 , where a first description, included in the first set of descriptions, and a second description, included in the second set of descriptions, share a short description type indicating a description length less than or equal to a threshold length; where encoding the first set of descriptions comprises: encoding the first description using a bi-directional long short-term memory neural network based on the first description and the second description sharing the short description type; and where encoding the second set of descriptions comprises: encoding the second description using the bi-directional long short-term memory neural network based on the first description and the second description sharing the short description type. 10. The method of claim 8 , where a first description, included in the first set of descriptions, and a second description, included in the second set of descriptions, share a long description type indicating a description length greater than or equal to a threshold length; where encoding the first set of descriptions comprises: encoding the first description using a convolutional neural network based on the first description and the second description sharing the long description type; and where encoding the second set of descriptions comprises: encoding the second description using the convolutional neural network based on the first description and the second description sharing the long description type. 11. The method of claim 8 , where a first description, included in the first set of descriptions, and a second description, included in the second set of descriptions, share a structured description type; where encoding the first set of descriptions comprises: encoding the first description using a single-layer neural network based on the first description and the second description sharing the structured description type; and where encoding the second set of descriptions comprises: encoding the second description using the single-layer neural network based on the first description and the second description sharing the structured description type. 12. The method of claim 8 , where the first set of descriptions includes a first description that shares a first description type with a second description included in the second set of descriptions; where the first set of descriptions includes a third description that shares a second description type with a fourth description included in the second set of descriptions; and where encoding the first set of descriptions and the second set of descriptions comprises: encoding the first description and the second description using a first neural network that corresponds to the first description type; and encoding the third description and the fourth description using a second neural network that corresponds to the second description type. 13. The method of claim 8 , where performing the action comprises: outputting an indication of whether the first entity and the second entity are duplicate entities or non-duplicate entities.

Assignees

Inventors

Classifications

  • G06Q10/06Primary

    Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling · CPC title

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • Combinations of networks · CPC title

  • Transfer learning · CPC title

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10705795B2 cover?
A device may receive information associated with first and second bug reports to be classified as duplicate or non-duplicate bug reports. The device may identify first and second descriptions associated with the first and second bug reports, respectively. The first and second descriptions may be different descriptions having a shared description type. The device may identify a neural network fo…
Who is the assignee on this patent?
Accenture Global Solutions Ltd
What technology area does this patent fall under?
Primary CPC classification G06Q10/06. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 07 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).