Adaptive self-adversarial negative sampling for graph neural network training

US12505350B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12505350-B2
Application numberUS-202117161944-A
CountryUS
Kind codeB2
Filing dateJan 29, 2021
Priority dateJan 29, 2021
Publication dateDec 23, 2025
Grant dateDec 23, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A graph neural network (GNN) training method, system, and computer program product in a graph, include generating, by the computing device, one or more one or more hypothetical edges between two or more nodes of a plurality of nodes of a graph neural network, testing, by the computing device, to determine whether the one or more generated hypothetical edges should be connected by using negative sampling, and permanently connecting, by the computing device, the one or more tested hypothetical edges if the negative sampling indicates the connectivity.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented graph neural network (GNN) training method, comprising: training a model in a first training iteration of the GNN; evaluating, using the trained model, a gradient caused by a negative sample of a plurality of negative samples; evaluating, using the trained model, a gradient caused by a positive sample, wherein the positive sample corresponds to the negative sample; classifying, using the trained model, the negative sample as a hard negative sample based on: the negative sample causing a high gradient to the trained model for training in a next training iteration, and the gradient caused by the negative sample being not higher than the gradient caused by the corresponding positive sample, wherein the corresponding positive sample is used during the classifying to offset a hardness of the negative sample to reduce false negative cases; selecting the negative sample among the plurality of negative samples, via an Adaptive Self-Adversarial (ASA) negative sampling algorithm based on: self-adversarial negative sampling, the first training iteration of the GNN, and the negative sample being classified as the hard negative sample, wherein a set of decay functions for the ASA automatically increases a threshold of the hardness for the next training iteration relative to the first training iteration; and training the GNN in the next training iteration using the selected negative sample. 2 . The computer-implemented GNN training method of claim 1 , further comprising uniformly sampling a pool of candidates from possible negative links in the GNN, for reducing a selection space. 3 . The computer-implemented GNN training method of claim 2 , wherein a graph schema is defined to reduce a negative sample space which filters out a link in the GNN which is incompatible with the graph schema. 4 . The computer-implemented GNN training method of claim 1 , further comprising utilizing a training strategy for the ASA during an entire GNN training life-cycle. 5 . The computer-implemented GNN training method of claim 1 , wherein the computer-implemented GNN training method is embodied in a cloud-computing environment. 6 . A computer program product for graph neural network (GNN) training, the computer program product comprising a computer-readable storage medium having program instructions embodied therewith, the program instructions executable by a computer to cause the computer to perform: training a model in a first training iteration of the GNN; evaluating, using the trained model, a gradient caused by a negative sample of a plurality of negative samples; evaluating, using the trained model, a gradient caused by a positive sample, wherein the positive sample corresponds to the negative sample; classifying, using the trained model, the negative sample as a hard negative sample based on: the negative sample causing a high gradient to the trained model for training in a next training iteration, and the gradient caused by the negative sample being not higher than the gradient caused by the corresponding positive sample, wherein the corresponding positive sample is used during the classifying to offset a hardness of the negative sample to reduce false negative cases; selecting the negative sample among the plurality of negative samples, via an Adaptive Self-Adversarial (ASA) negative sampling algorithm based on: self-adversarial negative sampling, the first training iteration of the GNN, and the negative sample being classified as the hard negative sample, wherein a set of decay functions for the ASA automatically increases a threshold of the hardness for the next training iteration relative to the first training iteration; and training the GNN in the next training iteration using the selected negative sample. 7 . The computer program product of claim 6 , further comprising uniformly sampling a pool of candidates from possible negative links in the GNN, for reducing a selection space. 8 . The computer program product of claim 7 , wherein a graph schema is defined to reduce a negative sample space which filters out a link in the GNN which is incompatible with the graph schema. 9 . The computer program product of claim 6 , further comprising utilizing a training strategy for the ASA during an entire GNN training life-cycle. 10 . The computer program product of claim 6 , wherein the computer program product is embodied in a cloud-computing environment. 11 . A graph neural network (GNN) training system, comprising: a processor; and a memory, the memory storing instructions to cause the processor to: train a model in a first training iteration of the GNN; evaluate, using the trained model, a gradient caused by a negative sample of a plurality of negative samples; evaluate, using the trained model, a gradient caused by a positive sample, wherein the positive sample corresponds to the negative sample; classify, using the trained model, the negative sample as a hard negative sample based on: the negative sample that causes a high gradient to the trained model to train in a next training iteration, and the gradient caused by the negative sample being not higher than the gradient caused by the corresponding positive sample, wherein the corresponding positive sample is used during the classifying to offset a hardness of the negative sample to reduce false negative cases; select the negative sample among the plurality of negative samples, via an Adaptive Self-Adversarial (ASA) negative sampling algorithm based on: self-adversarial negative sampling, the first training iteration of the GNN, and the negative sample being classified as the hard negative sample, wherein a set of decay functions for the ASA automatically increases a threshold of the hardness for the next training iteration relative to the first training iteration; and train the GNN in the next training iteration using the selected negative sample. 12 . The GNN training system of claim 11 , wherein the memory further stores instructions to cause the processor to: uniformly sample a pool of candidates from possible negative links in the GNN, for reducing a selection space. 13 . The GNN training system of claim 12 , wherein a graph schema is defined to reduce a negative sample space which filters out a link in the GNN which is incompatible with the graph schema. 14 . The GNN training system of claim 11 , wherein the memory further stores instructions to cause the processor to utilize a training strategy for the ASA during an entire GNN training life-cycle. 15 . The GNN training system of claim 11 , wherein the GNN training system is embodied in a cloud-computing environment. 16 . A computer-implemented graph neural network (GNN) training method, comprising: training a model in a first training iteration of the GNN; evaluating, using the trained model, a gradient caused by a negative sample of a plurality of negative samples; evaluating, using the trained model, a gradient caused by a positive sample, wherein the positive sample corresponds to the negative sample; classifying, using the trained model, the negative sample as a hard negative sample based on: the negative sample causing a high gradient to the trained model for training in a next training iteration, and the gradient caused by the negative sample being not higher than the gradient caused by the corresponding positive sample, wherein the corresponding positive sample is used during the classifying to offset a hardness of the negative sample to reduce false negative cases; se

Assignees

Inventors

Classifications

  • Architecture, e.g. interconnection topology · CPC title

  • Supervised learning · CPC title

  • Knowledge-based neural networks; Logical representations of neural networks · CPC title

  • G06N3/088Primary

    Non-supervised learning, e.g. competitive learning · CPC title

  • G06N3/084Primary

    Backpropagation, e.g. using gradient descent · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12505350B2 cover?
A graph neural network (GNN) training method, system, and computer program product in a graph, include generating, by the computing device, one or more one or more hypothetical edges between two or more nodes of a plurality of nodes of a graph neural network, testing, by the computing device, to determine whether the one or more generated hypothetical edges should be connected by using negative…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06N3/088. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 23 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).