Using Hardware Gather-Scatter Capabilities to Optimize MPI All-to-All
US-2018052803-A1 · Feb 22, 2018 · US
US12093208B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12093208-B2 |
| Application number | US-202217862222-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jul 11, 2022 |
| Priority date | May 27, 2022 |
| Publication date | Sep 17, 2024 |
| Grant date | Sep 17, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Technologies for enabling remote direct memory access (RDMA) transport of serialized objects in streaming pipelines are described. One method of a first computing device that stores a serialized object in a first memory can generate a remote descriptor associated with the serialized object. The remote descriptor uniquely identifies the location of the serialized object and a reference count token. The first computing device sends the remote descriptor to a second computing device in the data center over a network fabric. The second computing device uses the remote descriptor to obtain the contiguous block from the first memory for storage at a second memory associated with the second computing device. The value of the reference count token can be updated by receiving a message from the second computing device, and the remote descriptor can be released responsive to the value of the reference count token satisfying a threshold value.
Opening claim text (preview).
What is claimed is: 1. A method of operating a first computing device, the method comprising: generating a remote descriptor associated with a serialized object, wherein the remote descriptor is an object that uniquely identifies a first contiguous block of the serialized object stored in a first memory associated with the first computing device; and sending the remote descriptor to a second computing device, wherein contents of the first contiguous block are obtained from the first memory for storage at a second memory associated with the second computing device when the remote descriptor is processed by the second computing device, wherein: the object uniquely identifies a starting address of the first contiguous block in the first memory, a size of the first contiguous block, a physical machine identifier corresponding to the first memory, a remote direct memory access (RDMA) access key, and a value of a reference count token representing one or more shares of ownership of the serialized object; the second computing device is to use the remote descriptor to obtain contents of the first contiguous block from the first memory for storage at the second memory; and the method further comprises: receiving a message from the second computing device to release the remote descriptor; updating the value of the reference count token responsive to receiving the message; and releasing the serialized object from the first memory responsive to the value of the reference count token satisfying a threshold value. 2. The method of claim 1 , wherein a size of the remote descriptor is less than the size of the first contiguous block. 3. The method of claim 1 , further comprising: receiving a second remote descriptor associated with a second serialized object, wherein the second remote descriptor is a second object that uniquely identifies a second starting address of a second contiguous block in a third memory associated with a third computing device, a second size of the second contiguous block, a second physical machine identifier corresponding to the third memory, a second RDMA access key, and a value of a second reference count token representing one or more shares of ownership of the second serialized object; and performing, using the second remote descriptor, an RDMA GET operation to obtain contents of the second contiguous block from the third memory for transfer to the first memory. 4. The method of claim 3 , wherein receiving the second remote descriptor comprises receiving the second remote descriptor from a fourth computing device. 5. The method of claim 3 , further comprising: sending a second message to the third computing device to release the second remote descriptor, wherein the value of the second reference count token is updated in response to the second message, wherein the second serialized object is released responsive to the value of the second reference count token satisfying a threshold value. 6. The method of claim 1 , further comprising: receiving a second remote descriptor associated with a second serialized object, wherein the second remote descriptor is a second object that uniquely identifies a second starting address of a second contiguous block in a third memory associated with a third computing device, a second size of the second contiguous block, a second physical machine identifier corresponding to the third memory, a second RDMA access key, and a value of a second reference count token representing one or more shares of ownership of the second serialized object; and sending the second remote descriptor to the second computing device or a fourth computing device, wherein sending the second remote descriptor to the second or fourth computing device transfers full ownership of the second serialized object to the second or fourth computing device. 7. The method of claim 1 , further comprising: receiving a second remote descriptor associated with a second serialized object, wherein the second remote descriptor is a second object that uniquely identifies a second starting address of a second contiguous block in a third memory associated with a third computing device, a second size of the second contiguous block, a second physical machine identifier corresponding to the third memory, a second RDMA access key, and a value of a second reference count token representing one or more shares of ownership of the second serialized object; generating a plurality of remote descriptors from the second remote descriptor, wherein a value of the second reference count token is split among the plurality of remote descriptors; and sending each of the plurality of remote descriptors to one of a plurality of computing devices, wherein the plurality of computing devices comprises the second computing device. 8. A computing system comprising: a first computing device; and a first memory coupled to the first computing device, wherein the first computing device is to: generate a remote descriptor associated with a serialized object, wherein the remote descriptor is an object that uniquely identifies a first contiguous block of the serialized object stored in a first memory associated with the first computing device; and send the remote descriptor to a second computing device, wherein contents of the first contiguous block are obtained from the first memory for storage at a second memory associated with the second computing device when the remote descriptor is processed by the second computing device, wherein: the first computing device and the second computing device are part of a data center; the object uniquely identifies a starting address of the first contiguous block in the first memory, a size of the first contiguous block, a physical machine identifier corresponding to the first memory, a remote direct memory access (RDMA) access key, and a value of a reference count token representing one or more shares of ownership of the serialized object; the second computing device, using the remote descriptor, is to obtain contents of the first contiguous block from the first memory for storage at the second memory; the first computing device is further to: receive a message from the second computing device to release the remote descriptor; update the value of the reference count token responsive to receiving the message; and release the serialized object from the first memory responsive to the value of the reference count token satisfying a threshold value. 9. The computing system of claim 8 , wherein a size of the remote descriptor is less than the size of the first contiguous block. 10. The computing system of claim 8 , wherein the first computing device is further to: receive a second remote descriptor associated with a second serialized object, wherein the second remote descriptor is a second object that uniquely identifies a second starting address of a second contiguous block in a third memory associated with a third computing device in the data center, a second size of the second contiguous block, a second physical machine identifier corresponding to the third memory, a second RDMA access key, and a value of a second reference count token representing one or more shares of ownership of the second serialized object; and perform, using the second remote descriptor, an RDMA GET operation to obtain contents of the second contiguous block from the third memory for transfer to the first memory. 11. The computing system of claim 10 , wherein the first computing device is further to receive the second remote descriptor from a fourth computing device. 12. The computing system of claim 10 , wherein the first computing device is further to: send a second m
for remote control or remote monitoring of applications · CPC title
to perform operations on memory · CPC title
for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS] · CPC title
Distributed shared memory [DSM], e.g. remote direct memory access [RDMA] · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.