Data versioning in a multi-datacenter topology

US11494394B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11494394-B2
Application numberUS-201916730574-A
CountryUS
Kind codeB2
Filing dateDec 30, 2019
Priority dateDec 30, 2019
Publication dateNov 8, 2022
Grant dateNov 8, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques are disclosed relating to data versioning in a multi-datacenter topology. In various embodiments, a computer system at a first datacenter may execute a query at a first point in time against a particular dataset that is maintained at the first datacenter. The particular dataset includes a first data record with a corresponding version number, where the version number is associated with a value of the first data record at the first point in time. In response to the first query, the computer system may receive a first results dataset that includes the first data record with its corresponding version number. The computer system may then store information indicative of the results dataset that is usable to retrieve, at a subsequent time from a historical version of the particular dataset at a second datacenter, the first data record with the same value as at the first point in time.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: executing, by a computer system at a first point in time, a first query against a particular dataset at a first production datacenter, wherein the particular dataset includes a plurality of data records that are each associated with a corresponding version number, and wherein, for a given one of the plurality of data records, a corresponding version number of the given data record is associated with a value of the given data record at a given point in time; in response to executing the first query, receiving, by the computer system, a first results dataset that includes a first data record of the plurality of data records with a first version number that is associated with a value of the first data record at the first point in time; and storing information indicative of the first results dataset in a data store, wherein the information specifies the first version number associated with the first data record and is usable to retrieve, at a second point in time subsequent to the first point in time from a historical version of the particular dataset stored in a historical database separate from the data store, the first data record with the same value as at the first point in time. 2. The method of claim 1 , further comprising: generating the corresponding version number for each of the plurality of data records. 3. The method of claim 2 , wherein the generating the corresponding version numbers includes generating a universally unique identifier (UUID) for each of the plurality of data records. 4. The method of claim 2 , further comprising: receiving, at the first production datacenter prior to the first point in time, a batch of updated data records associated with the particular dataset, wherein the batch of updated data records includes updated values for each of the plurality of data records, and wherein the generating the corresponding version numbers includes assigning a single version number to all of the plurality of data records. 5. The method of claim 1 , further comprising: maintaining, at a simulation datacenter, the historical version of the particular dataset, wherein the historical version of the particular dataset includes one or more historical values for each of the plurality of data records at different points in time. 6. The method of claim 5 , further comprising: retrieving, by a second computer system at the simulation datacenter, the information indicative of the first results dataset from the data store, wherein the information specifies the first version number associated with the first data record; and retrieving, by the second computer system at the second point in time, the first data record from the historical version of the particular dataset, wherein the first data record retrieved at the second point in time has the same value as at the first point in time. 7. The method of claim 1 , wherein the particular dataset is a risk analytics dataset usable to perform risk detection operations. 8. A method, comprising: retrieving, by a first computer system at a simulation datacenter, information indicative of a first results dataset, wherein the information includes: a first version number associated with a first data record from the first results dataset, wherein the first results dataset was retrieved from a particular dataset at a first point in time, and wherein the first version number is associated with a value of the first data record at the first point in time; maintaining, at the simulation datacenter in a data store, a historical version of the particular dataset, wherein the historical version of the particular dataset includes one or more historical values for the first data record at different points in time; and based on the first version number, retrieving, by the first computer system at a second point in time that is subsequent to the first point in time, the first data record from the historical version of the particular dataset stored in a historical database separate from the data store, wherein the first data record retrieved at the second point in time has the same value as at the first point in time. 9. The method of claim 8 , further comprising: receiving, at the simulation datacenter prior to the first point in time, an updated value for the first data record; generating the first version number associated with the updated value for the first data record; and updating, by the first computer system at the simulation datacenter, the first data record in the historical version of the particular dataset to include the updated value and the first version number. 10. The method of claim 9 , wherein the first version number associated with the updated value for the first data record is generated by the first computer system at the simulation datacenter. 11. The method of claim 9 , wherein the updated value for the first data record is included in a batch of updated data records associated with the particular dataset, wherein the batch of updated data records includes updated values for each of a plurality of data records, and wherein the generating the first version number includes assigning the first version number as an updated version number for all of the plurality of data records. 12. The method of claim 8 , further comprising: receiving, at a first production datacenter prior to the first point in time, an updated value for the first data record; generating, by a second computer system at the first production datacenter, the first version number associated with the updated value for the first data record, wherein the first version number for the first data record generated at the first production datacenter is the same as the first version number for the first data record generated at the simulation datacenter; and updating, by the second computer system at the first production datacenter, the first data record, stored in a copy of the particular dataset maintained at the first production datacenter, to include the updated value and the first version number. 13. The method of claim 12 , further comprising: executing, by the second computer system at the first point in time, a first query against the copy of the particular dataset maintained at the first production datacenter to retrieve the first results dataset. 14. The method of claim 13 , further comprising: subsequent to executing the first query, storing, by the second computer system at the first production datacenter, the information indicative of the first results dataset in a data store accessible to the first computer system at the simulation datacenter. 15. A system, comprising: at least one processor; a memory having instructions stored thereon that are executable by the at least one processor to cause the system to execute a first datacenter and a second datacenter: the first datacenter that includes: a first database storing a copy of a particular dataset; and a first plurality of computer systems, wherein at least one of the first plurality of computer systems stores program instructions that are executable by one or more of the first plurality of computer systems to: receive an updated version of the particular dataset that includes updated values for a plurality of data records, wherein the updated version of the particular dataset is associated with a first point in time; generate a version number for each of the plurality of data records, wherein, for a given one of the plurality of data records, a given corresponding version number is associated with a value of the given data record at the first point in time; and up

Assignees

Inventors

Classifications

  • G06F16/219Primary

    Managing data history or versioning (querying versioned data G06F16/2474; querying temporal data G06F16/2477) · CPC title

  • Sequence data queries, e.g. querying versioned data · CPC title

  • Presentation of query results · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11494394B2 cover?
Techniques are disclosed relating to data versioning in a multi-datacenter topology. In various embodiments, a computer system at a first datacenter may execute a query at a first point in time against a particular dataset that is maintained at the first datacenter. The particular dataset includes a first data record with a corresponding version number, where the version number is associated wi…
Who is the assignee on this patent?
Paypal Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/219. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 08 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).