Data analytic systems

US12105682B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12105682-B2
Application numberUS-202318488826-A
CountryUS
Kind codeB2
Filing dateOct 17, 2023
Priority dateOct 17, 2017
Publication dateOct 1, 2024
Grant dateOct 1, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method comprises receiving, at a build service of a build server, an external dataset and an adaptor application module, the external dataset being in a specific format, the adaptor application module providing information relevant to a build pipeline maintained by the build service for building an output dataset based on the external dataset, the information including changes to the external dataset since a previous build of the output dataset is performed and a data schema used in the previous build, the build pipeline involving data only in one or more formats other than the specific format; incorporating the external dataset into the build pipeline without the external dataset being reformatted in accordance with requirements of the build service; receiving a request from the adaptor application module for specific information relating to a most recent data build run by the build service; providing a response to the adaptor application module.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, comprising: receiving, at a build service of a build server, an external dataset and an adaptor application module, the external dataset being in a specific format, the adaptor application module providing information relevant to a build pipeline maintained by the build service for building an output dataset based, at least in part, on the external dataset, the information including changes to the external dataset since a previous build of the output dataset is performed and a data schema used in the previous build, the build pipeline involving data only in one or more formats other than the specific format; incorporating the external dataset into the build pipeline without the external dataset being reformatted in accordance with requirements of the build service; receiving a request from the adaptor application module for specific information relating to a most recent data build run by the build service; providing a response to the adaptor application module. 2. The computer-implemented method of claim 1 , further comprising querying the adaptor application module for data related to the external dataset, the information being provided in response to the querying. 3. The computer-implemented method of claim 1 , the information including a title or other metadata of the external dataset. 4. The computer-implemented method of claim 1 , the information indicating a version of data used in the previous build or a version of transformation code used to run the previous build, runtime parameters for the previous build. 5. The computer-implemented method of claim 1 , further comprising: receiving a build request at the build service to perform a first data build on the external dataset; determining that the first data build is scheduled to be at least partially contemporaneous with a second data build on a second dataset; determining, based on the information, that the first data build and the second data build are compatible in that an order of executing the first data build and the second data build does not affect outputs of the first data build and the second data build; causing the first data build and the second data build to be executed at least partially concurrently. 6. The computer-implemented method of claim 1 , further comprising: receiving a build request at the build service to perform a first data build on the external dataset; determining that the first data build is scheduled to be at least partially contemporaneous with a second data build on a second dataset; determining, based on the information, that the first data build and the second data build are incompatible in that an order of executing the first data build and the second data build affects outputs of the first data build and the second data build. 7. The computer-implemented method of claim 6 , the first data build and the second data build reading and/or writing to common data objects with different input data versions, code versions, or build parameters. 8. The computer-implemented method of claim 6 , further comprising causing the first data build and the second data build to be executed consecutively. 9. The computer-implemented method of claim 6 , further comprising: determining priorities associated with the first data build and the second data build; causing executing the first data build and the second data build based on the priorities. 10. The computer-implemented method of claim 1 , further comprising: receiving a build request at the build service to perform a first data build on the external dataset; determining that the first data build is not scheduled to be at least partially contemporaneous with a second data build on a second dataset; causing executing the first data build. 11. One or more non-transitory, computer-readable storage media storing one or more sequences of instructions which, when executed cause one or more processors to perform a method, the method comprising: receiving, at a build service of a build server, an external dataset and an adaptor application module, the external dataset being in a specific format, the adaptor application module providing information relevant to a build pipeline maintained by the build service for building an output dataset based, at least in part, on the external dataset, the information including changes to the external dataset since a previous build of the output dataset is performed and a data schema used in the previous build, the build pipeline involving data only in one or more formats other than the specific format; incorporating the external dataset into the build pipeline without the external dataset being reformatted in accordance with requirements of the build service; receiving a request from the adaptor application module for specific information relating to a most recent data build run by the build service; providing a response to the adaptor application module. 12. The one or more non-transitory, computer-readable storage media of claim 11 , the method further comprising querying the adaptor application module for data related to the external dataset, the information being provided in response to the querying. 13. The one or more non-transitory, computer-readable storage media of claim 11 , the information including a title or other metadata of the external dataset. 14. The one or more non-transitory, computer-readable storage media of claim 11 , the information indicating a version of data used in the previous build or a version of transformation code used to run the previous build, runtime parameters for the previous build. 15. The one or more non-transitory, computer-readable storage media of claim 11 , the method further comprising: receiving a build request at the build service to perform a first data build on the external dataset; determining that the first data build is scheduled to be at least partially contemporaneous with a second data build on a second dataset; determining, based on the information, that the first data build and the second data build are compatible in that an order of executing the first data build and the second data build does not affect outputs of the first data build and the second data build; causing the first data build and the second data build to be executed at least partially concurrently. 16. The one or more non-transitory, computer-readable storage media of claim 11 , the method further comprising: receiving a build request at the build service to perform a first data build on the external dataset; determining that the first data build is scheduled to be at least partially contemporaneous with a second data build on a second dataset; determining, based on the information, that the first data build and the second data build are incompatible in that an order of executing the first data build and the second data build affects outputs of the first data build and the second data build. 17. The one or more non-transitory, computer-readable storage media of claim 16 , the first data build and the second data build reading and/or writing to common data objects with different input data versions, code versions, or build parameters. 18. The one or more non-transitory, computer-readable storage media of claim 16 , further comprising causing the first data build and the second data build to be executed consecutively. 19. The one or more non-transitory, computer-readable storage media of claim 16 , further comprising: determining priorities associated wit

Assignees

Inventors

Classifications

  • of parallel queries · CPC title

  • Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses · CPC title

  • Query execution · CPC title

  • Ensuring data consistency and integrity · CPC title

  • G06F16/211Primary

    Schema design and management · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12105682B2 cover?
A method comprises receiving, at a build service of a build server, an external dataset and an adaptor application module, the external dataset being in a specific format, the adaptor application module providing information relevant to a build pipeline maintained by the build service for building an output dataset based on the external dataset, the information including changes to the external…
Who is the assignee on this patent?
Palantir Technologies Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/24532. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 01 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).