System and method for recursive metadata layers on big data sets

US2017017678A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2017017678-A1
Application numberUS-201514799293-A
CountryUS
Kind codeA1
Filing dateJul 14, 2015
Priority dateJul 14, 2015
Publication dateJan 19, 2017
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The process includes receiving a data set comprising a plurality of rows and a plurality of columns, and applying a first rule based decisioning to the data set to generate a first layer of metadata that comprises at least one of a key, a type indicator, a categorical indicator, and/or a continuous indicator. The first layer of metadata may be descriptive of the data set. The processor may further apply a second rule based decisioning to the first layer to generate a second layer that includes at least one of the key, the type indicator, the categorical indicator, or the continuous indicator. The second layer may be descriptive of the first layer. The process may also include generating an output file from at least one of the first layer or the second layer.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method comprising: receiving, by a processor, a data set comprising a plurality of rows and a plurality of columns; applying, by the processor, a first rule based decisioning to the data set to generate a first layer of metadata, wherein the first layer of metadata comprises at least one of a key, a type indicator, a categorical indicator, or a continuous indicator, wherein the first layer of metadata is descriptive of the data set; applying, by the processor, a second rule based decisioning to the first layer of metadata to generate a second layer, wherein the second layer comprises at least one of the key, the type indicator, the categorical indicator, or the continuous indicator, wherein the second layer is descriptive of the first layer of metadata; and generating, by the processor, an output file from at least one of the first layer of metadata or the second layer. 2 . The method of claim 1 , further comprising running, by the processor, a regular expression on the first layer of metadata. 3 . The method of claim 1 , further comprising computing, by the processor, percentile calculations for a column of the plurality of columns. 4 . The method of claim 1 , further comprising formatting, by the processor, the first layer of metadata and the second layer for recursive decisioning. 5 . The method of claim 1 , wherein the data set is stored on a distributed storage. 6 . The method of claim 5 , further comprising communicating, by the processor, with the distributed storage across a network. 7 . The method of claim 5 , wherein the processor is in a node of the distributed storage. 8 . A computer-based system, comprising: a processor, a tangible, non-transitory memory configured to communicate with the processor, the tangible, non-transitory memory having instructions stored thereon that, in response to execution by the processor, cause the processor to perform operations comprising: receiving, by the processor, a data set comprising a plurality of rows and a plurality of columns; applying, by the processor, a first rule based decisioning to the data set to generate a first layer of metadata, wherein the first layer of metadata comprises at least one of a key, a type indicator, a categorical indicator, or a continuous indicator, wherein the first layer of metadata is descriptive of the data set; applying, by the processor, a second rule based decisioning to the first layer of metadata to generate a second layer, wherein the second layer comprises at least one of the key, the type indicator, the categorical indicator, or the continuous indicator, wherein the second layer is descriptive of the first layer of metadata; and generating, by the processor, an output file from at least one of the first layer of metadata or the second layer. 9 . The computer-based system of claim 8 , further comprising running, by the processor, a regular expression on the first layer of metadata. 10 . The computer-based system of claim 8 , further comprising computing, by the processor, percentile calculations for a column of the plurality of columns. 11 . The computer-based system of claim 8 , further comprising formatting, by the processor, the first layer of metadata and the second layer for recursive decisioning. 12 . The computer-based system of claim 8 , wherein the data set is stored on a distributed storage. 13 . The computer-based system of claim 12 , further comprising communicating, by the processor, with the distributed storage across a network. 14 . The computer-based system of claim 12 , wherein the processor is in a node of the distributed storage. 15 . An article of manufacture including a non-transitory, tangible computer readable storage medium having instructions stored thereon that, in response to execution by a computer-based system, cause the computer-based system to perform operations comprising: receiving, by a processor, a data set comprising a plurality of rows and a plurality of columns; applying, by the processor, a first rule based decisioning to the data set to generate a first layer of metadata, wherein the first layer of metadata comprises at least one of a key, a type indicator, a categorical indicator, or a continuous indicator, wherein the first layer of metadata is descriptive of the data set; applying, by the processor, a second rule based decisioning to the first layer of metadata to generate a second layer, wherein the second layer comprises at least one of the key, the type indicator, the categorical indicator, or the continuous indicator, wherein the second layer is descriptive of the first layer of metadata; and generating, by the processor, an output file from at least one of the first layer of metadata or the second layer. 16 . The computer-based system of claim 8 , further comprising running, by the processor, a regular expression on the first layer of metadata. 17 . The computer-based system of claim 8 , further comprising computing, by the processor, percentile calculations for a column of the plurality of columns. 18 . The computer-based system of claim 8 , further comprising formatting, by the processor, the first layer of metadata and the second layer for recursive decisioning. 19 . The computer-based system of claim 8 , wherein the data set is stored on a distributed storage system. 20 . The computer-based system of claim 12 , further comprising communicating, by the processor, with the distributed storage across a network.

Assignees

Inventors

Classifications

  • Physics · mapped topic

  • G06F16/21Primary

    Design, administration or maintenance of databases · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2017017678A1 cover?
The process includes receiving a data set comprising a plurality of rows and a plurality of columns, and applying a first rule based decisioning to the data set to generate a first layer of metadata that comprises at least one of a key, a type indicator, a categorical indicator, and/or a continuous indicator. The first layer of metadata may be descriptive of the data set. The processor may furt…
Who is the assignee on this patent?
American Express Travel Related Services Co Inc
What technology area does this patent fall under?
Primary CPC classification G06F17/30371. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Jan 19 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).