Data locality in data integration applications

US9607062B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9607062-B1
Application numberUS-201514945476-A
CountryUS
Kind codeB1
Filing dateNov 19, 2015
Priority dateNov 19, 2015
Publication dateMar 28, 2017
Grant dateMar 28, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer-implemented method includes identifying a logical node. The logical node is associated with one or more source stages. The computer-implemented method further includes identifying one or more file block components. The one or more file block components include a retrieval target for at least one of the one or more source stages. The computer-implemented method further includes identifying one or more physical nodes and determining, for each of the one or more physical nodes, a degree value. A corresponding computer program product and computer system are also disclosed.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: identifying a logical node, said logical node being associated with one or more source stages; identifying one or more file block components, said one or more file block components comprising a retrieval target for at least one of said one or more source stages; identifying one or more physical nodes; determining, for each of said one or more physical nodes, a degree value; identifying one or more qualified physical nodes from said one or more physical nodes having said degree value of one or more; creating a preferred physical node table, said preferred physical node table comprising, for each of said one or more qualified physical nodes, an identifying indication and an indication of said degree value; and sorting said preferred physical node table based on said degree value associated with each of said one or more qualified physical nodes. 2. A computer-implemented method comprising: identifying a logical node, said logical node being associated with one or more source stages; identifying one or more file block components, said one or more file block components comprising a retrieval target for at least one of said one or more source stages; identifying one or more physical nodes; determining, for each of said one or more physical nodes, a degree value; determining a candidate preferred physical node based on each said degree value; determining whether said candidate preferred physical node is available for allocation to said logical node; and responsive to said candidate preferred physical node being available for allocation to said logical node, allocating said candidate preferred physical node to said logical node. 3. The computer-implemented method of claim 2 , further comprising: responsive to said candidate preferred physical node not being available for allocation to said logical node: marking said candidate preferred physical node as unavailable for allocation to said logical node; and determining an alternative candidate preferred physical node based on each said degree value. 4. A computer program product comprising one or more computer readable storage media and program instructions stored on said one or more computer readable storage media, said program instructions comprising instructions to: identify a logical node, said logical node being associated with one or more source stages; identify one or more file block components, said one or more file block components comprising a retrieval target for at least one of said one or more source stages; identify one or more physical nodes; determine, for each of said one or more physical nodes, a degree value; identify one or more qualified physical nodes from said one or more physical nodes having said degree value of one or more; create a preferred physical node table, said preferred physical node table comprising, for each of said one or more qualified physical nodes, an identifying indication and an indication of said degree value; and sort said preferred physical node table based on said degree value associated with each of said one or more qualified physical nodes. 5. The computer program product of claim 4 , wherein said program instructions further comprise instructions to: determine a candidate preferred physical node based on each said degree value. 6. The computer program product of claim 5 , wherein said program instructions further comprise instructions to: determine whether said candidate preferred physical node is available for allocation to said logical node; and responsive to said candidate preferred physical node being available for allocation to said logical node, allocate said candidate preferred physical node to said logical node. 7. The computer program product of claim 6 , wherein said program instructions further comprise instructions to: responsive to said candidate preferred physical node not being available for allocation to said logical node: mark said candidate preferred physical node as unavailable for allocation to said logical node; and determine an alternative candidate preferred physical node based on each said degree value.

Assignees

Inventors

Classifications

  • Distributed file systems · CPC title

  • for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS] · CPC title

  • G06F16/254Primary

    Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses · CPC title

  • Query processing · CPC title

  • Provision of network file services by network file servers, e.g. by using NFS, CIFS (network file access protocols H04L67/1097) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9607062B1 cover?
A computer-implemented method includes identifying a logical node. The logical node is associated with one or more source stages. The computer-implemented method further includes identifying one or more file block components. The one or more file block components include a retrieval target for at least one of the one or more source stages. The computer-implemented method further includes identi…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/254. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 28 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).