Systems and methods for extracting information from a text string generated in a distributed computing operation

US11250040B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11250040-B2
Application numberUS-201715788773-A
CountryUS
Kind codeB2
Filing dateOct 19, 2017
Priority dateOct 19, 2017
Publication dateFeb 15, 2022
Grant dateFeb 15, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods are disclosed that provide for extracting information from a text string. In certain embodiments, a system is disclosed to receive a textual string representative of a distributed computing operation. The system is configured to isolate an identifier included in the textual string based on information in the textual string and/or other information associated with the distributed computing operation. The system is also configured to provide the identifier to a computer device over a network.

First claim

Opening claim text (preview).

What is claimed is: 1. A system for extracting information in a distributed computing operation, comprising: one or more memory devices storing instructions; and one or more hardware processors configured to execute the instructions to perform operations comprising: receiving a request to extract information from a textual string, the textual string comprising operation data generated by a plurality of computing device nodes along a path representing a distributed computing operation, wherein at least one of the plurality of computing device nodes receives information from a previous computing device node along the path, performs a component operation of the distributed computing operation, appends an item of operation data to the textual string based on a result of the performed component operation, and provides the textual string to a subsequent computing device node along the path; searching for an identifier included in the textual string, the identifier being associated with one of the computing device nodes, the searching comprising determining whether the at least one computing device node utilizes a dynamic description of the performed component operation and resolving a formatting inconsistency by normalizing a plurality of computing device identifiers to a standard format; identifying, based on the identifiers, at least one intermediary computing device node of the plurality of computing device nodes; determining a first attribute from the textual string, the first attribute being associated with the identifier and being representative of a physical location where the distributed computing operation took place; comparing the first attribute and the identifier to a database of candidate information to find a plurality of matching candidates in the database, the matching candidates being determined by generating first confidence scores for the matching candidates based on weights assigned to characteristics present in the textual string; and providing, in response to the request, the identifier, the first attribute, information indicating the at least one intermediary computing device node, and information indicating the matching candidates and the first confidence scores. 2. The system of claim 1 , wherein searching for the identifier includes discarding extraneous characters. 3. The system of claim 1 , wherein the one or more hardware processors are configured to perform the operations further comprising: determining a second attribute from the textual string, the second attribute suspected to be associated with the identifier; comparing the second attribute and the identifier to the database of candidate information to find a match based on a second confidence score as to whether the candidate information is associated with the identifier; determining that the second confidence score indicates that the association is inconclusive; and based on the determination that the association is inconclusive, discarding the second attribute from the textual string. 4. A method for extracting information from a textual string, comprising the following operations performed by one or more hardware processors: receiving a request to extract information from the textual string, the textual string comprising operation data generated by a plurality of computing device nodes along a path representing a distributed computing operation, wherein at least one of the plurality of computing device nodes receives information from a previous computing device node along the path, performs a component operation of the distributed computing operation, appends an item of operation data to the textual string based on a result of the performed component operation, and provides the textual string to a subsequent computing device node along the path; searching for an identifier included in the textual string, the identifier being associated with one of the computing device nodes, the searching comprising determining whether the at least one computing device node utilizes a dynamic description of the performed component operation and resolving a formatting inconsistency by normalizing a plurality of computing device identifiers to a standard format; identifying, based on the identifiers, at least one intermediary computing device node of the plurality of computing device nodes; determining a first attribute from the textual string, the first attribute being associated with the identifier and being representative of a physical location where the distributed computing operation took place; comparing the first attribute and the identifier to a database of candidate information to find a plurality of matching candidates in the database, the matching candidates being determined by generating first confidence scores for the matching candidates based on weights assigned to characteristics present in the textual string; and providing, in response to the request, the identifier, the first attribute, information indicating the at least one intermediary computing device node, and information indicating the matching candidates and the first confidence scores. 5. The method of claim 4 , wherein searching for the identifier includes discarding extraneous characters. 6. The method of claim 4 , wherein the one or more hardware processors are configured to perform the operations further comprising: determining a second attribute from the textual string, the second attribute suspected to be associated with the identifier; comparing the second attribute and the identifier to the database of candidate information to find a match based on a second confidence score as to whether the candidate information is associated with the distributed computing operation; determining that the second confidence score indicates that the association is inconclusive; and based on the determination that the association is inconclusive, discarding the second attribute from the textual string. 7. A non-transitory computer readable medium for extracting information from a textual string, the non-transitory computer readable medium comprising instructions, which when executed by a processor, cause a computer system to perform operations comprising: receiving a request to extract information from the textual string, the textual string comprising operation data generated by a plurality of computing device nodes along a path representing a distributed computing operation, wherein at least one of the plurality of computing device nodes receives information from a previous computing device node along the path, performs a component operation of the distributed computing operation, appends an item of operation data to the textual string based on a result of the performed component operation, and provides the textual string to a subsequent computing device node along the path; searching for an identifier included in the textual string, the identifier being associated with one of the computing device nodes, the searching comprising determining whether the at least one computing device node utilizes a dynamic description of the performed component operation and resolving a formatting inconsistency by normalizing a plurality of computing device identifiers to a standard format; identifying, based on the identifiers, at least one intermediary computing device node of the plurality of computing device nodes; determining a first attribute from the textual string, the first attribute being associated with the identifier and being representative of a physical location where the distributed computing operation took place; comparing the first attribute and the identifier to a database of candidate information to find a plurality of matching candidates in the database, the matching candidates being-determined by generating a first confidence s

Assignees

Inventors

Classifications

  • Query translation · CPC title

  • G06F16/334Primary

    Query execution (filtering based on additional data G06F16/335) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11250040B2 cover?
Systems and methods are disclosed that provide for extracting information from a text string. In certain embodiments, a system is disclosed to receive a textual string representative of a distributed computing operation. The system is configured to isolate an identifier included in the textual string based on information in the textual string and/or other information associated with the distrib…
Who is the assignee on this patent?
Capital One Services Llc
What technology area does this patent fall under?
Primary CPC classification G06F16/3332. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 15 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).