Ranking data analytics results using composite validation

US10210461B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10210461-B2
Application numberUS-201414222143-A
CountryUS
Kind codeB2
Filing dateMar 21, 2014
Priority dateMar 21, 2014
Publication dateFeb 19, 2019
Grant dateFeb 19, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for performing assisted knowledge discovery includes receiving a dataset. Each of a plurality of analytical techniques is applied to the received data set to generate a plurality of corresponding analytical results. A composite validation metric is applied to each of the plurality of analytical results. The composite validation metric is a single scoring/ranking function that is created from a plurality of different scoring/ranking functions. The plurality of analytical results is presented to a user arranged in accordance with the results of the applying the composite validation metric to each of the plurality of analytical results. A selection from the user from among the plurality of analytical results is recorded. The user's selection is used to modify the composite validation metric such that the analytical techniques responsible for generating the selected analytical result is scored/ranked more highly.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for performing assisted knowledge discovery, comprising: receiving a dataset; applying each of a plurality of different analytical techniques to the received data set to generate a plurality of corresponding analytical results; receiving a composite validation metric that is a single scoring or ranking function that is created from a plurality of different scoring or ranking functions, each of which is assigned a weighing that determines its relative influence on the composite validation metric; modifying the received composite validation metric by removing therefrom each of the plurality of different scoring or ranking functions that is assigned a weighing that is less than a predetermined value; applying the modified composite valuation metric to each of the plurality of analytical results; displaying the plurality of analytical results to a user arranged in accordance with the results of the applying the modified composite validation metric to each of the plurality of analytical results; recording a selection from the user from among the plurality of analytical results; and using the user's selection to refine the composite validation metric by changing one or more of the assigned weighing of the plurality of different scoring or ranking functions such that the analytical techniques responsible for generating the selected analytical result is scored or ranked more highly. 2. The method of claim 1 , wherein the plurality of analytical techniques is a plurality of different clustering techniques and the plurality of corresponding analytical results is a plurality of different clusterings of the same received dataset. 3. The method of claim 1 , wherein the plurality of analytical techniques includes frequent pattern mining techniques, anomaly detection techniques, or factor analysis techniques. 4. The method of claim 1 , wherein the composite validation metric includes elements from each of the plurality of different scoring or ranking functions along with a set of parameters that defines a relative weighing of each element within the composite validation metric. 5. The method of claim 1 , wherein the presenting of the results includes listing the results according to rank order as determined by the composite validation metric. 6. The method of claim 1 , wherein the presenting of the results includes listing the results alongside a score determined by the composite validation metric. 7. The method of claim 1 , wherein using the user's selection to modify the composite validation metric includes employing one or more learning algorithms. 8. The method of claim 1 , additionally comprising: receiving a second data set; applying each of the plurality of analytical techniques to the received second data set to generate a second plurality of corresponding analytical results; applying the refined composite validation metric to each of the second plurality of analytical results; and displaying the second plurality of analytical results to the user arranged in accordance with the results of the applying the refined composite validation metric to each of the second plurality of analytical results. 9. The method of claim 1 , wherein displaying the plurality of analytical results to the user includes displaying a subset of highest scoring or ranking results. 10. A method for performing assisted knowledge discovery, comprising: receiving a dataset; applying each of a plurality of different clustering techniques to the received data set to generate a plurality of corresponding clustering results; receiving a composite validation metric that is a single scoring or ranking function that is created by combining a plurality of different scoring or ranking functions, each of which is assigned a weighing that determines its relative influence on the composite validation metric; modifying the received composite validation metric by removing therefrom each of the plurality of different scoring or ranking functions that is assigned a weighing that is less than a predetermined value; applying the modified composite valuation metric to each of the plurality of analytical results to place the results in an order of importance; presenting the plurality of clustering results to a user arranged in the order determined by applying the modified composite validation metric; receiving a selection from the user from among the plurality of clustering results; and using the user's selection to refine the composition of the composite validation metric by changing one or more of the assigned weighing of the plurality of different scoring or ranking functions. 11. The method of claim 10 , wherein using the user's selection to refine the composition of the composite validation metric includes employing one or more learning algorithms to adapt the composite validation metric such that the clustering techniques responsible for generating the selected analytical result is scored or ranked more highly.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10210461B2 cover?
A method for performing assisted knowledge discovery includes receiving a dataset. Each of a plurality of analytical techniques is applied to the received data set to generate a plurality of corresponding analytical results. A composite validation metric is applied to each of the plurality of analytical results. The composite validation metric is a single scoring/ranking function that is create…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06N99/005. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 19 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).