What technology area does this patent fall under?

Primary CPC classification G06F16/3349. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Aug 10 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Archiving of topmost ranked answers of a cognitive search

US11086914B2 · US · B2

Patent metadata
Field	Value
Publication number	US-11086914-B2
Application number	US-201816154232-A
Country	US
Kind code	B2
Filing date	Oct 8, 2018
Priority date	Oct 8, 2018
Publication date	Aug 10, 2021
Grant date	Aug 10, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for archiving of documents of a query against a cognitive system can be provided. The cognitive system comprises at least a cognitive engine, several stored documents, and a learned model. The method comprises determining a plurality of evidence fragments, a related first list of documents and related metadata. The method also comprises removing a document from the stored documents, redetermining as second result a second list of documents, comparing the first and second list of documents, and upon determining identical documents in the compared first and second list of documents up to a confidence cliff, removing another document. Furthermore, the method comprises repeating the steps of removing, redetermining, and comparing until the first list of documents and the second list of documents differ above the confidence cliff and storing metadata of the documents of the first list, the plurality of evidence fragments, and the first query.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for archiving of topmost ranked documents, said method comprising: receiving a first query into a cognitive system, wherein said cognitive system comprises a cognitive engine, a plurality of stored documents, and a related learned model; determining a first result of said first query against said cognitive system based on said related learned model, wherein said first result comprises a plurality of evidence fragments and each evidence fragment is correlated to a document of the plurality of stored documents; ranking said plurality of evidence fragments; determining for said first query a first list of documents of said plurality of stored documents; determining metadata of each document in said first list of documents; removing a first number of documents from said plurality of stored documents, wherein said first number of documents are elements of said first list of documents, and wherein said first number of documents do not relate to topmost ranked evidence fragments; redetermining said first result comprising determining a second list of documents of said plurality of stored documents without said first number of documents; determining a confidence cliff; comparing said first list of documents with said second list of documents; determining said first list of documents and said second list of documents contain identical documents up to said confidence cliff; in response to determining identical documents in said first list of documents and said second list of documents up to said confidence cliff, removing a second number of documents from said plurality of stored documents, wherein said second number of documents is an element of said first list of documents and said second list of documents, and wherein said second number of documents does not relate to said topmost ranked evidence fragments; and storing said metadata of respective documents of said first list of documents, said plurality of evidence fragments, and said first query. 2. The method according to claim 1 , further comprising: while said first list of documents and said second list of documents are equal up to said confidence cliff and respective documents in each list remain in said plurality of stored documents, repeating said step of removing a subsequent document, said step of redetermining a subsequent result, and said step of comparing said first list of documents and said second list of documents. 3. The method according to claim 1 , wherein said related learned model is agnostic to a missing document in said plurality of stored documents. 4. The method according to claim 1 , wherein said metadata comprises at least one piece of information selected from the group consisting of: a document name, a document author, a document source, a document publishing date, document bibliographic data, an International Standard Book Number, and a web-link. 5. The method according to claim 1 , further comprising storing information about said related learned model and version information of said cognitive engine used in said first query. 6. The method according to claim 1 , wherein said storing of said plurality of evidence fragments further comprises storing only said topmost evidence fragments of said plurality of evidence fragments. 7. The method according to claim 1 , wherein said confidence cliff is a predefined number of documents. 8. The method according to claim 1 , wherein said confidence cliff relates to a document in said first list of documents with a predefined confidence level. 9. The method according to claim 1 , wherein said confidence cliff is determined by: determining a confidence level polynomial using absolute values of confidence levels of documents of said first list of documents; and setting said confidence cliff to said document after a first local maximum of said confidence level polynomial. 10. The method according to claim 1 , wherein a list of answers relating to said first query is determined by said cognitive engine together with a scoring of said list of answers. 11. A system for archiving of topmost ranked documents comprising: a processor; and a computer-readable storage medium communicatively coupled to the processor and storing program instructions which, when executed by the processor, are configured to cause the processor to perform a method comprising: receiving a first query into a cognitive system, wherein said cognitive system comprises a cognitive engine, a plurality of stored documents, and a related learned model; determining a first result of said first query against said cognitive system based on said related learned model, wherein said first result comprises a plurality of evidence fragments and each evidence fragment is correlated to a document of the plurality of stored documents; ranking said plurality of evidence fragments; determining for said first query a first list of documents of said plurality of stored documents; determining metadata of each document in said first list of documents; removing a first number of documents from said plurality of stored documents, wherein said first number of documents are elements of said first list of documents, and wherein said first number of documents do not relate to topmost ranked evidence fragments; redetermining said first result comprising determining a second list of documents of said plurality of stored documents without said first number of documents; determining a confidence cliff; comparing said first list of documents with said second list of documents; determining said first list of documents and said second list of documents contain identical documents up to said confidence cliff; in response to determining identical documents in said first list of documents and said second list of documents up to said confidence cliff, removing a second number of documents from said plurality of stored documents, wherein said second number of documents is an element of said first list of documents and said second list of documents, and wherein said second number of documents does not relate to said topmost ranked evidence fragments; and storing said metadata of respective documents of said first list of documents, said plurality of evidence fragments, and said first query. 12. The system according to claim 11 , wherein said related learned model is agnostic to a missing document in said plurality of stored documents. 13. The system according to claim 11 , wherein said metadata comprises at least one piece of information selected from the group consisting of: a document name, a document author, a document source, a document publishing date, document bibliographic data, an International Standard Book Number, and a web-link. 14. The system according to claim 11 , the program instructions are further configured to cause the processor to perform a method further comprising storing information about said related learned model and version information of said cognitive engine used in said first query. 15. The system according to claim 11 , wherein the program instructions are further configured to cause the processor to perform a method further comprising storing only said topmost evidence fragments of said plurality of evidence fragments. 16. The system according to claim 11 , wherein said confidence cliff is a predefined number of documents. 17. The system according to claim 11 , wherein said confidence cliff relates to a document in said first list of documents with a predefined confidence level. 18. The system according to c

Assignees

Inventors

Classifications

G06F16/38
Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually · CPC title
G06F16/3349Primary
Reuse of stored results of previous queries · CPC title
G06F16/113
Details of archiving (lifecycle management in storage systems G06F3/0649; point-in-time backing up or restoration of persistent data G06F11/1446) · CPC title
G06F16/93
Document management systems · CPC title
G06F16/9535Primary
Search customisation based on user profiles and personalisation · CPC title

Patent family

Related publications grouped by family.

View patent family 70052255

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11086914B2 cover?: A method for archiving of documents of a query against a cognitive system can be provided. The cognitive system comprises at least a cognitive engine, several stored documents, and a learned model. The method comprises determining a plurality of evidence fragments, a related first list of documents and related metadata. The method also comprises removing a document from the stored documents, re…
Who is the assignee on this patent?: IBM
What technology area does this patent fall under?: Primary CPC classification G06F16/3349. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Aug 10 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).