Preserving temporal relevance of content within a corpus

US10877730B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10877730-B2
Application numberUS-201615271540-A
CountryUS
Kind codeB2
Filing dateSep 21, 2016
Priority dateSep 21, 2016
Publication dateDec 29, 2020
Grant dateDec 29, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method, system and computer-usable medium are disclosed for preserving temporal relevance of content within a corpus. A corpus is processed to generate temporally-relevant metadata and mined content, which in turn are processed to generate first temporal relevancy metrics. The cache history of a web browser is likewise processed to generate second temporal relevancy metrics, which in turn is processed with the first temporal relevancy metrics to generate first temporal relevancy scores. New documents are ingested into the corpus and existing documents are revised. Temporally-relevant metadata and mined content associated with the updated corpus are then processed to generate third temporal relevancy metrics. The second and third temporal relevancy metrics are then processed to generate second temporal relevancy scores, which is then used to provide a temporally-relevant response to a query.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for preserving temporal relevance of content within a corpus, comprising: processing a corpus of documents to generate temporally-relevant metadata and mined content associated with documents in the corpus of documents; identifying explicit temporal features and implicit temporal features from the mined content; processing the temporally-relevant metadata and the mined content to generate first temporal relevancy metrics, wherein the processing comprises processing the explicit temporal features and the implicit temporal features; processing a cache history of a web browser to generate second temporal relevancy metrics, wherein the processing comprises analyzing content of the cache history of the web browser to provide a reference history for the documents in terms of reference frequency and likelihood of being referenced and generating the second temporal relevancy metrics based on the reference frequency and the likelihood of being referenced in the reference history; processing the first and the second temporal relevancy metrics to generate first temporal relevancy ranking scores for the documents in the corpus of documents, wherein the first temporal relevancy ranking scores provides a measure of temporal relevance for each of the documents in the corpus of documents updating the corpus of documents by ingesting new documents into the corpus of documents and revising existing documents in the corpus of documents; processing temporally-relevant metadata and mined content associated with the new documents and revisions to the existing documents to generate third temporal relevancy metrics; and processing the second and the third temporal relevancy metrics to generate second temporal relevancy ranking scores for the documents in the corpus of documents, wherein the processing comprises generating temporal relevancy ranking scores for the ingested new documents and revising the first temporal relevancy ranking scores associated with the existing documents to reflect their respective temporal relevance. 2. The method of claim 1 , further comprising: using the second temporal relevancy scores to provide a temporally-relevant response to a query, the response having a temporal feature corresponding to a temporal feature in the query. 3. The method of claim 1 , wherein the mined content associated with the documents in the corpus of documents comprises: user context and preferences; and content management processes. 4. The method of claim 1 , wherein: the processing of the second and the third temporal relevancy metrics comprises applying differential weighting to individual terms within the corpus of documents according to temporal features associated with the individual terms. 5. A system comprising: a processor; a data bus coupled to the processor; and a computer-usable medium embodying computer program code, the computer-usable medium being coupled to the data bus, the computer program code used for preserving temporal relevance of content within a corpus and comprising instructions executable by the processor and configured for: processing a corpus of documents to generate temporally-relevant metadata and mined content associated with documents in the corpus of documents; identifying explicit temporal features and implicit temporal features from the mined content; processing the temporally-relevant metadata and the mined content to generate first temporal relevancy metrics, wherein the processing comprises processing the explicit temporal features and the implicit temporal features; processing a cache history of a web browser to generate second temporal relevancy metrics, wherein the processing comprises analyzing content of the cache history of the web browser to provide a reference history for the documents in terms of reference frequency and likelihood of being referenced and generating the second temporal relevancy metrics based on the reference frequency and the likelihood of being referenced in the reference history; processing the first and the second temporal relevancy metrics to generate first temporal relevancy ranking scores for the documents in the corpus of documents, wherein the first temporal relevancy ranking scores provide a measure of temporal relevance for each of the documents in the corpus of documents updating the corpus of documents by ingesting new documents into the corpus of documents and revising existing documents in the corpus of documents; processing temporally-relevant metadata and mined content associated with the new documents and revisions to the existing documents to generate third temporal relevancy metrics; and processing the second and the third temporal relevancy metrics to generate second temporal relevancy ranking scores for the documents in the corpus of documents, wherein the processing comprises generating temporal relevancy ranking scores for the ingested new documents and revising the first temporal relevancy ranking scores associated with the existing documents to reflect their respective temporal relevance. 6. The system of claim 5 , further comprising: using the second temporal relevancy scores to provide a temporally-relevant response to a query, the response having a temporal feature corresponding to a temporal feature in the query. 7. The system of claim 5 , wherein the mined content associated with the documents in the corpus of documents comprises: user context and preferences; and content management processes. 8. The system of claim 5 , wherein: the processing of the second and the third temporal relevancy metrics comprises applying differential weighting to individual terms within the corpus of documents according to temporal features associated with the individual terms. 9. A non-transitory, computer-readable storage medium embodying computer program code, the computer program code comprising computer executable instructions configured for: processing a corpus of documents to generate temporally-relevant metadata and mined content associated with documents in the corpus of documents; identifying explicit temporal features and implicit temporal features from the mined content; processing the temporally-relevant metadata and the mined content to generate first temporal relevancy metrics wherein the processing comprises processing the explicit temporal features and the implicit temporal features; processing a cache history of a web browser to generate second temporal relevancy metrics, wherein the processing comprises analyzing content of the cache history of the web browser to provide a reference history for the documents, in terms of reference frequency and likelihood of being referenced and generating the second temporal relevancy metrics based on the reference frequency and the likelihood of being referenced in the reference history; processing the first and the second temporal relevancy metrics to generate first temporal relevancy ranking scores for the documents in the corpus of documents, wherein the first temporal relevancy ranking scores provide a measure of temporal relevance for each of the documents in the corpus of documents updating the corpus of documents by ingesting new documents into the corpus of documents and revising existing documents in the corpus of documents; processing temporally-relevant metadata and mined content associated with the new documents and revisions to the existing documents to generate third temporal relevancy metrics; and processing the second and the third temporal relevancy metrics to generate second temporal relevancy ranking scores for the documents in the corpus of documents, wherein the processing comprises generating temporal relevancy ranki

Assignees

Inventors

Classifications

  • Search customisation based on user profiles and personalisation · CPC title

  • using time information · CPC title

  • Data mining · CPC title

  • G06F7/24Primary

    Sorting, i.e. extracting data from one or more carriers, rearranging the data in numerical or other ordered sequence, and rerecording the sorted data on the original carrier or on a different carrier or set of carriers {sorting methods in general}(G06F7/36 takes precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10877730B2 cover?
A method, system and computer-usable medium are disclosed for preserving temporal relevance of content within a corpus. A corpus is processed to generate temporally-relevant metadata and mined content, which in turn are processed to generate first temporal relevancy metrics. The cache history of a web browser is likewise processed to generate second temporal relevancy metrics, which in turn is …
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/9535. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 29 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).