Systems, devices, and methods for content indexing and querying in a search system

US2020372011A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2020372011-A1
Application numberUS-202016991452-A
CountryUS
Kind codeA1
Filing dateAug 12, 2020
Priority dateOct 19, 2017
Publication dateNov 26, 2020
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for indexing content in a personal search system are disclosed. In one embodiment, the method comprises receiving a content item from a data source; extracting, from the content item, at least one document appearing in the content item, the at least one document including a first timestamp; storing the document in a content index; retrieving an event associated with the document from an event index, the event including a second timestamp and one or more attributes; generating an updated document based on the one or more attributes, the updated document including the second timestamp and the one or more attributes; and replacing the document stored in the content index with the updated document if the second timestamp is later than the first timestamp.

First claim

Opening claim text (preview).

1 - 20 . (canceled) 21 . A method comprising: initiating indexing of historic content and real-time content at a first time point, the historic content comprising content items stored prior to the first time point and the real-time content comprising content items received after the first time point; executing backlog jobs in parallel with real-time processing jobs, the backlog jobs indexing the historic content in a content index and the real-time processing jobs indexing the real-time content in the content index; receiving a query from a client device, the query received during execution of the backlog and real-time processing jobs; returning a partially responsive result to the client device, the partially responsive result based on a current state of the backlog jobs. 22 . The method of claim 21 , wherein executing the backlog jobs comprises processing batches of the historic content beginning at an initial time point and terminating at the first time point, the initial time point occurring before the first time point. 23 . The method of claim 21 , wherein executing the backlog jobs comprises processing batches of the historic content beginning at the first time point and terminating at an initial time point, the initial time point occurring before the first time point. 24 . The method of claim 21 , wherein indexing the real-time content comprises performing an atomic update of an index for the real-time content. 25 . The method of claim 21 , further comprising: receiving a second query from the client device after the backlog jobs have completed; and returning a full result to the client device, the full result comprising all responsive items in the historic content and real-time content. 26 . The method of claim 21 , wherein returning the partially responsive result comprises returning all responsive results from the real-time content and at least one responsive result from the historic content. 27 . The method of claim 21 , further comprising updating a historic content item in the content index based on the real-time processing jobs. 28 . A non-transitory computer-readable storage medium for tangibly storing computer program instructions capable of being executed by a computer processor, the computer program instructions defining the steps of: initiating indexing of historic content and real-time content at a first time point, the historic content comprising content items stored prior to the first time point and the real-time content comprising content items received after the first time point; executing backlog jobs in parallel with real-time processing jobs, the backlog jobs indexing the historic content in a content index and the real-time processing jobs indexing the real-time content in the content index; receiving a query from a client device, the query received during execution of the backlog and real-time processing jobs; returning a partially responsive result to the client device, the partially responsive result based on a current state of the backlog jobs. 29 . The computer-readable storage of claim 28 , wherein executing the backlog jobs comprises processing batches of the historic content beginning at an initial time point and terminating at the first time point, the initial time point occurring before the first time point. 30 . The computer-readable storage of claim 28 , wherein executing the backlog jobs comprises processing batches of the historic content beginning at the first time point and terminating at an initial time point, the initial time point occurring before the first time point. 31 . The computer-readable storage of claim 28 , wherein indexing the real-time content comprises performing an atomic update of an index for the real-time content. 32 . The computer-readable storage of claim 28 , further comprising: receiving a second query from the client device after the backlog jobs have completed; and returning a full result to the client device, the full result comprising all responsive items in the historic content and real-time content. 33 . The computer-readable storage of claim 28 , wherein returning the partially responsive result comprises returning all responsive results from the real-time content and at least one responsive result from the historic content. 34 . The computer-readable storage of claim 28 , further comprising updating a historic content item in the content index based on the real-time processing jobs. 35 . A device comprising: a processor; and a storage medium for tangibly storing thereon program logic for execution by the processor, the stored program logic comprising: logic, executed by the processor, for initiating indexing of historic content and real-time content at a first time point, the historic content comprising content items stored prior to the first time point and the real-time content comprising content items received after the first time point; logic, executed by the processor, for executing backlog jobs in parallel with real-time processing jobs, the backlog jobs indexing the historic content in a content index and the real-time processing jobs indexing the real-time content in the content index; logic, executed by the processor, for receiving a query from a client device, the query received during execution of the backlog and real-time processing jobs; logic, executed by the processor, for returning a partially responsive result to the client device, the partially responsive result based on a current state of the backlog jobs. 36 . The device of claim 35 , wherein executing the backlog jobs comprises processing batches of the historic content beginning at an initial time point and terminating at the first time point, the initial time point occurring before the first time point. 37 . The device of claim 35 , wherein executing the backlog jobs comprises processing batches of the historic content beginning at the first time point and terminating at an initial time point, the initial time point occurring before the first time point. 38 . The device of claim 35 , wherein indexing the real-time content comprises performing an atomic update of an index for the real-time content. 39 . The device of claim 35 , the stored program logic further comprising: logic, executed by the processor, for receiving a second query from the client device after the backlog jobs have completed; and logic, executed by the processor, for returning a full result to the client device, the full result comprising all responsive items in the historic content and real-time content. 40 . The device of claim 35 , wherein returning the partially responsive result comprises returning all responsive results from the real-time content and at least one responsive result from the historic content.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2020372011A1 cover?
Systems and methods for indexing content in a personal search system are disclosed. In one embodiment, the method comprises receiving a content item from a data source; extracting, from the content item, at least one document appearing in the content item, the at least one document including a first timestamp; storing the document in a content index; retrieving an event associated with the docu…
Who is the assignee on this patent?
Oath Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/2365. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Nov 26 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).