Locating relevant content items across multiple disparate content sources

US9817898B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9817898-B2
Application numberUS-201113295108-A
CountryUS
Kind codeB2
Filing dateNov 14, 2011
Priority dateNov 14, 2011
Publication dateNov 14, 2017
Grant dateNov 14, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Technologies are described herein for locating relevant content items across multiple disparate content sources. Query parameters are received from a user interface for defining a query for searching a number of content sources located on multiple, disparate content servers. A native search is executed on each of the content servers based on the received query parameters, and query statistics and other data regarding content items in the content sources matching the query parameters are received. The query statistics are aggregated across the content servers and presented in the user interface. The presentation of the query statistics may be broken out by each content source, by each query phrase segmented from the query, and the like. In addition, a preview of a number of content items matching the query parameters is presented based on the data received.

First claim

Opening claim text (preview).

What is claimed is: 1. A system for locating content items in a plurality of content sources across different content servers, the system comprising: one or more processors; a memory coupled to the one or more processors; and an e-discovery client application residing in the memory and comprising computer-executable instructions that, when executed by the one or more processors, cause the system to: present a user interface for defining a query for searching the plurality of content sources; receive query parameters, wherein the query parameters are specified by a user; receive a query scope regarding the query, the query scope comprising content sources located on at least two content servers of different types, wherein each content server has a search interface for the content sources; for each content source included in the query scope, execute a search using the corresponding search interface and initiate a segmentation process, wherein the segmentation process comprises iterative searches based on groupings of the query parameters; receive query statistics from searches executed by each of the at least two content servers based on the query parameters; aggregate the query statistics from the at least two content servers and present the aggregated query statistics in the user interface, wherein the query statistics are shown regarding each of the plurality of content sources; retrieve data regarding content items matching the query from the at least two content servers; and present a preview of the content items matching the query in the user interface from the retrieved data. 2. The system of claim 1 , wherein the query statistics are presented regarding each of a plurality of query phrases segmented from a free-text query comprising the query parameters. 3. The system of claim 1 , wherein the e-discovery client application comprises further computer-executable instructions that cause the system to: present a filter category along with one or more values for the filter category in the user interface based on the retrieved data; receive a selection of one of the one or more values for the filter category; modify the query parameters for a corresponding content server to include a filter category/value pair based on the selection; and re-execute the search of the corresponding content server based on the modified query parameters. 4. The system of claim 3 , wherein the filter category is specific to a type of content items being previewed in the user interface. 5. The system of claim 1 , wherein a first of the at least two content servers comprises an email server and a second of the at least two content servers comprises a content site server. 6. A computer-implemented method for locating content items in a plurality of content sources across different content servers, the method comprising: presenting a user interface for defining a query for searching the plurality of content sources; receiving query parameters, wherein the query parameters are specified by a user; receiving a query scope regarding a query, the query scope comprising content sources located on at least two content servers of different types, wherein each content server has a search interface for the content sources; for each content source included in the query scope, executing a search using the corresponding search interface; initiating a segmentation process, wherein the segmentation process comprises iterative searches based on groupings of the query parameters; receiving query statistics from searches executed by each of the at least two content servers based on the query parameters; aggregating the query statistics from the at least two content servers; presenting the aggregated query statistics in the user interface, wherein the query statistics are shown regarding each of the plurality of content sources; retrieving data regarding content items matching the query from the at least two content servers; and presenting a preview of the content items matching the query in the user interface from the retrieved data. 7. The computer-implemented method of claim 6 , wherein the query statistics regarding each of the plurality of content sources are grouped together by content source type. 8. The computer-implemented method of claim 6 , wherein the query statistics are presented regarding each of a plurality of query phrases segmented from a free-text query comprising the query parameters. 9. The computer-implemented method of claim 8 , wherein the plurality of query phrases are segmented from the free-text-query at each explicit or implied OR junction. 10. The computer-implemented method of claim 6 , wherein the preview of the one or more content items is presented for content items from content sources of a same type. 11. The computer-implemented method of claim 6 , further comprising: receiving from the user a modification of the query parameters; receiving data regarding the content items located through the native searches re-executed on each of the at least two content servers based on the modified query parameters; and upon receiving the data, updating the query statistics presented to the user. 12. The computer-implemented method of claim 6 , further comprising: presenting a filter category along with one or more values for the filter category to the user based on the received data; receiving a selection of one of the one or more values for the filter category from the user; modifying the query parameters for a corresponding content server to include a filter category/value pair based on the selection; receiving data regarding the content items located through the native search of the corresponding content server based on the modified query parameters; and upon receiving the data, updating the query statistics presented to the user. 13. The computer-implemented method of claim 6 , wherein a first of the at least two content servers comprises an email server and a second of the at least two content servers comprises a content site server. 14. A computer-readable storage device encoded with computer-executable instructions that, when executed by a computer, cause the computer to: present a user interface for defining a query for searching the plurality of content sources; receive query parameters, wherein the query parameters are specified by a user; receive a query scope regarding the query, the query scope comprising content sources located on at least two content servers of different types, wherein each content server has a search interface for the content sources; for each content source included in the query scope, execute a search using the corresponding search interface and initiate a segmentation process, wherein the segmentation process comprises iterative searches based on groupings of the query parameters; receive query statistics from searches executed by each of the at least two content servers based on the query parameters; aggregate the query statistics from the at least two content servers and present the aggregated query statistics in the user interface, wherein the query statistics are shown regarding each of the plurality of content sources; retrieve data regarding content items matching the query from the at least two content servers; and present a preview of the content items matching the query in the user interface from the retrieved data. 15. The computer-readable storage device of claim 14 , wherein the preview of the one or more content items is presented for content items from content sources of a same type. 16. The computer-readable stor

Assignees

Inventors

Classifications

  • G06F16/951Primary

    Indexing; Web crawling techniques · CPC title

  • Physics · mapped topic

  • G06F16/953Primary

    Querying, e.g. by the use of web search engines · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9817898B2 cover?
Technologies are described herein for locating relevant content items across multiple disparate content sources. Query parameters are received from a user interface for defining a query for searching a number of content sources located on multiple, disparate content servers. A native search is executed on each of the content servers based on the received query parameters, and query statistics a…
Who is the assignee on this patent?
Stevenson Bradley, Harmetz Adam David, Christensen Quentin Gary, and 16 more
What technology area does this patent fall under?
Primary CPC classification G06F16/951. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 14 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).