Updating a search index using reported browser history data

US10482073B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10482073-B2
Application numberUS-201615217433-A
CountryUS
Kind codeB2
Filing dateJul 22, 2016
Priority dateDec 9, 2010
Publication dateNov 19, 2019
Grant dateNov 19, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, systems, and computer-readable media are provided for updating a search index with new uniform resource locators (URLs) and spiking URLs with increased user interest. History data, provided from browser applications residing on users' computers that indicate URLs accessed by the users, is parsed to identify new/previously unknown URLs. The history data also indicates URLs in which there is increased interest based on a number of recent hits as compared to an average number of hits determined over time. Author postings of new URLs to social networking sites and a quality rating of the authors may also be used to identify and filter new URLs. Search indexes are updated with the new and spiking URL data. As such, lag time between posting of new URLs and spiking of URL interest and inclusion of this data in a search index is greatly decreased.

First claim

Opening claim text (preview).

We claim: 1. One or more computer-readable media having computer-executable instructions embodied thereon that, when executed, perform a method for updating a search index with newly published documents, the method comprising: providing an application to a user computing device having a processor and a memory, the application configured to generate and store on the user computing device a history file that includes an indication of each of one or more uniform resource identifiers (URIs) accessed by a browser application executing on the user computing device and metadata indicating a time of day when the URIs were accessed by the user computing device, wherein the application and the browser application are the same or different; receiving the history file at a server having a processor and a memory from the user computer device based on a predetermined schedule; identifying a new URI by the server from the one or more URIs indicated in the history file, the new URI being associated with a document that was previously unknown to the server; and updating a search index with the new URI, the search index being useable by a search engine to identify search results for a search query. 2. The computer-readable media of claim 1 , further comprising: receiving an authorization from a user to allow the application to track the indications of the one or more URIs accessed by the user computing device. 3. The computer-readable media of claim 1 , wherein the history file is received by the server periodically and the history file includes an indication of each of one or more URIs accessed by the browser application since a previous history file was received. 4. The computer-readable media of claim 1 , wherein the search index is updated with the new URI within thirty minutes of identifying the new URI. 5. The computer-readable media of claim 1 , further comprising: calculating a score for the new URI based at least partially on a number of instances of the new URI in history files received from the application executing on a plurality of user computing devices. 6. The computer-readable media of claim 5 , wherein the score indicates user interest in the URI. 7. The computer-readable media of claim 5 , further comprising: providing a list of newly posted URIs posted by authors on one or more social networking websites; and generating a second score for each of the newly posted URIs based at least partially on a quality of a posting author and a number of authors posting the newly posted URI, wherein the quality of the posting author is based at least partially on the posting author's time of posting the newly posted URI and the posting author's history of posting newly posted URIs. 8. The computer-readable media of claim 7 , further comprising: identifying a match between the new URI and one of the newly posted URIs in the list of newly posted URIs; and combining the score for the new URI with score of the one of the newly posted URIs, now a combined score. 9. The computer-readable media of claim 8 , wherein the combined score is employed to determine whether to update the search index with the new URI. 10. A system for updating a search index with newly published documents comprising: a first computing device, having a processor coupled to a memory, configured to receive an application which is configured to generate a history file that includes an indication of each of one or more uniform resource identifiers (URIs) accessed by a browser application executing on the first computing device and metadata indicating a time of day when the URIs were accessed, wherein the application and the browser application are the same or different, wherein the history file includes one or more URIs, and wherein the first computing device is a user computing device; a second computing device, having a processor coupled to a memory, configured to receive the history file; the second computing device configured to determine a new URI from the one or more URIs in the received history file that is previously unknown to the second computing device; and the search index configured to update automatically without user interaction with the new UM, the search index being useable by a search engine to identify search results for a search query. 11. The system of claim 10 , further comprising the first computing device or the second computing device configured to receive an authorization from a user to allow the application to track the indications of the one or more URIs accessed by the first computing device. 12. The system of claim 10 , wherein the history file is received by the second computing device periodically and the history file includes an indication of each of one or more URIs accessed by the browser application since a previous history file was received. 13. The system of claim 10 , wherein the search index is updated with the new URI within thirty minutes of identifying the new URI. 14. The system of claim 10 , further comprising the second computing device configured to calculate a score for the new URI based at least partially on a number of instances of the new URI in history files received from the application executing on a plurality of first computing devices. 15. The system of claim 14 , wherein the search index is updated based on the score. 16. A system for updating a search index with newly published documents comprising: a plurality of groups of one or more user computing devices, each user device having a processor coupled to a memory and configured to receive an application, the application configured to generate a history file that includes an indication of each of one or more uniform resource identifiers (URIs) accessed by a browser application executing on each of the user computing device and metadata indicating a time of day when the URIs were accessed, wherein the application and the browser application are the same or different, wherein the history files include one or more URIs; a server, having a processor coupled to a memory, configured to sequentially receive the history files from each group of the plurality of groups of one or more user computing devices based on a predetermined schedule for each group of one or more user computing devices, wherein the history files from each group of the plurality of groups of one or more user computing devices forms a group of history files; the server configured to determine upon receipt of each group of history files a new URI that is previously unknown to the server from the one or more URIs in one of the history files in the received group of history files; and the search index configured to update automatically without user interaction with the new URI, the search index being useable by a search engine to identify search results for a search query. 17. The system of claim 16 , wherein the search index is updated based on a score indicating that the URI is of value or interest to users. 18. The system of claim 16 , wherein the history files from each group of the plurality of groups of one or more user computing devices are received in at a different designated time for each group. 19. The system of claim 16 , wherein a group of history files is received every thirty minutes.

Assignees

Inventors

Classifications

  • using information identifiers, e.g. uniform resource locators [URL] · CPC title

  • G06F16/22Primary

    Indexing; Data structures therefor; Storage structures · CPC title

  • using ranking · CPC title

  • Indexing; Data structures therefor; Storage structures · CPC title

  • Indexing; Web crawling techniques · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10482073B2 cover?
Methods, systems, and computer-readable media are provided for updating a search index with new uniform resource locators (URLs) and spiking URLs with increased user interest. History data, provided from browser applications residing on users' computers that indicate URLs accessed by the users, is parsed to identify new/previously unknown URLs. The history data also indicates URLs in which ther…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G06F16/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 19 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).