Method and system for recommending cloud websites based on terminal access statistics

US10216848B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10216848-B2
Application numberUS-201314411463-A
CountryUS
Kind codeB2
Filing dateMay 21, 2013
Priority dateJun 27, 2012
Publication dateFeb 26, 2019
Grant dateFeb 26, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present invention discloses a method and system for recommending cloud websites based on terminal access statistics, wherein, the method mainly comprises: storing websites accessed by a terminal, and sorting the websites according to the accessed frequency by the terminal; capturing corresponding website descriptive information according to a website sorting result; storing the websites and corresponding descriptive information to a cloud storage medium; and when receiving a query request, querying the stored websites and corresponding website descriptive information according to the query keywords in the query request, returning a query result, and returning one or more websites whose frequency ranks on the top in the query result as recommended items. The present invention calculates the popularity of websites and sorts the websites based on the statistics about access behaviors, thereby significantly improving the quality and relevance of the recommended result.

First claim

Opening claim text (preview).

The invention claimed is: 1. A system for recommending cloud websites, which comprises: a processor; and a memory communicatively coupled to the processor and storing instructions that upon execution by the processor cause the system to: store website addresses accessed by a terminal, and sort the website addresses according to frequency accessed by the terminal, wherein the instructions that upon execution by the processor cause the system to sort the website addresses according to the frequency accessed by the terminal further comprises instructions that upon execution by the processor cause the system to: sort the website addresses in a from-high-to-low sequence based on the frequency accessed by the terminal, generate an original seed file with a designated format comprising the sorted website addresses, the original seed file containing a plurality of data records, each of which corresponds to one website address, and divide the original seed file into a plurality of sub-files according to a predefined rule, wherein the predefined rule comprises using a MD5 algorithm to calculate a MD5 value for each data record among the plurality of data records and determining a sub-file among the plurality of sub-files into which each data record among the plurality of data records is written; capture website descriptive information for each of the plurality of sub-files, the website descriptive information corresponding to website addresses recorded in the each of the plurality of sub-files; store the website descriptive information for the each of the plurality of sub-files and corresponding website addresses to a cloud storage medium; when receiving a query request, query the website descriptive information and corresponding website addresses according to one or more query keywords in the query request, and return a query result. 2. The system according to claim 1 , wherein the memory further storing instructions that upon execution by the processor cause the system to: extract website addresses from a database storing the website addresses accessed by the terminal and pre-process the extracted website addresses, the pre-processing comprising: filtering illegal website addresses or deleting repeated website addresses. 3. The system according to claim 1 , wherein, the predefined rule further comprises normalizing the MD5 value to an integer value; using the integer value to calculate a remainder of a number of sub-files to be divided into, the obtained remainder being used as a serial number of the sub-file; and writing the data record into the sub-file corresponding to the serial number. 4. The system according to claim 3 , wherein, the instructions that upon execution by the processor cause the system to capture website descriptive information further comprises instructions that upon execution by the processor cause the system to: generate website descriptive information files comprising the captured website descriptive information; and store the website descriptive information files. 5. The system according to claim 4 , wherein, the instructions that upon execution by the processor cause the system to store the website descriptive information and corresponding website addresses to a cloud storage medium further comprises instructions that upon execution by the processor cause the system to: temporarily store each website descriptive information file, and gather a plurality of the website descriptive information files to store in the cloud storage medium. 6. The system according to claim 5 , wherein, the instructions that upon execution by the processor cause the system to query the website descriptive information and corresponding website addresses further comprises instructions that upon execution by the processor cause the system to: build an index according to the website descriptive information and corresponding website addresses, and create an index file; call the index file to query according to the one or more query keywords in the query request, output the query result, and return one or more websites whose frequency ranks on the top in the query result as the recommended items. 7. The system according to claim 6 , wherein, the instructions that upon execution by the processor cause the system to query the website descriptive information and corresponding website addresses further comprises instructions that upon execution by the processor cause the system to: filter the query result, wherein the method for filtering results is selected from one or more of the following: Black and White List filtering, erotic content filtering, Trojan virus type filtering and phishing type filtering. 8. The system according to claim 6 , wherein, the instructions that upon execution by the processor cause the system to query the website descriptive information and corresponding website addresses further comprises instructions that upon execution by the processor cause the system to: perform load balancing on the received query request after receiving the query request. 9. A method for recommending cloud websites, comprising: storing website addresses accessed by a terminal, and sorting the website addresses according to frequency accessed by the terminal, wherein the sorting the website addresses further comprises: sorting the website addresses in a from-high-to-low sequence based on the frequency accessed by the terminal, generating an original seed file with a designated format comprising the sorted website addresses, the original seed file containing a plurality of data records, each of which corresponds to one website address, and dividing the original seed file into a plurality of sub-files according to a predefined rule, wherein the predefined rule comprises using a MD5 algorithm to calculate a MD5 value for each data record among the plurality of data records and determining a sub-file among the plurality of sub-files into which each data record among the plurality of data records is written; capturing website descriptive information for each of the plurality of sub-files, the website descriptive information corresponding to website addresses recorded in the each of the plurality of sub-files; storing the website descriptive information for the each of the plurality of sub-files and corresponding website addresses to a cloud storage medium; and when receiving a query request, querying the website descriptive information and corresponding website addresses according to one or more query keywords in the query request, and returning a query result. 10. The method according to claim 9 , wherein, before the sorting the website addresses, the method further comprises: extracting website addresses from a database storing the website addresses accessed by the terminal and pre-processing the extracted website addresses, the pre-processing comprising: filtering illegal website addresses or deleting repeated website addresses. 11. The method according to claim 9 , wherein, the predefined rule further comprises: normalizing the MD5 value to an integer value; using the integer value to calculate a remainder of a number of sub-files to be divided into, the obtained remainder being used as a serial number of the sub-file; and writing the data record into the sub-file corresponding to the serial number. 12. The method according to claim 9 , wherein, the capturing further comprises: generating website descriptive information files comprising the captured website descriptive information and storing the files. 13. The method according to claim 9 , wherein, after the capturing the web site descriptive information, the method further co

Assignees

Inventors

Classifications

  • Physics · mapped topic

  • Recommending goods or services · CPC title

  • Market modelling; Market analysis; Collecting market data · CPC title

  • Physics · mapped topic

  • based on web technology, e.g. hypertext transfer protocol [HTTP] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10216848B2 cover?
The present invention discloses a method and system for recommending cloud websites based on terminal access statistics, wherein, the method mainly comprises: storing websites accessed by a terminal, and sorting the websites according to the accessed frequency by the terminal; capturing corresponding website descriptive information according to a website sorting result; storing the websites and…
Who is the assignee on this patent?
Beijing Qihoo Technology Co, Qizhi Software Beijing Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06F17/30867. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 26 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).