Search Result Ranking Based on Post Classifiers on Online Social Networks
US-2018268065-A1 · Sep 20, 2018 · US
US2018276208A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2018276208-A1 |
| Application number | US-201715470300-A |
| Country | US |
| Kind code | A1 |
| Filing date | Mar 27, 2017 |
| Priority date | Mar 27, 2017 |
| Publication date | Sep 27, 2018 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Online search retrieval is improved by automatic generation of key phrases. When a search engine crawls an electronic document, key words and phrases greatly help organize the electronic document to one or more topics. A quotient matrix defines a ratio of a key phrase to a total number of words in the electronic document. A correlation coefficient may also determine which key phrase correlates to the electronic document. A title key phrase may then be generated in response to the correlation coefficient having a positive value. When the search engine crawls the electronic document, the title key phrase may be provided as metadata.
Opening claim text (preview).
What is claimed is: 1 . A method for crawling an electronic document, comprising: receiving, by an information handling system, the electronic document; determining, by the information handling system, a positive correlation coefficient between the electronic document and key phrases associated with a product; generating, by the information handling system, a title associated with the electronic document in response to the positive correlation coefficient; adding, by the information handling system, entries to an electronic database that electronically associate the title and the electronic document to the key phrases associated with the product; and providing, by the information handling system, the key phrases to a search engine when the crawling of the electronic document is performed. 2 . The method of claim 1 , further comprising determining a quotient matrix based on words in the electronic document. 3 . The method of claim 1 , further comprising discarding a generic word from the title associated with the electronic document. 4 . The method of claim 1 , further comprising determining a quotient matrix associated with the electronic document. 5 . The method of claim 1 , further comprising determining a quotient matrix associated with the electronic document, the quotient matrix defined as a quotient between the title and a total number of words in the electronic document. 6 . The method of claim 1 , further comprising discarding a negative correlation coefficient between the electronic document and key phrases associated with a product. 7 . The method of claim 1 , further comprising discarding the key phrases in response to no correlation coefficient between the electronic document and the key phrases associated with the product. 8 . An apparatus, comprising: a hardware processor; and a memory device accessible to the hardware processor, the memory device storing instructions, the instructions when executed causing the hardware processor to perform operations, the operations including: receiving an electronic document for crawling by a search engine; determining a positive correlation coefficient between the electronic document and key phrases associated with a product; generating a title associated with the electronic document in response to the positive correlation coefficient; adding entries to an electronic database that electronically associate the title and the electronic document to the key phrases associated with a product; and providing the key phrases to a search engine when the crawling of the electronic document is performed. 9 . The system of claim 8 , wherein the operations further comprise determining a quotient matrix based on words in the electronic document. 10 . The system of claim 8 , wherein the operations further comprise discarding a generic word from the title associated with the electronic document. 11 . The system of claim 8 , wherein the operations further comprise determining a quotient matrix associated with the electronic document. 12 . The system of claim 8 , wherein the operations further comprise determining a quotient matrix associated with the electronic document, the quotient matrix defined as a quotient between the title and a total number of words in the electronic document. 13 . The system of claim 8 , wherein the operations further comprise discarding a negative correlation coefficient between the electronic document and key phrases associated with a product. 14 . The system of claim 8 , wherein the operations further comprise discarding the key phrases in response to no correlation between the electronic document and the key phrases associated with the product. 15 . A memory device storing instructions that when executed cause a hardware processor to perform operations, the operations comprising: receiving an electronic document for crawling by a search engine; determining correlation coefficients between textual words in the electronic document and key phrases associated with a product; determining the key phrases associated with positive values of the correlation coefficients; hashing the key phrases associated with the positive values of the correlation coefficients using an electronic representation of a hashing algorithm; generating a title for the electronic document, the title based on hash values generated by the hashing of the key phrases associated with the positive values of the correlation coefficients; associating the title in an electronic database to the electronic document; and providing the title as metadata for the crawling by the search engine. 16 . The memory device of claim 15 , wherein the operations further comprise determining a quotient matrix based on the textual words in the electronic document. 17 . The memory device of claim 15 , wherein the operations further comprise discarding a generic word associated with the electronic document. 18 . The memory device of claim 15 , wherein the operations further comprise determining a quotient matrix associated with the electronic document, the quotient matrix defined as a ratio of the title to a total number of the textual words in the electronic document. 19 . The memory device of claim 15 , wherein the operations further comprise discarding the key phrases associated with negative values of the correlation coefficients. 20 . The memory device of claim 15 , wherein the operations further comprise discarding the key phrases having no correlation.
Document management systems · CPC title
Selection or weighting of terms for indexing · CPC title
Indexing; Web crawling techniques · CPC title
Physics · mapped topic
Physics · mapped topic
Related publications grouped by family.
Answers are generated from the same data shown on this page.