Browse node creation using frequent pattern mining
US-10838984-B2 · Nov 17, 2020 · US
US11397758B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11397758-B2 |
| Application number | US-202017064538-A |
| Country | US |
| Kind code | B2 |
| Filing date | Oct 6, 2020 |
| Priority date | Aug 18, 2016 |
| Publication date | Jul 26, 2022 |
| Grant date | Jul 26, 2022 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A browse node page is addressed by its path. As a result, a web crawler may be more likely to find the browse node than the corresponding parameter-based search page. The browse node may be further distinguished from a search result page by the use of a title or header metatag that indicates information about the browse node and distinguishes the browse node from a general search result page. The number of combinations of keywords, categories, and key-value pairs makes the creation of a browse node for every possible combination prohibitive in all but the simplest applications. Methods and systems to identify which search result pages should be converted to browse nodes are disclosed herein.
Opening claim text (preview).
What is claimed is: 1. A method comprising: accessing data representative of a list of potential browse nodes, the list including at least a first potential browse node including a first set of items and a second potential browse node including a second set of items; determining a degree of overlap between the first set of items and the second set of items; determining that the degree of overlap is greater than a threshold degree of overlap; and based at least in part on the degree of overlap exceeding the threshold degree of overlap, generating a browse node by, at least in part, generating a uniform resource locator (URL) referencing the browse node and including a key-value pair associated with the second potential browse node. 2. The method of claim 1 , wherein the generating the browse node further includes generating a title or header metatag indicating information about the browse node. 3. The method of claim 1 , wherein the determining the degree of overlap is based at least in part on a count of the first set of items and the second set of items, a first percentage of the first set of items included in the second set of items, a second percentage of the second set of items included in the first set of items, or a combination thereof. 4. The method of claim 1 , wherein the first potential browse node and the second potential browse node are associated with a same key-value pair. 5. The method of claim 1 , wherein the first potential browse node and the second potential browse node are associated with a key-value pair, and the method further comprises: comparing a value of the key-value pair to a set of excluded values to determine that the value is not in the set of excluded values; and wherein the generating the browse node is further based at least in part on the determination that the value of the key-value pair is not in the set of excluded values. 6. The method of claim 1 , wherein the first potential browse node and the second potential browse node are associated with a key-value pair, and the method further comprises: comparing a key of the key-value pair to a set of permitted keys to determine that the key is included in the set of permitted keys; and wherein the generating the browse node is further based at least in part on the determination that the key of the key-value pair is included in the set of permitted keys. 7. The method of claim 1 , further comprising removing the first potential browse node from the list of potential browse nodes based at least in part on the degree of overlap exceeding the threshold degree of overlap. 8. A system comprising: one or more processors; and one or more memory devices storing instructions that, when executed by the one or more processors, cause the one or more processors to execute operations comprising: accessing data representative of a list of potential browse nodes, the list including at least a first potential browse node including a first set of items and a second potential browse node including a second set of items; determining a degree of overlap between the first set of items and the second set of items; determining that the degree of overlap is greater than a threshold degree of overlap; and based at least in part on the degree of overlap exceeding the threshold degree of overlap, generating a browse node by, at least in part, generating a uniform resource locator (URL) referencing the browse node and including a key-value pair associated with the second potential browse node. 9. The system of claim 8 , wherein the generating the browse node further includes generating a title or header metatag indicating information about the browse node. 10. The system of claim 8 , wherein the determining the degree of overlap is based at least in part on a count of the first set of items and the second set of items, a first percentage of the first set of items included in the second set of items, a second percentage of the second set of items included in the first set of items, or a combination thereof. 11. The system of claim 8 , wherein the operations further comprise removing the first potential browse node from the list of potential browse nodes based at least in part on the degree of overlap exceeding the threshold degree of overlap. 12. The system of claim 8 , wherein the first potential browse node and the second potential browse node are associated with a key-value pair, and the operations further comprise: comparing a value of the key-value pair to a set of excluded values to determine that the value is not in the set of excluded values; and wherein the generating the browse node is further based at least in part on the determination that the value of the key-value pair is not in the set of excluded values. 13. The system of claim 8 , wherein the first potential browse node and the second potential browse node are associated with a key-value pair, and the operations further comprise: comparing a key of the key-value pair to a set of permitted keys to determine that the key is included in the set of permitted keys; and wherein the generating the browse node is further based at least in part on the determination that the key of the key-value pair is included in the set of permitted keys. 14. The system of claim 8 , wherein the generating the browse node includes: determining an image associated with an item of the second set of items; selecting an image corresponding to the item; and associating the image with the browse node. 15. One or more non-transitory machine-readable storage medium comprising instructions that, when executed by one or more processors of a machine, cause the machine to perform operations comprising: accessing data representative of a list of potential browse nodes, the list including at least a first potential browse node including a first set of items and a second potential browse node including a second set of items; determining a degree of overlap between the first set of items and the second set of items; determining that the degree of overlap is greater than a threshold degree of overlap; and based at least in part on the degree of overlap exceeding the threshold degree of overlap, generating a browse node by, at least in part, generating a uniform resource locator (URL) referencing the browse node and including a key-value pair associated with the second potential browse node. 16. The one or more non-transitory machine-readable storage medium of claim 15 , wherein the generating the browse node further includes generating a title or header metatag indicating information about the browse node. 17. The one or more non-transitory machine-readable storage medium of claim 15 , wherein the determining the degree of overlap is based at least in part on a count of the first set of items and the second set of items, a first percentage of the first set of items included in the second set of items, a second percentage of the second set of items included in the first set of items, or a combination thereof. 18. The one or more non-transitory machine-readable storage medium of claim 15 , wherein the first potential browse node and the second potential browse node are associated with a same key-value pair. 19. The one or more non-transitory machine-readable storage medium of claim 15 , wherein the first potential browse node and the second potential browse node are associated with a key-value pair, and the operations further comprise: comparing a value of the key-value pair to a set of excluded values to determine that the value is
Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking · CPC title
Management thereof · CPC title
Indexing; Data structures therefor; Storage structures · CPC title
Clustering or classification · CPC title
Visual data mining; Browsing structured data · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.