System and a method of generating a training set of data for training a machine-learning algorithm
US-2024232709-A1 · Jul 11, 2024 · US
US9679316B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9679316-B2 |
| Application number | US-201213490035-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jun 6, 2012 |
| Priority date | Jun 6, 2011 |
| Publication date | Jun 13, 2017 |
| Grant date | Jun 13, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A system and method for selecting diverse product titles to display on a website are disclosed. In some example embodiments, the methods and systems described herein identify available products to be displayed, cluster the identified products based on their similarity to one another, select one or more products from each of the clusters, and display information, such as a title, associated with the randomly selected products.
Opening claim text (preview).
What is claimed is: 1. A system, comprising: one or more processors and executable instructions accessible on a computer-readable medium that, in response to being executed, cause the system to perform operations comprising: receiving, by a server over a network, a keyword query from a client machine, the keyword query including a display constraint of a display of the client machine; accessing a network database storing publication information; in response to receiving the keyword query, generating a publication set; grouping the publication set into a particular cluster of multiple clusters based on a similarity metric between each of the publications of the publication set and other publications, each cluster comprising a cluster of similar publications and each of the multiple clusters being associated with a different similarity metric value; determining similarity metrics between the publications; assigning the publications to one of the multiple clusters based on the determined similarity metrics satisfying a threshold value of clustering publications; distributing, at random, each publication of the multiple clusters to one of multiple distinguished buckets of publications, a quantity of the multiple distinguished buckets being based on the display constraint of the display; selecting, at random, one of the multiple distinguished buckets of publications; and transmitting, from a server over a network, instructions to cause the display of the client machine to generate a user interface that is configured to display descriptions for the publications of the randomly selected distinguished bucket in response to the received keyword query. 2. The system of claim 1 , wherein the operations further comprise: grouping the publication set based on a similarity metric between a title of a publication and titles of other publications; and transmitting, from a server over a network, instructions to cause the client machine to generate a user interface that is configured to display titles associated with the randomly selected publications. 3. The system of claim 1 , wherein the operations further comprise: grouping the publication set based on a similarity metric between a title of a publication and titles of other publications; and transmitting, from a server over a network, instructions to cause the client machine to generate a user interface that is configured to display non title information associated with the randomly selected publications. 4. The system of claim 1 , wherein the operations further comprise transmitting, from a server over a network, instructions to cause the display of the client machine to generate a user interface that is configured to display titles associated with the publications of the randomly distinguished bucket along a sidebar presented by a web page associated with an online retail environment. 5. The system of claim 1 , wherein the operations further comprise assigning a value for the similarity metric based on a number publications. 6. A computerized method, comprising: receiving, by a server over a network, a keyword query from a client machine, the keyword query including a display constraint of a display of the client machine; accessing a network database storing publication information; in response to receiving the keyword query, generating a publication set; grouping the publication set into a particular cluster of multiple clusters based on a similarity metric between each of the publications of the publication set and other publications, each cluster comprising a cluster of similar publications and each of the multiple clusters being associated with a different similarity metric value; determining similarity metrics between the publications and assigning the publications to one of the multiple clusters based on the determined similarity metrics satisfying a threshold value of clustering publications; distributing, at random, each publication of the multiple clusters to one of multiple distinguished buckets of publications, a quantity of the multiple distinguished buckets being based on the display constraint of the display; selecting, at random, one of the multiple distinguished buckets of publications; and transmitting, from a server over a network, instructions to cause the display of the client machine to generate a user interface that is configured to display descriptions for the publications of the randomly selected distinguished bucket in response to the received keyword query. 7. The computerized method of claim 6 , further comprising: comparing a title of the publication to titles of other publications within the publication set; and determining a similarity metric including calculating a cosine similarity score for each of the titles. 8. The computerized method of claim 6 , further comprising: receiving a runtime request to display publication descriptions associated with one or more publications available for selection on the website. 9. The computerized method of claim 6 , further comprising transmitting, from a server over a network, instructions to cause a client machine to generate a modified user interface that is configured to display, in a modified size format, publications available for selection on the website. 10. A non-transitory machine-readable storage medium having embodied thereon instructions, that in response to being executed by one or more processors of a machine, cause the machine to perform operations comprising: receiving, by a server over a network, a keyword query from a client machine, the keyword query including a display constraint of a display of the client machine; accessing a network based publication a network database storing publication information; in response to receiving the keyword query, generating a publication set; grouping the publication set into a particular cluster of multiple clusters based on a similarity metric between each of the publications of the publication set and other publications, each cluster comprising a cluster of similar publications and each of the multiple clusters being associated with a different similarity metric value; determining similarity metrics between the publications and assigning the publications to one of the multiple clusters based on the determined similarity metrics satisfying a threshold value of clustering publications; distributing, at random, each publication of the multiple clusters to one of multiple distinguished buckets of publications, a quantity of the multiple distinguished buckets being based on the display constraint of the display; selecting, at random, one of the multiple distinguished buckets of publications; and transmitting, from a server over a network, instructions to cause the display of the client machine to generate a user interface that is configured to display descriptions for the publications of the randomly selected distinguished bucket in response to the received keyword query. 11. The computer-readable storage medium of claim 10 , wherein the operations further comprise transmitting, from a server over a network, instructions to cause the display of the client machine to generate a user interface that is configured to display titles for the publications within the randomly selected grouping. 12. The computer-readable storage medium of claim 10 , wherein the operations further comprise transmitting, from a server over a network, instructions to cause the display of the client machine to generate a user interface that is configured to display an indication of other publications similar to the publications within the randomly selected grouping. 13. The computer-readable s
Electronic shopping [e-shopping] · CPC title
Rating or review of business operators or products · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.