Sorting documents according to comprehensibility scores determined for the documents
US-2024119078-A1 · Apr 11, 2024 · US
US2018113934A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2018113934-A1 |
| Application number | US-201615334808-A |
| Country | US |
| Kind code | A1 |
| Filing date | Oct 26, 2016 |
| Priority date | Oct 26, 2016 |
| Publication date | Apr 26, 2018 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A device may receive information that identifies an item to be categorized. The device may map the item to a first vector based on the information that identifies the item. The device may compare the first vector to a second vector based on mapping the item to the first vector. The device may determine a similarity value between the first vector and the second vector based on comparing the first vector and the second vector. The device may determine that the similarity value satisfies a threshold. The device may determine a category associated with the item based on the similarity value satisfying the threshold. The second vector may be associated with the category. The device may provide information that identifies the category associated with the item to cause an action to be performed.
Opening claim text (preview).
What is claimed is: 1 . A device, comprising: one or more processors to: receive information that identifies an item to be categorized, the item including a set of first terms; map the item to a first vector based on the set of first terms, the first vector including a set of first values that correspond to the set of first terms; compare the first vector and a second vector associated with a categorized item, the second vector including a set of second values that correspond to a set of second terms associated with the categorized item; determine an amount of the first values that match the second values based on comparing the first vector and the second vector; determine a similarity value between the first vector and the second vector based on the amount of the first values that match the second values; determine that the similarity value satisfies a threshold; determine a category associated with the item based on the similarity value satisfying the threshold, the categorized item being associated with the category; and provide information that identifies the category associated with the item to permit and/or cause an action to be performed. 2 . The device of claim 1 , where the one or more processors are further to: compare a third vector and the first vector, the third vector being associated with another item to be categorized; determine that another similarity value between the third vector and the first vector satisfies the threshold; determine that the other item is associated with the category based on the other similarity value satisfying the threshold; and where the one or more processors, when providing the information that identifies the category associated with the item, are to: provide the information that identifies the category associated with the item and the other item. 3 . The device of claim 1 , where the one or more processors are further to: determine a first value, of the set of first values, associated with a first term of the set of first terms, the first vector including the first value in a first element; determine a match between the first value and a second value of the set of second values, the second vector including the second value in a second element, the first element and the second element being a same element; and where the one or more processors, when comparing the first vector and the second vector, are to: compare the first value and the second value. 4 . The device of claim 1 , where the one or more processors are further to: compare the first vector and a first set of vectors, the first set of vectors including the second vector; identify a second set of vectors based on comparing the first vector and the first set of vectors, the second set of vectors including the second vector; and where the one or more processors, when determining the similarity value between the first vector and the second vector, are to: determine the similarity value based on identifying the second set of vectors. 5 . The device of claim 1 , where the one or more processors are further to: determine a hamming distance value between the first vector and the second vector; and where the one or more processors, when determining the similarity value, are to: determine the similarity value based on the hamming distance value. 6 . The device of claim 1 , where the one or more processors are further to: compare the first vector to a set of other vectors of other items to be categorized, the item and the other items being associated with a same dataset; and determine that the other items are associated with the category based on comparing the first vector to the set of other vectors. 7 . The device of claim 1 , where the item to be categorized is a first item, and where the similarity value is a first similarity value; and where the one or more processors are further to: compare a third vector and a fourth vector, the third vector being associated with a second item to be categorized, and the fourth vector being associated with a third item to be categorized; determine that a second similarity value between the third vector and the fourth vector satisfies the threshold; determine that a third similarity value between the second vector and the fourth vector does not satisfy the threshold; determine that a fourth similarity value between the third vector and the first vector satisfies the threshold; and categorize the third item based on the fourth similarity value satisfying the threshold, the third item being associated with the category. 8 . A method, comprising: receiving, by a device, information that identifies a first item to be categorized; mapping, by the device, the first item to a first vector, the first vector including one or more values that correspond to one or more terms of the first item; comparing, by the device, the first vector and a second vector, the second vector being associated with a second item; determining, by the device, a similarity value associated with the first vector and the second vector based on comparing the first vector and the second vector; determining, by the device, that the similarity value satisfies a threshold; determining, by the device, a category associated with the first item based on the similarity value satisfying the threshold, the second item being associated with the category; and providing, by the device, information that identifies the category associated with the first item to permit an action to be performed. 9 . The method of claim 8 , further comprising: identifying the one or more terms of the first item; determining the one or more values based on the one or more terms; assigning the one or more values to the one or more terms; and where mapping the first item to the first vector comprises: mapping the first item to the first vector based on assigning the one or more values to the one or more terms. 10 . The method of claim 8 , further comprising: comparing a first value, of the one or more values, of the first vector and a second value of the second vector, the first value being associated with a first element of the first vector, the second value being associated with a second element of the second vector, and the first element matching the second element; determining that the first value matches the second value; and where determining that the similarity value satisfies the threshold comprises: determining that the similarity value satisfies the threshold based on the first value matching the second value. 11 . The method of claim 8 , further comprising: identifying an amount of the one or more terms; and generating the first vector based on the one or more terms, the first vector including another amount of values that is different than the amount of the one or more terms. 12 . The method of claim 8 , further comprising: comparing a third vector and the first vector; determining that another similarity value between the third vector and the first vector satisfies the threshold; and categorizing the third vector based on the other similarity value satisfying the threshold, the third vector being categorized in association with the category. 13 . The method of claim 8 , further comprising: comparing the first vector and a first set of vectors, the first set of vectors including the second vector; identifying a second set of vectors based on comparing the first vector and the first set of vectors, the second set of vectors being different than the first set of vectors, and the second set of vecto
Parsing · CPC title
Query execution (filtering based on additional data G06F16/335) · CPC title
Clustering; Classification · CPC title
Machine learning · CPC title
Physics · mapped topic
Related publications grouped by family.
Answers are generated from the same data shown on this page.