Item to vector based categorization

US2018113934A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2018113934-A1
Application numberUS-201615334808-A
CountryUS
Kind codeA1
Filing dateOct 26, 2016
Priority dateOct 26, 2016
Publication dateApr 26, 2018
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A device may receive information that identifies an item to be categorized. The device may map the item to a first vector based on the information that identifies the item. The device may compare the first vector to a second vector based on mapping the item to the first vector. The device may determine a similarity value between the first vector and the second vector based on comparing the first vector and the second vector. The device may determine that the similarity value satisfies a threshold. The device may determine a category associated with the item based on the similarity value satisfying the threshold. The second vector may be associated with the category. The device may provide information that identifies the category associated with the item to cause an action to be performed.

First claim

Opening claim text (preview).

What is claimed is: 1 . A device, comprising: one or more processors to: receive information that identifies an item to be categorized, the item including a set of first terms; map the item to a first vector based on the set of first terms, the first vector including a set of first values that correspond to the set of first terms; compare the first vector and a second vector associated with a categorized item, the second vector including a set of second values that correspond to a set of second terms associated with the categorized item; determine an amount of the first values that match the second values based on comparing the first vector and the second vector; determine a similarity value between the first vector and the second vector based on the amount of the first values that match the second values; determine that the similarity value satisfies a threshold; determine a category associated with the item based on the similarity value satisfying the threshold, the categorized item being associated with the category; and provide information that identifies the category associated with the item to permit and/or cause an action to be performed. 2 . The device of claim 1 , where the one or more processors are further to: compare a third vector and the first vector, the third vector being associated with another item to be categorized; determine that another similarity value between the third vector and the first vector satisfies the threshold; determine that the other item is associated with the category based on the other similarity value satisfying the threshold; and where the one or more processors, when providing the information that identifies the category associated with the item, are to: provide the information that identifies the category associated with the item and the other item. 3 . The device of claim 1 , where the one or more processors are further to: determine a first value, of the set of first values, associated with a first term of the set of first terms, the first vector including the first value in a first element; determine a match between the first value and a second value of the set of second values, the second vector including the second value in a second element, the first element and the second element being a same element; and where the one or more processors, when comparing the first vector and the second vector, are to: compare the first value and the second value. 4 . The device of claim 1 , where the one or more processors are further to: compare the first vector and a first set of vectors, the first set of vectors including the second vector; identify a second set of vectors based on comparing the first vector and the first set of vectors, the second set of vectors including the second vector; and where the one or more processors, when determining the similarity value between the first vector and the second vector, are to: determine the similarity value based on identifying the second set of vectors. 5 . The device of claim 1 , where the one or more processors are further to: determine a hamming distance value between the first vector and the second vector; and where the one or more processors, when determining the similarity value, are to: determine the similarity value based on the hamming distance value. 6 . The device of claim 1 , where the one or more processors are further to: compare the first vector to a set of other vectors of other items to be categorized, the item and the other items being associated with a same dataset; and determine that the other items are associated with the category based on comparing the first vector to the set of other vectors. 7 . The device of claim 1 , where the item to be categorized is a first item, and where the similarity value is a first similarity value; and where the one or more processors are further to: compare a third vector and a fourth vector, the third vector being associated with a second item to be categorized, and the fourth vector being associated with a third item to be categorized; determine that a second similarity value between the third vector and the fourth vector satisfies the threshold; determine that a third similarity value between the second vector and the fourth vector does not satisfy the threshold; determine that a fourth similarity value between the third vector and the first vector satisfies the threshold; and categorize the third item based on the fourth similarity value satisfying the threshold, the third item being associated with the category. 8 . A method, comprising: receiving, by a device, information that identifies a first item to be categorized; mapping, by the device, the first item to a first vector, the first vector including one or more values that correspond to one or more terms of the first item; comparing, by the device, the first vector and a second vector, the second vector being associated with a second item; determining, by the device, a similarity value associated with the first vector and the second vector based on comparing the first vector and the second vector; determining, by the device, that the similarity value satisfies a threshold; determining, by the device, a category associated with the first item based on the similarity value satisfying the threshold, the second item being associated with the category; and providing, by the device, information that identifies the category associated with the first item to permit an action to be performed. 9 . The method of claim 8 , further comprising: identifying the one or more terms of the first item; determining the one or more values based on the one or more terms; assigning the one or more values to the one or more terms; and where mapping the first item to the first vector comprises: mapping the first item to the first vector based on assigning the one or more values to the one or more terms. 10 . The method of claim 8 , further comprising: comparing a first value, of the one or more values, of the first vector and a second value of the second vector, the first value being associated with a first element of the first vector, the second value being associated with a second element of the second vector, and the first element matching the second element; determining that the first value matches the second value; and where determining that the similarity value satisfies the threshold comprises: determining that the similarity value satisfies the threshold based on the first value matching the second value. 11 . The method of claim 8 , further comprising: identifying an amount of the one or more terms; and generating the first vector based on the one or more terms, the first vector including another amount of values that is different than the amount of the one or more terms. 12 . The method of claim 8 , further comprising: comparing a third vector and the first vector; determining that another similarity value between the third vector and the first vector satisfies the threshold; and categorizing the third vector based on the other similarity value satisfying the threshold, the third vector being categorized in association with the category. 13 . The method of claim 8 , further comprising: comparing the first vector and a first set of vectors, the first set of vectors including the second vector; identifying a second set of vectors based on comparing the first vector and the first set of vectors, the second set of vectors being different than the first set of vectors, and the second set of vecto

Assignees

Inventors

Classifications

  • Parsing · CPC title

  • Query execution (filtering based on additional data G06F16/335) · CPC title

  • G06F16/35Primary

    Clustering; Classification · CPC title

  • Machine learning · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2018113934A1 cover?
A device may receive information that identifies an item to be categorized. The device may map the item to a first vector based on the information that identifies the item. The device may compare the first vector to a second vector based on mapping the item to the first vector. The device may determine a similarity value between the first vector and the second vector based on comparing the firs…
Who is the assignee on this patent?
Accenture Global Solutions Ltd
What technology area does this patent fall under?
Primary CPC classification G06F16/35. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Apr 26 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).