Personalization engine for building a dynamic classification dictionary

US9619467B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9619467-B2
Application numberUS-201213555823-A
CountryUS
Kind codeB2
Filing dateJul 23, 2012
Priority dateJun 27, 2008
Publication dateApr 11, 2017
Grant dateApr 11, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A dynamic classification dictionary is built for use in profiling and targeting users for additional relevant content. Behavioral data is gathered from user activity, and user documents and actions are categorized. Author-generated document classification information is analyzed and assigned a first taxonomic noun to characterize the document. User-generated tags characterizing a portion of the document are assigned a second taxonomic noun. Search terms that resulted in the user accessing the document are identified and assigned a third taxonomic noun. Attributes related to the manner in which the document was accessed are evaluated and assigned a fourth taxonomic noun. The document is processed using pattern rules to extract a fifth taxonomic noun. The taxonomic nouns are aggregated into a composite set of taxonomic nouns, and the dynamic classification dictionary is built by storing the composite set of taxonomic nouns.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method executed by one or more computing devices for building a dynamic classification dictionary, the method comprising: extracting, by at least one of the one or more computing devices, one or more terms from information that characterizes a document; applying, by at least one of the one or more computing devices, a first pattern matching rule to the one or more terms to identify one or more taxonomic nouns in the one or more terms, the first pattern matching rule including comparing sequences of consecutive words in the document with known terms and, if a match exists, merging matching consecutive words into a multi-word term and, if a match does not exist, assigning a single word term as the one or more taxonomic nouns; applying, by at least one of the one or more computing devices, a second pattern matching rule to the one or more terms to determine at least one of a part-of-speech and a noun type associated with at least one taxonomic noun of the one or more taxonomic nouns, the second pattern matching rule being different from the first pattern matching rule; and building, by at least one of the one or more computing devices, a dynamic classification dictionary based on the one or more taxonomic nouns and at least one of the part-of-speech and noun type associated with the at least one taxonomic noun. 2. The method of claim 1 , wherein the information that characterizes the document comprises at least one of author-generated classification information regarding the document and a user-generated tag characterizing a portion of the document. 3. The method of claim 1 , wherein the information that characterizes the document comprises one or more search terms that resulted in a user accessing the document. 4. The method of claim 1 , wherein the information that characterizes the document comprises one or more attributes related to the manner in which a user accessed the document. 5. The method of claim 1 , wherein the second pattern matching rule is a contextual pattern matching rule and wherein applying the second pattern matching rule comprises: comparing the context of the at least one taxonomic noun to one or more known contexts which are indicative of at least one of a part-of-speech and a noun type, wherein the context of the at least one taxonomic noun includes terms surrounding the at least one taxonomic noun; and marking the at least one taxonomic noun as being associated with at least one of a part-of-speech and a noun type corresponding to a matching context in the one or more known contexts. 6. The method of claim 1 , wherein the second pattern matching rule includes weighing the one or more taxonomic nouns using a predetermined weighing scheme, wherein the weighing scheme accounts for systematic differences in predicted variability of taxonomic noun assignments if the assignments were repeatedly carried out. 7. An apparatus for building a dynamic classification dictionary, the apparatus comprising: one or more processors; and one or more memories operatively coupled to at least one of the one or more processors and having instructions stored thereon that, when executed by at least one of the one or more processors, cause at least one of the one or more processors to: extract one or more terms from information that characterizes a document; apply a first pattern matching rule to the one or more terms to identify one or more taxonomic nouns in the one or more terms, the first pattern matching rule including comparing sequences of consecutive words in the document with known terms and, if a match exists, merging matching consecutive words into a multi-word term and, if a match does not exist, assigning a single word term as the one or more taxonomic nouns; apply a second pattern matching rule to the one or more terms to determine at least one of a part-of-speech and a noun type associated with at least one taxonomic noun of the one or more taxonomic nouns, the second pattern matching rule being different from the first pattern matching rule; and build a dynamic classification dictionary based on the one or more taxonomic nouns and at least one of the part-of-speech and noun type associated with the at least one taxonomic noun. 8. The apparatus of claim 7 , wherein the information that characterizes the document comprises at least one of author-generated classification information regarding the document and a user-generated tag characterizing a portion of the document. 9. The apparatus of claim 7 , wherein the information that characterizes the document comprises one or more search terms that resulted in a user accessing the document. 10. The apparatus of claim 7 , wherein the information that characterizes the document comprises one or more attributes related to the manner in which a user accessed the document. 11. The apparatus of claim 7 , wherein the second pattern matching rule is a contextual pattern matching rule and wherein the instructions that, when executed by at least one of the one or more processors, cause at least one of the one or more processors to apply the second pattern matching rule further cause at least one of the one or more processors to: comparing the context of the at least one taxonomic noun to one or more known contexts which are indicative of at least one of a part-of-speech and a noun type, wherein the context of the at least one taxonomic noun includes terms surrounding the at least one taxonomic noun; and marking the at least one taxonomic noun as being associated with at least one of a part-of-speech and a noun type corresponding to a matching context in the one or more known contexts. 12. The apparatus of claim 7 , wherein the second pattern matching rule is a contextual pattern matching rule and wherein the instructions that, when executed by at least one of the one or more processors, cause at least one of the one or more processors to apply the second pattern matching rule further cause at least one of the one or more processors to: weighing the one or more taxonomic nouns using a predetermined weighing scheme, wherein the weighing scheme accounts for systematic differences in predicted variability of taxonomic noun assignments if the assignments were repeatedly carried out. 13. At least one non-transitory computer-readable medium storing computer-readable instructions that, when executed by one or more computing devices, cause at least one of the one or more computing devices to: extract one or more terms from information that characterizes a document; apply a first pattern matching rule to the one or more terms to identify one or more taxonomic nouns in the one or more terms, the first pattern matching rule including comparing sequences of consecutive words in the document with known terms and, if a match exists, merging matching consecutive words into a multi-word term and, if a match does not exist, assigning a single word term as the one or more taxonomic nouns; apply a second pattern matching rule to the one or more terms to determine at least one of a part-of-speech and a noun type associated with at least one taxonomic noun of the one or more taxonomic nouns, the second pattern matching rule being different from the first pattern matching rule; and build a dynamic classification dictionary based on the one or more taxonomic nouns and at least one of the part-of-speech and noun type associated with the at least one taxonomic noun. 14. The at least one non-transitory computer-readable medium of claim 13 , wherein the information that characterizes the document comprises at least one of author-generated classification informa

Assignees

Inventors

Classifications

  • Physics · mapped topic

  • Physics · mapped topic

  • Physics · mapped topic

  • Physics · mapped topic

  • G06F16/335Primary

    Filtering based on additional data, e.g. user or group profiles (filtering in web context G06F16/9535, G06F16/9536) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9619467B2 cover?
A dynamic classification dictionary is built for use in profiling and targeting users for additional relevant content. Behavioral data is gathered from user activity, and user documents and actions are categorized. Author-generated document classification information is analyzed and assigned a first taxonomic noun to characterize the document. User-generated tags characterizing a portion of the…
Who is the assignee on this patent?
Pradhan Tushar, Osborne Thomas, Potter John, and 1 more
What technology area does this patent fall under?
Primary CPC classification G06F17/30011. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 11 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).