Portable communication terminal for extracting subjects of interest to the user, and a method therefor

US9323845B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9323845-B2
Application numberUS-201113577149-A
CountryUS
Kind codeB2
Filing dateJan 31, 2011
Priority dateFeb 3, 2010
Publication dateApr 26, 2016
Grant dateApr 26, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A portable communication device for extracting a user interest comprises a term vector generation unit for generating, based on types of text data stored in the portable communication device, a term vector representing each text data, a subject classification tree storage unit for storing a subject classification tree, which is a tree structure in which multiple nodes, each including at least one training data and representing a subject, are connected to one another, and a similarity calculation unit for calculating a similarity between the term vector and the training data for each node in the subject classification tree. The similarity calculation unit extracts a node name representing the user interest from the subject classification tree based on the similarity.

First claim

Opening claim text (preview).

What is claimed is: 1. A portable communication device for extracting a user interest, the device comprising: a term vector generation unit for generating, based on types of text data stored in the portable communication device, a term vector representing each text data; a subject classification tree storage unit for storing a subject classification tree, which is a tree structure in which multiple nodes, each including at least one training data and representing a subject, are connected to one another; a subject classification tree generation unit for generating the subject classification tree by processing open directory data; a training data generation unit for generating the training data representing each directory based on text data information of a set of web sites included in the each directory of the open directory data; a classification unit for mapping the training data to a directory included in the subject classification tree; and a similarity calculation unit for calculating a similarity between the term vector and the training data for each node in the subject classification tree, wherein the similarity calculation unit extracts a node name representing the user interest from the subject classification tree based on the similarity. 2. The portable communication device for extracting a user interest of claim 1 , wherein the term vector generation unit comprises: a term extraction unit for extracting terms from the text data; and a term weight calculation unit for calculating a term weight based on usage frequency of each of the terms used in the text data, and generation time of the text data containing the terms. 3. The portable communication device for extracting a user interest of claim 1 , wherein the similarity calculation unit calculates the similarity between the term vector and the training data included in each node of the subject classification tree, and for each node of the subject classification tree, a parent node's similarity is calculated by summing up all its children's similarities. 4. The portable communication device for extracting a user interest of claim 1 , wherein the name of the node having the highest similarity in the subject classification tree is extracted as the user interest. 5. The portable communication device for extracting a user interest of claim 1 , wherein the text data are extracted from at least one of a text message, a file name, an e-mail, and a mobile web usage history generated in the portable communication device. 6. A method for extracting a user interest, the method comprising: a step in which a term extraction unit extracts terms from text data stored in a portable communication device; a step in which a term weight calculation unit calculates a term weight based on usage frequency of each of the terms used in the text data, and generation time of the text data containing the terms; a step in which a term vector generation unit, based on types of text data stored in the portable communication device, generates a term vector representing each text data; and a step in which an open directory data collection unit collects various open directories and information about web pages included in each of the directories; a step in which a subject classification tree generation unit generates a subject classification tree by processing the collected directory data; a step in which a training data generation unit generates a training data representing each directory based on text data information of a set of web sites included in the each directory of the collected directory data; a stage in which a classification unit maps the training data to a directory included in the subject classification tree; and a step in which a similarity calculation unit calculates a similarity between the term vector and the training data for each node included in a subject classification tree, which is a tree structure in which multiple nodes, each including at least one training data and representing a subject, are connected to one another; wherein the similarity calculation unit extracts a node name representing the user interest from the subject classification tree based on the calculated similarity. 7. The method for extracting a user interest of claim 6 , wherein the similarity calculation unit, for each node of the subject classification tree, calculates a parent node's similarity by summing up all its children's similarities. 8. The method for extracting a user interest of claim 6 , wherein the similarity calculation unit extracts the name of the node having the highest similarity in the subject classification tree as the user interest.

Assignees

Inventors

Classifications

  • Distances to cluster centroïds · CPC title

  • Tree-organised classifiers · CPC title

  • Relational databases · CPC title

  • into predefined classes · CPC title

  • Marketing; Price estimation or determination; Fundraising · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9323845B2 cover?
A portable communication device for extracting a user interest comprises a term vector generation unit for generating, based on types of text data stored in the portable communication device, a term vector representing each text data, a subject classification tree storage unit for storing a subject classification tree, which is a tree structure in which multiple nodes, each including at least o…
Who is the assignee on this patent?
Lee Sang Keun, Ha Jong Woo, Lee Jung Hyun, and 1 more
What technology area does this patent fall under?
Primary CPC classification G06F16/951. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 26 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).