Feature-set augmentation using knowledge engine

US2017337486A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2017337486-A1
Application numberUS-201615157138-A
CountryUS
Kind codeA1
Filing dateMay 17, 2016
Priority dateMay 17, 2016
Publication dateNov 23, 2017
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method includes receiving an original feature-set for training a machine learning system, the feature-set including multiple records each having a set of original features with original feature values and a result, querying a knowledge base based on the set of original features, receiving a set of knowledge features with knowledge feature values responsive to the querying of the knowledge base, generating a first augmented feature-set that includes the multiple records of the original feature set and the knowledge features for the multiple records, and training the machine learning system based on the first augmented feature-set.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method comprising: receiving an original feature-set for training a machine learning system, the feature-set including multiple records each having a set of original features with original feature values and a result; querying a knowledge base based on the set of original features; receiving a set of knowledge features with knowledge feature values responsive to the querying of the knowledge base; generating a first augmented feature-set that includes the multiple records of the original feature set and the knowledge features for the multiple records; and training the machine learning system based on the first augmented feature-set. 2 . The method of claim 1 and further comprising combining multiple values of a single feature to create at least one higher level feature having at least two clusters of higher level feature values. 3 . The method of claim 2 and further comprising selecting at least one higher level feature from a number of higher level features for a physical feature for inclusion in the first augmented feature set for training the machine learning system. 4 . The method of claim 2 wherein a feature value of each cluster is a function of a mean or median value of the feature values in the cluster. 5 . The method of claim 1 and further comprising creating high level feature values from mathematically combined knowledge features, or a group of knowledge features. 6 . The method of claim 4 wherein the mathematically combined features comprises a length and width, and wherein the length and width are multiplied to produce an area as the further feature value. 7 . The method of claim 4 wherein the high level feature values comprise numeric or nominal values. 8 . The method of claim 1 wherein the knowledge base comprises a networked knowledge base. 9 . The method of claim 1 wherein multiple feature values are combined into clusters of higher level feature values based on one or more of a Euclidean distance function, a Manhattan distance function, a Cosine distance function, or a Hamming distance function. 10 . The method of claim 1 wherein the knowledge base comprises the Internet, and wherein the original features comprise cellular phone information and the result comprises a carrier churn value. 11 . The method of claim 1 and further comprising providing an interface to select features to include in the augmented feature set. 12 . A non-transitory machine readable storage device having instructions for execution by one or more processors to perform operations comprising: receiving an original feature-set for training a machine learning system, the feature-set including multiple records each having a set of original features with original feature values and a result; querying a knowledge base based on the set of original features; receiving a set of knowledge features with knowledge feature values responsive to the querying of the knowledge base; generating a first augmented feature-set that includes the multiple records of the original feature set and the knowledge features for the multiple records; and training the machine learning system based on the first augmented feature-set. 13 . The non-transitory machine readable storage device of claim 12 wherein the operations further comprise combining multiple values of a single feature to create at least one higher level feature having at least one cluster of higher level feature values. 14 . The non-transitory machine readable storage device of claim 12 wherein multiple feature values are combined into clusters of higher level feature values based on one or more of a Euclidean distance function, a Manhattan distance function, a Cosine distance function, or a Hamming distance function to produce a further knowledge feature. 15 . The non-transitory machine readable storage device of claim 12 wherein the knowledge base comprises the Internet, and wherein the original features comprise cellular phone information and the result comprises a carrier churn value. 16 . A device comprising: a processor; and a memory device coupled to the processor and having a program stored thereon for execution by the processor to perform operations comprising: receiving an original feature-set for training a machine learning system, the feature-set including multiple records each having a set of original features with original feature values and a result; querying a knowledge base based on the set of original features; receiving a set of knowledge features with knowledge feature values responsive to the querying of the knowledge base; generating a first augmented feature-set that includes the multiple records of the original feature set and the knowledge features for the multiple records; and training the machine learning system based on the first augmented feature-set. 17 . The device of claim 16 wherein the operations further comprise combining multiple values of a single feature to create at least one higher level feature having at least one cluster of higher level feature values. 18 . The device of claim 17 wherein the multiple feature values are combined into clusters of higher level feature values based on one or more of a Euclidean distance function, a Manhattan distance function, a Cosine distance function, or a Hamming distance function to produce a further knowledge feature. 19 . The device of claim 16 wherein the operations further comprise creating high level feature values from mathematically combined knowledge features, wherein the mathematically combined features comprises a length and width, and wherein the length and width are multiplied to produce an area as the further feature value. 20 . The device of claim 16 wherein the knowledge base comprises the Internet, and wherein the original features comprise cellular phone information and the result comprises a carrier churn value.

Assignees

Inventors

Classifications

  • Interaction with lists of selectable items, e.g. menus · CPC title

  • Selection of displayed objects or displayed text elements (G06F3/0482 takes precedence) · CPC title

  • Query execution · CPC title

  • Knowledge engineering; Knowledge acquisition · CPC title

  • Knowledge representation; Symbolic representation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2017337486A1 cover?
A method includes receiving an original feature-set for training a machine learning system, the feature-set including multiple records each having a set of original features with original feature values and a result, querying a knowledge base based on the set of original features, receiving a set of knowledge features with knowledge feature values responsive to the querying of the knowledge bas…
Who is the assignee on this patent?
Futurewei Technologies Inc
What technology area does this patent fall under?
Primary CPC classification G06N99/005. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Nov 23 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).