Inferring attributes of organizations using member graph

US10769426B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10769426-B2
Application numberUS-201514929128-A
CountryUS
Kind codeB2
Filing dateOct 30, 2015
Priority dateSep 30, 2015
Publication dateSep 8, 2020
Grant dateSep 8, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In an example embodiment, a member profile corresponding to a member of a social networking service is obtained. Usage information for the member is then obtained, and one or more member metrics are calculated based on the member profile and usage information for the corresponding member. A plurality of features are extracted from the member profile and the one or more member metrics. The plurality of features is inserted into an organization name confidence score model to obtain a confidence score for an organization name in the member profile.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising; obtaining, by a processor, a plurality of sample member profiles, each sample member profile corresponding to a sample member of a social networking service; for each of the plurality of sample member profiles: obtaining, by the processor, usage information for a corresponding sample member; calculating, by the processor, one or more member metrics based on the sample member profile and usage information for the corresponding sample member; extracting, by the processor, a first plurality of features from the sample member profile and the one or more member metrics; feeding, by the processor, the first plurality of features into a supervised machine learning organization confidence score algorithm to train an organization name confidence score model to calculate a confidence score for a particular member profile indicating a probability that an organization name in the particular member profile is accurate, the supervised machine learning organization confidence score algorithm executed by the computer to implement a supervised machine learning classifier; obtaining, by the processor, a member profile corresponding to a member of a social networking service; obtaining, by the processor, usage information for the member; calculating, by the processor, one or more member metrics based on the member profile and usage information for the corresponding member; extracting, by the processor, a second plurality of features from the member profile and the one or more member metrics; inputting, by the processor, the second plurality of features into the organization name confidence score model to obtain a confidence score for an organization name in the member profile. 2. The method of claim 1 , further comprising: based on the confidence score for the organization name in the member profile, combining a first organization record corresponding to the organization name with a second organization record. 3. The method of claim 1 , further comprising: based on the confidence score for the organization name in the member profile, adding the organization name to an organization record that is missing an organization name, the organization record having at least one field matching a field in the member profile. 4. The method of claim 1 , wherein the one or more member metrics include how frequently a member accesses the social networking service. 5. The method of claim 1 , wherein the one or more member metrics include how frequently a member updates a member profile on the social networking service. 6. The method of claim 1 , wherein the one or more member metrics include how frequently a member communicates with other members via the social networking service. 7. The method of claim 1 , wherein the first plurality of features and the second plurality of features include a power user score calculated based on the one or more member metrics. 8. The method of claim 1 , wherein the first plurality of features and the second plurality of features include a connection density score calculated on based on connections in the member profile, the connection density score indicating a level at which the member is connected to other members having member profiles in a group with the member profile. 9. The method of claim 1 , wherein the first plurality of features and the second plurality of features include an email connection density score calculated based on an email address in the member profile, the connection density score indicating a level at which the email address contains a domain that is shared with email addresses of other members having member profiles in a group with the member profile. 10. A system comprising: a processor; a computer-readable medium having instructions stored there on, which, when executed by the processor, cause the system to perform operations comprising: obtaining a plurality of sample member profiles, each sample member profile corresponding to a sample member of a social networking service; for each of the plurality of sample member profiles: obtaining usage information for a corresponding sample member; calculating one or more member metrics based on the sample member profile and usage information for the corresponding sample member; extracting a first plurality of features from the sample member profile and the one or more member metrics; feeding, using a computer, the first plurality of features into a supervised machine learning organization confidence score algorithm to train an organization name confidence score model to calculate a confidence score for a particular member profile indicating a probability that an organization name in the particular member profile is accurate, the supervised machine learning organization confidence score algorithm executed by the computer to implement a supervised machine learning classifier; obtaining a member profile corresponding to a member of a social networking service; obtaining usage information for the member; calculating one or more member metrics based on the member profile and usage information for the corresponding member; extracting a second plurality of features from the member profile and the one or more member metrics; inputting the second plurality of features into the organization name confidence score model to obtain a confidence score for an organization name in the member profile. 11. The system of claim 10 , wherein the operations further comprise: based on the confidence score for the organization name in the member profile, combining a first organization record corresponding to the organization name with a second organization record. 12. The system of claim 10 , wherein the operations further comprise: based on the confidence score for the organization name in the member profile, adding the organization name to an organization record that is missing an organization name, the organization record having at least one field matching a field in the member profile. 13. The system of claim 10 , wherein the one or more member metrics include how frequently a member accesses the social networking service. 14. The system of claim 10 , wherein the one or more member metrics include how frequently a member updates a member profile on the social networking service. 15. The system of claim 10 , wherein the one or more member metrics include how frequently a member communicates with other members via the social networking service. 16. The system of claim 10 , wherein the first plurality of features and the second plurality of features include a power user score calculated based on the one or more member metrics. 17. The system of claim 10 , wherein the first plurality of features and the second plurality of features include a connection density score calculated based on connections in the member profile, the connection density score indicating a level at which the member is connected to other members having member profiles in a group with the member profile. 18. The system of claim 10 , wherein the first plurality of features and the second plurality of features include an email connection density score calculated on based on an email address in the member profile, the connection density score indicating a level at which the email address contains a domain that is shared with email addresses of other members having member profiles in a group with the member profile. 19. A non-transitory machine-readable storage medium comprising instructions, which when implemented by one or more ma

Assignees

Inventors

Classifications

  • G06F16/215Primary

    Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors · CPC title

  • Active pattern learning · CPC title

  • Classification techniques · CPC title

  • Obtaining sets of training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • Proximity, similarity or dissimilarity measures · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10769426B2 cover?
In an example embodiment, a member profile corresponding to a member of a social networking service is obtained. Usage information for the member is then obtained, and one or more member metrics are calculated based on the member profile and usage information for the corresponding member. A plurality of features are extracted from the member profile and the one or more member metrics. The plura…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G06F16/215. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 08 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).