Calculating expertise confidence based on content and social proximity

US10146839B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10146839-B2
Application numberUS-201414573561-A
CountryUS
Kind codeB2
Filing dateDec 17, 2014
Priority dateDec 17, 2014
Publication dateDec 4, 2018
Grant dateDec 4, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method includes executing, via a processor, a document-oriented search based on a query in an index of documents to generate a set of document results, each document associated with at least one potential expert. The method includes analyzing the document results to produce a list of potential experts. The method includes calculating an expertise score for each potential expert based on a calculated content score and metadata score for each potential expert. The method includes calculating an evidence diversity score for each potential expert. The method includes calculating a confidence score for each potential expert based on a diversity-constrained content score and a diversity-constrained metadata score for each potential expert. The method includes displaying a list of potential experts with associated confidence scores.

First claim

Opening claim text (preview).

What is claimed is: 1. A system, comprising a processor configured to: execute a document-oriented search based on a query in an index of documents to generate a set of document results, each document in the set of document results is associated with at least one potential expert; analyze the document results to produce a list of potential experts; calculate an expertise score for each potential expert based on a content score and a metadata score for each potential expert; calculate a confidence score for each potential expert based on a diversity-constrained content score and a diversity-constrained metadata score for each potential expert, wherein the diversity-constrained content score is calculated using an evidence diversity score, comprising a predetermined threshold number of different activities associated with the potential expert, and the content score for the potential expert, the diversity-constrained metadata score is calculated using the evidence diversity score and the metadata score for the potential expert, the content score is calculated based on a number of different content document types and associations associated with the potential expert, the content document types and associations are gathered by parsing websites and stored in a data repository, the metadata score is calculated based on profile-related information associated with the potential expert, and the confidence score is further calculated based on a social score, wherein the processor is configured to generate a graph of connections between the predetermined number of selected experts, the social score for each selected expert is calculated using the graph and based on a number of connections to other selected experts; and send a list of experts with associated confidence scores that are above a confidence score threshold to a client device. 2. The system of claim 1 , the processor further configured to: select a predetermined number of potential experts with expertise scores above a threshold from the list of potential experts and calculate the evidence diversity score for each selected expert; and calculate the confidence score for each selected expert using the evidence diversity score for each selected expert. 3. The system of claim 1 , wherein the confidence scores are calculated based on preconfigured thresholds. 4. The system of claim 1 . wherein the query comprising an expertise, and the confidence score is used to indicate a level of certainty in the expertise for each potential expert. 5. The system of claim 1 , wherein the list of experts is filtered according to the confidence scores and sorted by the expertise scores. 6. A computer program product for calculating confidence scores, the computer program product comprising a computer-readable storage medium having program code embodied therewith, wherein the computer readable storage medium is not a transitory signal per se, the program code executable by a processor to cause the processor to: execute, via the processor, a document-oriented search based on a query in an index of documents to generate a set of document results, each document in the set of document results is associated with at least one potential expert; analyze, via the processor, the document results to produce a list of potential experts; calculate, via the processor, an expertise score for each potential expert based on a calculated content score and metadata score for each potential expert and sorting the potential experts by the expertise scores; select, via the processor, a predetermined number of potential experts with higher expertise scores from the list of potential experts; calculate, via the processor, an evidence diversity score for each selected expert, wherein the evidence diversity score comprising a predetermined threshold number of different activities associated with the selected expert; calculate a confidence score for each potential expert based on diversity constrained content score and a diversity-constrained metadata score for each potential expert, wherein the diversity-constrained content score is calculated using the evidence diversity score and the content score for the potential expert, the diversity-constrained metadata score is calculated using the evidence diversity score and the metadata score the potential expert, the content score is calculated based on a number of different content document types and associations associate with the potential expert, the content document types and associations are gathered by parsing websites and stored in a data repository, the metadata score is calculated based on profile-related information associated with the potential expert, and the confidence score is further calculated based on a social score, wherein the processor is configured to generate a graph of connections between the predetermined number of selected experts, the social score for each selected expert is calculated using the graph and based on a number of connections to other selected experts; and send a list of experts with associated confidence scores that are above a confidence score threshold to a client device. 7. The computer program product of claim 6 , wherein the query comprising an expertise, the confidence score is used to indicate a relative level of the expertise for each selected expert. 8. The computer program product of claim 6 , the content score is calculated based on contributions including wiki entries, comments, and likes stored in a data repository.

Assignees

Inventors

Classifications

  • G06Q10/40Primary

    Business processes related to social networking or social networking services · CPC title

  • using ranking · CPC title

  • Presentation of query results · CPC title

  • Indexing; Web crawling techniques · CPC title

  • G06F16/33Primary

    Querying · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10146839B2 cover?
A method includes executing, via a processor, a document-oriented search based on a query in an index of documents to generate a set of document results, each document associated with at least one potential expert. The method includes analyzing the document results to produce a list of potential experts. The method includes calculating an expertise score for each potential expert based on a cal…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06Q10/40. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 04 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).