Unintended bias detection in conversational agent platforms with machine learning model

US11270080B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11270080-B2
Application numberUS-202016743661-A
CountryUS
Kind codeB2
Filing dateJan 15, 2020
Priority dateJan 15, 2020
Publication dateMar 8, 2022
Grant dateMar 8, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A mechanism is provided for implementing a bias detection mechanism that mitigates unintended bias in a conversational agent by leveraging conversational agent definitions, a conversational agent chat logs, and user satisfaction statistics. One or more protected attributes are identified within an utterance from the conversational agent chat logs. Using the identified protected attributes, a replacement utterance with a replacement term is generated for at least one of the identified protected attributes in the utterance. A score is generated for the utterance and the replacement utterance using utterance level relative term importance for protected attributes and regular terms in the utterance and the replacement utterance. Utilizing the scoring, a determination is made as to whether unintended bias exists within the utterance. Responsive to unintended bias being detected, an action is implemented that causes a change to a machine learning model used by the conversational agent.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, in a data processing system, for comprising at least one processor and at least one memory, wherein the at least one memory comprises instructions that are executed by the at least one processor to configure the at least one processor to implement a bias detection mechanism that mitigates unintended bias in a conversational agent, the method comprising: identifying one or more protected attributes within a set of original utterances from the conversational agent chat logs; for each utterance in the set of original utterances, generating a replacement utterance with a replacement term for at least one of the identified protected attributes in the utterance to form a set of replacement utterances; determining a first subset of utterances where protected attributes exist and result in change in intents between the set of original utterances and the set of replacement utterances: determining a second subset of utterances from the first subset of utterances where protected attributes exist and result in change in confidences between the set of original utterances and the set of replacement utterances; and responsive to the change in confidences being above a predetermined threshold, implementing an action that causes a change to a machine learning model used by the conversational agent. 2. The method of claim 1 , wherein identifying the one or more protected attributes comprises: using regular expression matching or dictionary matching to identify commonly defined protected attribute types. 3. The method of claim 1 , wherein identifying the one or more protected attributes comprises: using machine learning based Named Entity Recognition (NER). 4. The method of claim 1 , wherein generating the replacement utterance with the replacement term for the at least one of the identified protected attributes comprises: generating the replacement term that replaces the at least one identified protected attribute with a common equivalent. 5. The method of claim 1 , wherein generating the replacement utterance with the replacement term for the at least one of the identified protected attributes comprises: generating the replacement term that replaces the at least one identified protected attribute with a random equivalent from a same identity type. 6. The method of claim 1 , further comprising: determining a percentage of utterances within the conversation agent chat logs that have protected attributes that are relatively important, wherein determining the percentage of utterances within the conversation agent chat logs that have protected attributes that are relatively important comprises: detecting a prevalence of utterances with protected attributes for a specific conversational agent; for those utterances with protected attributes associated with the specific conversational agent, detecting a prevalence of utterances with protected attributes with relative importance above a predetermined threshold; and responsive to detecting a subset of utterances with a relative importance above the predetermined threshold, identifying unintended bias within the subset of utterances. 7. The method of claim 1 , further comprising: aggregating utterances with protected attributes detected for the conversational agent with relative term importance analysis both with and without protected attribute replacement. 8. The method of claim 1 , further comprising: segmenting utterances where protected attributes are detected from utterances Where protected attributes are not detected for the conversational agent; calculating user satisfaction rates on chat sessions where protected attributes are detected versus chat sessions where protected attributes are not detected for the conversational agent; calculating user satisfaction rates on chat sessions where protected attributes are detected and are considered relatively important versus chat sessions where protected attributes are not detected or chat sessions where protected attributes are detected but are not considered relatively important as determined by the protected attribute analyzer; and responsive to determining a drop in satisfaction being greater than or equal to a predetermined threshold, identifying unintended bias. 9. The method of claim 1 , further comprising: generating a score for the utterance and the replacement utterance using utterance level relative term importance for protected attributes and regular terms in the utterance and the replacement utterance: aggregating statistics from the scoring for the utterance and the replacement utterance; and responsive to statistics associated with the utterance resulting in a significant score change between the utterance and the replacement utterance, identifying unintended bias. 10. The method of claim 1 , further comprising: utilizing a set of weights for the protected attributes in the utterance, re-weighing the protected attributes determined to be relatively more important for the conversational agent definition; and responsive to determining the re-weighing change improves protected attribute detection, identifying unintended bias. 11. A computer program product comprising a computer readable storage medium having a computer readable program stored therein, wherein the computer readable program, when executed on a data processing system, causes the data processing system to implement a bias detection mechanism that mitigates unintended bias in a conversational agent, wherein the computer readable program further causes the data processing system to: identify one or more protected attributes within a set of original utterances from the conversational agent chat logs; for each utterance in the set of original utterances, generate a replacement utterance with a replacement term for at least one of the identified protected attributes in the utterance to form a set of replacement utterances; determine a first subset of utterances where protected attributes exist and result in change in intents between the set of original utterances and the set of replacement utterances; determine a second subset of utterances from the first subset of utterances where protected attributes exist and result in change in confidences between the set of original utterances and the set of replacement utterances; and responsive to the change in confidences being above a predetermined threshold, implement an action that causes a change to a machine learning model used by the conversational agent. 12. The computer program product of claim 11 , wherein the computer readable program further causes the data processing system to: use regular expression matching or dictionary matching to identify commonly defined protected attribute types; or use machine learning based Named Entity Recognition (NER). 13. The computer program product of claim 11 , wherein the computer readable program further causes the data processing system to: generate the replacement term that replaces the at least one identified protected attribute with a common equivalent. 14. The computer program product of claim 11 , wherein the computer readable program further causes the data processing system to: generate the replacement term that replaces the at least one identified protected attribute with a random equivalent from a same identity type. 15. The computer program product of claim 11 , wherein the computer readable program further causes the data processing system to: determine a percentage of utterances within the conversation agent chat logs that have protected attributes that are relatively important, wherein the comp

Assignees

Inventors

Classifications

  • G06F40/30Primary

    Semantic analysis · CPC title

  • G06F40/35Primary

    Discourse or dialogue representation · CPC title

  • Natural language query formulation or dialogue systems · CPC title

  • Phrasal analysis, e.g. finite state techniques or chunking · CPC title

  • Query formulation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11270080B2 cover?
A mechanism is provided for implementing a bias detection mechanism that mitigates unintended bias in a conversational agent by leveraging conversational agent definitions, a conversational agent chat logs, and user satisfaction statistics. One or more protected attributes are identified within an utterance from the conversational agent chat logs. Using the identified protected attributes, a re…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F40/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 08 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).