Optimizing speech to text conversion and text summarization using a medical provider workflow model

US11094322B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11094322-B2
Application numberUS-201916269795-A
CountryUS
Kind codeB2
Filing dateFeb 7, 2019
Priority dateFeb 7, 2019
Publication dateAug 17, 2021
Grant dateAug 17, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method, a system, and a computer program product are provided. Speech signals from a medical conversation between a medical provider and a patient are converted to text based on a first domain model associated with a medical scenario. The first domain model is selected from multiple domain models associated with a workflow of the medical provider. One or more triggers are detected, each of which indicates a respective change in the medical scenario. A corresponding second domain model is applied to the medical conversation to more accurately convert the speech signals to text in response to each of the detected one or more triggers. The corresponding second domain model is associated with a respective change in the medical scenario of the workflow of the medical provider. A clinical note is provided based on the text produced by converting the speech signals.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method of processing a medical conversation comprising: converting to text, via a processor, speech signals from the medical conversation between a medical provider and a patient for a workflow of the medical provider based on a first domain model of a plurality of domain models, wherein the workflow includes a plurality of medical scenarios and the plurality of domain models are each trained for a corresponding medical scenario of the workflow, and wherein the first domain model is associated with a current medical scenario of the medical conversation; detecting, via the processor during the converting to text, one or more triggers occurring during the medical conversation, each of the one or more triggers indicating a change to a different medical scenario of the workflow within the medical conversation; in response to each of the detected one or more triggers, applying, via the processor during the converting to text, a corresponding second domain model of the plurality of domain models to the speech signals of the medical conversation pertaining to the different medical scenario indicated by the detected trigger to convert the speech signals pertaining to the different medical scenario indicated by the detected trigger from the medical conversation to text, wherein the corresponding second domain model is trained for the different medical scenario indicated by the detected trigger; and providing, via the processor, a clinical note based on the text produced from the speech signals of the medical conversation. 2. The method of claim 1 , further comprising storing the speech signals from the medical conversation, wherein the applying the corresponding second domain model further comprises: applying a plurality of second domain models to the stored speech signals based on the detected one or more triggers to convert the stored speech signals to text. 3. The method of claim 2 , wherein the detecting of the one or more triggers is based on recognizing at least one of certain words, certain phrases and certain combinations of words in the medical conversation. 4. The method of claim 1 , further comprising: learning, via the processor, one or more domain models based on previous interactions between a medical provider and a patient. 5. The method of claim 1 , wherein the plurality of domain models includes domain models at different levels of granularity for corresponding medical scenarios. 6. The method of claim 1 , wherein the plurality of domain models includes a domain model for use with a plurality of different medical scenarios. 7. The method of claim 1 , further comprising: automatically performing, via the processor, an action included in the clinical note. 8. A system for processing a medical conversation, the system comprising: at least one processor; and at least one memory connected to the at least one processor, the at least one processor being configured to: convert to text speech signals from the medical conversation between a medical provider and a patient for a workflow of the medical provider based on a first domain model of a plurality of domain models, wherein the workflow includes a plurality of medical scenarios and the plurality of domain models are each trained for a corresponding medical scenario of the workflow, and wherein the first domain model is associated with a current medical scenario of the medical conversation; detect, during the converting to text, one or more triggers occurring during the medical conversation, each of the one or more triggers indicating a change to a different medical scenario of the workflow within the medical conversation; in response to each of the detected one or more triggers, apply during the converting to text a corresponding second domain model of the plurality of domain models to the speech signals of the medical conversation pertaining to the different medical scenario indicated by the detected trigger to convert the speech signals pertaining to the different medical scenario indicated by the detected trigger from the medical conversation to text, wherein the corresponding second domain model is trained for the different in the medical scenario indicated by the detected trigger; and provide a clinical note based on the text produced from the speech signals of the medical conversation. 9. The system of claim 8 , wherein the at least one processer is further configured to: store the speech signals from the medical conversation, wherein the at least one processor being configured to apply the corresponding second domain model further comprises the at least one processor being configured to: apply a plurality of second domain models to the stored speech signals based on the detected one or more triggers to convert the stored speech signals to text. 10. The system of claim 8 , wherein the detecting of the one or more triggers is based on at least one of recognizing one or more certain words in the medical conversation and receiving signals from at least one sensor associated with a medical device. 11. The system of claim 8 , wherein the at least one processor is further configured to: learn one or more domain models based on previous interactions between a medical provider and a patient. 12. The system of claim 8 , wherein the plurality of domain models includes domain models at different levels of granularity for corresponding medical scenarios. 13. The system of claim 8 , wherein the plurality of domain models includes a domain model for use with a plurality of different medical scenarios. 14. The system of claim 8 , wherein the clinical note is arranged according to a subjective, objective, assessment and plan format. 15. A computer program product for processing a medical conversation, the computer program product comprising at least one computer readable storage medium having computer readable program code embodied therewith for execution on at least one processor of a computing device, the computer readable program code being configured to: convert to text speech signals from the medical conversation between a medical provider and a patient for a workflow of the medical provider based on a first domain model of a plurality of domain models, wherein the workflow includes a plurality of medical scenarios and the plurality of domain models are each trained for a corresponding medical scenario of the workflow, and wherein the first domain model is associated with a current medical scenario of the medical conversation; detect, during the converting to text, one or more triggers occurring during the medical conversation, each of the one or more triggers indicating a change to a different medical scenario of the workflow within the medical conversation; in response to each of the detected one or more triggers, apply during the converting to text a corresponding second domain model of the plurality of domain models to the speech signals of the medical conversation pertaining to the different medical scenario indicated by the detected trigger to convert the speech signals pertaining to the different medical scenario indicated by the detected trigger from the medical conversation to text, wherein the corresponding second domain model is trained for the different medical scenario indicated by the detected trigger; and provide a clinical note based on the text produced from the speech signals of the medical conversation. 16. The computer program product of claim 15 , wherein the computer readable program code is further configured to: store the speech signals from the medical conversation, wh

Assignees

Inventors

Classifications

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • relating to drugs or medications, e.g. for ensuring correct administration to patients · CPC title

  • G10L15/183Primary

    using context dependencies, e.g. language models · CPC title

  • for data related to laboratory analysis, e.g. patient specimen analysis · CPC title

  • Semantic analysis · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11094322B2 cover?
A method, a system, and a computer program product are provided. Speech signals from a medical conversation between a medical provider and a patient are converted to text based on a first domain model associated with a medical scenario. The first domain model is selected from multiple domain models associated with a workflow of the medical provider. One or more triggers are detected, each of wh…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 17 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).