Real-time annotation of symptoms in telemedicine

US2019328300A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2019328300-A1
Application numberUS-201815964542-A
CountryUS
Kind codeA1
Filing dateApr 27, 2018
Priority dateApr 27, 2018
Publication dateOct 31, 2019
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A teleconferencing system includes a first terminal configured to acquire an audio signal and a video signal. A teleconferencing server in communication with the first terminal and a second terminal is configured to receive the video signal and the audio signal from the first terminal, in real-time, and transmit the video signal and the audio signal to the second terminal. A symptom recognition server in communication with the first terminal and the teleconferencing server is configured to receive the video signal and the audio signal from the first terminal, asynchronously, analyze the video signal and the audio signal to detect one or more indicia of illness, generate a diagnostic alert on detecting the one or more indicia of illness, and transmit the diagnostic alert to the teleconferencing server for display on the second terminal.

First claim

Opening claim text (preview).

1 . A teleconferencing system, comprising: a first terminal including a camera and a microphone configured to acquire an audio signal and a high-quality: video signal and convert the acquired high-quality video signal into a low-quality video signal of a Nitrate that is less than a hit rate of the high-quality video signal; a teleconferencing server in communication with the first terminal and a second terminal and configured to receive the low-quality video signal and the audio signal from the first terminal, in real-time and transmit the low-quality video signal and the audio signal to the second terminal; and a symptom recognition server in communication with the first terminal and the teleconferencing server and configured to receive the high-quality video signal and the audio signal from the first terminal, asynchronously, analyze the high-quality video signal and the audio signal to detect one or more indicia of illness, generate a diagnostic alert on detecting the one or more indicia of illness, and transmit the diagnostic alert to the teleconferencing server for display on the second terminal. 2 . The system of claim 1 , wherein the symptom recognition server is configured to detect the indicia of illness from the high-quality video signal and the audio signal using multimodal recurrent neural networks. 3 . The system of claim 2 , wherein the symptom recognition server is configured to detect the indicia of illness from the high-quality video signal by: detecting a face from the high-quality video signal; extracting action units from the detected face; detecting landmarks from the detected face; tracking the detected landmarks; performing semantic feature extraction using the tracked landmarks; and using the multimodal recurrent neural networks to detect the indicia of illness from the detected face, extracted action units, tracked landmarks, and extracted semantic features. 4 . The system of claim 2 , wherein the symptom recognition server is configured to detect the indicia of illness from the high-quality video signal by: detecting a body posture from the high-quality video signal; tracking head movements from the high-quality video signal; and using the multimodal recurrent neural networks to detect the indicia of illness from the detected body posture and tracked head movements. 5 . The system of claim 2 , wherein the symptom recognition server is configured to detect the indicia of illness from the audio signal by: detecting tone features from the audio signal; transcribing the audio signal to generate a transcription; performing natural language processing on the transcription; performing semantic analysis on the transcription; performing language structure extraction on the transcription; and using the recurrent neural networks to detect the indicia of illness from the detected tone features, the transcription, the results of the natural language processing, the results of the semantic analysis, and the results of the language structure extraction. 6 . The system of claim 1 , wherein the first terminal is configured to convert the video signal into a low-quality video signal of less bitrate by reducing a resolution of the high-quality signal, by reducing a framerate of the high-quality signal, or by compressing the high-quality signal. 7 . The system of claim 1 , wherein the symptom recognition server is part of or locally connected to the first terminal. 8 . The system of claim 1 , wherein the teleconferencing server is in communication with the first terminal and the second terminal over the Internet or another wide-area network. 9 . The system of claim 1 , wherein the second terminal is configured to display the low-quality video signal as part of a teleconference and the teleconferencing server is configured to overlay the diagnostic alert on the display of the second terminal. 10 . The system of claim 9 , wherein the teleconferencing server is configured to overlay the diagnostic alert on the display of the second terminal in the form of a textual alert. 11 . The system of claim 9 , wherein the teleconferencing server is configured to overlay the diagnostic alert on the display of the second terminal in the form of a graphic element that highlights or emphasizes a part of a face or body that the indicia of illness are based on. 12 . The system of claim 9 , wherein the teleconferencing server is configured to overlay the diagnostic alert on the display of the second terminal in the form of an annotation, highlighting, or other marking on a textual transcription of the audio signal. 13 . The system of claim 9 , wherein the teleconferencing server is configured to overlay the diagnostic alert on the display of the second terminal in the form of a picture-in-picture element that includes a replaying of a portion of the high-quality video signal that the indicia of illness are based on. 14 . A method for teleconferencing, comprising: acquiring an audio signal and a video signal from a first terminal; transmitting the video signal and the audio signal to a teleconferencing server in communication with the first terminal and a second terminal; transmitting the video signal and the audio signal to a symptom recognition server in communication with the first terminal and the teleconferencing server; detecting indicia of illness from the video signal and the audio signal using multimodal recurrent neural networks; generating a diagnostic alert for the detected indicia illness; annotating the video signal with the diagnostic alert; and displaying the annotated video signal on the second terminal. 15 . The method of claim 14 , wherein detecting the indicia of illness from the video signal comprises: detecting a face from the video signal; extracting action units from the detected face; detecting landmarks from the detected face; tracking the detected landmarks; performing semantic feature extraction using the tracked landmarks; and using the multimodal recurrent neural networks to detect the indicia of illness from the detected face, extracted action units, tracked landmarks, and extracted semantic features. 16 . The method of claim 14 , wherein detecting the indicia of illness from the video signal comprises: detecting a body posture from the video signal; tracking head movements from the video signal; and using the multimodal recurrent neural networks to detect the indicia of illness from the detected body posture and tracked head movements. 17 . The method of claim 14 , wherein detecting the indicia of illness from the audio signal comprises; detecting tone features from the audio signal; transcribing the audio signal to generate a transcription; performing natural language processing on the transcription; performing semantic analysis on the transcription; performing language structure extraction on the transcription; and using the recurrent neural networks to detect the indicia of illness from the detected tone features, the transcription, the results of the natural language processing, the results of the semantic analysis, and the results of the language structure extraction. 18 . The method of claim 14 , wherein a bit rate of the video signal is reduced prior to transmitting the video signal to the symptom recognition server. 19 . The method of claim 14 , wherein the video signal is up-sampled prior to detecting the indicia of illness from the video signal. 20 . A computer program product for detecting indicia of illness

Assignees

Inventors

Classifications

  • for computer-aided diagnosis, e.g. based on medical expert systems · CPC title

  • for remote operation · CPC title

  • ICT specially adapted for facilitating communication between medical practitioners or patients, e.g. for collaborative diagnosis, therapy or health monitoring · CPC title

  • for processing medical images, e.g. editing · CPC title

  • Semantic analysis · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2019328300A1 cover?
A teleconferencing system includes a first terminal configured to acquire an audio signal and a video signal. A teleconferencing server in communication with the first terminal and a second terminal is configured to receive the video signal and the audio signal from the first terminal, in real-time, and transmit the video signal and the audio signal to the second terminal. A symptom recognition…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification A61B5/0022. Mapped technology areas include Human Necessities.
When was this patent published?
Publication date Thu Oct 31 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).