Active learning on biological sounds for determing presence of medical condition

US2025099067A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2025099067-A1
Application numberUS-202318372934-A
CountryUS
Kind codeA1
Filing dateSep 26, 2023
Priority dateSep 26, 2023
Publication dateMar 27, 2025
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods and systems for training an audio-based machine learning model to predict a health condition based on biological sounds emitted by a person. Audio data corresponding to biological sounds produced by the person is generated from a microphone. The audio data is segmented into a plurality of segments, each segment associated with a respective sound event. An audio-based machine learning model is executed on the plurality of segments. The audio-based machine learning model is configured to output, for each segment, a label of a medical condition and an associated a confidence score. The model is trained via active learning, in which a subset of the plurality of segments are selected based on their confidence score being below a threshold, and provided to a human for annotation.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for training an audio-based machine learning model with active learning, the method comprising: receiving audio data corresponding to biological sounds produced by a body of a patient; segmenting the audio data into a plurality of segments; executing an audio-based machine learning model on the plurality of segments, wherein the audio-based machine learning model is configured to output, for each segment, a label of a medical condition and an associated a confidence score; storing the labels and associated confidence scores in storage; and training the audio-based machine learning model via active learning, wherein the training includes: retrieving a subset of the plurality of segments from the storage; receiving annotations from a human annotator, wherein the annotations are associated with medical conditions; and training the audio-based machine learning model based on the annotations until convergence to yield a trained sound-based machine learning model configured to output a predicted medical condition associated with input biological sounds. 2 . The method of claim 1 , wherein the segmenting is performed via a pre-trained audio neural network (PANN), and each of the plurality of segments is associated with a respective audio event corresponding to an audio-based biomarker. 3 . The method of claim 1 , wherein the plurality of segments are each associated with breathing, the predicted medical condition includes asthma. 4 . The method of claim 1 , wherein the plurality of segments are each associated with heartbeats, the predicted medical condition includes a heart murmur. 5 . The method of claim 1 , wherein the subset of the plurality of segments are selected for retrieval based upon confidence scores associated with the plurality of segments being under a threshold. 6 . The method of claim 1 , wherein the microphone is attached to a stethoscope. 7 . The method of claim 1 , wherein the audio data is a continuous stream of audio data. 8 . A system comprising: a processors; and a non-transitory memory coupled to the processor comprising instructions executable by the processor, the processor operable when executing the instructions to: receive audio data generated from a microphone and corresponding to biological sounds produced by a body of a patient; segment the audio data into a plurality of segments; execute an audio-based machine learning model on the plurality of segments, wherein the audio-based machine learning model is configured to output, for each segment, a label of a medical condition and an associated a confidence score; store the labels and associated confidence scores; and train the audio-based machine learning model via active learning, wherein the training includes: retrieving a subset of the plurality of segments; receiving annotations from a human annotator, wherein the annotations are associated with medical conditions; and training the audio-based machine learning model based on the annotations until convergence to yield a trained sound-based machine learning model configured to output a predicted medical condition. 9 . The system of claim 8 , wherein the audio data is a continuous stream of audio data. 10 . The system of claim 9 , wherein the segmenting of the audio data is performed via a pre-trained audio neural network (PANN), and each of the plurality of segments is associated with a respective audio event corresponding to an audio-based biomarker. 11 . The system of claim 8 , wherein the plurality of segments are each associated with breathing, the predicted medical condition includes asthma. 12 . The system of claim 8 , wherein the plurality of segments are each associated with heartbeats, the predicted medical condition includes a heart murmur. 13 . The system of claim 8 , wherein the subset of the plurality of segments are selected for retrieval based upon confidence scores associated with the plurality of segments being under a threshold. 14 . The system of claim 8 , wherein the microphone is attached to a stethoscope. 15 . A computer-readable non-transitory storage medium embodying software that is operable, when executed, to: receive audio data generated from a microphone and corresponding to biological sounds produced by a body of a patient; segment the audio data into a plurality of segments, execute an audio-based machine learning model on the plurality of segments, wherein the audio-based machine learning model is configured to output, for each segment, a label of a medical condition and an associated a confidence score; store the labels and associated confidence scores; and train the audio-based machine learning model via active learning, wherein the training includes: retrieving a subset of the plurality of segments; receiving annotations from a human annotator, wherein the annotations are associated with medical conditions; and training the audio-based machine learning model based on the annotations until convergence to yield a trained sound-based machine learning model configured to output a predicted medical condition. 16 . The computer-readable non-transitory storage medium of claim 15 , wherein the audio data is a continuous stream of audio data. 17 . The computer-readable non-transitory storage medium of claim 16 , wherein the segmenting of the audio data is performed via a pre-trained audio neural network (PANN), and each of the plurality of segments is associated with a respective audio event corresponding to an audio-based biomarker. 18 . The computer-readable non-transitory storage medium of claim 15 , wherein the plurality of segments are each associated with breathing, the predicted medical condition includes asthma. 19 . The computer-readable non-transitory storage medium of claim 15 , wherein the plurality of segments are each associated with heartbeats, the predicted medical condition includes a heart murmur. 20 . The computer-readable non-transitory storage medium of claim 15 , wherein the subset of the plurality of segments are selected for retrieval based upon confidence scores associated with the plurality of segments being under a threshold.

Assignees

Inventors

Classifications

  • Combinations of networks · CPC title

  • Learning methods · CPC title

  • Electric stethoscopes · CPC title

  • Evaluation by breath analysis, e.g. determination of the chemical composition of exhaled breath (A61B5/083, A61B5/091 take precedence) · CPC title

  • Active learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025099067A1 cover?
Methods and systems for training an audio-based machine learning model to predict a health condition based on biological sounds emitted by a person. Audio data corresponding to biological sounds produced by the person is generated from a microphone. The audio data is segmented into a plurality of segments, each segment associated with a respective sound event. An audio-based machine learning mo…
Who is the assignee on this patent?
Bosch Gmbh Robert, Allegheny Singer Res Institute, Highmark Health
What technology area does this patent fall under?
Primary CPC classification A61B7/003. Mapped technology areas include Human Necessities.
When was this patent published?
Publication date Thu Mar 27 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).