What technology area does this patent fall under?

Primary CPC classification G10L25/87. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Feb 21 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Automated verbal fluency assessment

US9576593B2 · US · B2

Patent metadata
Field	Value
Publication number	US-9576593-B2
Application number	US-201314379654-A
Country	US
Kind code	B2
Filing date	Mar 14, 2013
Priority date	Mar 15, 2012
Publication date	Feb 21, 2017
Grant date	Feb 21, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques are described for calculating one or more verbal fluency scores for a person. An example method includes classifying, by a computing device, samples of audio data of speech of a person, based on amplitudes of the samples, into a first class of samples including speech or sound and a second class of samples including silence. The method further includes analyzing the first class of samples to determine a number of words spoken by the person, and calculating a verbal fluency score for the person based at least in part on the determined number of words spoken by the person.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: obtaining, by a computing device via a speech analyzer, a waveform representing a digital recording of audio speech of a person, the speech analyzer comprising at least one of a microphone, an interface to a sound recorder, an interface to a database, or an interface to a data storage system; measuring, by the computing device, amplitudes of waves within the waveform, the waves corresponding to samples of the digital recording of the audio speech of the person; classifying, by a silence detector of the computing device, the samples of the digital recording of the audio data of the speech of the person, based on the measured amplitudes of the samples and on a silence threshold, into a first class of samples including speech or sound and a second class of samples including silence, wherein classifying the samples comprises: sorting, by the silence detector, the samples of the audio data in an order defined by the amplitudes of the samples of the audio data; determining, by the silence detector, the silence threshold based on the amplitudes of the samples of the audio data, wherein determining the silence threshold comprises: calculating, by the silence detector, linear regressions of the sorted samples in the sorted order; and determining, by the silence detector, the silence threshold as the amplitude of one of the samples for which a slope of the calculated linear regression exceeds a predetermined value; classifying, by the silence detector, samples having amplitudes above the silence threshold as belonging to the first class; and classifying, by the silence detector, samples having amplitudes below the silence threshold as belonging to the second class; analyzing, by the computing device, the first class of samples to determine a number of words spoken by the person; calculating, by the computing device, a verbal fluency score for the person based at least in part on the determined number of words spoken by the person, and outputting, by the computing device, the verbal fluency score. 2. The method of claim 1 , wherein the predetermined value comprises −0.2. 3. The method of claim 1 , wherein analyzing the first class of samples comprises determining a first subset of samples of the first class including speech sound and a second subset of samples of the first class including non-speech sound. 4. The method of claim 3 , further comprising determining the number of words as a number of contiguous samples in the audio data belonging to the first subset that start with a sample above the silence threshold and end with a sample below the silence threshold. 5. The method of claim 3 , wherein determining the first subset and the second subset comprises: classifying contiguous samples in the first class of the audio data for which a fundamental frequency can be calculated as belonging to the first subset; and classifying contiguous samples in the first class of the audio data for which a fundamental frequency cannot be calculated as belonging to the second subset. 6. The method of claim 1 , further comprising: determining a number of pauses as a number of contiguous samples in the second class that start with a sample below the silence threshold and end with a sample below the silence threshold. 7. The method of claim 6 , further comprising: measuring a duration associated with each pause of the pauses; calculating an average duration comprising a mean value of the measured durations; and calculating a standard deviation of the measured durations from the average duration. 8. The method of claim 1 , further comprising: classifying, by the computing device, second samples of second audio data of speech of the person, based on second amplitudes of the second samples and on a second silence threshold, into the first class and the second class; calculating a second verbal fluency score based at least in part on the number of words spoken by the person; and calculating a learning score based at least in part on a change from the verbal fluency score to the second verbal fluency score. 9. The method of claim 8 , wherein the silence threshold and the second silence threshold comprise equal values. 10. The method of claim 8 , wherein calculating the learning score further comprises: plotting at least the verbal fluency score and the second verbal fluency score on a graph; and calculating a slope associated with the graph. 11. The method of claim 1 , further comprising: receiving the samples as at least a portion of a verbal fluency test of the person. 12. The method of claim 11 , further comprising: outputting the verbal fluency score. 13. The method of claim 1 , wherein analyzing the first class of samples comprises excluding non-speech sounds in the first class of samples from the number of words spoken by the person, comprising: calculating an average duration of the samples in the first class of samples; calculating a standard deviation of durations of the samples in the first class of samples; and classifying samples having durations that deviate from the average duration by at least one standard deviation as non-speech sounds. 14. The device of claim 13 , wherein to analyze the first class of samples, the one or more processors are configured to exclude non-speech sounds in the first class of samples from the number of words spoken by the person, and wherein to exclude the non-speech sounds, the one or more processors are configured to: calculate an average duration of the samples in the first class of samples; calculate a standard deviation of durations of the samples in the first class of samples; and classify samples having durations that deviate from the average duration by at least one standard deviation as non-speech sounds. 15. A device comprising: a memory storing instructions defining at least a silence detector; a speech analyzer comprising at least one of a microphone, an interface to a sound recorder, an interface to a database, or an interface to a data storage system, wherein the speech analyzer is configured to obtain a waveform representing a digital recording of audio speech of a person; one or more processors configured to execute the instructions, wherein execution of the instructions causes the one or more processors to: measure amplitudes of waves within the waveform, the waves corresponding to samples of the digital recording of the audio speech of the person; execute the silence detector to classify the samples of the digital recording of the audio data of the speech of the person, based on the measured amplitudes of the samples and on a silence threshold, into a first class of samples including speech or sound and a second class of samples including silence, wherein to classify the samples, the silence detector is configured to: sort the samples of the audio data in an order defined by the amplitudes of the samples of the audio data; determine the silence threshold based on the amplitudes of the samples of the audio data, wherein to determine the silence threshold, the silence detector is configured to: calculate linear regressions of the sorted samples in the sorted order; and determine the silence threshold as the amplitude of one of the samples for which a slope of the calculated linear regression exceeds a predetermined value; classify samples having amplitudes above the silence threshold as belonging to the first class; and classify samples having amplitudes below the silence threshold as belonging to the second class; analyze the first class of samples to determine a number of words sp

Assignees

Univ Minnesota

Inventors

Classifications

G10L25/78
Detection of presence or absence of voice signals (switching of direction of transmission by voice frequency in two-way loud-speaking telephone systems H04M9/10) · CPC title
G10L25/87Primary
Detection of discrete points within a voice signal · CPC title
G10L15/02
Feature extraction for speech recognition; Selection of recognition unit · CPC title
G10L15/08
Speech classification or search · CPC title
G10L25/66
for extracting parameters related to health condition (detecting or measuring for diagnostic purposes A61B5/00) · CPC title

Patent family

Related publications grouped by family.

View patent family 48048205

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9576593B2 cover?: Techniques are described for calculating one or more verbal fluency scores for a person. An example method includes classifying, by a computing device, samples of audio data of speech of a person, based on amplitudes of the samples, into a first class of samples including speech or sound and a second class of samples including silence. The method further includes analyzing the first class of sa…
Who is the assignee on this patent?: Univ Minnesota
What technology area does this patent fall under?: Primary CPC classification G10L25/87. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Feb 21 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).