Cluster analysis of participant responses for test generation or teaching

US10388177B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10388177-B2
Application numberUS-201313871627-A
CountryUS
Kind codeB2
Filing dateApr 26, 2013
Priority dateApr 27, 2012
Publication dateAug 20, 2019
Grant dateAug 20, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Textual responses to open-ended (i.e., free-response) items provided by participants (e.g., by means of mobile wireless devices) are automatically classified, enabling an instructor to assess the responses in a convenient, organized fashion and adjust instruction accordingly.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of determining a plurality of textual responses to generate a future test option to correspond with a response to a free response item, the method comprising the steps of: determining, at a server computer, one or more participant devices and one or more instructor devices, wherein the one or more participant devices and the one or more instructor devices are geographically dispersed and communicate via a network, wherein each of the one or more participant devices is operated by a respective participant and each respective participant is associated with a respective characteristic; receiving, at the server computer, the free response item from one of the one or more instructor devices; receiving, at the server computer, the plurality of textual responses from the one or more participant devices; removing punctuation from the plurality of textual responses; translating into a numerical dataset the plurality of textual responses resulting from the removing punctuation step; applying, at the server computer, a clustering algorithm to the numerical dataset to generate one or more clusters, wherein the clustering algorithm is based at least in part on the respective characteristic; generating the one or more clusters from the clustering algorithm; determining a single cluster as a weighted average of the one or more clusters, wherein the determined single cluster is used to generate the future test option based at least in part on the plurality of textual responses and the determined single cluster; and wirelessly providing the future test option by the server computer to the one or more instructor devices. 2. The method of claim 1 , further comprising: after generating the one or more clusters from the clustering algorithm, determining a distance metric associated with the one or more clusters to identify similarities between the one or more clusters, wherein the one or more clusters include at least one incorrect answer to the free response item. 3. The method of claim 2 , wherein the distance metric analyzes a number of the plurality of textual responses not found in the same cluster. 4. The method of claim 2 , wherein the distance metric is associated with a number between zero and log(a number of the one or more clusters). 5. The method of claim 2 , further comprising: ranking the one or more clusters based in part on the distance metric; and transmitting the ranked clusters to the one or more instructor devices. 6. The method of claim 1 , wherein the plurality of textual responses are translated to the numerical dataset by: stemming one or more words in the plurality of textual responses; computing 1, 2, 3, 4, and 5-grams; removing stemmed n-grams that appear in fewer than 1-percent or more than 99-percent of the plurality of textual responses; and generating the stemmed n-grams as a vector. 7. The method of claim 1 , further comprising: removing a second cluster from the one or more clusters, wherein the distance metric associated with the second cluster exceeds a threshold. 8. The method of claim 1 , further comprising: storing, by the server computer, participant device information in a participant database. 9. The method of claim 1 , wherein four clusters are generated by the clustering algorithm to the numerical dataset. 10. One or more non-transitory machine-readable medium having machine-executable instructions configured to perform the machine-implementable method for: determining, at a server computer, one or more participant devices and one or more instructor devices, wherein the one or more participant devices and the one or more instructor devices are geographically dispersed and communicate via a network, wherein each of the one or more participant devices is operated by a respective participant and each respective participant is associated with a respective characteristic; receiving, at the server computer, a free response item from one of the one or more instructor devices; receiving, at the server computer, the plurality of textual responses from the one or more participant devices; removing punctuation from the plurality of textual responses; translating into a numerical dataset the plurality of textual responses resulting from the removing punctuation step; applying, at the server computer, a clustering algorithm to the numerical dataset to generate one or more clusters, wherein the clustering algorithm is based at least in part on the respective characteristic; generating the one or more clusters from the clustering algorithm; determining a single cluster as a weighted average of the one or more clusters, wherein the determined single cluster is used to generate the future test option based at least in part on the plurality of textual responses and the determined single cluster; and wirelessly providing the future test option by the server computer to the one or more instructor devices. 11. The one or more non-transitory machine-readable medium of claim 10 , wherein the method further comprises: after generating the one or more clusters from the clustering algorithm, determining a distance metric associated with the one or more clusters to identify similarities between the one or more clusters, wherein the one or more clusters include at least one incorrect answer to the free response item. 12. The one or more non-transitory machine-readable medium of claim 11 , wherein the distance metric analyzes a number of the plurality of textual responses not found in the same cluster. 13. The one or more non-transitory machine-readable medium of claim 11 , wherein the distance metric is associated with a number between zero and log(a number of the one or more clusters). 14. The one or more non-transitory machine-readable medium of claim 11 , wherein the method further comprises: ranking the one or more clusters based in part on the distance metric; and transmitting the ranked clusters to the one or more instructor devices. 15. The one or more non-transitory machine-readable medium of claim 10 , wherein the plurality of textual responses are translated to the numerical dataset by: stemming one or more words in the plurality of textual responses; computing 1, 2, 3, 4, and 5-grams; removing stemmed n-grams that appear in fewer than 1-percent or more than 99-percent of the plurality of textual responses; and generating the stemmed n-grams as a vector. 16. A computer system for determining a plurality of textual responses to generate a future test option to correspond with a response to a free response item, the system comprising: a processor; and one or memories coupled with said one or more processors, wherein the one or more processors and one or more memories are configured to; determine, at a server computer, one or more participant devices and one or more instructor devices, wherein the one or more participant devices and the one or more instructor devices are geographically dispersed and communicate via a network, wherein each of the one or more participant devices is operated by a respective participant and each respective participant is associated with a respective characteristic; receive, at the server computer, the free response item from one of the one or more instructor devices; receive, at the server computer, the plurality of textual responses from the one or more participant devices; remove punctuation from the plurality of textual responses; translate into a numerical dataset the plurality of textual responses resulting from the remove punctuation step; apply, at the server computer, a

Assignees

Inventors

Classifications

  • G09B7/00Primary

    Electrically-operated teaching apparatus or devices working with questions and answers (mechanically operated G09B3/00; computing arrangements G06F) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10388177B2 cover?
Textual responses to open-ended (i.e., free-response) items provided by participants (e.g., by means of mobile wireless devices) are automatically classified, enabling an instructor to assess the responses in a convenient, organized fashion and adjust instruction accordingly.
Who is the assignee on this patent?
Harvard College
What technology area does this patent fall under?
Primary CPC classification G09B7/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 20 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).