Intelligent voice interaction method, device and computer readable storage medium

US11189183B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11189183-B2
Application numberUS-201916566099-A
CountryUS
Kind codeB2
Filing dateSep 10, 2019
Priority dateOct 25, 2018
Publication dateNov 30, 2021
Grant dateNov 30, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of the present disclosure provide a method, a device, and a computer readable storage medium for intelligent voice interaction. The method includes: obtaining text information or drawing information input by a user on a digitizer tablet; identifying the text information or the drawing information to obtain an identification result; and transmitting audio information corresponding to the identification result to the digitizer tablet, enabling the digitizer tablet to play the audio information. According to the embodiments of the present disclosure, the digitizer tablet performs an intelligent voice interaction with the user according to the text information or the drawing information input by the user when the user practices calligraphy or drawing, thus increasing an interactivity between the user, especially a child, with the digitizer tablet, and enhancing the child's interest in learning.

First claim

Opening claim text (preview).

What is claimed is: 1. An intelligent voice interaction method, comprising: obtaining text information or drawing information input by a user on a digitizer tablet; identifying the text information or the drawing information to obtain an identification result; and transmitting audio information corresponding to the identification result to the digitizer tablet, enabling the digitizer tablet to play the audio information, wherein the audio information comprises whether a stroke order of a character written by the user is correct, or whether an object drawn by the user meets a drawing requirement proposed by the digitizer tablet to the user. 2. The method according to claim 1 , wherein the obtaining text information input by a user on a digitizer tablet comprises: receiving dot matrix data of each stroke of the character and dot matrix data of the character as a whole written by the user on the digitizer tablet. 3. The method according to claim 2 , wherein the identifying the text information to obtain an identification result comprises: determining whether a stroke order of the character written by the user is correct according to the dot matrix data of each stroke of the character written by the user on the digitizer tablet that is transmitted by the digitizer tablet; and determining whether the character written by the user is standard according to the dot matrix data of the character as a whole. 4. The method according to claim 3 , wherein the determining whether a stroke order of the character written by the user is correct according to the dot matrix data of each stroke of the character written by the user on the digitizer tablet that is transmitted by the digitizer tablet comprises: determining an image corresponding to each stroke of the character according to the dot matrix data of each stroke of the character; and comparing the image corresponding to each stroke of the character with a standard stroke image of the character to determine whether the stroke order of the character written by the user is correct. 5. The method according to claim 3 , wherein the determining whether the character written by the user is standard according to the dot matrix data of the character as a whole comprises: determining an image corresponding to the character according to the dot matrix data of the character as a whole; and comparing the image corresponding to the character and a standard image of the character to determine whether the character written by the user is standard. 6. The method according to claim 1 , wherein the obtaining drawing information input by a user on a digitizer tablet comprises: receiving dot matrix data of each stroke, transmitted by the digitizer tablet, when the user draws on the digitizer tablet. 7. The method according to claim 6 , wherein the identifying the drawing information to obtain an identification result comprises: determining the object drawn by the user according to the dot matrix data of each stroke when the user draws on the digitizer tablet and an image database of simple drawings. 8. The method according to claim 7 , wherein the determining an object drawn by the user according to the dot matrix data of each stroke when the user draws on the digitizer tablet and an image database of simple drawings comprises: determining a drawing image according to the dot matrix data of each stroke when the user draws on the digitizer tablet; and identifying the drawing image through a neural network model pre-trained by the image database of simple drawings to determine the object drawn by the user. 9. A server, comprising: a memory; a processor; a communication interface; and a computer program; wherein the computer program is stored in the memory and is configured to be implemented by the processor to: obtain text information or drawing information input by a user on a digitizer tablet; identify the text information or the drawing information to obtain an identification result; and transmit, by the communication interface, audio information corresponding to the identification result to the digitizer tablet, enabling the digitizer tablet to play the audio information, wherein the audio information comprises whether a stroke order of a character written by the user is correct, or whether an object drawn by the user meets a drawing requirement proposed by the digitizer tablet to the user. 10. The server according to claim 9 , wherein when the processor obtains text information input by the user on the digitizer tablet, the processor is configured to: receive dot matrix data of each stroke of the character and dot matrix data of the character as a whole written by the user on the digitizer tablet through the communication interface. 11. The server according to claim 10 , wherein when the processor identifies the text information to obtain the identification result, the processor is configured to: determine whether a stroke order of the character written by the user is correct according to the dot matrix data of each stroke of the character written by the user on the digitizer tablet that is transmitted by the digitizer tablet; and determine whether the character written by the user is standard according to the dot matrix data of the character as a whole. 12. The server according to claim 11 , wherein when the processor determines whether the stroke order of the character written by the user is correct according to the dot matrix data of each stroke of the character written by the user on the digitizer tablet that is transmitted by the digitizer tablet, the processor is configured to: determine an image corresponding to each stroke of the character according to the dot matrix data of each stroke of the character; and compare the image corresponding to each stroke of the character with a standard stroke image of the character to determine whether the stroke order of the character written by the user is correct. 13. The server according to claim 11 , wherein the processor determines whether the character written by the user is standard according to the dot matrix data of the character as a whole, the processor is configured to: determine an image corresponding to the character according to the dot matrix data of the character as a whole; and compare the image corresponding to the character and a standard image of the character to determine whether the character written by the user is standard. 14. The server according to claim 9 , wherein when the processor obtains the drawing information input by the user on the digitizer tablet, the processor is configured to: receive dot matrix data of each stroke that is transmitted by the digitizer tablet through the communication interface when the user draws on the digitizer tablet. 15. The server according to claim 14 , wherein when the processor identifies the drawing information to obtain the identification result, the processor is configured to: determine the object drawn by the user according to the dot matrix data of each stroke when the user draws on the digitizer tablet and an image database of simple drawings. 16. The server according to claim 15 , wherein when the processor determines the object drawn by the user according to the dot matrix data of each stroke when the user draws on the digitizer tablet and the image database of simple drawings, the processor is configured to: determine a drawing image according to the dot matrix data of each stroke when the user draws on the digitizer tablet; and identify the drawing image through a neural network model pre-trained by the

Assignees

Inventors

Classifications

  • Sampling; Contour coding; Stroke extraction · CPC title

  • for inputting data by handwriting, e.g. gesture or text · CPC title

  • with both visual and audible presentation of the material to be studied · CPC title

  • G09B5/04Primary

    with audible presentation of the material to be studied (sound recording or reproducing G11B) · CPC title

  • Management of the audio stream, e.g. setting of volume, audio stream path · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11189183B2 cover?
Embodiments of the present disclosure provide a method, a device, and a computer readable storage medium for intelligent voice interaction. The method includes: obtaining text information or drawing information input by a user on a digitizer tablet; identifying the text information or the drawing information to obtain an identification result; and transmitting audio information corresponding to…
Who is the assignee on this patent?
Baidu online network technology beijing co ltd, Shanghai Xiaodu Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06F3/04883. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 30 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).