Electronic device and method for performing task using external device by electronic device
US-10778830-B2 · Sep 15, 2020 · US
US11164571B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11164571-B2 |
| Application number | US-201715858968-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 29, 2017 |
| Priority date | Nov 16, 2017 |
| Publication date | Nov 2, 2021 |
| Grant date | Nov 2, 2021 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
The present disclosure provides a content recognizing method and apparatus, a device and a computer storage medium, wherein the method comprises: a smart multimedia device performing speech recognition and intention parsing for a speech instruction; if a content recognition intention is obtained from the parsing, internally recording multimedia content that is being played by the smart multimedia device; sending internally-recorded media data to a server side, and obtaining a content recognition result returned by the server side for the media data. The user may implement recognition of multimedia content through speech interaction with the smart multimedia device, and operations are simple without depending on other smart devices.
Opening claim text (preview).
What is claimed is: 1. A content recognizing method, wherein the method comprises: a smart multimedia device receiving a speech instruction, and sending the speech instruction to a server side which performs speech recognition and intention parsing for the speech instruction; the smart multimedia device receiving the parsed intention from the server side and internally recording the content of the part of multimedia content that is being played by the smart multimedia device, which has a type corresponding to the parsed intention, according to a pre-configured correspondence relationship between the intention and the type of the internally recorded content, wherein the intention comprises audio recognition and image recognition, and wherein the internally recorded content which has the type corresponding to the parsed intention comprises an audio stream or video frames that is being played by the smart multimedia device; the smart multimedia device sending the internally-recorded media data to the server side, and obtaining a content recognition result returned by the server side for the media data. 2. The method according to claim 1 , wherein if a content recognition intention is obtained from the intention parsing, performing internal recording of multimedia content that is being played by the smart multimedia device. 3. The method according to claim 1 , wherein the smart multimedia device comprises a smart TV set, a smart acoustic enclosure or a smart projector. 4. The method according to claim 1 , wherein the smart multimedia device performing speech recognition and intention parsing for the speech instruction comprises: the smart multimedia device sending the speech instruction to the server side, and obtaining a result after the server side performs speech recognition and intention parsing for the speech instruction. 5. The method according to claim 1 , wherein the internally recording multimedia content that is being played by the smart multimedia device comprises: collecting video frames from a graphics card of the smart multimedia device; or collecting audio stream from a sound card of the smart multimedia device. 6. The method according to claim 1 , wherein the method further comprises: displaying the content recognition result in the form of speech; or displaying the content recognition result on a display screen. 7. A smart multimedia device, wherein the smart multimedia device comprises: one or more processors; a memory for storing one or more programs; when said one or more programs are executed by said one or more processors, said one or more processors are enabled to implement the following operation: receiving a speech instruction, and sending the speech instruction to a server side which performs speech recognition and intention parsing for the speech instruction; receiving the parsed intention from the server side and internally recording the content of the part of multimedia content that is being played by the smart multimedia device, which has a type corresponding to the parsed intention, according to a pre-configured correspondence relationship between the intention and the type of the internally recorded content, wherein the intention comprises audio recognition and image recognition, and wherein the internally recorded content which has the type corresponding to the parsed intention comprises an audio stream or video frames that is being played by the smart multimedia device; sending the internally-recorded media data to the server side, and obtaining a content recognition result returned by the server side for the media data. 8. The smart multimedia device according to claim 7 , wherein if a content recognition intention is obtained from the intention parsing, performing internal recording of multimedia content that is being played by the smart multimedia device. 9. The smart multimedia device according to claim 7 , wherein the smart multimedia device comprises a smart TV set, a smart acoustic enclosure or a smart projector. 10. The smart multimedia device according to claim 7 , wherein the operation of performing speech recognition and intention parsing for the speech instruction comprises: sending the speech instruction to the server side, and obtaining a result after the server side performs speech recognition and intention parsing for the speech instruction. 11. The smart multimedia device according to claim 7 , wherein the internally recording multimedia content that is being played by the smart multimedia device comprises: collecting video frames from a graphics card of the smart multimedia device; or collecting audio stream from a sound card of the smart multimedia device. 12. The smart multimedia device according to claim 7 , wherein the operation further comprises: displaying the content recognition result in the form of speech; or displaying the content recognition result on a display screen. 13. A non-transitory computer storage medium in which one or more programs are stored, an apparatus being enabled to execute the following operation when said one or more programs are executed by the apparatus: receiving a speech instruction, and sending the speech instruction to a server side which performs speech recognition and intention parsing for the speech instruction; receiving the parsed intention from the server side and internally recording the content of the part of multimedia content that is being played by a smart multimedia device, which has a type corresponding to the parsed intention, according to a pre-configured correspondence relationship between the intention and the type of internally recorded content, wherein the intention comprises audio recognition and image recognition, and wherein the internally recorded content which has the type corresponding to the parsed intention comprises an audio stream or video frames that is being played by the smart multimedia device; sending the internally-recorded media data to the server side, and obtaining a content recognition result returned by the server side for the media data. 14. The non-transitory computer storage medium according to claim 13 , wherein if a content recognition intention is obtained from the intention parsing, performing internal recording of multimedia content that is being played by the smart multimedia device. 15. The non-transitory computer storage medium according to claim 13 , wherein the smart multimedia device comprises a smart TV set, a smart acoustic enclosure or a smart projector. 16. The non-transitory computer storage medium according to claim 13 , wherein the operation of performing speech recognition and intention parsing for the speech instruction comprises: sending the speech instruction to the server side, and obtaining a result after the server side performs speech recognition and intention parsing for the speech instruction. 17. The non-transitory computer storage medium according to claim 13 , wherein the internally recording multimedia content that is being played by a smart multimedia device comprises: collecting video frames from a graphics card of the smart multimedia device; or collecting audio stream from a sound card of the smart multimedia device. 18. The non-transitory computer storage medium according to claim 13 , wherein the operation further comprises: displaying the content recognition result in the form of speech; or displaying the content recognition result on a display screen.
sound input device, e.g. microphone · CPC title
Recording operations (recording of a television signal H04N5/76; arrangements for recording or accumulating broadcast information or broadcast-related information H04H60/27) · CPC title
Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title
Additional components integrated in the remote control device, e.g. timer, speaker, sensors for detecting position, direction or movement of the remote control, microphone or battery charging device · CPC title
Interfacing the upstream path of the transmission network, e.g. for transmitting client requests to a VOD server {(flow control in data networks H04L47/10; streaming protocols, e.g. RTP or RTCP, H04L65/65; scheduling or organising the servicing of application requests in data packet switching networks H04L67/60)} · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.