Method and device for searching according to speech based on artificial intelligence

US10157619B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10157619-B2
Application numberUS-201715823663-A
CountryUS
Kind codeB2
Filing dateNov 28, 2017
Priority dateNov 29, 2016
Publication dateDec 18, 2018
Grant dateDec 18, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and a device for searching according to a speech based on artificial intelligence are provided. The method includes: identifying an input speech of a user to determine whether the input speech is a child speech; filtrating a searched result obtained according to the input speech to obtain a filtrated searched result, if the input speech is the child speech; and feeding the filtrated searched result back to the user.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for searching according to a speech based on artificial intelligence, comprising: acquiring, by at least one computing device, sample speeches for training a preset classifier; removing, by the at least one computing device, a silent speech from the sample speeches by performing a speech activity detection on the sample speeches, to obtain training speeches; extracting, by the at least one computing device, acoustic features of each training speech; and training, by the at least one computing device, the preset classifier by inputting the acoustic features of the each training speech into the preset classifier, to obtain a target classifier; identifying, by at least one computing device, an input speech of a user to determine whether the input speech is a child speech; filtrating, by the at least one computing device, a searched result obtained according to the input speech to obtain a filtrated searched result, if the input speech is the child speech; and feeding, by the at least one computing device, the filtrated searched result to the user, wherein removing, by the at least one computing device, the silent speech from the sample speeches by performing a speech activity detection on the sample speeches, to obtain training speeches comprises: dividing, by the at least one computing device, each sample speech into frames by a preset first step size, and removing, by the at least one computing device, the silent speech from each frame of the each sample speech by performing the speech activity detection on the each frame of the each sample speech, to obtain the each training speech; wherein extracting, by the at least one computing device, the acoustic features of each training speech comprises: dividing, by the at least one computing device, the each training speech by a preset second step size; and extracting, by the at least one computing device, by a preset third step size, the acoustic features of the each training speech after dividing by the preset second step size. 2. The method according to claim 1 , wherein filtrating, by the at least one computing device, a searched result obtained according to the input speech comprises: converting, by the at least one computing device, the input speech into a text content; obtaining, by the at least one computing device, the searched result by searching according to the text content; and filtrating, by the at least one computing device, the searched result to remove a sensitive content unsuitable for a child. 3. The method according to claim 2 , wherein obtaining, by the at least one computing device, the searched result by searching according to the text content comprises: searching, by the at least one computing device, according to the text content in a first database pre-established for children; and searching, by the at least one computing device, according to the text content in a second database to obtain the searched result, if no content related to the input speech is searched in the first database. 4. The method according to claim 1 , wherein identifying, by the at least one computing device, an input speech of a user to determine whether the input speech is a child speech comprises: removing, by the at least one computing device, a silent speech from the input speech by performing the speech activity detection on the input speech, to obtain a tested speech; extracting, by the at least one computing device, acoustic features of the tested speech; and identifying, by the at least one computing device, the acoustic features of the tested speech by inputting the acoustic features of the tested speech into the target classifier, to determine whether the input speech is the child speech. 5. The method according to claim 4 , wherein removing, by the at least one computing device, the silent speech from the input speech by performing the speech activity detection on the input speech, to obtain a tested speech comprises: dividing, by the at least one computing device, the input speech into frames by a preset first step size, and removing, by the at least one computing device, the silent speech from each frame of the input speech by performing the speech activity detection on the each frame of the input speech, to obtain the tested speech; extracting, by the at least one computing device, acoustic features of the tested speech comprises: dividing, by the at least one computing device, the tested speech by a preset second step size; and extracting, by the at least one computing device, by a preset third step size, the acoustic features of the tested speech after dividing by the preset second step size; and, identifying, by the at least one computing device, the acoustic features of the tested speech by inputting the acoustic features of the tested speech into the target classifier, to determine whether the input speech is the child speech comprises: grading, by the at least one computing device, the acoustic features of the tested speech by inputting the acoustic features of the tested speech into the target classifier; acquiring, by the at least one computing device, an average value of the tested speech; and determining, by the at least one computing device, that the input speech is the child speech if the average value is greater than a preset threshold. 6. A device for searching according to a speech based on artificial intelligence, comprising: a processor; and a memory, configured to store instructions executable by the processor, wherein the processor is configured to: acquire sample speeches for training a preset classifier; remove a silent speech from the sample speeches by performing a speech activity detection on the sample speeches, to obtain training speeches; extract acoustic features of each training speech; and train the preset classifier by inputting the acoustic features of the each training speech into the preset classifier, to obtain a target classifier; identify an input speech of a user to determine whether the input speech is a child speech; filtrate a searched result obtained according to the input speech to obtain a filtrated searched result, if the input speech is the child speech; and feed the filtrated searched result to the user, wherein the processor is configured to remove a silent speech from the sample speeches by performing a speech activity detection on the sample speeches, to obtain training speeches by acts of: dividing each sample speech into frames by a preset first step size, and removing the silent speech from each frame of the each sample speech by performing the speech activity detection on the each frame of the each sample speech, to obtain the each training speech; and the processor is configured to extract the acoustic features of each training speech by acts of: dividing the each training speech by a preset second step size; and extracting by a preset third step size, the acoustic features of the each training speech after dividing by the preset second step size. 7. The device according to claim 6 , wherein the processor is configured to filtrate the searched result obtained according to the input speech by acts of: converting the input speech into a text content; obtaining the searched result by searching according to the text content; and filtrating the searched result to remove a sensitive content unsuitable for a child. 8. The device according to claim 7 , wherein the processor is configured to obtain the searched result by searching according to the text content by acts of: searching according to the text content in a first database pre-established for children; and searching according to the text content in a second database to obtain the searched result, if no

Assignees

Inventors

Classifications

  • Adaptive threshold · CPC title

  • Feature extraction for speech recognition; Selection of recognition unit · CPC title

  • characterised by the analysis technique · CPC title

  • Discriminating between voiced and unvoiced parts of speech signals (G10L25/90 takes precedence) · CPC title

  • using neural networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10157619B2 cover?
A method and a device for searching according to a speech based on artificial intelligence are provided. The method includes: identifying an input speech of a user to determine whether the input speech is a child speech; filtrating a searched result obtained according to the input speech to obtain a filtrated searched result, if the input speech is the child speech; and feeding the filtrated se…
Who is the assignee on this patent?
Baidu online network technology beijing co ltd
What technology area does this patent fall under?
Primary CPC classification G10L17/26. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 18 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).