Voice search system, voice search method, and computer-readable storage medium

US10489451B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10489451-B2
Application numberUS-201314907877-A
CountryUS
Kind codeB2
Filing dateSep 11, 2013
Priority dateSep 11, 2013
Publication dateNov 26, 2019
Grant dateNov 26, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Provided is a voice search technology that can efficiently find and check a problematic call. To this end, a voice search system of the present invention includes a call search database that stores, for each of a reception channel and a transmission channel of each of a plurality of pieces of recorded call voice data, voice section sequences in association with predetermined keywords and time information. The call search database is searched based on an input search keyword, so that a voice section sequence that contains the search keyword is obtained. More specifically, the voice search system obtains, as a keyword search result, a voice section sequence that contains the search keyword and the appearance time thereof from the plurality of pieces of recorded call voice data, and obtains, based on the appearance time in the keyword search result, the start time of a voice section sequence of another channel immediately before the voice section sequence obtained as the keyword search result, and thus determines the start time as the playback start position for playing back the recorded voice. Then, the playback start position is output as a voice search result.

First claim

Opening claim text (preview).

The invention claimed is: 1. A voice search system comprising: a recording device including: a receiver configured to receive voice data; and a memory configured to store the voice data; a search device including: a search database that stores, for each of a reception channel and a transmission channel of each of a plurality of pieces of recorded voice data, voice section sequences in association with predetermined keywords and time information; and a processor configured to search the search database based on a search keyword, and obtain a voice section sequence that contains the search keyword, wherein the processor is configured to: obtain, as a keyword search result, a voice section sequence that contains the search keyword and an appearance time of the voice section sequence from the plurality of pieces of recorded voice data, obtain, based on the appearance time in the keyword search result, a start time of a voice section sequence of another channel immediately before the voice section sequence obtained as the keyword search result, and determine the start time as a playback start position for playing back the recorded voice, and output the playback start position as a voice search result on a graphical user interface; wherein the search database further stores a non-verbal information score, indicating a seriousness of a problem discussed in the voice data, of each voice section sequence, and wherein the processor is configured to determine, based on the seriousness of the problem indicated by the non-verbal information score, a priority of the voice search result for which the playback start position has been determined, and rearrange the voice search result based on the seriousness of the problem obtained from the non-verbal information score; and a search terminal device including the graphical user interface that includes a display screen and a user input device through which a user inputs the search keyword. 2. The voice search system according to claim 1 , wherein the processor is configured to output voice search results to the graphical user interface in descending order of the priority for display on the display screen. 3. The voice search system according to claim 2 , wherein the processor is configured to allow, other than the start time of the voice section sequence of the other channel immediately before the voice section sequence obtained as the keyword search result, the start time of the voice section sequence obtained as the keyword search result or a start time of a voice section sequence of another channel immediately after the voice section sequence obtained as the keyword search result, to be selected as a playback start position for playing back the recorded voice, and display the playback start position on the display screen. 4. The voice search system according to claim 1 , wherein the non-verbal information score is an emotion score that is obtained by determining an emotion in the voice section sequence, and the emotion score is associated with a start time of the voice section sequence. 5. A voice search method for searching a search database based on a search keyword input by a user through a graphical user interface including a display screen and a user input device, and obtaining a voice section sequence that contains the search keyword, the search database being configured to store, for each of a reception channel and a transmission channel of each of a plurality of pieces of voice data, voice section sequences in association with predetermined keywords and time information, the method comprising causing a processor to: receive and record the voice data; receive the search keyword input by the user through the graphical user interface; search the search database based on the search keyword; obtain, as a keyword search result, a voice section sequence that contains the search keyword and an appearance time of the voice section sequence from the plurality of pieces of recorded voice data; obtain, based on the appearance time in the keyword search result, a start time of a voice section sequence of another channel immediately before the voice section sequence obtained as the keyword search result, and determine the start time as a playback start position for playing back the recorded voice; and output the playback start position as a voice search result on the graphical user interface; wherein the search database further stores a non-verbal information score, indicating a seriousness of a problem discussed in the voice data, of each voice section sequence, and wherein the method further comprises causing the processor to determine, based on the seriousness of the problem indicated by the non-verbal information score, a priority of the voice search result for which the playback start position has been determined, and rearrange the voice search result based on the seriousness of the problem obtained from the non-verbal information score. 6. The voice search method according to claim 5 , further comprising causing the processor to output voice search results to the graphical user interface in descending order of the priority for display on the display screen. 7. The voice search method according to claim 6 , further comprising, in the step of displaying the voice search results, causing the processor to allow, other than the start time of the voice section sequence of the other channel immediately before the voice section sequence obtained as the keyword search result, the start time of the voice section sequence obtained as the keyword search result or a start time of a voice section sequence of another channel immediately after the voice section sequence obtained as the keyword search result, to be selected as a playback start position for playing back the recorded voice, and display the playback start position on the display screen. 8. The voice search method according to claim 5 , wherein the non-verbal information score is an emotion score that is obtained by determining an emotion in the voice section sequence, and the emotion score is associated with a start time of the voice section sequence. 9. A non-transitory computer-readable storage medium that stores a program for causing a computer to execute the voice search method according to claim 5 .

Assignees

Inventors

Classifications

  • Word spotting · CPC title

  • G06F16/638Primary

    Presentation of query results · CPC title

  • Browsing; Visualisation therefor (generation of a list or set of audio data G06F16/638) · CPC title

  • using speech recognition · CPC title

  • Information retrieval; Database structures therefor; File system structures therefor · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10489451B2 cover?
Provided is a voice search technology that can efficiently find and check a problematic call. To this end, a voice search system of the present invention includes a call search database that stores, for each of a reception channel and a transmission channel of each of a plurality of pieces of recorded call voice data, voice section sequences in association with predetermined keywords and time i…
Who is the assignee on this patent?
Hitachi Ltd
What technology area does this patent fall under?
Primary CPC classification G06F16/638. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 26 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).