Voice-responsive building management system
US-2015120297-A1 · Apr 30, 2015 · US
US9786276B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9786276-B2 |
| Application number | US-201414467242-A |
| Country | US |
| Kind code | B2 |
| Filing date | Aug 25, 2014 |
| Priority date | Aug 25, 2014 |
| Publication date | Oct 10, 2017 |
| Grant date | Oct 10, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A speech-enabled management system is described herein. One system includes a grammar building tool configured to create a set of grammar keys based on ontology analytics corresponding to data received from a digital video manager (DVM) server, a speech recognition engine configured to recognize a speech command from a set of grammar files, a command translator configured to translate the recognized speech command to an executable command, and a processor configured to execute the speech command based on a particular grammar key from the set of grammar keys.
Opening claim text (preview).
What is claimed: 1. A speech-enabled surveillance management system, comprising: a grammar building tool configured to create a set of grammar keys based on ontology analytics corresponding to data received from a digital video manager (DVM) server and including a mapping table that associates each of a plurality of locations to at least one camera located within a particular facility, wherein the set of grammar keys corresponds to the particular facility, and wherein a first set of the grammar keys are for applications of the particular facility that remain constant during application execution and a second set of the grammar keys changes based on a change of the particular facility from a first facility to a second facility; a speech recognition engine configured to recognize a speech command from a set of grammar files; a control dialog manager configured to determine, upon recognizing the speech command, that the recognized speech command is applicable for a current facility context; a command translator configured to translate, upon determining the recognized speech command is applicable, the recognized applicable speech command to an executable command by mapping the speech command to a location and a particular camera associated with the location based on the mapping table and recognized speech command, wherein the location is a physical location within the particular facility and the recognized speech command comprises the location; and a processor configured to: execute the speech command; display a video feed of a portion of the facility by a monitor based on the executed speech command and the particular camera associated with the location. 2. The system of claim 1 , further comprising a speech synthesizer configured to identify and select a pronunciation lexicon based on pronunciation phonemes associated with domain terms. 3. The system of claim 1 , wherein the speech recognition engine is based on operator voice training profile or a speech pattern. 4. The system of claim 1 , wherein the ontology analytics are based on ontological factors, including inferences and associations between two data elements. 5. The system of claim 1 , wherein the DVM server includes camera configuration data, location data, and system configuration data. 6. The system of claim 5 , wherein the set of grammar keys is configured to: correspond to a camera located within a particular area and control the camera in a sequential or mapping order; and control a set of operations, wherein the set of operations include pan, tilt, zoom, start, stop, recording, clear, monitor, and tile features. 7. The system of claim 1 , wherein the executed speech command is performed at a workstation that includes a surveillance monitor, video, console, or microphone. 8. A method for operating a speech-enabled surveillance management system, comprising: creating a set of grammar keys from a plurality of grammar files, wherein the set of grammar keys corresponds to a particular facility, and wherein a first set of the grammar keys are for applications of the particular facility that remain constant during application execution and a second set of the grammar keys changes based on a change of the particular facility from a first facility to a second facility, and wherein creating the set of grammar keys is based on ontology analytics including a mapping table that associates each of a plurality of locations to at least one camera located within the particular facility; identifying a speech command; determining, upon identifying the speech command, whether the speech command is applicable for a current facility context; and upon determining that the identified speech command is applicable for the current facility context: translating a grammar key from the set of grammar keys based on the speech command, wherein translating the grammar key includes mapping the speech command to a location and a particular camera associated with the location based on the mapping table and identified speech command, wherein the location is a physical location within the particular facility and the identified speech command comprises the location; and executing the speech command based on the translated grammar key; and displaying a video feed of a portion of the facility by a monitor based on the executed speech command and the particular camera associated with the location. 9. The method of claim 8 , wherein the method includes identifying the speech command by deciphering the speech command from a plurality of pronunciation speech lexicons. 10. The method of claim 8 , wherein executing the speech command includes commanding a particular camera, view, audit, recording, or operational task. 11. A speech-enabled surveillance management system, comprising: a grammar building tool configured to create a set of grammar keys based on ontology analytics corresponding to a set of data received from a DVM server and including a mapping table that associates each of a plurality of locations to at least one camera located within a particular facility, wherein the set of grammar keys corresponds to the particular facility, and wherein a first set of the grammar keys are for applications of the particular facility that remain constant during application execution and a second set of the grammar keys changes based on a change of the particular facility from a first facility to a second facility; a speech recognition engine configured to recognize a speech command from a set of grammar files; a control dialog manager configured to determine, upon recognizing the speech command, that the recognized speech command is applicable for a current facility context; a command translator configured to translate, upon determining the recognized speech command is applicable, the recognized applicable speech command to an executable speech command by mapping the speech command to a location and a particular camera associated with the location based on the mapping table and recognized speech command, wherein the location is a physical location within the particular facility and the recognized speech command comprises the location; and a processor configured to: execute the speech command; and display a video feed of a portion of the facility by a monitor based on the executed speech command and the particular camera associated with the location. 12. The system of claim 11 , wherein the grammar building tool includes a plurality of grammar files associated with recognition grammar, features, and location. 13. The system of claim 11 , wherein the speech synthesizer is configured to synthesize text to speech signals and transfer the speech signals to a speaker. 14. The system of claim 11 , wherein the speech recognition engine is configured to identify the speech command based on phonology, morphology, syntax, semantics, and lexicon language aspects. 15. The system of claim 11 , further comprising displaying camera views on surveillance monitors and automatically changing a number of camera tile views on the surveillance monitors based on a number of cameras. 16. The system of claim 11 , wherein the mapping table associates a location in a key table with a camera or tile location in a camera table.
by using electronic viewfinders · CPC title
Execution procedure of a spoken command · CPC title
Electricity · mapped topic
Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules · CPC title
for receiving images from a plurality of remote sources · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.