What technology area does this patent fall under?

Primary CPC classification G06F3/167. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu May 05 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

System and method for initiating multi-modal speech recognition using a long-touch gesture

US2016124706A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2016124706-A1
Application number	US-201414529766-A
Country	US
Kind code	A1
Filing date	Oct 31, 2014
Priority date	Oct 31, 2014
Publication date	May 5, 2016
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system, method and computer-readable storage devices are disclosed for multi-modal interactions with a system via a long-touch gesture on a touch-sensitive display. A system operating per this disclosure can receive a multi-modal input comprising speech and a touch on a display, wherein the speech comprises a pronoun. When the touch on the display has a duration longer than a threshold duration, the system can identify an object within a threshold distance of the touch, associate the object with the pronoun in the speech, to yield an association, and perform an action based on the speech and the association.

First claim

Opening claim text (preview).

We claim: 1 . A method comprising: receiving a multi-modal input comprising speech and a touch on a display; and when the touch on the display has a duration longer than a threshold duration: identifying an object within a threshold distance of the touch; associating the object with the pronoun in the speech, to yield an association; and performing an action based on the speech and the association. 2 . The method of claim 1 , wherein the speech comprises a pronoun. 3 . The method of claim 1 , wherein a pronoun is implied in the speech. 4 . The method of claim 1 , wherein: the display is presenting a computer-aided design program; and the action modifies a design within the computer-aided design program. 5 . The method of claim 4 , further comprising: receiving a second touch on the display, wherein the action requires the second touch. 6 . The method of claim 1 , wherein the threshold duration is based on a context for the touch on the display. 7 . The method of claim 1 , wherein the threshold duration is based on a recognition certainty of a command recognized in the speech. 8 . The method of claim 2 , wherein the object is identified based, at least in part, on the pronoun. 9 . The method of claim 1 , wherein the speech of the multi-modal input is received simultaneously with initiation of the touch on the display. 10 . The method of claim 1 , wherein the speech of the multi-modal input is received after a duration of the touch on the display is determined to meet a long touch threshold. 11 . The method of claim 1 , wherein the speech of the multi-modal input is received after a duration of the touch on the display is determined to meet a press and hold threshold. 12 . A system comprising: a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising: receiving a multi-modal input comprising speech and a touch on a display, wherein the speech comprises a pronoun; and when the touch on the display has a duration longer than a threshold duration: identifying an object within a threshold distance of the touch; associating the object with the pronoun in the speech, to yield an association; and performing an action based on the speech and the association. 13 . The system of claim 12 , wherein the threshold duration is based on a context for the touch on the display. 14 . The system of claim 12 , wherein the threshold duration is based on a recognition certainty of a command recognized in the speech. 15 . The system of claim 12 , wherein the object is identified based, at least in part, on the pronoun. 16 . The system of claim 12 , wherein the speech of the multi-modal input is received simultaneously with initiation of the touch on the display. 17 . The system of claim 12 , wherein the speech of the multi-modal input is received after a duration of the touch on the display is determined to meet a long touch threshold. 18 . The system of claim 12 , wherein the speech of the multi-modal input is received after a duration of the touch on the display is determined to meet a press and hold threshold. 19 . A computer-readable storage device having instructions stored which, when executed by a computing device, cause the computing device to perform operations comprising: receiving a multi-modal input comprising speech and a touch on a display, wherein the speech comprises a pronoun; and when the touch on the display has a duration longer than a threshold duration: identifying an object within a threshold distance of the touch; associating the object with the pronoun in the speech, to yield an association; and performing an action based on the speech and the association. 20 . The computer-readable storage device of claim 19 , wherein: the display is presenting a computer-aided design program; and the action modifies a design within the computer-aided design program.

Assignees

At & T Ip I Lp

Inventors

Classifications

G10L2015/228
of application context · CPC title
G06F3/04842
Selection of displayed objects or displayed text elements (G06F3/0482 takes precedence) · CPC title
G06F3/167Primary
Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title
G10L2015/223
Execution procedure of a spoken command · CPC title
G10L15/22Primary
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

Patent family

Related publications grouped by family.

View patent family 55852720

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016124706A1 cover?: A system, method and computer-readable storage devices are disclosed for multi-modal interactions with a system via a long-touch gesture on a touch-sensitive display. A system operating per this disclosure can receive a multi-modal input comprising speech and a touch on a display, wherein the speech comprises a pronoun. When the touch on the display has a duration longer than a threshold durati…
Who is the assignee on this patent?: At & T Ip I Lp
What technology area does this patent fall under?: Primary CPC classification G06F3/167. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu May 05 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).