Automated closed captioning using temporal data

US2016357746A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016357746-A1
Application numberUS-201514728201-A
CountryUS
Kind codeA1
Filing dateJun 2, 2015
Priority dateJun 2, 2015
Publication dateDec 8, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

One or more systems and/or techniques are provided for automatic closed captioning for media content. In an example, real-time content, occurring within a threshold timespan of a broadcast of media content (e.g., social network posts occurring during and an hour before a live broadcast of an interview), may be accessed. A list of named entities, occurring within the social network data, may be generated (e.g., Interviewer Jon, Interviewee Kathy, Husband Dave, Son Jack, etc.). A ranked list of named entities may be created based upon trending named entities within the list of named entities (e.g., a named entity may be ranked higher based upon a more frequent occurrence within the social network posts). A dynamic grammar (e.g., library, etc.) may be built based upon the ranked list of named entities. Speech recognition may be performed upon the broadcast of media content utilizing the dynamic grammar to create closed caption text.

First claim

Opening claim text (preview).

1 . A system for increasing accuracy of computer speech recognition comprising: a dynamic grammar builder computing device comprising one or more processing units and one or more computer-readable media comprising computer-executable instructions which, when executed by the one or more processing units, cause the dynamic grammar builder computing device to: obtain social network data occurring within a threshold timespan of a broadcast of media content; identify named entities from the obtained social network data that are trending within the obtained social network data; rank the identified named entities based upon the trending; and build a dynamic grammar comprising at least some of the named entities based upon the ranking; and a speech recognition computing device comprising one or more processing units and one or more computer-readable media comprising computer-executable instructions which, when executed by the one or more processing units, cause the speech recognition computing device to: perform speech recognition of spoken words, spoken by the broadcast of the media content, utilizing the dynamic grammar to create closed caption text for the broadcast of the media content. 2 . The system of claim 1 , wherein the computer-readable media of the dynamic grammar builder computing device comprise additional computer-executable instructions which, when executed by the one or more processing units of the dynamic grammar builder computing device cause the dynamic grammar builder computing device to: access supplemental content associated with at least one of the named entities; identify a context associated with the at least one of the named entities or with an event occurring within the media content; and build the dynamic grammar based upon the identified context. 3 . A method for increasing accuracy of computer speech recognition comprising: obtaining, by a computing device, social network data occurring within a threshold timespan of a broadcast of media content; identifying, on the computing device, named entities from the obtained social network data that are trending within the obtained social network data; ranking, on the computing device, the identified named entities based upon the trending; building, on the computing device, a dynamic grammar comprising at least some of the named entities based upon the ranking; and performing computer speech recognition of spoken words, spoken by the broadcast of the media content, utilizing the dynamic grammar to create closed caption text for the broadcast of the media content. 4 . The method of claim 3 , further comprising: accessing, by the computing device, supplemental content associated with at least one of the named entities; identifying, with the computing device, a context associated with either the at least one of the named entities or with an event occurring within the media content; and building the dynamic grammar based upon the identified context. 5 . The method of claim 3 , wherein the trending comprises occurrence metrics identifying a number of times a particular named entity is referenced in the obtained social network data within trending timespan thresholds. 6 . The method of claim 3 , wherein the identifying the named entities comprises: executing entity recognition functionality upon the social network data. 7 . The method of claim 3 , wherein the ranking comprises: responsive to a refresh timer expiring, re-ranking the named entities based upon the trending within updated social network data. 8 . (canceled) 9 . The method of claim 3 , further comprising: defining the threshold timespan based upon a type of event occurring within the media content. 10 . The method of claim 3 , wherein the media content is one of a live interview, a live telecast, or an online videogame. 11 . The method of claim 3 , wherein the dynamic grammar is a speech recognition grammar specification file. 12 - 13 . (canceled) 14 . The method of claim 3 , further comprising: filtering the social network data based upon a context of an event occurring within the media content. 15 . (canceled) 16 . The method of claim 3 , the social network data comprises at least one of a query log, a message, a microblog, a forum post, or user created data that is updated in real-time. 17 . The method of claim 3 , wherein the performing the speech recognition comprises: loading the dynamic grammar into memory of a speech recognition server; and invoking speech recognition functionality, of the speech recognition server, to utilize the dynamic grammar as a library against which an audio fragment, of the broadcast of the media content, is compared for determining a probability that the audio fragment is a named entity. 18 . The method of claim 3 , further comprising: correcting user generated closed captioning for the broadcast of the media content based upon the dynamic grammar. 19 . The method of claim 3 , wherein the performing speech recognition comprises: utilizing the dynamic grammar to either increase or decrease a probability that an audio fragment, of the broadcast of the media content, is a named entity. 20 . A computing device comprising: one or more processing units; and one or more computer-readable media comprising computer-executable instructions which, when executed by the one or more processing units, cause the computing device to: obtain real-time content occurring within a threshold timespan of a broadcast of media content; identify named entities from the obtained real-time content that are trending within the obtained real-time content; ranking the identified named entities based upon the trending; building a dynamic grammar comprising at least some of the named entities based upon the ranking; and utilizing the dynamic grammar for correcting user generated closed captioning for the broadcast of the media content. 21 . The system of claim 1 , wherein the trending comprises occurrence metrics identifying a number of times a particular named entity is referenced in the obtained social network data within trending timespan thresholds. 22 . The system of claim 1 , wherein the identifying the named entities comprises executing entity recognition functionality upon the social network data. 23 . The system of claim 1 , wherein the ranking comprises responsive to a refresh timer expiring, re-ranking the named entities based upon the trending within updated social network data. 24 . The system of claim 1 , wherein the computer-readable media of the dynamic grammar builder computing device comprise additional computer-executable instructions which, when executed by the one or more processing units of the dynamic grammar builder computing device cause the dynamic grammar builder computing device to: define the threshold timespan based upon a type of event occurring within the media content.

Assignees

Inventors

Classifications

  • using context dependencies, e.g. language models · CPC title

  • for displaying teletext characters · CPC title

  • using ranking · CPC title

  • Named entity recognition · CPC title

  • Search customisation based on user profiles and personalisation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016357746A1 cover?
One or more systems and/or techniques are provided for automatic closed captioning for media content. In an example, real-time content, occurring within a threshold timespan of a broadcast of media content (e.g., social network posts occurring during and an hour before a live broadcast of an interview), may be accessed. A list of named entities, occurring within the social network data, may be …
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G06F16/24578. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Dec 08 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).