Time ordered indexing of an information stream
US-8972840-B2 · Mar 3, 2015 · US
US10127306B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10127306-B2 |
| Application number | US-201213687270-A |
| Country | US |
| Kind code | B2 |
| Filing date | Nov 28, 2012 |
| Priority date | Nov 28, 2012 |
| Publication date | Nov 13, 2018 |
| Grant date | Nov 13, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method and system for searching alternative data sources include monitoring a first communications source broadcasting unstructured data, and a second communications source broadcasting structured data. The method further includes generating text from the unstructured data and from the structured data collected, and parsing the generated text. The method also includes defining a search phrase, and analyzing the generated or parsed text for semantically relevant text in relation to the search phrase. The method also includes selecting the semantically relevant text.
Opening claim text (preview).
What is claimed is: 1. A computer system for searching data sources, comprising: one or more computer processors, one or more computer-readable storage media, and program instructions stored on one or more of the computer-readable storage media for execution by at least one of the one or more processors, the program instructions comprising: program instructions to capture unstructured broadcast data from a broadcast source by using a listening device, wherein the broadcast source includes one or more of a television broadcast source and a radio broadcast source; program instructions to convert the captured unstructured broadcast data to a first structured data, the converting comprising detecting an audio component of the captured unstructured broadcast data and converting the audio component to structured text; program instructions to receive a second structured data from a source different from the broadcast source; program instructions to capture a digital image from a source different from the second structured data source, wherein the digital image includes graphical text elements recognizable as text by optical character recognition software; program instructions to convert the graphical text elements of the digital image to a third structured data, the third structured data comprising structured text from the graphical text elements; program instructions to generate text of contents of each of the first structured data, the second structured data, and the third structured data; and to store the generated text on a searchable data storage device; program instructions to parse the stored text by (i) collecting only phrases relevant to a specified field and (ii) grouping phrases recognized as comprising temporal references; program instructions to receive a search phrase and a selection of a particular language from a user; program instructions to semantically analyze the parsed text by ignoring any text not in the particular language; program instructions to search the semantically analyzed text using the received search phrase; program instructions to generate search results based on searching the semantically analyzed text; and program instructions to provide the search results for communication to the user. 2. The computer system of claim 1 , wherein generating the text further comprises: program instructions to identify one or more advertisement components of the stored text, wherein the searching is based on the one or more advertisement components. 3. The computer system of claim 1 , wherein the unstructured broadcast data includes one or more of analog audio and analog video having an analog audio component. 4. A computer program product for searching sources of data, comprising a non-transitory computer-readable storage medium having program code embodied therewith, the program code executable by a processor of a computer to perform a method comprising: capturing, by the processor, unstructured broadcast data from a broadcast source by using a listening device, wherein the broadcast source includes one or more of a television broadcast source and a radio broadcast source; converting, by the processor, the captured unstructured broadcast data to a first structured data, the converting comprising detecting an audio component of the captured unstructured broadcast data and converting the audio component to structured text; receiving, by the processor, a second structured data from a source different from the broadcast source; capturing, by the processor, a digital image from a source different from the second structured data source, wherein the digital image includes graphical text elements recognizable as text by optical character recognition software; converting, by the processor, the graphical text elements of the digital image to a third structured data, the third structured data comprising structured text from the graphical text elements; generating text of contents of each of the first structured data, the second structured data, and the third structured data and storing the generated text on a searchable data storage device; parsing, by the processor, the stored text by (i) collecting only phrases relevant to a specified field and (ii) grouping phrases recognized as comprising temporal references; receiving, by the processor, a search phrase and a selection of a particular language from a user; semantically analyzing, by the processor, the parsed text by ignoring any text not in the particular language; searching, by the processor, the semantically analyzed text using the received search phrase; generating, by the processor, search results based on searching the semantically analyzed text; and providing, by the processor, the search results for communication to the user. 5. The computer program product of claim 4 , wherein the method further comprises: identifying, by the processor, one or more advertisement components of the stored text, wherein the searching is based on the one or more advertisement components. 6. The computer program product of claim 4 , wherein the second structured data includes one or more of streaming audio and streaming video. 7. The computer program product of claim 4 , wherein the unstructured broadcast data includes one or more of analog audio and analog video having an analog audio component. 8. The computer program product of claim 4 , wherein the second structured data includes one or more emails.
Physics · mapped topic
Physics · mapped topic
Indexing; Data structures therefor; Storage structures · CPC title
Querying · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.