Searching alternative data sources

US10127306B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10127306-B2
Application numberUS-201213687270-A
CountryUS
Kind codeB2
Filing dateNov 28, 2012
Priority dateNov 28, 2012
Publication dateNov 13, 2018
Grant dateNov 13, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and system for searching alternative data sources include monitoring a first communications source broadcasting unstructured data, and a second communications source broadcasting structured data. The method further includes generating text from the unstructured data and from the structured data collected, and parsing the generated text. The method also includes defining a search phrase, and analyzing the generated or parsed text for semantically relevant text in relation to the search phrase. The method also includes selecting the semantically relevant text.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer system for searching data sources, comprising: one or more computer processors, one or more computer-readable storage media, and program instructions stored on one or more of the computer-readable storage media for execution by at least one of the one or more processors, the program instructions comprising: program instructions to capture unstructured broadcast data from a broadcast source by using a listening device, wherein the broadcast source includes one or more of a television broadcast source and a radio broadcast source; program instructions to convert the captured unstructured broadcast data to a first structured data, the converting comprising detecting an audio component of the captured unstructured broadcast data and converting the audio component to structured text; program instructions to receive a second structured data from a source different from the broadcast source; program instructions to capture a digital image from a source different from the second structured data source, wherein the digital image includes graphical text elements recognizable as text by optical character recognition software; program instructions to convert the graphical text elements of the digital image to a third structured data, the third structured data comprising structured text from the graphical text elements; program instructions to generate text of contents of each of the first structured data, the second structured data, and the third structured data; and to store the generated text on a searchable data storage device; program instructions to parse the stored text by (i) collecting only phrases relevant to a specified field and (ii) grouping phrases recognized as comprising temporal references; program instructions to receive a search phrase and a selection of a particular language from a user; program instructions to semantically analyze the parsed text by ignoring any text not in the particular language; program instructions to search the semantically analyzed text using the received search phrase; program instructions to generate search results based on searching the semantically analyzed text; and program instructions to provide the search results for communication to the user. 2. The computer system of claim 1 , wherein generating the text further comprises: program instructions to identify one or more advertisement components of the stored text, wherein the searching is based on the one or more advertisement components. 3. The computer system of claim 1 , wherein the unstructured broadcast data includes one or more of analog audio and analog video having an analog audio component. 4. A computer program product for searching sources of data, comprising a non-transitory computer-readable storage medium having program code embodied therewith, the program code executable by a processor of a computer to perform a method comprising: capturing, by the processor, unstructured broadcast data from a broadcast source by using a listening device, wherein the broadcast source includes one or more of a television broadcast source and a radio broadcast source; converting, by the processor, the captured unstructured broadcast data to a first structured data, the converting comprising detecting an audio component of the captured unstructured broadcast data and converting the audio component to structured text; receiving, by the processor, a second structured data from a source different from the broadcast source; capturing, by the processor, a digital image from a source different from the second structured data source, wherein the digital image includes graphical text elements recognizable as text by optical character recognition software; converting, by the processor, the graphical text elements of the digital image to a third structured data, the third structured data comprising structured text from the graphical text elements; generating text of contents of each of the first structured data, the second structured data, and the third structured data and storing the generated text on a searchable data storage device; parsing, by the processor, the stored text by (i) collecting only phrases relevant to a specified field and (ii) grouping phrases recognized as comprising temporal references; receiving, by the processor, a search phrase and a selection of a particular language from a user; semantically analyzing, by the processor, the parsed text by ignoring any text not in the particular language; searching, by the processor, the semantically analyzed text using the received search phrase; generating, by the processor, search results based on searching the semantically analyzed text; and providing, by the processor, the search results for communication to the user. 5. The computer program product of claim 4 , wherein the method further comprises: identifying, by the processor, one or more advertisement components of the stored text, wherein the searching is based on the one or more advertisement components. 6. The computer program product of claim 4 , wherein the second structured data includes one or more of streaming audio and streaming video. 7. The computer program product of claim 4 , wherein the unstructured broadcast data includes one or more of analog audio and analog video having an analog audio component. 8. The computer program product of claim 4 , wherein the second structured data includes one or more emails.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10127306B2 cover?
A method and system for searching alternative data sources include monitoring a first communications source broadcasting unstructured data, and a second communications source broadcasting structured data. The method further includes generating text from the unstructured data and from the structured data collected, and parsing the generated text. The method also includes defining a search phrase…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F17/30634. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 13 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).