System and Method for Parsing Regulatory and Other Documents for Machine Scoring Background
US-2024296188-A1 · Sep 5, 2024 · US
US2016004707A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2016004707-A1 |
| Application number | US-201514733188-A |
| Country | US |
| Kind code | A1 |
| Filing date | Jun 8, 2015 |
| Priority date | May 12, 2011 |
| Publication date | Jan 7, 2016 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Natural language query translation may be provided. A statistical model may be trained to detect domains according to a plurality of query click log data. Upon receiving a natural language query, the statistical model may be used to translate the natural language query into an action. The action may then be performed and at least one result associated with performing the action may be provided.
Opening claim text (preview).
We claim: 1 . A method for providing natural language query translation, the method comprising: training a statistical model according to a plurality of query click log data; receiving a natural language query; translating the natural language query into a search query according to the statistical model; performing the search query; and providing at least one result associated with performing the search query. 2 . The method of claim 1 , wherein the natural language query is received as text. 3 . The method of claim 1 , wherein the natural language query is received as speech. 4 . The method of claim 1 , wherein training the statistical model comprises identifying a plurality of domain independent salient phrases. 5 . The method of claim 4 , wherein each of the plurality of domain independent salient phrases comprises at least one word indicating that an associated search query comprises a natural language search query. 6 . The method of claim 4 , wherein training the plurality of query click log data is associated with a plurality of search engine results. 7 . The method of claim 6 , further comprising identifying a plurality of search queries associated with the plurality of query click log data that comprise natural language search queries according to the plurality of domain independent salient phrases. 8 . The method of claim 7 , further comprising generating a query pair by correlating at least one of the plurality of natural language search queries to at least one keyword-based search query. 9 . The method of claim 8 , wherein the correlation between the at least one of the plurality of natural language search queries and the at least one keyword-based search query is associated with a Uniform Resource Locator (URL) distribution. 10 . The method of claim 9 , wherein performing the search query comprises: searching the plurality of query click log data for a query pair corresponding to the search query; and identifying a domain associated with the search query according to the URL distribution. 11 . A system for providing natural language query translation, the system comprising: a memory storage; and a processing unit coupled to the memory storage, wherein the processing unit is operable to: receive a query from a user, determine whether the query comprises a natural language query, and in response to determining that the query comprises the natural language query: map the natural language query into a keyword-based query; perform a search according to a query pair comprising the natural language query and the keyword-based query; and provide a plurality of results associated with the search to the user. 12 . The system of claim 11 , wherein being operative to map the natural language query into the keyword-based query comprises being operative to: detect a domain associated with the natural language query; and strip at least one domain-independent word from the natural language query. 13 . The system of claim 12 , wherein being operative to detect the domain associated with the natural language query comprises being operative to: identify a subset of a plurality of possible domains according to at least one feature of the natural language query 14 . The system of claim 13 , wherein the at least one feature of the natural language query comprises at least one of the following: a lexical feature, a contextual feature, a semantic feature, a syntactic feature, and a topical feature. 15 . The system of claim 13 , wherein being operative to map the natural language query into the keyword-based query comprises being further operative to: convert the natural language query into the keyword-based query according to a trained statistical machine translation model. 16 . The system of claim 15 , wherein the statistical machine translation model is trained according to a plurality of mined query pairs each comprising a previous natural language query and an associated previous keyword-based query. 17 . The system of claim 16 , wherein the previous natural language query is identified according to a domain independent salient phrase. 18 . The system of claim 16 , wherein the previous natural language query and the previous keyword-based query are associated according to a weighted Uniform Resource Locator (URL) click graph. 19 . The system of claim 16 , wherein the mined query pairs each comprise a statistical weighting according to a semantic correlation between the previous natural language query and the previous keyword-based query. 20 . A computer-readable medium which stores a set of instructions which when executed performs a method for providing natural language query translation, the method executed by the set of instructions comprising: training a statistical machine translation model according to a plurality of mined query pairs, wherein training the statistical machine translation model comprises: identifying a plurality of domain independent salient phrases (DISPs), identifying a plurality of previous natural language queries according to the plurality of DISPs, associating each of the plurality of previous natural language queries with a previous keyword-based query into a mined query pair of the plurality of mined query pairs according to a uniform resource locator (URL) click graph, wherein the URL click graph comprises a weighted distribution of URLs selected in response to the a previous natural language queries and previous keyword-based queries, and extracting a plurality of common features for each of the mined query pairs; receiving a new query from a user, determining whether the new query comprises a new natural language query, in response to determining that the query comprises the natural language query, mapping the new natural language query into a keyword-based query according to the trained statistical machine translation model; performing a search according to the new query; and providing a plurality of results associated with the search to the user.
Mapping; Conversion · CPC title
Machine learning · CPC title
Query translation · CPC title
Query expansion · CPC title
Selection or weighting of terms from queries, including natural language queries · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.