Token Based Dynamic Data Indexing With Integrated Security

US2025173319A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2025173319-A1
Application numberUS-202519040034-A
CountryUS
Kind codeA1
Filing dateJan 29, 2025
Priority dateJul 17, 2013
Publication dateMay 29, 2025
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Semantic analysis to resolve ambiguous user input data with respect to a request for data includes identifying tokens based on the text string, identifying a tables in a database, wherein a token indicates a column from a table and a token indicates a column from another table, identifying candidate join paths for joining tables, obtaining respective candidate results using the candidate join paths, outputting combined candidate results including values from the respective candidate results, obtaining second user input data indicating a selected value, identifying the request for data as unambiguous and identifying a selected join path based on the selected value, and, in response to identifying the request for data as unambiguous, outputting data responsive to the request for data using the selected join path.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method comprising: obtaining, by an information retrieval system, from a database management system, enterprise data; automatically generating, by the information retrieval system, enterprise data tokens representing the enterprise data; automatically indexing, by the information retrieval system, the enterprise data tokens in an index structure of the information retrieval system; obtaining, by the information retrieval system, first user input data including a natural language string expressing a request for data from the information retrieval system; and automatically generating, by the information retrieval system, a semantic representation of the natural language string, in a form that differs from a structured query language of the database management system, wherein generating the semantic representation includes: traversing the index structure to match a portion of the natural language string to an enterprise data token from the enterprise data tokens; and obtaining the semantic representation in accordance with the enterprise data token using a natural language processor. 2 . The method of claim 1 , wherein indexing the enterprise data tokens includes: including, in the index structure, a root node; including, in the index structure, a first branch depending from the root node, the first branch representing a first symbol from the enterprise data token; and including, in the index structure, a security bitmask for the enterprise data token, such that a security bitmask for the first branch at the root node is a hierarchical logical disjunction based on the security bitmask for the enterprise data token. 3 . The method of claim 2 , wherein: obtaining the first user input data includes obtaining a security bitmask for the first user input data; and automatically generating the semantic representation includes: determining that the first symbol matches a symbol from the portion of the natural language string; and determining that a horizontal logical disjunction of a vertical logical conjunction of the security bitmask for the first branch and the security bitmask for the first user input data indicates authorization. 4 . The method of claim 1 , wherein: automatically indexing the enterprise data tokens includes: including, in the index structure, data indicating an association between the enterprise data token and a column of a table stored in the database management system; and automatically generating the semantic representation includes: in response to determining, by a finite state machine of the information retrieval system, that the column is a measure column, including, in the semantic representation, data indicating an aggregation operation with respect to the measure column. 5 . The method of claim 1 , further comprising: automatically converting, by the information retrieval system, the semantic representation into a structured query language query expressing the request for data; obtaining, by the information retrieval system, from the database management system, in response to the structured query language query, results data responsive to the request for data; and automatically outputting, for presentation to a user, the results data. 6 . The method of claim 5 , wherein: obtaining the first user input data includes obtaining the first user input data from a user device; and automatically outputting the results data includes outputting the results data to the user device. 7 . The method of claim 5 , wherein: automatically indexing the enterprise data tokens includes: including, in the index structure, data indicating an association between the enterprise data token and a first table stored in the database management system; and including, in the index structure, data indicating an association between a second enterprise data token from the enterprise data tokens and a second table stored in the database management system; automatically generating the semantic representation includes: traversing the index structure to match a second portion of the natural language string to the second enterprise data token; including the second enterprise data token in the semantic representation; identifying a join path for joining data from the first table with data from the second table; including data indicating the join path in the semantic representation; and automatically converting the semantic representation includes: including data indicating the join path in the structured query language query. 8 . The method of claim 7 , wherein: obtaining the enterprise data includes: obtaining relationship data indicating a relationship between the first table and the second table stored; and storing the relationship data in the information retrieval system; and identifying the join path includes using the relationship data. 9 . The method of claim 5 , wherein: automatically indexing the enterprise data tokens includes: including, in the index structure, data indicating an association between the enterprise data token and a first column of a first table stored in the database management system; automatically generating the semantic representation includes: identifying candidate joint paths, wherein identifying the candidate join paths includes: identifying a first candidate join path for joining data from the first column with data from a second column from a second table stored in the database management system; and identifying a second candidate join path for joining data from the first column with data from a third column from the second table; obtaining second user input data indicating a join path from the candidate join paths; including data indicating the join path in the semantic representation; and automatically converting the semantic representation includes: including data indicating the join path in the structured query language query. 10 . A non-transitory computer-readable storage medium, comprising executable instructions that, when executed by a processor, facilitate performance of operations, comprising: obtaining, by an information retrieval system, from a database management system, enterprise data; automatically generating, by the information retrieval system, enterprise data tokens representing the enterprise data; automatically indexing, by the information retrieval system, the enterprise data tokens in an index structure of the information retrieval system; obtaining, by the information retrieval system, first user input data including a natural language string expressing a request for data from the information retrieval system; and automatically generating, by the information retrieval system, a semantic representation of the natural language string, in a form that differs from a structured query language of the database management system, wherein generating the semantic representation includes: traversing the index structure to match a portion of the natural language string to an enterprise data token from the enterprise data tokens; and obtaining the semantic representation in accordance with the enterprise data token using a natural language processor. 11 . The non-transitory computer-readable storage medium of claim 10 , wherein: indexing the enterprise data tokens includes: including, in the index structure, a root node; including, in the index structure, a first branch depending from the root node, the first branch representing a first symbol from the enterprise data token; and including, in the index structure, a security bitmask for the enterprise data token, such that a security bitmask for the first branch at the roo

Assignees

Inventors

Classifications

  • Inference or reasoning models · CPC title

  • Natural language query formulation · CPC title

  • Translation of natural language queries to structured queries · CPC title

  • Natural language query formulation · CPC title

  • Selection or weighting of terms from queries, including natural language queries · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025173319A1 cover?
Semantic analysis to resolve ambiguous user input data with respect to a request for data includes identifying tokens based on the text string, identifying a tables in a database, wherein a token indicates a column from a table and a token indicates a column from another table, identifying candidate join paths for joining tables, obtaining respective candidate results using the candidate join p…
Who is the assignee on this patent?
Thoughtspot Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/2423. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu May 29 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).