Index configuration for searchable data in network

US9411839B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9411839-B2
Application numberUS-201514728109-A
CountryUS
Kind codeB2
Filing dateJun 2, 2015
Priority dateOct 12, 2012
Publication dateAug 9, 2016
Grant dateAug 9, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An entity using a computing device can upload searchable data to a network service to be indexed and stored. The data can include a plurality of data fields, each data field having one or more associated values. The network service can analyze the data fields and their respectively associated values to determine data field types for the data fields and search options to be enabled for the data fields. Based at least in part on the data field types and the search options, the network service can generate a search index configuration/schema. Based at least in part on the generated search index configuration/schema, the network service can generate a search index for the data. In some embodiments, the network service can also convert the data into a format compatible with the search index.

First claim

Opening claim text (preview).

What is claimed is: 1. A system comprising: at least one communications transceiver; one or more storage allocations; at least one processor; and a memory device including instructions that, when executed by the at least one processor, cause the system to: receive, via the at least one communications transceiver, data of a first format to be indexed, the data including a plurality of data fields; analyze data in each of the plurality of data fields to determine a value associated with each data field, each respective value corresponding to one of an amount of integer characters reaching an integer threshold, an amount of alphabetic characters being within a specified literal amount range, or an amount of integer and alphabetic characters above a text amount threshold; determine from a plurality of types of data fields, a type of a data field for each of the plurality of data fields; determine one or more search options to be enabled for each data field of the plurality of data fields based at least in part on the type of the data field for each of the plurality of data fields, the one or more search options including an option to include a respective data field in a search index; generate an index configuration for the data based at least in part on the one or more search options; generate a search index for the data based at least in part on the index configuration for the data; and convert the data from the first format to a second format to generate converted data, the second format being compatible with the search index, wherein the converted data is stored in at least one storage location. 2. The system of claim 1 , wherein the instructions to convert the data from the first format to the second format cause the system to further: compare the first format with the second format; and modify at least one data tag associated with the first format to correspond to at least one data tag associated with the second format. 3. The system of claim 1 , wherein the instructions to determine the type of the data field cause the system to further: determine that a value associated with the data field has an amount of characters above a specified character-type threshold; and determine the type of the data field to be a type corresponding to the character-type threshold. 4. The system of claim 3 , wherein the plurality of types of data fields includes at least one of an integer type, a text type, a literal type, a geolocation type, a time type, a date type, or a floating point number type. 5. The system of claim 4 , wherein the character-type threshold includes: an integer amount threshold; a text amount threshold; a text distinct amount threshold; a text distinct percentage threshold; and a text length threshold. 6. The system of claim 1 , wherein the one or more search options include at least one of an option to include the data field in the search index to be generated, an option to calculate a facet count for the data field, or an option to provide a value associated with the data field in response to a relevant search query. 7. The system of claim 6 , wherein the determining the one or more search options to be enabled comprises: enabling the option to include the data field in the search index to be generated, based at least in part on receiving an input indicating that the data field is to be included in the search index or determining a type of the data field to be a literal type; enabling the option to calculate the facet count for the data field, based at least in part on determining that a quantity for a distribution of a plurality of values associated with the data field is below a specified facet count upper threshold; or enabling the option to provide the value associated with the data field in response to the relevant search query, based at least in part on at least one of receiving a signal included in the data field indicating that the value associated with the data field is to be provided or determining that a length of the value associated with the data field is below a specified return value length threshold. 8. A computer-implemented method comprising: receiving data to be indexed, the data including a plurality of data fields; analyzing data in a first format in each of the plurality of data fields to determine a value associated with each data field, each respective value corresponding to one of an amount of integer characters reaching an integer threshold, an amount of alphabetic characters being within a specified literal amount range, or an amount of integer and alphabetic characters above a text amount threshold; determining from a plurality of types of data fields, a type of a data field for each of the plurality of data fields; determining one or more search options to be enabled for each data field of the plurality of data fields based at least in part on the type of the data field for each of the plurality of data fields, the one or more search options including an option to include a respective data field in a search index; generating an index configuration for the data based at least in part on the one or more search options; generating a search index for the data based at least in part on the index configuration for the data; and converting the data from the first format to a second format to generate converted data, the second format being compatible with the search index, wherein the converted data is stored in at least one storage location. 9. The computer-implemented method of claim 8 , wherein the converting the data from the first format to the second format comprises: comparing the first format with the second format; and modifying at least one data tag associated with the first format to correspond to at least one data tag associated with the second format. 10. The computer-implemented method of claim 8 , wherein the determining the type of the data field comprises: determining that a value associated with the data field has an amount of characters above a specified character-type threshold; and determining the type of the data field to be a type corresponding to the character-type threshold. 11. The computer-implemented method of claim 10 , wherein the plurality of types of data fields includes at least one of an integer type, a text type, a literal type, a geolocation type, a time type, a date type, or a floating point number type. 12. The computer-implemented method of claim 11 , wherein the character-type threshold includes: an integer amount threshold; a text amount threshold; a text distinct amount threshold; a text distinct percentage threshold; and a text length threshold. 13. The computer-implemented method of claim 8 , wherein the one or more search options include at least one of an option to include the data field in the search index to be generated, an option to calculate a facet count for the data field, or an option to provide a value associated with the data field in response to a relevant search query. 14. The computer-implemented method of claim 13 , wherein the determining the one or more search options to be enabled comprises: enabling the option to include the data field in the search index to be generated, based at least in part on receiving an input indicating that the data field is to be included in the search index or determining a type of the data field to be a literal type; enabling the option to calculate the facet count for the data field, based at least in part on determining that a quantity for a distribution of a plurality of values associated with the data field is below a specified facet count upper threshold; or

Assignees

Inventors

Classifications

  • Physics · mapped topic

  • Physics · mapped topic

  • Physics · mapped topic

  • Physics · mapped topic

  • in which an application is distributed across nodes in the network (software deployment G06F8/60; multiprogramming arrangements G06F9/46) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9411839B2 cover?
An entity using a computing device can upload searchable data to a network service to be indexed and stored. The data can include a plurality of data fields, each data field having one or more associated values. The network service can analyze the data fields and their respectively associated values to determine data field types for the data fields and search options to be enabled for the data …
Who is the assignee on this patent?
A9 Com Inc
What technology area does this patent fall under?
Primary CPC classification G06F17/30321. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 09 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).