System and method for preparing a data set for searching

US10685000B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-10685000-B1
Application numberUS-201916518385-A
CountryUS
Kind codeB1
Filing dateJul 22, 2019
Priority dateJul 22, 2019
Publication dateJun 16, 2020
Grant dateJun 16, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of the present disclosure relate to systems and methods for preparing a data set for searching. In addition, embodiments of the present disclosure relate to solutions for configuring a storage infrastructure and indexing process for a data set.

First claim

Opening claim text (preview).

The invention claimed is: 1. A computer-implemented method for preparing a data set for searching, the method comprising: receiving, by a computer system comprising one or more processors, a request from a client device to upload a data set for use with an application; determining, by the computer system, whether the client device is authorized to upload the data set; sending, by the computer system, a request to the client device instructing a user to configure settings for the application; receiving, by the computer system, the settings for the application, the settings configuring a storage infrastructure for storing data associated with the application, and configuring an indexing process for indexing the data associated with the application, in response to selection of the settings by the user; receiving, by the computer system, the data set; processing, by the computer system, the data set to store the data set as an indexed data set in accordance with the configuration for the storage infrastructure and in accordance with the configuration for indexing the data; providing, by the computer system, a notification of a status of the processing of the data set for display to the user on the client device; identifying, by the computer system, a first field of the indexed data set that is searched more frequently than a second field of the indexed data set; and adding, by the computer system in response to identifying the first field as being searched more frequently than the second field, an entry to a graphical user interface for searching for an attribute in the first field. 2. The method of claim 1 , wherein the settings include one or more of a format of the data set, a frequency in which data sets will be loaded, a volume of data to be stored and searched, and a preferred notification channel for providing status updates to the user. 3. The method of claim 2 , wherein the settings include a format of the data set, and the format of the data set is one of a comma-separated values (CSV) format, a JavaScript Object Notation (JSON) format, a portable document file (PDF) format, a text file format, a hypertext markup language (HTML) format, or an extensible markup language (XML) format. 4. The method of claim 1 , wherein the data set is a first data set, comprising: receiving, by the computer system, a request to upload a second data set for use with the application; identifying, by the computer system, that the settings have already been configured; receiving, by the computer system, the second data set; processing, by the computer system, the second data set to store the second data set as an indexed second data set in accordance with the settings; and providing, by the computer system, a notification of a status of the processing of the second data set for display to the user on the client device. 5. The method of claim 1 , wherein the computer system is a first computer system, comprising receiving, by the first computer system, the data set from a second computer system, the second computer system receiving a portion of the data set from each of a plurality of client devices and combining the portions into the data set for providing to the first computer system. 6. The method of claim 1 , comprising: determining, by the computer system, after processing the data set, that at least a portion of the data set was not stored or was not indexed; and providing, by the computer system, the notification in response to determining that the at least a portion of the data set was not stored or was not indexed, the notification indicating that the processing of the data set was unsuccessful. 7. The method of claim 1 , wherein processing the data set to store the data set as an indexed data set in accordance with the determined configuration for the storage infrastructure and in accordance with the determined configuration for indexing the data includes breaking the data set into portions, and processing the portions in parallel across one or more processors. 8. The method of claim 1 , wherein representational state transfer (REST) application programming interfaces (APIs) are used by the computer system to store data into the storage infrastructure, to modify indexed data, and/or to retrieve indexed data in response to a search. 9. The method of claim 1 , wherein the data set includes data representing a vehicle inventory, further comprising: receiving, by the computer system, a search for data regarding vehicles within a proximity of a geographical location; and providing, by the computer system, a list of search results identifying vehicles within the proximity of the geographical location, the list of search results identifying the vehicles in an order based on the proximity of the vehicle to the geographical location. 10. The method of claim 9 , wherein the client device is a first client device, the search is received from a second client device executing the application, and the list of search results are provided to the second client device. 11. The method of claim 1 , comprising selecting, by the computer system, a number of computing devices on which to store the indexed data set, the number of computing devices selected being based at least in part on one or more of the settings received for configuring the storage infrastructure and the settings received for configuring the indexing. 12. The method of claim 11 , wherein each of the computing devices is one of a processor or a storage device. 13. The method of claim 1 , wherein the computer system identifies the first field as being searched more frequently than the second field by executing a machine learning algorithm across the indexed data set in the storage infrastructure. 14. A computer system for preparing a data set for searching, the computer system comprising: one or more memories storing instructions; and one or more processors that, when executing the instructions, are configured to: receive a request from a client device to upload a data set for use with an application; determine whether the client device is authorized to upload the data set; send a request to the client device instructing a user to configure settings for the application; receive the settings for the application, the settings configuring a storage infrastructure for storing data associated with the application, and configuring an indexing process for indexing the data associated with the application, in response to selection of the settings by the user; receive the data set; process the data set to store the data set as an indexed data set in accordance with the configuration for the storage infrastructure, and in accordance with the configuration for indexing the data; provide a notification of a status of the processing of the data set for display to the user on the client device; identify a first field of the indexed data set that is searched more frequently than a second field of the indexed data set; and add, in response to identifying the first field as being searched more frequently than the second field, an entry to a graphical user interface for searching for an attribute in the first field. 15. The computer system of claim 14 , wherein the settings include one or more of a format of the data set, a frequency in which the data sets will be loaded, a volume of data to be stored and searched, and a preferred notification channel for providing status updates to the user. 16. The computer system of claim 14 , wherein the one or more processors, when executing the instructions, are configured to: receive a request to upload

Assignees

Inventors

Classifications

  • Probabilistic graphical models, e.g. probabilistic networks · CPC title

  • Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound · CPC title

  • between a Database Management System and a front-end application · CPC title

  • Management thereof · CPC title

  • Extracting rules from data · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10685000B1 cover?
Embodiments of the present disclosure relate to systems and methods for preparing a data set for searching. In addition, embodiments of the present disclosure relate to solutions for configuring a storage infrastructure and indexing process for a data set.
Who is the assignee on this patent?
Capital One Services Llc
What technology area does this patent fall under?
Primary CPC classification G06F16/2272. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 16 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).