Systems and methods for optimizing data for a serverless file system

US12499090B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12499090-B2
Application numberUS-202318472430-A
CountryUS
Kind codeB2
Filing dateSep 22, 2023
Priority dateSep 22, 2023
Publication dateDec 16, 2025
Grant dateDec 16, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed embodiments may include a system for optimizing data for a serverless file system. The system may receive input data and determine whether the input data is appropriate for optimization. If so, the system may optimize the input data to reduce a size to create optimized data, group the optimized data, apply metadata, versioning, tags, enterprise governance and compliance requirements to the optimized data and store the optimized data in a data repository. In response to receiving an input to upload the optimized data to the serverless file system and determining, from the metadata, that the optimized data is not stored in the serverless file system, the system may upload the optimized data to the serverless file system via an application programming interface. The system may update metadata of the optimized data to indicate a new location at which the optimized data is stored on the serverless file system.

First claim

Opening claim text (preview).

What is claimed is: 1 . A system comprising: one or more processors; memory in communication with the one or more processors and storing instructions that are configured to cause the system to: receive input data; determine a maximum size threshold based on an upload size restriction of a serverless file system and an expected percentage in data reduction from optimizing data for upload to the serverless file system; automatically determine whether the input data is appropriate for optimization by comparing a size of the input data to a minimum size threshold and the maximum size threshold; responsive to determining that the input data is appropriate for optimization: optimize the input data to reduce a size to create optimized data; group the optimized data; apply metadata to the optimized data, wherein the metadata is configured to accommodate an indication of all locations the optimized data is stored in the serverless file system; apply enterprise governance and compliance requirements to the optimized data; store the optimized data and the metadata in a data repository, the data repository being configured to locally store optimized data in a condition to be uploaded to the serverless file system upon command; receive an input to upload the optimized data to the serverless file system; determine, from the metadata, whether the optimized data is stored in the serverless file system; responsive to determining that the optimized data is not stored in the serverless file system: upload, from the data repository and via an application programming interface, to the serverless file system, the optimized data; and update, in the data repository, the metadata associated with the optimized data that is stored in the data repository to indicate a new location at which the optimized data is stored on the serverless file system. 2 . The system of claim 1 , wherein determining whether the input data is appropriate for optimization comprises: identifying whether one or more dependencies exist; responsive to identifying that the one or more dependencies exist: determining whether a sum of a total size of the one or more dependencies is greater than 50 megabytes and less than 250 megabytes; and responsive to determining that the sum of the total size of the one or more dependencies is greater than 50 megabytes and less than 250 megabytes: determining that the input data is appropriate for optimization. 3 . The system of claim 1 , wherein optimizing the input data further comprises: determining a programming language of the input data; responsive to determining the programming language of the input data: completing language-specific size optimizations of the input data; stripping object symbols from .so files; deleting test folders; and compressing the input data. 4 . The system of claim 3 , wherein the memory stores further instructions that are configured to cause the system to: responsive to determining that the optimized data is not stored in the serverless file system: decompress the optimized data. 5 . The system of claim 4 , wherein optimizing the input data is completed by a first machine learning model, wherein the first machine learning model is trained to complete language-specific optimization of the input data based on prior examples of the input data and the optimized data. 6 . The system of claim 3 , wherein completing the language-specific size optimizations of the input data comprises deleting object files in python, minifying files in JavaScript, or combinations thereof. 7 . The system of claim 1 , wherein grouping the optimized data comprises: locating dependencies in the optimized data; and grouping, using heuristics, the optimized data such that inputs and outputs of the dependencies are located in a proximity to one another. 8 . The system of claim 7 , wherein the grouping of the optimized data is completed by a second machine learning model based on prior examples of the input data and the optimized data. 9 . The system of claim 1 , wherein the memory stores further instructions that are configured to cause the system to: responsive to determining that the optimized data is stored in the serverless file system: reference the optimized data in the serverless file system. 10 . A system comprising: one or more processors; memory in communication with the one or more processors and storing instructions that are configured to cause the system to: receive input data; determine a maximum size threshold based on an upload size restriction of a serverless file system and an expected percentage in data reduction from optimizing data for upload to the serverless file system; automatically determine whether the input data is appropriate for optimization by comparing a size of the input data to a minimum size threshold and the maximum size threshold; responsive to determining that the input data is appropriate for optimization: optimize the input data to reduce a size to create optimized data; group the optimized data; apply metadata to the optimized data, wherein the metadata is configured to accommodate an indication of all locations the optimized data is stored in the serverless file system; apply enterprise governance and compliance requirements to the optimized data; store the optimized data and the metadata in a data repository, the data repository being configured to locally store optimized data in a condition to be uploaded to the serverless file system upon command; generate a first graphical user interface comprising search input terms; transmit the first graphical user interface to a user device; receive a search command from the user device; generate, based on the search command and the metadata, an interactive search list, wherein the interactive search list shows results including the optimized data; generate a second graphical user interface comprising the interactive search list; transmit the second graphical user interface to the user device; and receive a selection from the user device indicating to upload the optimized data. 11 . The system of claim 10 , wherein the results are entries further comprising information regarding the entry, an architecture, a last updated date, contributors, or combinations thereof. 12 . The system of claim 10 , wherein the interactive search list further comprises filters used to sort results and access control limitations that require user authentication to view certain results. 13 . The system of claim 10 , wherein the memory stores further instructions that are configured to cause the system to: upload, via an application programming interface, to the serverless file system, the optimized data. 14 . The system of claim 10 , wherein the memory stores further instructions that are configured to cause the system to: determine, from the metadata, whether the optimized data is stored in the serverless file system; responsive to determining that the optimized data is not stored in the serverless file system: upload, via an application programming interface, to the serverless file system, the optimized data; update the metadata of the optimized data to indicate a new location at which the optimized data is stored on the serverless file system; and responsive to determining that the optimized data is stored in the serverless file system: reference the optimized data in the serverless file system. 15 . The system of claim 10 , wherein optimizing the input data is completed by a first machine learning model, wherein the first machine learning mod

Assignees

Inventors

Classifications

  • G06F16/164Primary

    File meta data generation · CPC title

  • G06F16/172Primary

    Caching, prefetching or hoarding of files · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12499090B2 cover?
Disclosed embodiments may include a system for optimizing data for a serverless file system. The system may receive input data and determine whether the input data is appropriate for optimization. If so, the system may optimize the input data to reduce a size to create optimized data, group the optimized data, apply metadata, versioning, tags, enterprise governance and compliance requirements t…
Who is the assignee on this patent?
Capital One Services Llc
What technology area does this patent fall under?
Primary CPC classification G06F16/164. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 16 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).