Methods, apparatuses and computer program products for generating synthetic data

US12511560B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12511560-B2
Application numberUS-202217701102-A
CountryUS
Kind codeB2
Filing dateMar 22, 2022
Priority dateMar 22, 2022
Publication dateDec 30, 2025
Grant dateDec 30, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system for generating simulated data is disclosed. The system may determine items of content utilized by a network. The system may also retrieve one or more data patterns associated with one or more features associated with the content. The system may also determine a plurality of indices associated with the data patterns. The system may also generate, based on the data patterns and the plurality of indices, simulated data associated with the content.

First claim

Opening claim text (preview).

What is claimed: 1 . A method comprising: determining items of content utilized by a network; retrieving one or more data patterns associated with one or more features associated with the content, the one or more data patterns including a number of indices; determining rare indices that appear less than threshold number of times in the indices; and generating, based on the data patterns and the indices, simulated data associated with the content, wherein the simulated data includes the rare indices, and wherein any remaining indices of the simulated data are generated using a Markov chain. 2 . The method of claim 1 , further comprising: enabling usage of the simulated data to facilitate generation of at least one item of hardware or at least one application. 3 . The method of claim 1 , wherein the simulated data is associated with one or more characteristics associated with the content. 4 . The method of claim 1 , further comprising: determining one or more memory access patterns based on the indices. 5 . The method of claim 1 , further comprising determining indices that appear greater than or equal to the threshold number of times in the indices. 6 . The method of claim 1 , wherein generating the simulated data comprises applying a random permutation associated with the rare indices and arranging the rare indices in a sequence. 7 . The method of claim 1 , wherein the simulated data includes a same number of indices as the one or more data patterns. 8 . A system comprising: a device comprising one or more processors; and at least one memory storing instructions, that when executed by the one or more processors, cause the device to: determine items of content utilized by a network; retrieve one or more data patterns associated with one or more features associated with the content, the one or more data patterns including a number of indices; determining rare indices that appear less than threshold number of times in the indices; and generate, based on the data patterns and the indices, simulated data associated with the content, wherein the simulated data includes the rare indices, and wherein any remaining indices to fill the number of indices are generated using a Markov chain. 9 . The system of claim 8 , wherein when the one or more processors further execute the instructions, further causes the device to: enable usage of the simulated data to facilitate generation of at least one item of hardware or at least one application. 10 . The system of claim 8 , wherein the simulated data is associated with one or more characteristics associated with the content. 11 . The system of claim 8 , wherein when the one or more processors further execute the instructions, further causes the device to: determine one or more memory access patterns based on the indices. 12 . The system of claim 8 , wherein when the one or more processors further execute the instructions, further causes the device to determine indices that appear greater than or equal to the threshold number of times in the indices. 13 . A computer-readable medium storing instructions that, when executed, cause: determining items of content utilized by a network; retrieving one or more data patterns associated with one or more features associated with the content, the one or more data patterns including a number of indices; determining rare indices that appear less than threshold number of times in the indices; and generating, based on the data patterns and the indices, simulated data associated with the content, wherein the simulated data includes the rare indices, and wherein any remaining indices to fill the number of indices are generated using a Markov chain. 14 . The computer-readable medium of claim 13 , wherein the instructions, when executed, further cause: enabling usage of the simulated data to facilitate generation of at least one item of hardware or at least one application.

Assignees

Inventors

Classifications

  • based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO] · CPC title

  • Markov models or related models, e.g. semi-Markov models; Markov random fields; Networks embedding Markov models · CPC title

  • Threshold monitoring · CPC title

  • using statistical or mathematical methods · CPC title

  • involving simulating, designing, planning or modelling of a network · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12511560B2 cover?
A system for generating simulated data is disclosed. The system may determine items of content utilized by a network. The system may also retrieve one or more data patterns associated with one or more features associated with the content. The system may also determine a plurality of indices associated with the data patterns. The system may also generate, based on the data patterns and the plura…
Who is the assignee on this patent?
Meta Platforms Inc
What technology area does this patent fall under?
Primary CPC classification G06N7/01. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 30 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).