Evaluating models that rely on aggregate historical data

US10719521B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10719521-B2
Application numberUS-201715707594-A
CountryUS
Kind codeB2
Filing dateSep 18, 2017
Priority dateSep 18, 2017
Publication dateJul 21, 2020
Grant dateJul 21, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for model validation includes generating a first and a second time series of segmentation states for a data set representative of a simulated population, e.g., a collection of membership counts corresponding to respective segments of the simulated population. The first and second time series of segmentation states are generated by respectively processing the data set through a first and a second simulation each comprising iterative application of a plurality of event functions. The first and the second simulation differ in at least one capacity, e.g., one including a first event function configured with a first parameter, and the second not. Analysis of differences between the first and second time series may be compared to analysis of one of the time series using a subject model. The comparison is then used to validate the model or demonstrate accuracies, inaccuracies, and/or model bias with respect to a performance metric.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for model validation, the method comprising: generating a data set comprising a collection of membership counts corresponding to segments; generating a first time series of states including a plurality of first membership counts corresponding to the segments at a plurality of points in time by processing the data set through a first simulation comprising application of a first plurality of functions including a first function configured with a first parameter, wherein processing the data set through the first simulation causes the collection of membership counts to change to one of the plurality of first membership counts at each of the plurality of points in time; generating a second time series of states including a plurality of second membership counts corresponding to the segments at the plurality of points in time by processing the data set through a second simulation comprising application of a second plurality of functions, wherein the second plurality of functions does not include the first function configured with the first parameter, wherein processing the data set through the second simulation causes the collection of membership counts to change to one of the plurality of second membership counts at each of the plurality of points in time; identifying a first value for a metric by analyzing differences in the plurality of first membership counts and the plurality of second membership counts; identifying, for a subject model, a second value for the metric, the second value representative of an output from application of the subject model to one of the first time series or the second first time series; and determining, by comparison of the first value to the second value, a score for the subject model. 2. The method of claim 1 , wherein the second plurality of functions includes the first function configured with a second parameter different from the first parameter. 3. The method of claim 1 , wherein the second plurality of functions does not include the first function. 4. The method of claim 1 , wherein the first plurality of functions includes a natural migration event. 5. The method of claim 1 , wherein the subject model is a media mix model. 6. The method of claim 5 , wherein the media mix model includes a time series multivariate ordinary least square (“OLS”) regression. 7. The method of claim 1 , comprising generating the data set at random. 8. A system for model validation, the system comprising: a computer-readable memory storing instructions; and a processor configured to execute instructions from the memory to: generate a data set comprising a collection of membership counts corresponding to segments; generate a first time series of states including a plurality of first membership counts corresponding to the segments at a plurality of points in time by processing the data set through a first simulation comprising application of a first plurality of functions including a first function configured with a first parameter, wherein processing the data set through the first simulation causes the collection of membership counts to change to one of the plurality of first membership counts at each of the plurality of points in time; generate a second time series of states including a plurality of second membership counts corresponding to the segments at the plurality of points in time by processing the data set through a second simulation comprising application of a second plurality of functions, wherein the second plurality of functions does not include the first function configured with the first parameter, wherein processing the data set through the second simulation causes the collection of membership counts to change to one of the plurality of second membership counts at each of the plurality of points in time; identify a first value for a metric by analyzing differences in the plurality of first membership counts and the plurality of second membership counts; and identify, for a subject model, a second value for the metric, the second value representative of an output from application of the subject model to one of the first time series or the second first time series; determine, by comparison of the first value to the second value, a score for the subject model. 9. The system of claim 8 , wherein the second plurality of functions includes the first function configured with a second parameter different from the first parameter. 10. The system of claim 8 , wherein the second plurality of functions does not include the first function. 11. The system of claim 8 , wherein the first plurality of functions includes a natural migration event. 12. The system of claim 8 , wherein the subject model is a media mix model. 13. The system of claim 12 , wherein the media mix model includes a time series multivariate ordinary least square (“OLS”) regression. 14. The system of claim 8 , wherein the processor is configured to generate the data set at random. 15. A non-transitory computer-readable memory storing instructions that cause a processor executing the instructions to: generate a data set comprising a collection of membership counts corresponding to segments; generate a first time series of states including a plurality of first membership counts corresponding to the segments at a plurality of points in time by processing the data set through a first simulation with a first parameter, wherein processing the data set through the first simulation causes the collection of membership counts to change to one of the plurality of first membership counts at each of the plurality of points in time; generate a second time series of states including a plurality of second membership counts corresponding to the segments at the plurality of points in time by processing the data set through a second simulation with a second parameter, wherein processing the data set through the second simulation causes the collection of membership counts to change to one of the plurality of second membership counts at each of the plurality of points in time; identify a first value for a metric by analyzing differences in the plurality of first membership counts and the plurality of second membership counts; and identify, for a subject model, a second value for the metric, the second value representative of an output from application of the subject model to one of the first time series or the second first time series; determine, by comparison of the first value to the second value, a score for the subject model. 16. The non-transitory computer-readable memory of claim 15 , wherein the first simulation comprises applying a first plurality of functions and the second simulation comprises applying a second plurality of functions, wherein applying the first plurality of functions comprises applying a first function of the plurality of functions configured with the first parameter; wherein the second plurality of functions includes the first function of the first plurality of functions configured with the second parameter, wherein the second parameter is different than the first parameter. 17. The non-transitory computer-readable memory of claim 15 , wherein the first simulation comprises applying a first plurality of functions and the second simulation comprises applying a second plurality of functions, wherein applying the first plurality of functions comprises applying a first function of the plurality of functions configured with the first parameter; wherein the second plurality of functions does not include the first function. 18. Th

Assignees

Inventors

Classifications

  • G06Q30/02Primary

    Marketing; Price estimation or determination; Fundraising · CPC title

  • Market modelling; Market analysis; Collecting market data · CPC title

  • Grouping and aggregation · CPC title

  • Query processing support for facilitating data mining operations in structured databases · CPC title

  • Matrix or vector computation {, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization (matrix transposition G06F7/78)} · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10719521B2 cover?
Systems and methods for model validation includes generating a first and a second time series of segmentation states for a data set representative of a simulated population, e.g., a collection of membership counts corresponding to respective segments of the simulated population. The first and second time series of segmentation states are generated by respectively processing the data set through…
Who is the assignee on this patent?
Google Inc, Google Llc
What technology area does this patent fall under?
Primary CPC classification G06Q30/02. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 21 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).