Model-driven data insights for latent topic materiality

US12423386B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12423386-B2
Application numberUS-202418418171-A
CountryUS
Kind codeB2
Filing dateJan 19, 2024
Priority dateJan 21, 2023
Publication dateSep 23, 2025
Grant dateSep 23, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Described herein are machine learning methods and systems for locating and tracking performance of latent themes in changing data from disparate sources. Themes may be indirect goals or consequential impacts indicated by latent topics. Identifying performance indicators of latent themes in large changing data sets uncovers underlying trends or previously concealed behaviors that may be accelerating or undermining goals.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: ingesting data from a from disparate data sources to identify data portions of multiple data sets with information that relate to the one or more issues; evaluating the information for each data set against an ensemble model to determine a set of topics for the one or more issues; generate a topic materiality score for each data portion that indicate relevancy of the information to a particular topic in the set of topics; training a time series aggregation model with the topic materiality scores to the one or more issues over time for the data portions of multiple data sets; utilizing the time series aggregation model to generate insights analysis for a selected issue of the one or more issues; applying the insight analysis to updated data from the disparate data sources to output a prediction or recommend for the selected issue; executing one or more actions in response to the prediction or recommendation, wherein the one or more actions include assigning computing power to an initiative. 2. The method of claim 1 , wherein the data include at least one data point selected comprising a measurement metric related to the one or more topics of the set of topics, and a performance measure related to the one or more topics of the set of topics. 3. The method of claim 1 , wherein the time series aggregation model is configured to determine a respective performance score for each respective topic of the associated of topics, wherein the respective performance score indicates a contribution of the initiative to overall performance for the respective issue. 4. The method of claim 1 wherein ingesting data from a from disparate data sources uses canonical type declarations with metadata to normalize data portions from different data sources. 5. The method of claim 1 , wherein the disparate data sources include enterprise data sources of an organization, wherein the set of topics are latent, and wherein the one or more issues comprises Environmental, Social, and Governance (ESG) issues for the organization. 6. The method of claim 1 , wherein the initiative is designed to offset carbon emissions. 7. The method of claim 1 , wherein the one or more actions include instructing one or more artificial intelligence applications to generate action items for improving progress of the initiative. 8. A system comprising: one or more processors; and memory storing instructions that, when executed by the one or more processors, cause the system to perform: ingesting data from a from disparate data sources to identify data portions of multiple data sets with information that relate to the one or more issues; evaluating the information for each data set against an ensemble model to determine a set of topics for the one or more issues; generate a topic materiality score for each data portion that indicate relevancy of the information to a particular topic in the set of topics; training a time series aggregation model with the topic materiality scores to the one or more issues over time for the data portions of multiple data sets; utilizing the time series aggregation model to generate insights analysis for a selected issue of the one or more issues; applying the insight analysis to updated data from the disparate data sources to output a prediction or recommend for the selected issue; executing one or more actions in response to the prediction or recommendation, wherein the one or more actions include assigning computing power to an initiative. 9. The system of claim 8 , wherein the data include at least one data point selected comprising a measurement metric related to the one or more topics of the set of topics, and a performance measure related to the one or more topics of the set of topics. 10. The system of claim 8 , wherein the time series aggregation model is configured to determine a respective performance score for each respective topic of the associated of topics, wherein the respective performance score indicates a contribution of the initiative to overall performance for the respective issue. 11. The system of claim 8 wherein ingesting data from a from disparate data sources uses canonical type declarations with metadata to normalize data portions from different data sources. 12. The system of claim 8 , wherein the disparate data sources include enterprise data sources of an organization, wherein the set of topics are latent, and wherein the one or more issues comprises Environmental, Social, and Governance (ESG) issues for the organization. 13. The system of claim 8 , wherein the initiative is designed to offset carbon emissions. 14. The system of claim 8 , wherein the one or more actions include instructing one or more artificial intelligence applications to generate action items for improving progress of the initiative. 15. A non-transitory computer readable medium comprising instructions that, when executed, cause one or more processors to perform: ingesting data from a from disparate data sources to identify data portions of multiple data sets with information that relate to the one or more issues; evaluating the information for each data set against an ensemble model to determine a set of topics for the one or more issues; generate a topic materiality score for each data portion that indicate relevancy of the information to a particular topic in the set of topics; training a time series aggregation model with the topic materiality scores to the one or more issues over time for the data portions of multiple data sets; utilizing the time series aggregation model to generate insights analysis for a selected issue of the one or more issues; applying the insight analysis to updated data from the disparate data sources to output a prediction or recommend for the selected issue; executing one or more actions in response to the prediction or recommendation, wherein the one or more actions include assigning computing power to an initiative. 16. The non-transitory computer readable medium of claim 15 , wherein the data include at least one data point selected comprising a measurement metric related to the one or more topics of the set of topics, and a performance measure related to the one or more topics of the set of topics. 17. The non-transitory computer readable medium of claim 15 , wherein the time series aggregation model is configured to determine a respective performance score for each respective topic of the associated of topics, wherein the respective performance score indicates a contribution of the initiative to overall performance for the respective issue. 18. The non-transitory computer readable medium of claim 15 wherein ingesting data from a from disparate data sources uses canonical type declarations with metadata to normalize data portions from different data sources. 19. The non-transitory computer readable medium of claim 15 , wherein the disparate data sources include enterprise data sources of an organization, wherein the set of topics are latent, and wherein the one or more issues comprises Environmental, Social, and Governance (ESG) issues for the organization. 20. The non-transitory computer readable medium of claim 15 , wherein the one or more actions include instructing one or more artificial intelligence applications to generate action items for improving progress of the initiative.

Assignees

Inventors

Classifications

  • Government or public services (business processes related to the transportation industry G06Q50/40) · CPC title

  • based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12423386B2 cover?
Described herein are machine learning methods and systems for locating and tracking performance of latent themes in changing data from disparate sources. Themes may be indirect goals or consequential impacts indicated by latent topics. Identifying performance indicators of latent themes in large changing data sets uncovers underlying trends or previously concealed behaviors that may be accelera…
Who is the assignee on this patent?
C3 Ai Inc
What technology area does this patent fall under?
Primary CPC classification G06F18/2415. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 23 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).