Extracting and surfacing topic descriptions from regionally separated data stores

US11586662B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11586662-B2
Application numberUS-202117210414-A
CountryUS
Kind codeB2
Filing dateMar 23, 2021
Priority dateMar 5, 2021
Publication dateFeb 21, 2023
Grant dateFeb 21, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Extracting and surfacing information corresponding to individual logical topics from enterprise data stores that are separated across multiple geographic regions. A clustering service creates, by utilizing machine learning toolkits that are agnostic to the region in which data is stored, individual topics that have references to multiple shards of data that are stored in different geographic regions. The clustering service also shards the knowledge base state according to the regions from which pieces of data for the particular logical topic was extracted. For example, a first shard containing information extracted from a first document may be stored in a first region whereas a second shard containing information extracted from a second document may be stored in a second region. Responsive to user activity associated with the topic, a serving platform may identify and reconstitute these shards that are stored in different regions so as to surface the regionally extracted and sharded information on that topic to a user.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, the method comprising: receiving a corpus that includes at least: first enterprise data that is stored within at least one first enterprise computing resource that is operating within a first region, and second enterprise data that is stored within at least one second enterprise computing resource that is operating within a second region; inputting the corpus into a machine learning (ML) model that is configured to output data extracts corresponding to a plurality of topics; receiving, from the ML model, an output that includes at least: first data extracts that are extracted from the first enterprise data, and second data extracts that are extracted from the second enterprise data; generating, based on the output, a knowledge base (KB) state that defines references between a particular topic and at least: a first KB shard that is generated based on the first data extracts, and a second KB shard that is generated based on the second data extracts; causing the first KB shard to be stored within the at least one first enterprise computing resource; and causing the second KB shard to be stored within the at least one second enterprise computing resource. 2. The computer-implemented method of claim 1 , further comprising: generating a site-region map that includes at least: an association between the first KB shard and a first data object from which first information in the first KB shard was extracted by the ML model, and an indication that the first data object is stored within the at least one first enterprise computing resource that is operating within the first region. 3. The computer-implemented method of claim 2 , further comprising: detecting a migration of the first data object from the at least one first enterprise computing resource that is operating within the first region to at least one third enterprise computing resource that is operating within a third region; responsive to the migration: causing the first KB shard to be stored within the at least one third enterprise computing resource that is operating within the third region; and updating a topic-region map to indicate that the first data object is stored within the at least one third enterprise computing resource that is operating within the third region. 4. The computer-implemented method of claim 1 , further comprising: providing the KB state to at least one serving platform that is configured to surface the particular topic by reconstituting the first KB shard, that is stored within the at least one first enterprise computing resource, and the second KB shard that is stored within the at least one second enterprise computing resource. 5. The computer-implemented method of claim 4 , wherein the at least one serving platform is operating within the first region, and wherein the at least one serving platform is further configured to transmit a region-specific query to the at least one second enterprise computing resource to request the second KB shard. 6. The computer-implemented method of claim 1 , wherein: the first KB shard is not stored in the at least one second enterprise computing resource; and the second KB shard is not stored in the at least one first enterprise computing resource. 7. The computer-implemented method of claim 1 , further comprising: receiving, at the first region, a request that corresponds to the particular topic from a client device; in response to the request, transmitting a region-specific query to the at least one second enterprise computing resource to request the second KB shard; receiving, at the first region, the second KB shard from the at least one second enterprise computing resource; and reconstituting, at the first region, the first KB shard and the second KB shard into the particular topic. 8. A system comprising: one or more processors; and at least one computer storage medium having computer executable instructions stored thereon which, when executed by the one or more processors, cause the system to: receive a corpus that includes at least: first enterprise data that is stored within at least one first enterprise computing resource that is operating within a first region, and second enterprise data that is stored within at least one second enterprise computing resource that is operating within a second region; input the corpus into a machine learning (ML) model that is configured to output data extracts corresponding to a plurality of topics; generate, based on the output, a knowledge base (KB) state that defines references between a particular topic and at least: a first KB shard, and a second KB shard; cause the first KB shard to be stored within the at least one first enterprise computing resource; and cause the second KB shard to be stored within the at least one second enterprise computing resource. 9. The system of claim 8 , wherein the computer executable instructions further cause the system to: generate a site-region map that includes at least: an association between the first KB shard and a first data object from which first information in the first KB shard was extracted by the ML model, and an indication that the first data object is stored within the at least one first enterprise computing resource that is operating within the first region. 10. The system of claim 9 , wherein the computer executable instructions further cause the system to: detect a migration of the first data object from the at least one first enterprise computing resource that is operating within the first region to at least one third enterprise computing resource that is operating within a third region; responsive to the migration: cause the first KB shard to be stored within the at least one third enterprise computing resource that is operating within the third region; and update a topic-region map to indicate that the first data object is stored within the at least one third enterprise computing resource that is operating within the third region. 11. The system of claim 8 , wherein the computer executable instructions further cause the system to: provide the KB state to at least one serving platform that is configured to surface the particular topic by reconstituting the first KB shard, that is stored within the at least one first enterprise computing resource, and the second KB shard that is stored within the at least one second enterprise computing resource. 12. The system of claim 11 , wherein the at least one serving platform is operating within the first region, and wherein the at least one serving platform is further configured to transmit a region-specific query to the at least one second enterprise computing resource to request the second KB. 13. The system of claim 8 , wherein the computer executable instructions further cause the system to: receive, from the ML model, an output that includes at least: first data extracts that are extracted from the first enterprise data, and second data extracts that are extracted from the second enterprise data, wherein: the first KB shard is generated based on the first data extracts, and the second KB shard is generated based on the second data extracts. 14. The system of claim 8 , wherein the computer executable instructions further cause the system to: receive, at the first region, a request that corresponds to the particular topic from a client device; in response to the request, transmit a region-specific query to the at least one second enterprise computing resource to request the second KB shard; receiving, at the first region, the second KB shard from the at least one second enterpri

Assignees

Inventors

Classifications

  • Supervised learning · CPC title

  • G06N5/022Primary

    Knowledge engineering; Knowledge acquisition · CPC title

  • Combinations of networks · CPC title

  • Machine learning · CPC title

  • Database migration support · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11586662B2 cover?
Extracting and surfacing information corresponding to individual logical topics from enterprise data stores that are separated across multiple geographic regions. A clustering service creates, by utilizing machine learning toolkits that are agnostic to the region in which data is stored, individual topics that have references to multiple shards of data that are stored in different geographic re…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G06N5/022. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 21 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).