Method for detection of anomolous operation of a system
US-2023123872-A1 · Apr 20, 2023 · US
US12182516B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12182516-B2 |
| Application number | US-202117527798-A |
| Country | US |
| Kind code | B2 |
| Filing date | Nov 16, 2021 |
| Priority date | Oct 21, 2021 |
| Publication date | Dec 31, 2024 |
| Grant date | Dec 31, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Embodiments of the present disclosure relate to a computer-implemented method, a device, and a computer program product. The method includes extracting respective themes of a set of documents with release time within a first period; determining respective semantic information of the themes and frequencies of the themes appearing in the set of documents; and determining the number of documents associated with the themes within a second period according to a prediction model and based on the semantic information and frequencies of the themes. The second period is after the first period. Embodiments of the present disclosure can better predict the tendency of the themes appearing in the future based on the semantic information and frequencies of the themes.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented method, comprising: extracting respective themes of a set of documents with release time within a first period; determining respective semantic information of the themes and frequencies of the themes appearing in the set of documents utilizing at least a first machine learning model, the first machine learning model comprising a transformer-based model; and determining the number of documents associated with the themes within a second period according to a second machine learning model, different than the first machine learning model, the second machine learning model comprising a time sequence prediction model and based on the semantic information and frequencies of the themes, wherein the second period is after the first period; wherein the first machine learning model is configured to encode the semantic information of the themes and the frequencies of the themes appearing in the set of documents; wherein results of the encoding of the semantic information of the themes and the frequencies of the themes comprises, for each of at least a subset of the themes, (i) a semantic encoding component comprising at least a classification token for the semantic information of the theme, and (ii) a frequency encoding component comprising a plurality of frequency-related values generated for the frequency of appearance of the theme based on a designated probability function; wherein the second machine learning model is configured to process the semantic encoding components and the frequency encoding components to determine the number of documents associated with the themes within the second period; wherein determining the frequencies comprises determining a time sequence of frequency representations of the themes within the first period; wherein determining the time sequence of frequency representations comprises: for a time interval point within the first period, determining a frequency representation of the time sequence of frequency representations at the time interval point based on the number of documents corresponding to the themes; and wherein determining the frequency representation at the time interval point comprises: determining the frequency representation by using a position extending code based on the number of documents corresponding to the themes. 2. The method according to claim 1 , wherein determining the semantic information comprises: determining a time sequence of semantic representations of the themes within the first period. 3. The method according to claim 2 , wherein determining the time sequence of semantic representations comprises: for a time interval point within the first period, determining a semantic representation of the time sequence of semantic representations at the time interval point according to a semantic encoding model and based on words or words in phrases corresponding to the themes in documents with release time not later than the time interval point in the set of documents. 4. The method according to claim 1 , wherein determining the frequency representation of the time sequence of frequency representations at the time interval point is based on the number of documents corresponding to the themes in documents with release time not later than the time interval point in the set of documents. 5. The method according to claim 1 , wherein the frequency representation has a predefined dimension which is greater than one dimension. 6. The method according to claim 1 , wherein extracting the respective themes of the set of documents comprises: extracting a predefined number of respective themes of the set of documents by using a theme classifying model. 7. The method according to claim 1 , wherein determining the number of the documents associated with the themes within the second period comprises: determining a number time sequence of the themes within the second period, wherein the number time sequence comprises the number of documents associated with the themes at each time interval point within the second period. 8. An electronic device, comprising: at least one processor; and at least one memory storing computer program instructions, wherein the computer program instructions, when executed by the at least one processor, cause the electronic device to perform actions comprising: extracting respective themes of a set of documents with release time within a first period; determining respective semantic information of the themes and frequencies of the themes appearing in the set of documents utilizing at least a first machine learning model, the first machine learning model comprising a transformer-based model; and determining the number of documents associated with the themes within a second period according to a second machine learning model, different than the first machine learning model, the second machine learning model comprising a time sequence prediction model and based on the semantic information and frequencies of the themes, wherein the second period is after the first period; wherein the first machine learning model is configured to encode the semantic information of the themes and the frequencies of the themes appearing in the set of documents; wherein results of the encoding of the semantic information of the themes and the frequencies of the themes comprises, for each of at least a subset of the themes, (i) a semantic encoding component comprising at least a classification token for the semantic information of the theme, and (ii) a frequency encoding component comprising a plurality of frequency-related values generated for the frequency of appearance of the theme based on a designated probability function; wherein the second machine learning model is configured to process the semantic encoding components and the frequency encoding components to determine the number of documents associated with the themes within the second period; wherein determining the frequencies comprises determining a time sequence of frequency representations of the themes within the first period; wherein determining the time sequence of frequency representations comprises: for a time interval point within the first period, determining a frequency representation of the time sequence of frequency representations at the time interval point based on the number of documents corresponding to the themes; and wherein determining the frequency representation at the time interval point comprises: determining the frequency representation by using a position extending code based on the number of documents corresponding to the themes. 9. The electronic device according to claim 8 , wherein determining the semantic information comprises: determining a time sequence of semantic representations of the themes within the first period. 10. The electronic device according to claim 9 , wherein determining the time sequence of semantic representations comprises: for a time interval point within the first period, determining a semantic representation of the time sequence of semantic representations at the time interval point according to a semantic encoding model and based on words or words in phrases corresponding to the themes in documents with release time not later than the time interval point in the set of documents. 11. The electronic device according to claim 8 , wherein determining the frequency representation of the time sequence of frequency representations at the time interval point is based on the number of documents corresponding to the themes in documents with release time not later than the time interval point in the set of documents. 12. The electronic device according to claim 8 , where
Knowledge representation; Symbolic representation · CPC title
Combinations of networks · CPC title
characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title
Semantic analysis · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.