Summarization of email on a client computing device based on content contribution to an email thread using classification and word frequency considerations

US10353994B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10353994-B2
Application numberUS-201615337897-A
CountryUS
Kind codeB2
Filing dateOct 28, 2016
Priority dateNov 3, 2015
Publication dateJul 16, 2019
Grant dateJul 16, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems, methods, and computer-readable media are disclosed for enhancing an email application to automatically analyze an email thread and generate a compact content summary. The content summary is based on relative content contributions provided by the constituent email messages in the email thread. The content summary may be presented in a special window without disturbing or modifying the email thread or its constituent email messages. The distinctive content summary disclosed herein comprises certain sentences that are automatically gleaned from the email thread, analyzed relative to other sentences, and presented in a chronological sequence so that the user can quickly determine what the email thread is about and/or the current status of the conversation. The content summary is based on email weights, word weights, and intersecting sentence pairs.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: classifying, by a computing device executing an email application, a baseline set of email messages received and sent during a predefined baseline period of time, wherein each email message in the baseline set is classified as either important or not-important; wherein a given email message in the baseline set is more likely to be classified as important rather than not-important, when the given email message was addressed personally to a user of the computing device executing the email application, and not to a group comprising the user; by the computing device executing the email application, generating a first word-frequency table comprising words in email messages classified as important within the baseline set, wherein each word in the first word-frequency table is associated with a measure of frequency of occurrence of the word in the email messages classified as important; by the computing device executing the email application, generating a second word-frequency table comprising words in email messages classified as not-important within the baseline set, which are not in the first word-frequency table, wherein each word in the second word-frequency table is associated with a measure of frequency of occurrence of the word in the email messages classified as not-important; by the computing device executing the email application, generating a content summary of an email-thread comprising a plurality of email messages which are distinct from and created after the baseline set of email messages, wherein the content summary of the email-thread is generated based on, for each email message in the email-thread: (i) an email-weight assigned to the email message in the email-thread, based at least in part on a proximity of the email message in the email-thread to a chronological median of the email-thread, and (ii) a word-weight assigned to some words in the email message in the email-thread, wherein for any given word the word-weight is based at least in part on: (a) a measure of frequency of the given word obtained from one of the first word-frequency table and the second word-frequency table, (b) whether the given word is in the first word-frequency table versus in the second word-frequency table, and (c) the email-weight of the email message in the email-thread comprising the given word. 2. The method of claim 1 further comprising: by the computing device executing the email application, identifying a first set of intersecting sentence pairs from sentences in the email-thread, wherein a first sentence in a given intersecting sentence pair has a second set of words in common with a second sentence in the given intersecting sentence pair; and by the computing device executing the email application, assigning an intersection-score to each intersecting sentence pair in the first set, wherein the intersection-score is based on: (a) word-weights of the words in the second set belonging to the first sentence in the given intersecting sentence pair, which are weighted by the email-weight of the email message in the email-thread comprising the first sentence, and (b) word-weights of the words in the second set belonging to the second sentence in the given intersecting sentence pair, which are weighted by the email-weight of the email message in the email-thread comprising the second sentence. 3. The method of claim 2 further comprising: by the computing device executing the email application, generating the content summary of the email-thread, which lists in chronological order unique sentences from highest-scoring intersecting sentence pairs. 4. The method of claim 1 wherein the given email message in the baseline set is further more likely to be classified as important rather than not-important when the given email message in the baseline set was read by the user of the computing device executing the email application. 5. The method of claim 1 wherein the given email message in the baseline set is further more likely to be classified as important rather than not-important when the user of the computing device executing the email application responded to the given email message in the baseline set. 6. The method of claim 1 wherein the given email message in the baseline set is further more likely to be classified as important rather than not-important when a follow-up flag was entered for the given email message in the baseline set. 7. The method of claim 1 wherein the word-weight for a given word is higher when the given word appears in the first word-frequency table, as compared to when the given word appears in the second word-frequency table. 8. The method of claim 1 further comprising: by the computing device executing the email application, identifying a first set of intersecting sentence pairs among sentences in the email-thread, wherein a first sentence in a given intersecting sentence pair has a second set of words in common with a second sentence in the given intersecting sentence pair; and by the computing device executing the email application, generating the content summary of the email-thread, which lists in chronological order unique sentences from highest-scoring intersecting sentence pairs. 9. The method of claim 1 further comprising: by the computing device executing the email application, identifying a first set of intersecting sentence pairs among sentences in the email-thread, wherein a first sentence in a given intersecting sentence pair has a second set of words in common with a second sentence in the given intersecting sentence pair; and by the computing device executing the email application, assigning an intersection-score to each intersecting sentence pair in the first set, wherein the intersection-score is based on: (a) word-weights of the words in the second set belonging to the first sentence in the given intersecting sentence pair, and (b) word-weights of the words in the second set belonging to the second sentence in the given intersecting sentence pair. 10. The method of claim 1 further comprising: by the computing device executing the email application, generating the content summary of the email-thread, which lists in chronological order unique sentences from intersecting sentence pairs among sentences in the email-thread, wherein a first sentence in a given intersecting sentence pair has words in common with a second sentence in the given intersecting sentence pair. 11. A non-transitory computer-readable medium storing instructions that, when executed by a computing device comprising one or more processors and computer-readable memory, cause the computing device to perform operations comprising: executing an email application; classifying a baseline set of email messages received and sent during a predefined baseline period of time, wherein each email message in the baseline set is classified as either important or not-important; wherein a given email message in the baseline set is more likely to be classified as important rather than not-important when the given email message was addressed personally to a user of the computing device executing the email application, and not to a group comprising the user; generating a first word-frequency table comprising words in email messages classified as important within the baseline set, wherein each word in the first word-frequency table is associated with a measure of frequency of occurrence of the word in the email messages classified as important; generating a second word-frequency table comprising words in email messages classified as not-important within the baseline set, which are not in the first word-frequency table, wherein eac

Assignees

Inventors

Classifications

  • G06F40/166Primary

    Editing, e.g. inserting or deleting · CPC title

  • Computer-aided management of electronic mailing [e-mailing] · CPC title

  • Phrasal analysis, e.g. finite state techniques or chunking · CPC title

  • Interoperability with other network applications or services · CPC title

  • Summarisation for human users · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10353994B2 cover?
Systems, methods, and computer-readable media are disclosed for enhancing an email application to automatically analyze an email thread and generate a compact content summary. The content summary is based on relative content contributions provided by the constituent email messages in the email thread. The content summary may be presented in a special window without disturbing or modifying the e…
Who is the assignee on this patent?
Commvault Systems Inc
What technology area does this patent fall under?
Primary CPC classification G06F40/166. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 16 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).