Protecting confidential information

US10032046B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-10032046-B1
Application numberUS-201715826858-A
CountryUS
Kind codeB1
Filing dateNov 30, 2017
Priority dateJun 28, 2017
Publication dateJul 24, 2018
Grant dateJul 24, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An approach using a computer, receives from a first computer, text generated by a user and identifies in the text generated by the user, confidential information registered in a dictionary that contains registered confidential information and substitute words corresponding to the registered confidential information. The approach includes retrieving, from the dictionary, substitute words corresponding to each identified registered confidential information and identifying, in the text generated by the user, potentially confidential words based on a text analysis of the text generated by the user. The approach includes sending to the first computer, a proposed protected text including the text generated by the user with each of the identified registered confidential information marked along with each of the retrieved substitute words to replace the identified confidential information, and each of the potentially confidential words marked along with each of one or more generated words to replace the potentially confidential words.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: receiving, by one or more computer processors, from a first computer, text generated by a user wherein the text generated by the user is one of: text input on an external web service generated from a plug-in to a client program that is a browser on the first computer, an email from a plug-in to the client program that is one of an email program or an email application on the first computer, and a message from a plug-in to the client program that is one of a messaging program or a messaging application on the first computer; identifying, by one or more computer processors, in the text generated by the user, one or more confidential information registered in a dictionary, wherein the dictionary contains a plurality of registered confidential information and a plurality of substitute words corresponding to the plurality of registered confidential information; retrieving, by one or more computer processors, from the dictionary, one or more substitute words corresponding to each identified registered confidential information of the one or more confidential information registered in the dictionary; identifying, by one or more computer processors, in the text generated by the user, whether one or more words are potentially confidential based, at least in part, on a text analysis of the text generated by the user; generating, by one or more computer processors, one or more words for each of the one or more potentially confidential words, wherein the one or more generated words are determined based, at least in part, on determining an edit distance is less than a threshold edit distance; determining, by one or more computer processors, for each of the one or more potentially confidential words with the edit distance less than the threshold edit distance, the registered confidential information associated with a shortest edit distance; retrieving, by one or more computer processors, from the dictionary, the one or more substitute words corresponding to the registered confidential information with the shortest edit distance; determining, by one or more computer processors, a category of the one or more substitute words corresponding to the registered confidential information associated with the shortest edit distance; retrieving, by one or more computer processors, a list of unused words in the category of the one or more words corresponding to the registered confidential information associated with the shortest edit distance; selecting, by one or more computer processors, one or more words from the list of unused words in the category of the one or more retrieved substitute words corresponding to the registered confidential information with the shortest edit distance based, at least in part, the text analysis identifying a highest topic index of the one or more words from the list of unused words in the category of the one or more retrieved substitute words corresponding to the registered confidential information associated with the shortest edit distance; sending, by one or more computer processors, to the first computer, a proposed protected text, wherein the proposed protected text includes the text generated by the user with each of the identified registered confidential information included with each of the one or more retrieved substitute words to replace the identified confidential information and each of the one or more potentially confidential words included with each of the one or more generated words to replace the one or more potentially confidential words; receiving, by one or more computer processors, from the first computer, at least one of: one of more edits to the proposed protected text input by the user and an indication of an approval by the user of the proposed protected text; responsive to receiving, from the first computer, the one of more edits to the proposed protected text input by the user, performing, by one or more computer processors, the one or more edits to the proposed protected text input by the user; generating, by the computer, one or more substitute words for each of the one of more edits to the proposed protected text input by the user, wherein the one or more generated substitute words are determined based, at least in part, on determining an edit distance is less than a threshold edit distance; responsive to receiving, from the first computer, the indication of the approval by the user of the proposed protected text, creating, by one or more computer processors, a user approved protected text, wherein the user approved protected text includes replacing each of the identified registered confidential information in the proposed protected text with the one or more retrieved substitute words corresponding to the identified registered confidential information of the one or more confidential information registered in the dictionary, replacing each of the one or more potentially confidential words in the proposed protected text with the one or more generated words to replace each of the one or more potentially confidential words, and replacing the additional registered confidential information indicated by the one or more edits to the proposed protected text input by the user with the one or more generated substitute words for each of the additional registered confidential information indicated by the one of more edits to the proposed protected text input by the user; sending, by one or more computer processors, the user approved protected text to the first computer; identifying, by one or more computer processors, each of the one or more potentially confidential words replaced in the user approved protected text as registered confidential information with the one or more generated words replacing each of the one or more potentially confidential words in the user approved protected text and each of the additional registered confidential information indicated by the one or more edits to the proposed protected text input by the user with the one or more generated substitute words for each of the additional registered confidential information indicated by one of more edits to the proposed protected text input the by the user; and updating, by one or more computers processors, the dictionary to include each of the one or more potentially confidential words replaced in the user approved protected text as registered confidential information with the one or more generated words replacing each of the one or more potentially confidential words in the user approved protected text and each of the additional registered confidential information in the proposed protected text identified by the one of one of more edits as registered confidential information with the with the one or more generated substitute words replacing each of the additional registered confidential information in the user approved proposed protected text.

Assignees

Inventors

Classifications

  • Dictionaries · CPC title

  • Filtering by information in the payload · CPC title

  • Thesauruses; Synonyms · CPC title

  • Computer-aided management of electronic mailing [e-mailing] · CPC title

  • where protection concerns the structure of data, e.g. records, types, queries · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10032046B1 cover?
An approach using a computer, receives from a first computer, text generated by a user and identifies in the text generated by the user, confidential information registered in a dictionary that contains registered confidential information and substitute words corresponding to the registered confidential information. The approach includes retrieving, from the dictionary, substitute words corresp…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F21/6245. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 24 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).