Removal of personality signatures

US11551006B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11551006-B2
Application numberUS-201916564013-A
CountryUS
Kind codeB2
Filing dateSep 9, 2019
Priority dateSep 9, 2019
Publication dateJan 10, 2023
Grant dateJan 10, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments relate to an intelligent computer platform to selectively amend one or more document elements. A first document is subjected to natural language processing (NLP) and two or more document characteristics are subjected to an assessment to produce a characteristic value. The document characteristics and corresponding characteristic values are analyzed to produce a characteristic profile for each identified document characteristic. Upon receipt of a new document, document characteristic data and corresponding characteristic value(s) are identified. The corresponding characteristic value(s) of the new document is applied against the produced characteristic profile. New document characteristic data is selectively amended responsive to the comparison, and a new document version is created from the selective amendment.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer system comprising: a processing unit operatively coupled to memory; and an artificial intelligence (AI) platform in communication with the processing unit, the AI platform including one or more tools to identify and amend one or more document characteristic values, including: a natural language (NL) manager configured to subject a document set comprising two or more documents authored by a common author to natural language processing (NLP), including identify two or more document characteristics within the two or more documents of the document set, and subject the identified two or more document characteristics to an assessment, the assessment to produce a corresponding characteristic value for each of the identified two or more document characteristics in each of the two or more documents authored by the common author; a profile manager, operatively coupled to the NL manager, the profile manager configured to statistically analyze the identified two or more document characteristics and the corresponding characteristic values across the two or more documents authored by the common author, the statistical analysis to produce a characteristic profile for each identified document characteristic; a document manager, operatively coupled to the profile manager, the document manager configured to detect receipt of a new document authored by the common author, and statistically analyze the new document with respect to the identified two or more document characteristics, including identify document characteristic data and a corresponding characteristic value; a director, operatively coupled to the document manager, the director configured to: selectively identify an intersection or a non-intersection of one or more vector component values of the new document and one or more vector scores from the document set; responsive to the intersection, selectively amend one or more tokens of text of the new document to change the characteristic value of the new document to fall outside of the characteristic profile; and create a new document version from the selective amendment. 2. The computer system of claim 1 , wherein the selective amendment of one or more tokens of text of the new document to change the characteristic value comprises the director configured to modify the one or more tokens, the modification comprising replacement of the one or more tokens with one or more new tokens. 3. The computer system of claim 2 , wherein the profile manager is further configured to subject the new document version to a characteristic assessment, including produce a new characteristic value for each modified token. 4. The computer system of claim 3 , wherein the document manager is further configured to accept the new document amendment responsive to identification of divergence of the new document characteristic assessment from the characteristic values across the two or more documents authored by the common author. 5. The computer system of claim 1 , wherein the director configured to selectively amend tokens of text of the new document provides authorship anonymity with respect to the new document version. 6. The computer system of claim 1 , wherein the characteristic value for each document characteristic in each of the documents is a composite value. 7. The computer system of claim 1 , wherein the director is further configured to, responsive to the non-intersection, not selectively amend the new document. 8. A computer program product to identify and amend one or more document characteristic values, the computer program product comprising: a computer readable storage medium having program code embodied therewith, the program code executable by a processor to: subject a document set comprising two or more documents authored by a common author to natural language processing (NLP), including identify two or more document characteristics within the two or more documents of the document set, and subject the identified two or more document characteristics to an assessment, the assessment producing a corresponding characteristic value for each of the identified two or more document characteristics in each of the two or more documents authored by the common author; statistically analyze the identified two or more document characteristics and the corresponding characteristic values across the two or more documents authored by the common author, the statistical analysis producing a characteristic profile for each identified document characteristic; responsive to receiving a new document authored by the common author, statistically analyze the new document with respect to the identified two or more document characteristics, including identify document characteristic data and a corresponding characteristic value; selectively identify an intersection or a non-intersection of one or more vector component values of the new document and one or more vector scores from the document set; responsive to the intersection, selectively amend one or more tokens of text of the new document to change the characteristic value of the new document to fall outside of the characteristic profile; and create a new document version from the selective amendment. 9. The computer program product of claim 8 , wherein the selective amendment of one or more tokens of text of the new document to change the characteristic value comprises program code executable by the processor to modify the one or more tokens, the modification comprising replacement of the one or more tokens with one or more new tokens. 10. The computer program product of claim 9 , further comprising program code executable by the processor to subject the new document version to a characteristic assessment, including produce a new characteristic value for each modified token. 11. The computer program product of claim 10 , further comprising program code executable by the processor to accept the new document amendments responsive to identifying divergence of the new document characteristic assessment from the characteristic value of the document set. 12. The computer program product of claim 8 , wherein the characteristic value for each document characteristic in each of the documents is a composite value. 13. The computer program product of claim 8 , further comprising program code executable by the processor to, responsive to the non-intersection, not selectively amend the new document. 14. A method comprising: using a computer processor to support an artificial intelligence (AI) platform to identify and amend one or more tokens, including: subjecting a document set comprising two or more documents authored by a common author to natural language processing (NLP), including identifying two or more document characteristics within the two or more documents of the document set, and subjecting the identified two or more document characteristics to an assessment, the assessment producing a corresponding characteristic value for each of the identified two or more document characteristics in each of the two or more documents authored by the common author; statistically analyzing the identified two or more document characteristics and the corresponding characteristic values across the two or more documents authored by the common author, the statistical analysis producing a characteristic profile for each identified document characteristic; responsive to receiving a new document authored by the common author, statistically analyzing the new document with respect to the identified two or more document characteristics, including identifying document characteristic data and a corresponding characte

Assignees

Inventors

Classifications

  • G06F40/30Primary

    Semantic analysis · CPC title

  • Lexical analysis, e.g. tokenisation or collocates · CPC title

  • by anonymising data, e.g. decorrelating personal data from the owner's identification · CPC title

  • Abduction · CPC title

  • Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11551006B2 cover?
Embodiments relate to an intelligent computer platform to selectively amend one or more document elements. A first document is subjected to natural language processing (NLP) and two or more document characteristics are subjected to an assessment to produce a characteristic value. The document characteristics and corresponding characteristic values are analyzed to produce a characteristic profil…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F40/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 10 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).