Systems and methods for structuring data from unstructured electronic data files

US9984428B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9984428-B2
Application numberUS-201514923712-A
CountryUS
Kind codeB2
Filing dateOct 27, 2015
Priority dateSep 4, 2015
Publication dateMay 29, 2018
Grant dateMay 29, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Computer implemented systems and methods are disclosed for structuring data from unstructured electronic data files. In accordance with some embodiments, an electronic data file including unstructured content associated with a legal process return is received and the unstructured content parsed. The unstructured content is parsed to identify one or more objects and properties based on a database ontology that are processed to generate an object model. A data report may be generated based on the identified objects and properties.

First claim

Opening claim text (preview).

What is claimed is: 1. A data structuring system for structuring data from electronic data files of a social media platform received in association with a legal process return, the system comprising: a memory device that stores a set of instructions; at least one processing device configured to execute the instructions to: receive an electronic data file, the electronic data file including unstructured content of the social media platform received in association with the legal process return; determine, based on scanning a header included in the electronic data file, that the electronic data file includes data received from the social media platform; identify, from a plurality of database ontologies corresponding to a plurality of social media platforms, a database ontology corresponding to the social media platform associated with the electronic data file, the database ontology defining known data objects and corresponding property types for unstructured data received from the social media platform; parse the unstructured content to identify a first string in the unstructured content; compare the first string to the known data objects defined in the database ontology corresponding to the social media platform, yielding a comparison; determine, based on the comparison, that the first string matches a first known data object defined in the database ontology corresponding to the social media platform, the first data object being of a first data object type that is associated with a first property type; in response to determining that the first string matches the first known data object, identify the first string as a first identified object in the unstructured content and assign the first object type to the first string; parse the electronic data file to identify a second string that follows the first string in the unstructured content; identify the second string as a first identified property of the first identified object in the unstructured content and assign the first property type to the second string; and generate a data report based on the first identified data object and the first identified property. 2. The system of claim 1 , wherein the electronic data file associated with the legal process return is received in response to a legal process. 3. The system of claim 2 , wherein the legal process includes at least one of a warrant, a national security letter, and a subpoena. 4. The system of claim 1 , wherein the first identified data object is a private message, and the first identified property is an identifier included in the private message. 5. The system of claim 4 , wherein the at least one processing device is configured to execute the instructions to: parse the identifier to normalize the identifier. 6. The system of claim 1 , wherein the data report includes at least one of a list of histogramed telephone number data report, a conversation reconstructed from one or more private messages, a login information data report, a picture mapping data report, and a shared IP address data report. 7. A method of structuring data from unstructured electronic data files of a social media platform received in association with a legal process return, the method performed by at least one processing device and comprising: receiving an electronic data file, the electronic data file including unstructured content of the social media platform received in association with the legal process return; determining, based on scanning a header included in the electronic data file, that the electronic data file includes data received from the social media platform; identifying, from a plurality of database ontologies corresponding to a plurality of social media platforms, a database ontology corresponding to the social media platform associated with the electronic data file, the database ontology defining known data objects and corresponding property types for unstructured data received from the social media platform; parsing the unstructured content to identify a first string in the unstructured content; comparing the first string to the known data objects defined in the database ontology corresponding to the social media platform, yielding a comparison; determining, based on the comparison, that the first string matches a first known data object defined in the database ontology corresponding to the social media platform, the first data object being of a first data object type that is associated with a first property type; in response to determining that the first string matches the first known data object, identifying the first string as a first identified object in the unstructured content and assign the first object type to the first string; parse the electronic data file to identify a second string that follows the first string in the unstructured content; identifying the second string as a first identified property of the first identified object in the unstructured content and assign the first property type to the second string; and generating a data report based on the first identified data object and the first identified property. 8. The method of claim 7 , wherein the electronic data file associated with the legal process return is received in response to a legal process. 9. The method of claim 8 , wherein the legal process includes at least one of a warrant, a national security letter, and a subpoena. 10. The method of claim 7 , wherein the data report is an interactive data report. 11. The method of claim 10 , wherein the first identified data object is a private message, and the first identified property is an identifier included in the private message. 12. The method of claim 7 , wherein the report includes at least one of a list of histogramed telephone number data report, a conversation reconstructed from one or more private messages, a login information data report, a picture mapping data report, and a shared IP address data report. 13. A non-transitory computer-readable medium storing a set of instructions that, when executed by one or more processors, cause the one or more processing devices to perform a method of structuring data from unstructured electronic data files of a social media platform received in association with a legal process return, the method comprising: receiving an electronic data file, the electronic data file including unstructured content of the social media platform received in association with the legal process return; determining, based on scanning a header included in the electronic data file, that the electronic data file includes data received from the social media platform; identifying, from a plurality of database ontologies corresponding to a plurality of social media platforms, a database ontology corresponding to the social media platform associated with the electronic data file, the database ontology defining known data objects and corresponding property types for unstructured data received from the social media platform; parsing the unstructured content to identify a first string in the unstructured content; comparing the first string to the known data objects defined in the database ontology corresponding to the social media platform, yielding a comparison; determining, based on the comparison, that the first string matches a first known data object defined in the database ontology corresponding to the social media platform, the first data object being of a first data object type that is associated with a first property type; in response to determining that the first string matches the first known data object, identifying the first string as a first identified object in the unstructured co

Assignees

Inventors

Classifications

  • Business processes related to social networking or social networking services · CPC title

  • Natural language query formulation · CPC title

  • G06Q50/18Primary

    Legal services · CPC title

  • Office automation; Time management · CPC title

  • Presentation of query results · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9984428B2 cover?
Computer implemented systems and methods are disclosed for structuring data from unstructured electronic data files. In accordance with some embodiments, an electronic data file including unstructured content associated with a legal process return is received and the unstructured content parsed. The unstructured content is parsed to identify one or more objects and properties based on a databas…
Who is the assignee on this patent?
Palantir Technologies Inc
What technology area does this patent fall under?
Primary CPC classification G06Q50/18. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 29 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).