Automatic selection of different visualizations for the organization of multivariate data
US-9299173-B2 · Mar 29, 2016 · US
US9824469B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9824469-B2 |
| Application number | US-201213610198-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 11, 2012 |
| Priority date | Sep 11, 2012 |
| Publication date | Nov 21, 2017 |
| Grant date | Nov 21, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
According to one embodiment of the present invention, a system generates an alternative visualization of a data set based on a specification of a selected first visualization of the data set and parameters comprising information about a data visualization goal and statistical relations between two of more variables in the data set. Embodiments of the present invention further include a computer program product for generating an alternative visualization in substantially the same manners described above.
Opening claim text (preview).
What is claimed: 1. A system comprising: a computer system including at least one processor configured to: analyze a specification of a selected first visualization of a data set to determine information about a data visualization goal, wherein the specification describes the selected first visualization; perform statistical analysis of the data set to determine statistical relations and strengths of the statistical relations between two or more variables in the data set, wherein performing the statistical analysis comprises: identifying variables associated with dimensions of the selected first visualization based on statistics of individual variables in the data set; building a multivariate model to predict dependence of a first variable on other variables of the data set based on the identified association of the first variable with a visualization dimension; and determining a relation of the first variable and a second variable based on the multivariate model; and generate an alternative visualization of the data set based on the specification of the selected first visualization of the data set and parameters comprising the determined information about the data visualization goal and the determined statistical relations and strengths, wherein the strengths indicate a mapping between the variables and visualization dimensions. 2. The system of claim 1 , wherein the parameters further comprise metadata for the data set and statistical relations among more than two of the variables in the data set. 3. The system of claim 1 , wherein the selected first visualization comprises a chart; analyzing the specification comprises determining an association of the first variable with a predetermined axis of the chart; and generating the alternative visualization comprises generating a representation of the relation of the first variable and the second variable based on the determination of the relation. 4. The system of claim 1 , wherein the at least one processor is further configured to: generate an explanation for the alternative visualization. 5. The system of claim 1 , wherein the specification of the first visualization comprises a set of visualization components, and a specification of an alternative visualization is generated by applying a pre-defined heuristic rule to create a modified instance of the set of visualization components. 6. The system of claim 1 , wherein the at least one processor is further configured to: generate specifications of a plurality of new visualizations by applying heuristic rules to a specification of an initial visualization and re-applying the heuristic rules to the generated specifications of the plurality of new visualizations to produce specifications for one or more additional new visualizations. 7. The system of claim 1 , wherein the selected first visualization comprises a chart and determining the information about a data visualization goal comprises: determining an interest in a dependence of the first variable of the data set on the second variable of the data set based on an association of the first and second variables with respective axes of the chart. 8. The system of claim 7 , wherein the dependence of the first variable on the second variable is conditioned on a third variable of the data set. 9. The system of claim 1 , wherein performing the statistical analysis comprises: calculating univariate statistics for individual variables of the data set; calculating bivariate statistics for pairs of the individual variables, wherein the pairs are selected based on the analysis of the specification of the selected first visualization; and determining multivariate statistical relationships between more than two of the variables. 10. The system of claim 1 , wherein the specification of the selected first visualization comprises metadata specifying variable position, variable aesthetic representation, graphical elements, faceting, and pre-calculated statistics. 11. A computer program product comprising: a computer readable storage device having computer readable program code embodied therewith, the computer readable program code comprising computer readable program code configured to: analyze a specification of a selected first visualization of a data set to determine information about a data visualization goal, wherein the specification describes the selected first visualization; perform statistical analysis of the data set to determine statistical relations and strengths of the statistical relations between two or more variables in the data set, wherein performing the statistical analysis comprises: identifying variables associated with dimensions of the selected first visualization based on statistics of individual variables in the data set; building a multivariate model to predict dependence of a first variable on other variables of the data set based on the identified association of the first variable with a visualization dimension; and determining a relation of the first variable and a second variable based on the multivariate model; and generate an alternative visualization of the data set based on the specification of the selected first visualization of the data set and parameters comprising the determined information about the data visualization goal and the determined statistical relations and strengths, wherein the strengths indicate a mapping between the variables and visualization dimensions. 12. The computer program product of claim 11 , wherein the parameters further comprise metadata for the data set and statistical relations among more than two of the variables in the data set. 13. The computer program product of claim 11 , wherein analyzing the specification comprises determining an association of the first variable with a predetermined axis of a chart; and generating the alternative visualization comprises generating a representation of the relation of the first variable and the second variable based on the determination of the relation. 14. The computer program product of claim 11 , further comprising computer readable program code configured to: generate an explanation for the alternative visualization. 15. The computer program product of claim 11 , wherein the specification of the first visualization comprises a set of visualization components, and a specification of an alternative visualization is generated by applying a pre-defined heuristic rule to create a modified instance of the set of visualization components. 16. The computer program product of claim 11 , further comprising computer readable program code configured to: generate specifications of a plurality of new visualizations by applying heuristic rules to a specification of an initial visualization and re-applying the heuristic rules to the generated specifications of the plurality of new visualizations to produce specifications for one or more additional new visualizations. 17. The computer program product of claim 11 , wherein the selected first visualization comprises a chart and determining the information about a data visualization goal comprises: determining an interest in a dependence of the first variable of the data set on the second variable of the data set based on an association of the first and second variables with respective axes of the chart. 18. The computer program product of claim 17 , wherein the dependence of the first variable on the second variable is conditioned on a third variable of the data set. 19. The computer program product of claim 11 , whe
Drawing of charts or graphs · CPC title
Physics · mapped topic
Physics · mapped topic
Physics · mapped topic
Presentation of query results · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.