Information processing apparatus, information processing method and program

US9922040B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9922040-B2
Application numberUS-201414561603-A
CountryUS
Kind codeB2
Filing dateDec 5, 2014
Priority dateDec 9, 2013
Publication dateMar 20, 2018
Grant dateMar 20, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present invention aims to automatically determine an encoding parameter in consideration of a condition of compression efficiency and a memory usage and performing encoding based on the determined encoding parameter. To do so, an information processing method of an information processing apparatus comprises: estimating a memory usage concerning holding of a correspondence table of a part of structured data and codes; and estimating a compression effect obtained when the structured data is encoded, by holding the correspondence table, wherein a data size of the correspondence table is variable according to a value of a parameter, and the information processing method further comprises, under a condition of the memory usage, determining the value of the parameter on the basis of the estimated memory usage and the estimated compression effect, and encoding the structured data on the basis of the determined value of the parameter.

First claim

Opening claim text (preview).

What is claimed is: 1. An information processing apparatus comprising: one or more hardware processors; and a memory which stores instructions to be executed by the one or more hardware processors, wherein in accordance with the instructions executed by the one or more hardware processors, the information processing apparatus performs: (1) obtaining structured data comprising character strings to be subjected to encoding; (2) determining at least one of (a) a first parameter for limiting a length of a character string of the structured data to be subjected to encoding and (b) a second parameter for limiting a number of attributes of character strings of the structured data to be subjected to encoding, so that a data size of information indicating correspondence between character strings of the structured data to be subjected to encoding and codes for encoding the character strings of the structured data to be subjected to encoding is less than a predetermined size; (3) storing information indicating correspondence between the character strings of the structured data to be subjected to encoding and the codes for encoding the character strings of the structured data to be subjected to encoding, wherein the information is generated according to at least one of the first parameter and the second parameter; (4) encoding the character strings of the structured data to be subjected to encoding using the information indicating correspondence between the character strings of the structured data to be subjected to encoding and the codes for encoding the character strings of the structured data to be subjected to encoding, wherein the information indicating correspondence between the character strings of the structured data to be subjected to encoding and the codes for encoding the character strings of the structured data to be subjected to encoding comprises a correspondence table; (5) analyzing the obtained structured data; and (6) estimating a data size of each entry of the correspondence table or estimating a compression effect for each entry of the correspondence table, wherein at least one of the first parameter and the second parameter is determined based on a result of the estimating. 2. The information processing apparatus according to claim 1 , wherein in the determining, under a condition of a memory usage, a process of (a) selecting, among entries of the correspondence table, the entry of which an estimated compression effect is highest as the entry to be held on the correspondence table, and (b) updating the condition of the memory usage on the basis of an estimated data size of the selected entry is repeated until the entry which satisfies the condition of the memory usage disappears. 3. The information processing apparatus according to claim 1 , wherein in the determining, on the basis of (i) a threshold of a compression effect, (ii) an estimated memory usage, and (iii) an estimated compression effect in a case where there is no entry of which the compression effect is within the threshold and the memory usage satisfies a condition of a memory reduction, a process of (a) selecting the entry of which the compression effect is within the threshold and the memory usage is highest as the entry to be not held on the correspondence table, (b) deleting the selected entry, and (c) updating the condition of the memory reduction on the basis of the memory usage of the deleted entry is repeated until the entry of which the compression effect is within the threshold and which satisfies the condition of the memory reduction appears. 4. The information processing apparatus according to claim 1 , wherein the parameter determined in the determining indicates a maximum length of a character string to be subjected to encoding. 5. The information processing apparatus according to claim 1 , wherein the parameter determined in the determining indicates a maximum number of entries for elements and attributes of character strings to be subjected to encoding. 6. The information processing apparatus according to claim 1 , wherein the structured data includes XML (Extensible Markup Language) data, and wherein in the encoding, the structured data is encoded into EXI (Efficient XML Interchange) data. 7. The information processing apparatus according to claim 1 , wherein in the determining, both the first parameter and the second parameter are determined. 8. The information processing apparatus according to claim 1 , wherein the information processing apparatus further performs: detecting a length of each of a plurality of character strings included in the obtained structured data; and detecting a number of times of appearance for each of the plurality of character strings in the obtained structured data, wherein in the determining, at least one of the first parameter and the second parameter are determined based on (i) the detected length and (ii) the detected number of times of appearance. 9. The information processing apparatus according to claim 1 , wherein in the estimating, the data size of an entry is estimated based on both a length of a character string corresponding to the entry and whether the entry corresponds to an attribute or an element. 10. The information processing apparatus according to claim 1 , wherein in the estimating, the compression effect for an entry is estimated based on (1) a length of a character string corresponding to the entry, (2) a number of times of appearance of the character string in the obtained structured data, and (3) whether the entry corresponds to an attribute or an element. 11. The information processing apparatus according to claim 1 , wherein the predetermined size is set according to at least one of a user operation and a setting file. 12. An information processing method to be performed by an information processing apparatus, the information processing method comprising: obtaining structured data comprising character strings to be subjected to encoding; determining at least one of (a) a first parameter for limiting a length of a character string of the structured data to be subjected to encoding and (b) a second parameter for limiting a number of attributes of character strings of the structured data to be subjected to encoding, so that a data size of information indicating correspondence between character strings of the structured data to be subjected to encoding and codes for encoding the character strings of the structured data to be subjected to encoding is less than a predetermined size; storing the information indicating correspondence between the character strings of the structured data to be subjected to encoding and the codes for encoding the character strings of the structured data to be subjected to encoding, wherein the information is generated according to at least one of the first parameter and the second parameter; encoding the character strings of the structured data to be subjected to encoding using the information indicating correspondence between the character strings of the structured data to be subjected to encoding and the codes for encoding the character strings of the structured data to be subjected to encoding, wherein the information indicating correspondence between the character strings of the structured data to be subjected to encoding and the codes for encoding the character strings of the structured data to be subjected to encoding comprises a correspondence table; analyzing the obtained structured data; and estimating a data size of each entry of the correspondence table or estimating a compression effect for each entry of the correspondence table, wherein at least one of the first parameter and the second pa

Assignees

Inventors

Classifications

  • H03M7/607Primary

    Selection between different types of compressors · CPC title

  • Physics · mapped topic

  • Selection of Compressor · CPC title

  • Structured documents, e.g. XML · CPC title

  • H03M7/30Primary

    Compression (speech analysis-synthesis for redundancy reduction G10L19/00; for image communication H04N); Expansion; Suppression of unnecessary data, e.g. redundancy reduction · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9922040B2 cover?
The present invention aims to automatically determine an encoding parameter in consideration of a condition of compression efficiency and a memory usage and performing encoding based on the determined encoding parameter. To do so, an information processing method of an information processing apparatus comprises: estimating a memory usage concerning holding of a correspondence table of a part of…
Who is the assignee on this patent?
Canon Kk
What technology area does this patent fall under?
Primary CPC classification H03M7/607. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Mar 20 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).