Method and apparatus for compressing webpage text

US9542373B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9542373-B2
Application numberUS-201314388529-A
CountryUS
Kind codeB2
Filing dateSep 4, 2013
Priority dateDec 31, 2012
Publication dateJan 10, 2017
Grant dateJan 10, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods and apparatus for compressing webpage text are provided. According to a webpage-opening request of a mobile terminal browser, a current language environment of the terminal is obtained. According to the language environment, a character container corresponding to the language environment is initialized. Requested webpage data are received. The received webpage data are parsed. The webpage data are merged and stored using the corresponding character container. During the initializing of the corresponding character container according to the language environment, it is determined whether the language environment belongs to Latin languages. When the language environment belongs to Latin languages, the character container is initialized to store utf-8 encoded data. Otherwise, the character container is initialized to store utf-16 encoded data. The present disclosure can intelligently determine a storage method according to the language environment, optimize and adapt stored content, and effectively reduce the browser memory occupied by the webpage data.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for compressing webpage text, comprising: according to a webpage-opening request of a mobile terminal browser, obtaining a current language environment of a terminal; according to the language environment, initializing a character container corresponding to the language environment, wherein: the character container is a storage space for storing related webpage data, and is implemented by applying for a “type” in software code to store the webpage data, when the language environment belongs to Latin languages, the character container is initialized to store utf-8 encoded data, and when the language environment does not belong to Latin languages, the character container is initialized to store utf-16 encoded data; and receiving webpage data that are requested, parsing the webpage data, and merging and storing the webpage data by using the corresponding character container, including: merging a plurality of text objects in source code of the webpage data that are originally stored in multiple character containers, and storing the merged plurality of text objects in the initialized character container, wherein the merged plurality of text objects are connected end to end, and adding an adapting index for the initialized character container, such that each of the plurality of text object is able to be retrieved in the initialized character container according to a corresponding index value of the adapting index wherein each of the plurality of text object is able to be retrieved in the initialized character container according to a corresponding index value of the adapting index without using the utf-8 encoded data or the utf-16 encoded data. 2. The method according to claim 1 , wherein the language environment belonging to Latin languages comprises English, German, or French. 3. The method according to claim 1 , wherein during a process of applying for the “type” in software code to store the webpage data, the method further comprises: first, applying for an extra space in order to store a head of the type for indicating additional descriptive information; and next, applying for an actual space in order to store the webpage data. 4. The method according to claim 1 , wherein the character container is implemented by applying for a String type in Java language to store the webpage text. 5. An apparatus for compressing webpage text, comprising one or more processors, memory, and one or more program units stored in the memory and to be executed by the one or more processors, the one or more program units comprising: a language-environment-determining unit configured to, according to a webpage-opening request of a mobile terminal browser, obtain a current language environment of a terminal; a character-container-initializing unit configured to, according to the language environment, initialize a character container corresponding to the language environment, wherein: the character container is a storage space for storing related webpage data, and is implemented by applying for a “type” in software code to store the webpage data, when the language environment belongs to Latin languages, the character container is initialized to store utf-8 encoded data, and when the language environment does not belong to Latin languages, the character container is initialized to store utf-16 encoded data; a data-transmitting unit configured to receive webpage data that are requested; and a data-parsing-storing unit configured to parse the received webpage data, and merge and store the webpage data by using the corresponding character container, wherein the data-parsing-storing unit comprises: a content-optimized-storage unit configured to merge a plurality of text objects in source code of the webpage data that are originally stored in multiple character containers, and storing the merged plurality of text objects in the initialized character container, wherein the merged plurality of text objects are connected end to end, and an adapting unit configured to add an adapting index for the initialized character container, such that each of the plurality of text object is able to be retrieved in the initialized character container according to a corresponding index value of the adapting index wherein each of the plurality of text object is able to be retrieved in the initialized character container according to a corresponding index value of the adapting index without using the utf-8 encoded data or the utf-16 encoded data. 6. The apparatus according to claim 5 , wherein, during a process of applying for the “type” in software code to store the webpage data: an extra space is applied for first, in order to store a head of the type for indicating additional descriptive information; and an actual space is applied for next, in order to store the webpage data. 7. The apparatus according to claim 5 , wherein the language environment belonging to Latin languages comprises English, German, or French. 8. The apparatus according to claim 5 , wherein the character container is implemented by applying for a String type in Java language to store the webpage text. 9. A non-transitory computer-readable storage medium having a program stored thereon, the program being executed by a processor, wherein, when executed by the processor, the program causes the processor to perform a process for compressing webpage text, the process comprising: according to a webpage-opening request of a mobile terminal browser, obtaining a current language environment of a terminal; according to the language environment, initializing a character container corresponding to the language environment, wherein: the character container is a storage space for storing related webpage data, and is implemented by applying for a “type” in software code to store the webpage data, when the language environment belongs to Latin languages, the character container is initialized to store utf-8 encoded data, and when the language environment does not belong to Latin languages, the character container is initialized to store utf-16 encoded data; and receiving webpage data that are requested, parsing the webpage data, and merging and storing the webpage data by using the corresponding character container, including: merging a plurality of text objects in source code of the webpage data that are originally stored in multiple character containers, and storing the merged plurality of text objects in the initialized character container, wherein the merged plurality of text objects are connected end to end, and adding an adapting index for the initialized character container, such that each of the plurality of text object is able to be retrieved in the initialized character container according to a corresponding index value of the adapting index wherein each of the plurality of text object is able to be retrieved in the initialized character container according to a corresponding index value of the adapting index without using the utf-8 encoded data or the utf-16 encoded data. 10. The non-transitory computer-readable storage medium according to claim 9 , wherein the language environment belonging to Latin languages comprises English, German, or French. 11. The non-transitory computer-readable storage medium according to claim 9 , wherein the program further causes the processor the process comprising: during a process of applying for the “type” in software code to store the webpage data, first, applying for an extra space in order to store a head of the type for indicating additional descriptive information; and next, applying for an actual space in order to store the webpage data. 12. The non-tr

Assignees

Inventors

Classifications

  • Browsing optimisation, e.g. caching or content distillation · CPC title

  • G06F40/146Primary

    Coding or compression of tree-structured data · CPC title

  • G06F40/126Primary

    Character encoding · CPC title

  • Parsing markup language streams (streaming G06F40/149) · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9542373B2 cover?
Methods and apparatus for compressing webpage text are provided. According to a webpage-opening request of a mobile terminal browser, a current language environment of the terminal is obtained. According to the language environment, a character container corresponding to the language environment is initialized. Requested webpage data are received. The received webpage data are parsed. The webpa…
Who is the assignee on this patent?
Guangzhou Ucweb Computer Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06F40/146. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 10 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).