Export data from tables into partitioned folders on an external data lake
US-2024211491-A1 · Jun 27, 2024 · US
US2016203151A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2016203151-A1 |
| Application number | US-201514594464-A |
| Country | US |
| Kind code | A1 |
| Filing date | Jan 12, 2015 |
| Priority date | Jan 12, 2015 |
| Publication date | Jul 14, 2016 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Technology is disclosed herein for compressing, encoding, and otherwise reducing the size of resource files. In at least one implementation, similarity compression is employed to reduce the size of a resource file. In another implementation, map-less encoding is employed to reduce the number of bytes used to represent a resource string. Bit-level compression is employed in another implementation to reduce the quantity of bits used to encode each character in a string. In addition, implementations are disclosed related to technology for naming strings and accelerated string location and retrieval.
Opening claim text (preview).
1 . A method to facilitate enhanced resource file naming and compression comprising: ordering a set of resource strings in a resource file to produce an ordered set of resource strings in the resource file, wherein the resource file comprises the ordered set of resource strings and a resource name corresponding to each of the ordered set of resource strings; for each of the ordered set of resources strings, hashing the resource name to generate a hash value; and replacing the resource name with a resource identifier that comprises the hash value. 2 . The method of claim 1 further comprising reducing a size of the resource file by, for any of the ordered set of resource strings that qualify for similarity compression, at least: identifying a similarity value representative of an extent to which an initial portion of a resource string is similar to a next resource string in the ordered set of resource strings; and replacing the initial portion of the resource string in the resource file with the similarity value while retaining in the file a remaining portion of the resource string that was not replaced by the similarity value. 3 . The method of claim 2 wherein each of the ordered set of resource strings comprises a set of characters and wherein the method further comprises further reducing the size of the resource file by, for any of the ordered set of resource strings that qualify for map-less encoding, at least: identifying a double-byte Unicode representation of each character in the set of characters in the resource string, wherein the double-byte Unicode representation comprises a lower byte and an upper byte; identifying at least one character in the set of characters for which the upper byte of the one character comprises a non-zero value, wherein the non-zero value indicates an occurrence of non-Latin characters; setting a value of an encoding byte to the non-zero value and retaining the encoding byte in the resource file to reflect the occurrence of the non-Latin characters; discarding the upper byte from the resource file for each of the set of characters; and retaining the lower byte in the resource file for each of the set of characters. 4 . The method of claim 3 further comprising determining whether or not any of the ordered set of resource strings qualifies for the map-less encoding based at least in part on whether or not the set of characters for any given resource string of the ordered set of resource strings includes characters from more than two character ranges corresponding to more than two different languages. 5 . The method of claim 4 further comprising, when a value of the lower byte of any of the non-Latin characters falls within a lower half of a range of possible values for the lower byte, shifting a value of the lower byte of any Latin characters into an upper half of the range of possible values for the lower byte. 6 . The method of claim 5 further comprising shifting the encoding byte to reflect the shifting of the value of the lower byte of the Latin characters into the upper half of the range of the possible values for the lower byte. 7 . The method of claim 3 wherein the lower byte retained in the resource file for each of the set of characters comprises an initial quantity of bits and wherein the method further comprises further reducing the size of the resource file by, for any of the ordered set of resource strings that qualify for bit-level compression, at least: defining a dictionary specific to the resource string to include one or more characters of the set of characters in the resource string; and for each of the set of characters in the resource string, encoding the character in the resource file in a subsequent quantity of bits that is less than the initial quantity of bits and that represents a position of the character in either the dictionary or in a range of characters not included in the dictionary. 8 . The method of claim 7 wherein the lower byte comprises eight bits initially and five bits subsequent to the encoding. 9 . The method of claim 7 further comprising determining whether or not any of the ordered set of resource strings qualify for bit-level compression based at least in part on a length of a given string of the ordered set of resource strings. 10 . The method of claim 1 wherein the resource file comprises a one of a plurality of files associated with a productivity application and wherein the ordered set of resource strings describe features in the productivity application. 11 . The method of claim 1 wherein ordering the set of resource strings in the resource file comprises alphabetizing the set of resource strings. 12 . An apparatus comprising: one or more computer readable storage media; and program instructions stored on the one or more computer readable storage media for facilitating enhanced resource file operations that, when executed by a processing system, direct the processing system to at least: when a resource string is invoked in a program using a hash value associated with the resource string, identify a location in a resource file from where to retrieve the resource string based on a relationship between the hash value, a total number of possible hash values, and a total number of resource strings in the resource file; and return the resource string to the program. 13 . The apparatus of claim 12 wherein the program instructions further direct the processing system to, when the resource string has been compressed using similarity compression, decompress the resource string with respect to the similarity compression prior to returning the resource string to the program. 14 . The apparatus of claim 13 wherein the program instructions further direct the processing system to, when the resource string has been encoded using map-less encoding, decode the resource string with respect to the map-less encoding prior to returning the resource string to the program. 15 . The apparatus of claim 14 wherein the program instructions further direct the processing system to, when the resource string has been compressed using bit-level compression decompress the resource string with respect to the bit-level compression prior to returning the resource string to the program. 16 . The apparatus of claim 12 wherein the program instructions further direct the processing system to, when the resource string has been encoded using map-less encoding, decode the resource string with respect to the map-less encoding prior to returning the resource string to the program. 17 . The apparatus of claim 16 wherein the program instructions further direct the processing system to, when the resource string has been compressed using bit-level compression decompress the resource string with respect to the bit-level compression prior to returning the resource string to the program. 18 . The apparatus of claim 12 further comprising the processing system operatively coupled to the computer readable storage media and configured to read and execute the program instructions. 19 . A system to facilitate enhanced resource file compression comprising: a storage system having software stored thereon: a processing system operatively coupled to the storage system that reads and executes the software; and program instructions stored on the storage system that, when read and executed by the processing system, directs the processing system to at least: decompress at least a resource string of a plurality of resource strings in a resource file based at least in part
Hash-based (content-based indexing of textual data G06F16/31) · CPC title
employing the use of a dictionary, e.g. LZ78 · CPC title
Unicode · CPC title
Sorting · CPC title
Physics · mapped topic
Related publications grouped by family.
Answers are generated from the same data shown on this page.