#Lutece0337. Just The Simple Fax
Just The Simple Fax
Migrated from Lutece 337 Just The Simple Fax
All parts of this problem, including description, images, samples, data and checker, might be broken. If you find bugs in this problem, please contact the admins.
Description
Fax machines use a form of compression based on run-length encoding. Run-length encoding (RLE) is a very simple form of data compression in which runs of data (that is, sequences in which the same data value occurs in many consecutive data elements) are stored as a single data value and count, rather than as the original run. This is most useful on data that contains many such runs: for example, relatively simple graphic images such as icons, text, and line drawings. It is not useful with files that don't have many runs as it could potentially double the file size (photograph, for example)
For this problem, you will write a program that encodes a block of data using a very simple RLE algorithm. A run is encoded using a two byte sequence. The first byte of the sequence contains the count, and the second contains the value to repeat. The count is encoded using an bit value with the high order bit set to . The remaining bits represent the count-. This gives a maximum run count of per byte sequence. (This implies that the minimum run count is ). Bytes that are not part of a run are encoded as-is with a prefix byte indicating the count of bytes in the non-run minus , through , representing a range of (the high order bit will always be in the case of nonrun data).
If a run contains more than bytes, then it must be encoded using multiple sequences, the first of which will always be a run of . All runs of or more must be encoded as a run. If a non-run contains more than bytes, then multiple non-run sequences must be used. For example, a run of would be encoded as two runs of followed by a non-run of .
Input
The first line of input contains a single integer , (), which is the number of data sets that follow. Each data set consists of multiple lines. The first line contains two () decimal integer values: the problem number, followed by a space, followed by the number of bytes , (), to encode. The remaining line(s) contain(s) the data to be encoded. Each line of data to encode will contain hexadecimal digits (except the last line, which may contain less). hexadecimal digits are used to represent each byte. Hexadecimal digits are: 0
,1
,2
,3
,4
,5
,6
,7
,8
,9
,A
,B
,C
,D
,E
,F
.
Output
For each data set, there are multiple lines of output. The first line contains a decimal integer giving the data set number followed by a single space, followed by a decimal integer giving the total number of encoded bytes. The remaining lines contain the encoded data each with hexadecimal digits, except the last, which may contain less.
Samples
4
1 1
07
2 5
F4A5A5A5A5
3 44
0000000000000000FFFFFF66665A5A5A5A5A71727374758008011011135555555555555501020399
777777CC
4 40
68686868686868686868686868686868686868686868686868686868686868686868686868686868
1 2
0007
2 4
00F481A5
3 32
850080FF016666825A0A717273747580080110111384550301020399807700CC
4 2
A568
Resources
Greater New York 2010