项目作者: Michael98Liu

项目描述 :
Compress numerical data using variable byte encoding.
高级语言: C++
项目地址: git://github.com/Michael98Liu/Variable-Byte-VarByte-Encoding.git
创建时间: 2017-07-24T00:01:14Z
项目社区:https://github.com/Michael98Liu/Variable-Byte-VarByte-Encoding

开源协议:GNU General Public License v3.0

下载


Variable-Byte-VarByte-Encoding

Compress numerical data using variable byte encoding.

Test

I test the algorithm with first 10000 documents from ClueWeb09B dataset.

Speed

Compress: 7.40MB/s

  1. michael916@moa:~/Variable-Byte-VarByte-Encoding$ g++ compress_test.cpp -std=c++11 -o comp.out
  2. michael916@moa:~/Variable-Byte-VarByte-Encoding$ ./comp.out
  3. Compressed in: 3.57952 seconds.
  4. michael916@moa:~/Variable-Byte-VarByte-Encoding$ ls -l
  5. total 37624
  6. -rwxrwxr-x 1 michael916 michael916 37731 Jul 24 19:15 comp.out
  7. -rw-rw-r-- 1 michael916 michael916 1162 Jul 24 19:08 compress_test.cpp
  8. -rw-rw-r-- 1 michael916 michael916 1632 Jul 24 19:06 decompress_test.cpp
  9. -rw-rw-r-- 1 michael916 michael916 35141 Jul 24 17:21 LICENSE
  10. -rw-rw-r-- 1 michael916 michael916 923 Jul 24 19:06 README.md
  11. -rw-rw-r-- 1 michael916 michael916 10565973 Jul 24 19:15 test_comp
  12. -rwxrwxr-x 1 michael916 michael916 62715 Jul 24 19:13 test.out
  13. -rw-rw-r-- 1 michael916 michael916 27796432 Jul 24 19:13 test_uncomp
  14. -rw-rw-r-- 1 michael916 michael916 1712 Jul 24 17:21 varByte.cpp

Decompress: 10.28MB/s

  1. michael916@moa:~/Variable-Byte-VarByte-Encoding$ g++ decompress_test.cpp -std=c++11 -o test.out
  2. michael916@moa:~/Variable-Byte-VarByte-Encoding$ ./test.out
  3. Decompressed in: 2.55497 seconds.
  4. michael916@moa:~/Variable-Byte-VarByte-Encoding$ ls -l
  5. total 37624
  6. -rwxrwxr-x 1 michael916 michael916 37731 Jul 24 19:08 comp.out
  7. -rw-rw-r-- 1 michael916 michael916 1162 Jul 24 19:08 compress_test.cpp
  8. -rw-rw-r-- 1 michael916 michael916 1632 Jul 24 19:06 decompress_test.cpp
  9. -rw-rw-r-- 1 michael916 michael916 35141 Jul 24 17:21 LICENSE
  10. -rw-rw-r-- 1 michael916 michael916 923 Jul 24 19:06 README.md
  11. -rw-rw-r-- 1 michael916 michael916 10565973 Jul 24 19:08 test_comp
  12. -rwxrwxr-x 1 michael916 michael916 62715 Jul 24 19:13 test.out
  13. -rw-rw-r-- 1 michael916 michael916 27796432 Jul 24 19:13 test_uncomp
  14. -rw-rw-r-- 1 michael916 michael916 1712 Jul 24 17:21 varByte.cpp