cloudera中hbase使用Snappy算法安装及设置
来源:互联网 发布:淘宝科尔沃面具 编辑:程序博客网 时间:2024/06/04 19:08
Snappy is a compression/decompression library. It aims for very high speeds and reasonable compression, rather than maximum compression or compatibility with other compression libraries.
Snappy Installation
Snappy is provided in the native package along with the other native libraries (such as native gzipcompression). If you are already using this package there is no additional installation to do. Otherwise follow these installation instructions:
To install Snappy on Ubuntu systems:
To install Snappy on Red Hat systems:
To install Snappy on SUSE systems:
To take advantage of Snappy compression you need to set certain configuration properties, which are explained in the following sections.
Using Snappy for MapReduce Compression
It's very common to enable MapReduce intermediate compression, since this can make jobs run faster without you having to make any application changes. Only the temporary intermediate files created by Hadoop for the shuffle phase are compressed (the final output may or may not be compressed). Snappy is ideal in this case because it compresses and decompresses very fast compared to other compression algorithms, such as Gzip.
To enable Snappy for MapReduce intermediate compression for the whole cluster, set the following properties in mapred-site.xml:
You can also set these properties on a per-job basis.
Use the properties in the following table to compress the final output of a MapReduce job. These are usually set on a per-job basis.
Using Snappy for Pig Compression
Set the same properties for Pig as for MapReduce (see the table in the previous section).
Using Snappy for Hive Compression
To enable Snappy compression for Hive output when creating SequenceFile outputs, use the following settings:
Configuring Flume to use Snappy Compression
Depending on the architecture of the machine you are installing on, add one of the following lines to/usr/lib/flume/bin/flume-env.sh:
- For 32-bit platforms:
- For 64-bit platforms:
The following section explains how to take advantage of Snappy compression.
Using Snappy compression in Flume Sinks
You can specify Snappy as a compression codec in Flume's configuration language. For example, the following specifies a Snappy-compressed SequenceFile sink on HDFS:
Using Snappy compression in Sqoop Imports
On the command line, use the following option to enable Snappy compression:
It is a good idea to use the --as-sequencefile option with this compression option.
Configuring HBase to use Snappy Compression
Depending on the architecture of the machine you are installing on, add one of the following lines to/etc/hbase/conf/hbase-env.sh:
- For 32-bit platforms:
- For 64-bit platforms:
To use Snappy compression in HBase Tables, specify the column family compression as snappy. For example, in the shell:
- cloudera中hbase使用Snappy算法安装及设置
- hbase压缩算法-Snappy算法安装
- hbase压缩算法-Snappy算法安装
- Hbase设置Snappy压缩测试
- Hadoop/Hbase的Snappy安装
- 解决HBase中snappy出错
- [置顶] CentOS 安装 hadoop hbase 使用 cloudera 版本。(一)
- 【Hadoop/Hbase】centos上安装并设置Snappy/LZO压缩方式
- Hadoop HBase 配置 安装 Snappy 终极教程
- hadoop,hbase,hive 安装snappy压缩
- Hadoop HBase 配置 安装 Snappy 终极教程
- Hadoop HBase 配置 安装 Snappy 终极教程
- Hadoop HBase 配置 安装 Snappy 终极教程
- 在hadoop2.X集群中安装压缩工具snappy(主要用于hbase)
- HBase修改压缩格式及Snappy压缩实测分享
- HBase修改压缩格式及Snappy压缩实测分享
- HBase修改压缩格式及Snappy压缩实测分享
- HBase修改压缩格式及Snappy压缩实测分享
- ubuntu11.04下安装gtk+
- ubuntu apache apxs 安装问题
- 用纯粹的C++编写COM组件
- 关于--在 System.Threading.ThreadAbortException 中第一次偶然出现的“mscorlib.dll”类型的异常
- Html 进行DOM 操作(放缩,颜色...),HTML抓图(放缩,滚动,拼接)
- cloudera中hbase使用Snappy算法安装及设置
- gcc编译连接库文件 转载http://www.iteye.com/topic/261176
- ExtJS 4 官方指南翻译:Grid组件(上)
- 服务器硬盘分区格式化mount
- C++纯虚函数 抽象类
- AfxOleInit 和CoInitlize的区别
- 用c#读取文件内容中文是乱码的解决方法
- Eclipse的trace功能测试插件的步骤
- jquery判断checkbox(复选框)是否被选中的代码