Hue 可视化的Hadoop架构工具

来源:互联网 发布:启示录2优化补丁 编辑:程序博客网 时间:2024/05/20 18:16

转载地址:http://www.datawhy.net/index.php?m=blog&f=view&id=42

1)简介

大数据可视化工具

2)编译

下载地址
编译帮助文档

  1. ##解压
  2. $ tar -zxvf hue-3.7.0-cdh5.3.6.tar.gz -C /opt/cdh/
  3. $ cd /opt/cdh
  4. $ mv hue-3.7.0-cdh5.3.6hue-3.7.0
  5. ##yum安装相关组件(联网)
  6. $sudo yum install ant asciidoc cyrus-sasl-devel cyrus-sasl-gssapi gcc gcc-c++ krb5-devel libtidy libxml2-devel libxslt-devel openldap-devel python-devel sqlite-devel openssl-devel mysql-devel gmp-devel
  7. ##编译
  8. $ cd /opt/cdh/hue-3.7.0
  9. $ make apps
  10. ##编译好的hue
  11. $ cd /opt/cdh/hue-3.7.0/apps

3)安装

配置文件目录:$ cd /opt/cdh/hue-3.7.0/desktop/conf  ---> hue.ini

Hue.ini位于 Hue的 desktop/conf/ 目录下

配置如下(desktop): 

[desktop]  # Set this to a random string, the longer the better.  # This is used for secure hashing in the session store.  secret_key=jFE93j;2[290-eiw.KEiwN2s3['d;/.q[eIW^y#e=+Iei*@Mn



启动:

  1. $ cd /opt/cdh/hue-3.7.0/
  2. $ build/env/bin/supervisor

访问:

http://bigdata.caiwei.me:8888/

4)集成hdfs

配置文档HDFS

  1. ##hdfs-site.xml
  2. dfs.webhdfs.enabledname>
  3. truevalue>
  4. property>
  5. ##core-site.xml
  6. hadoop.proxyuser.hue.hostsname>
  7. *value>
  8. property>
  9. hadoop.proxyuser.hue.groupsname>
  10. *value>
  11. property>
  12. ##集群之外访问:httpfs-site.xml
  13. httpfs.proxyuser.hue.hostsname>
  14. *value>
  15. property>
  16. httpfs.proxyuser.hue.groupsname>
  17. *value>
  18. property>

配置文档hue.ini

  1. fs_defaultfs=hdfs://bigdata.caiwei.me:8020
  2. # Default port is 14000 for HttpFs.
  3. webhdfs_url=http://bigdata.caiwei.me:50070/webhdfs/v1
  4. # Directory of the Hadoop configuration
  5. hadoop_conf_dir=/opt/cdh/hadoop-2.5.0/etc/hadoop
  6. # Directory of the Hadoop home
  7. hadoop_hdfs_home=/opt/cdh/hadoop-2.5.0/
  8. # Directory of the Hadoop bin
  9. hadoop_bin=/opt/cdh/hadoop-2.5.0/bin


6)集成yarn

需要启动 hiveserver2$ nohup hiveserver2 &

配置文档hue.ini

  1. # Enter the host on which you are running the ResourceManager
  2. resourcemanager_host=bigdata.caiwei.me.com
  3. # The port where the ResourceManager IPC listens on
  4. resourcemanager_port=8032
  5. # Whether to submit jobs to this cluster
  6. submit_to=True
  7. # URL of the ResourceManager API
  8. resourcemanager_api_url=http://bigdata.caiwei.me:8088
  9. # URL of the ProxyServer API
  10. proxy_api_url=http://bigdata.caiwei.me:8088
  11. # URL of the HistoryServer API
  12. history_server_api_url=http://bigdata.caiwei.me:19888


7)集成hive

配置文档hue.ini

注意:本地metastore不用配置。不是本地需要配置:https://cwiki.apache.org/confluence/display/Hive/AdminManual+MetastoreAdmin#AdminManualMetastoreAdmin-Local/EmbeddedMetastoreDatabase(Derby)
  1. # If Kerberos security is enabled, use fully-qualified domain name (FQDN).
  2. hive_server_host=bigdata.caiwei.me
  3. # Port where HiveServer2 Thrift server runs on.
  4. hive_server_port=10000
  5. # Hive configuration directory, where hive-site.xml is located
  6. hive_conf_dir=/opt/cdh/hive-0.13.1/conf
  7. # Timeout in seconds for thrift calls to Hive service
  8. server_conn_timeout=120


5)集成RDBMS

  [[databases]]
    # mysql, oracle, or postgresql configuration.
    [[[mysql]]]
      # Name to show in the UI.
      nice_name="My SQL DB"

      # For MySQL and PostgreSQL, name is the name of the database.
      # For Oracle, Name is instance of the Oracle server. For express edition
      # this is 'xe' by default.
      name=test

      # Database backend to use. This can be:
      # 1. mysql
      # 2. postgresql
      # 3. oracle
      engine=mysql

      # IP or hostname of the database to connect to.
      host=bigdata.caiwei.me

      # Port the database server is listening to. Defaults are:
      # 1. MySQL: 3306
      # 2. PostgreSQL: 5432
      # 3. Oracle Express Edition: 1521
      port=3306

      # Username to authenticate with when connecting to the database.
      user=root

      # Password matching the username to authenticate with when
      # connecting to the database.
      password=123456


0 0
原创粉丝点击