sqoop应用

来源:互联网 发布:淘宝买家数据采集器 编辑:程序博客网 时间:2024/04/29 21:58

Sqoop User Guide:
http://sqoop.apache.org/docs/1.4.2/SqoopUserGuide.html#_syntax_11

Sqoop是一个用来将Hadoop和关系型数据库中的数据相互转移的工具,可以将一个关系型数据库(例如 : MySQL ,Oracle ,Postgres等)中的数据导进到Hadoop的HDFS中,也可以将HDFS的数据导进到关系型数据库中。

usage: sqoop COMMAND [ARGS]

Available commands:  codegen     Generate code to interact with database records  create-hive-table  Import a table definition into Hive  eval        Evaluate a SQL statement and display the results  export             Export an HDFS directory to a database table  help               List available commands  import             Import a table from a database to HDFS  import-all-tables  Import tables from a database to HDFS  job                Work with saved jobs  list-databases     List available databases on a server  list-tables        List available tables in a database  merge              Merge results of incremental imports  metastore          Run a standalone Sqoop metastore  version            Display version information

命令选项文件option files,将选项设置封装在一个文件里:

$ sqoop import --connect jdbc:mysql://localhost/db --username foo --table TEST$ sqoop --options-file /users/homer/work/import.txt --table TESTwhere the options file /users/homer/work/import.txt contains the following:import--connectjdbc:mysql://localhost/db--usernamefoo

选项文件可以有空行或者注释:

## Options file for Sqoop import## Specifies the tool being invokedimport# Connect parameter and value--connectjdbc:mysql://localhost/db# Username parameter and value--usernamefoo## Remaining options should be specified in the command line.#

1。sqoop-import
将一个表从一个RDBMS导入到HDFS,

参数                     描述--connect <jdbc-uri>    Specify JDBC connect string--connection-manager <class-name>           Specify connection manager class to use--driver <class-name>           Manually specify JDBC driver class to use--hadoop-home <dir> Override $HADOOP_HOME--help  Print usage instructions-P  Read password from console--password <password>   Set authentication password--username <username>   Set authentication username--verbose   Print more information while working--connection-param-file <filename>  Optional properties file that provides connection parameters

要导入表首先要创建数据库连接

0 0