sparksql语法,通过编程方式读txt
来源:互联网 发布:qq远程监控软件 编辑:程序博客网 时间:2024/06/03 17:26
Programmatically Specifying the Schema(以编程的方式指定schema)scala> val people = sc.textFile("hdfs://node1:8020/test/input/people.txt")people: org.apache.spark.rdd.RDD[String] = MapPartitionsRDD[1] at textFile at <console>:21scala> val schemaString = "name age"schemaString: String = name agescala> import org.apache.spark.sql.types.{StructType,StructField,StringType};import org.apache.spark.sql.types.{StructType, StructField, StringType}scala> import org.apache.spark.sql.Row;import org.apache.spark.sql.Rowscala> val schema = StructType(schemaString.split(" ").map(fieldName => StructField(fieldName, StringType, true)))schema: org.apache.spark.sql.types.StructType = StructType(StructField(name,StringType,true), StructField(age,StringType,true))scala> val rowRDD = people.map(_.split(",")).map(p => Row(p(0), p(1).trim))rowRDD: org.apache.spark.rdd.RDD[org.apache.spark.sql.Row] = MapPartitionsRDD[3] at map at <console>:25scala> val peopleDataFrame = sqlContext.createDataFrame(rowRDD, schema)peopleDataFrame: org.apache.spark.sql.DataFrame = [name: string, age: string]scala> peopleDataFrame.registerTempTable("people")scala> val results = sqlContext.sql("SELECT name FROM people")15/12/15 09:46:17 INFO parse.ParseDriver: Parsing command: SELECT name FROM people15/12/15 09:46:18 INFO parse.ParseDriver: Parse Completedresults: org.apache.spark.sql.DataFrame = [name: string]scala> results.map(t => "Name: " + t(0)).collect().foreach(println)Name: MichaelName: AndyName: Justin
0 0
- sparksql语法,通过编程方式读txt
- sparksql语法,通过映射方式读txt
- sparksql语法,读json
- Sparksql语法,读json
- sparksql语法,读parquet,load,save
- SparkSQL相关语法总结
- SparkSQL编程指导
- 通过编程方式获取backtrace
- 通过编程方式获取backtrace
- txt内容通过另存为方式导入到word中
- 通过修改注册表改变txt文件的默认打开方式
- 自定义SparkSql语法的一般步骤
- SPARKSQL读SPARK表
- sparkSQL
- SparkSQL
- SparkSQL
- SparkSQL
- SparkSQL 实现UDF的两种方式
- oracle创建一个函数例子
- codevs 2471 表达式的转换--二叉树
- MFC入门教程
- VS2010 MFC中实现printf调试功能,即MFC程序利用控制台输出调试信息
- Spring boot 手动配置redis
- sparksql语法,通过编程方式读txt
- SQL Server 管理数据收集
- Java中的继承
- 如何生成不规则形状的mask,以解决对图像不规则区域设置ROI的问题
- iOS调试技巧
- C/C++内存分配
- Material design
- 2016.1.8 个人总结
- “可访问性不一致”问题处理