MapReduce Pig Bug
来源:互联网 发布:读三毛梦里花落知多少 编辑:程序博客网 时间:2024/06/05 18:47
问题,在运行pig代码时候出现以下错误,强制类型转换的错误,但是在pig中并没有出现强制转换的语句(引用了UDF)。
AttemptID:attempt_1499336294438_15015_m_000001_0 Info:Error: java.lang.ClassCastException: java.lang.Integer cannot be cast to java.lang.String at org.apache.pig.backend.hadoop.HDataType.getWritableComparableTypes(HDataType.java:106) at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigGenericMapReduce$Map.collect(PigGenericMapReduce.java:111) at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigGenericMapBase.runPipeline(PigGenericMapBase.java:284) at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigGenericMapBase.map(PigGenericMapBase.java:277) at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigGenericMapBase.map(PigGenericMapBase.java:64) at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:145) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:787) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:341) at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:163) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:415) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1671) at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:158)
解决方案和问题原因
data_grp = FOREACH Data GENERATE FLATTEN(grp(type,lon,lat)) AS grpnumber:double;
这里调用UDF并且另命名为其他列,注意这里的数据类型要和UDF中返回的数据类型一致,如果不一致将会出现上面的那个错误。
package cn.sibat.taxi.sum;import cn.sibat.taxi.sum.util.LocationUtil;import org.apache.pig.EvalFunc;import org.apache.pig.data.Tuple;import org.apache.pig.data.TupleFactory;import java.io.IOException;import java.util.ArrayList;import java.util.List;/** * Created by User on 2017/9/20. */public class Divide extends EvalFunc<Tuple> { //粤B093YU,2016-12-31T23:02:20.000Z,113.95858,22.543633,1 static int groupnumber = 0; static List<Tuple> store= new ArrayList<Tuple>(); static double sum =0; static double final_sum = 0; @Override public Tuple exec(Tuple tuple) throws IOException { TupleFactory tupleFactory = TupleFactory.getInstance(); Tuple out = tupleFactory.newTuple(); if (tuple.get(0) != null) { int type = Integer.parseInt(tuple.get(0).toString()); if (store.size() == 0 || type == 1) { store.add(tuple); // out.append("-1"); // return out; } else if (type == 0) { groupnumber++; for (int i = 0; i < store.size() - 1; i++) { double distance = LocationUtil.distance(Double.parseDouble(store.get(i).get(1).toString()), Double.parseDouble(store.get(i).get(2).toString()), Double.parseDouble(store.get(i + 1).get(1).toString()), Double.parseDouble(store.get(i + 1).get(2).toString())); if (distance == 0 || distance > 1500) {//上下两点的距离等于0或大于1500米,过滤 continue; } //System.out.println("d:" + distance); sum += distance; } final_sum=final_sum+sum; out.append(final_sum);// final_sum 是double类型所以在pig中也是double类型 store.clear(); //final_sum=final_sum+sum; sum = 0; } } return out; } }
阅读全文
0 0
- MapReduce Pig Bug
- Yahoo持续的Pig/Hadoop(MapReduce)工作流
- pig在MapReduce下的实例
- Pig在MapReduce下的实例
- pig 安装部署及mapreduce模式下测试
- Pig、Hive、MapReduce 解决分组 Top K 问题
- Pig、Hive、MapReduce 解决分组 Top K 问题
- Pig、Hive、MapReduce 解决分组 Top K 问题
- Pig、Hive、MapReduce 解决分组 Top K 问题
- pig
- pig
- Pig
- Pig
- Pig
- Pig
- Pig
- pig
- mongodb 3.2 mapreduce shard collection bug
- Java的字节流和字符流
- redis-事务
- String、StringBuffer与StringBuilder之间区别
- 微信网站扫码登录过期与不过期的作用
- 如何在同一台电脑上安装python2与3
- MapReduce Pig Bug
- MySQL 乱码问题
- 自定义toolbar布局不居中(已解决)
- PHP怎样打印半边金字塔、实心金字塔、空心金字塔、空心菱形
- Mac iterm2 开启密钥转发功能
- Jedis的Publish/Subscribe功能的运用
- 设计模式阅读备忘笔记-原则
- ajax请求成功,但执行error回调
- Kotlin 第一章: Kotlin简介以及插件安装