How-to: resolve "java.io.NotSerializableException" issue during spark reading hbase table
来源:互联网 发布:我是淘宝黑名单买家 编辑:程序博客网 时间:2024/06/04 19:24
During reading htable via spark scala code, the following error happened:
15/10/28 16:39:00 WARN scheduler.TaskSetManager: Lost task 0.0 in stage 10.0 (TID 2536, slave14.dc.tj): java.lang.RuntimeException: java.io.NotSerializableException: org.apache.hadoop.hbase.io.ImmutableBytesWritable
Serialization stack:
- object not serializable (class: org.apache.hadoop.hbase.io.ImmutableBytesWritable, value: 30 30 5f 39 39 39 38 38 33)
- field (class: scala.Tuple2, name: _1, type: class java.lang.Object)
- object (class scala.Tuple2, (30 30 5f 39 39 39 38 38 33,keyval......
15/10/28 16:39:00 WARN scheduler.TaskSetManager: Lost task 0.0 in stage 10.0 (TID 2536, slave14.dc.tj): java.lang.RuntimeException: java.io.NotSerializableException: org.apache.hadoop.hbase.io.ImmutableBytesWritable
Serialization stack:
- object not serializable (class: org.apache.hadoop.hbase.io.ImmutableBytesWritable, value: 30 30 5f 39 39 39 38 38 33)
- field (class: scala.Tuple2, name: _1, type: class java.lang.Object)
- object (class scala.Tuple2, (30 30 5f 39 39 39 38 38 33,keyval......
at org.apache.spark.serializer.SerializationDebugger$.improveException(SerializationDebugger.scala:40)
at org.apache.spark.serializer.JavaSerializationStream.writeObject(JavaSerializer.scala:47)
at org.apache.spark.serializer.SerializationStream.writeAll(Serializer.scala:153)
at org.apache.spark.storage.BlockManager.dataSerializeStream(BlockManager.scala:1190)
at org.apache.spark.serializer.JavaSerializationStream.writeObject(JavaSerializer.scala:47)
at org.apache.spark.serializer.SerializationStream.writeAll(Serializer.scala:153)
at org.apache.spark.storage.BlockManager.dataSerializeStream(BlockManager.scala:1190)
......
The solution is turn spark to use KryoSerializer:
sparkconf.set("spark.serializer", "org.apache.spark.serializer.KryoSerializer")
Reference:
http://spark.apache.org/docs/latest/tuning.html#data-serialization
0 0
- How-to: resolve "java.io.NotSerializableException" issue during spark reading hbase table
- spark<java.io.NotSerializableException>
- How-to: resolve "Unapproved licenses:" issue during building with mvn+rat
- Spark学习-SparkSQL--06-spark读取HBase数据报异常java.io.NotSerializableException
- how to resolve "key values mismatch" issue
- How-to: resolve spark "/usr/bin/python: No module named pyspark" issue
- Redis on spark java.io.NotSerializableException
- SPARK java.io.NotSerializableException 异常处理
- how-to: resolve "java.lang.OutOfMemoryError: Java heap space" during using beeline && hiveserver2
- how-to: resolve "Connection refused" during connext hiveserver2 via beeline
- Caused by: java.io.NotSerializableException: org.apache.spark.SparkContext
- How-to: resolve " java.lang.OutOfMemoryError: unable to create new native thread" for hbase thrift
- Android开发时图片Bitmap序列化问题(How to fix a java.io.NotSerializableException: android.graphics.Bitmap)
- HBase Table Enabling issue
- java.io.NotSerializableException翻译
- java.io.NotSerializableException 解决办法
- java.io.NotSerializableException
- java.io.NotSerializableException
- linux目录介绍和文件操作
- 【旅游类APP源码】开发者案例开源,走过路过不可错过!
- 项目经验分享
- 解决qt移植中镜像过大问题
- iOS QQ第三方登实现
- How-to: resolve "java.io.NotSerializableException" issue during spark reading hbase table
- Android 编程下的代码混淆
- 海尔T520智能扫地机器人使用评测
- activiti与spring集成(maven)
- linux权限篇【一】
- MFC中获取各个窗体之间的句柄或者指针对象的方法 .
- WebLogic Tuxedo Connector 快速入门指南
- Python vs Matlab—— find 与 np.where
- 定制Android系统开发之六——系统服务JNI的实现分析