我也想说很尴尬, 搞到显现 发现 原来是个版本的bug
spark 1.6.0 有个BUG
希望更多人看到
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/Users/jzhang/github/spark/python/pyspark/sql/context.py", line 430, in createDataFrame
jdf = self._ssql_ctx.applySchemaToPythonRDD(jrdd.rdd(), schema.json())
File "/Users/jzhang/github/spark/python/pyspark/sql/context.py", line 691, in _ssql_ctx
"build/sbt assembly", e)
Exception: ("You must build Spark with Hive. Export 'SPARK_HIVE=true' and run build/sbt assembly", Py4JJavaError(u'An error occurred while calling None.org.apache.spark.sql.hive.HiveContext.
', JavaObject id=o34))
虽然是一个很明确的错误,但是在网上找了好久都没解决,最后居然发现是spark1.6.0的一个bug, 更新到1.6.1就没这个问题了,我也是醉了,唉,还是要记得多多保持软件的更新啊!!!
原文参考 http://shellbye.com/blog/tech_world/spark-bug-lead-to-error-note/