参考帖子操作解决:
安装jupyter参考:
https://blog.csdn.net/lanyuelvyun/article/details/93499423
运行pyspark参考:
https://www.cnblogs.com/chenxiangzhen/p/10706258.html
jupyter集合Scala:
https://blog.csdn.net/u014612752/article/details/51789233
win10部署spark和jupyter:
https://www.cnblogs.com/wubdut/p/11552059.html
https://www.cnblogs.com/xuliangxing/p/7279662.html
Linux上切换python版本
https://blog.csdn.net/weixin_43645287/article/details/109776871
pyspark:TypeError:an integer is required(got type bytes):
小结:
安装python;安装spark;把spark的python文件夹下pyspark文件夹复制放到本机python目录的lib/site-scripts安装hadoop及winutils.exe;安装jupyter;安装py4j
3.8版本用不了,卸载重装3.7
https://blog.csdn.net/weixin_43645287/article/details/109776235
然后pip3 install py4j -i http://pypi.douban.com/simple --trusted-host pypi.douban.com
完成。
下一步:如何在jupyter中跑spark+Scala