一、RDD论文(Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing)阅读笔记
two kind of operations on RDD:
1. transformation: map,filter, and join
2. action: persistent, count, collect, save
一、RDD论文(Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing)阅读笔记
two kind of operations on RDD:
1. transformation: map,filter, and join
2. action: persistent, count, collect, save