spark 将数据序列化存放内存
在spark shell客户端启动后执行scala> var rdd = sc.textFile("hdfs://mycluster/spark/data/acc.txt")rdd: org.apache.spark.rdd.RDD[String] = hdfs://mycluster/spark/data/acc.txt MapPartitionsRDD[6] at textFile at <console>:24scala&am