经典的美国气象数据统计每年最高温spark集群scala命令实现
步骤一: 读取hdfs上存储的气象数据val rddall = sc.textFile("hdfs://hadoop01:9000/ncdc/197*/*")rddall: org.apache.spark.rdd.RDD[String] = hdfs://hadoop01:9000/ncdc/* MapPartitionsRDD[93] at textFile at <console...