2.3.3 在 Standalone 模式下启动 Spark-shell

bin/spark-shell \
--master spark://hadoop201:7077

说明:

  • --master spark://hadoop201:7077指定要连接的集群的master

执行wordcount程序

sc.textFile("input/").flatMap(_.split(" ")).map((_,1)).reduceByKey(_+_).collect
res4: Array[(String, Int)] = Array((are,2), (how,2), (hello,4), (atguigu,2), (world,2), (you,2))

注意:

  • 每个worker节点上要有相同的文件夹:input/, 否则会报文件不存在的异常
Copyright © 尚硅谷大数据 2019 all right reserved,powered by Gitbook
该文件最后修订时间: 2019-08-09 00:21:43

results matching ""

    No results matching ""