问题描述:
在输入文件中,有多个,其中每个输入文件代表一个学生的各科成绩,其中每行的数据形式为<科目,成绩>,你需要将每个文件中的每科目的成绩进行统计,然后求平均值。
输入文件格式:
这里有三个学生:
输出文件格式:
实例代码:
复制代码
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50package com.test; import java.io.IOException; import org.apache.hadoop.conf.Configuration; import org.apache.hadoop.fs.Path; import org.apache.hadoop.io.IntWritable; import org.apache.hadoop.io.LongWritable; import org.apache.hadoop.io.Text; import org.apache.hadoop.mapreduce.Job; import org.apache.hadoop.mapreduce.Mapper; import org.apache.hadoop.mapreduce.Reducer; import org.apache.hadoop.mapreduce.lib.input.FileInputFormat; import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat; public class StudentAverage { public static void main(String[] args) throws IllegalArgumentException, IOException, ClassNotFoundException, InterruptedException { @SuppressWarnings("deprecation") Job job = new Job(new Configuration(), "StudentAverage"); job.setJarByClass(StudentAverage.class); job.setMapperClass(Map.class); job.setReducerClass(Reduce.class); job.setMapOutputKeyClass(Text.class); job.setMapOutputValueClass(IntWritable.class); job.setOutputKeyClass(Text.class); job.setOutputValueClass(IntWritable.class); FileInputFormat.setInputPaths(job, new Path("hdfs://localhost:9000/Student/input")); FileOutputFormat.setOutputPath(job, new Path("hdfs://localhost:9000/Student/output")); job.waitForCompletion(true); System.out.println("运行结束!"); } public static class Map extends Mapper<LongWritable, Text, Text, IntWritable>{ protected void map(LongWritable key, Text value, org.apache.hadoop.mapreduce.Mapper<LongWritable, Text, Text, IntWritable>.Context context) throws java.io.IOException, InterruptedException { String[] data = value.toString().split(" "); context.write(new Text(data[0]), new IntWritable(Integer.parseInt(data[1]))); }; } public static class Reduce extends Reducer<Text, IntWritable, Text, IntWritable> { protected void reduce(Text key, java.lang.Iterable<IntWritable> values, Context context) throws java.io.IOException, InterruptedException { int average = 0; int sum = 0; for (IntWritable value : values) { sum += value.get(); } average = sum / 3; context.write(new Text(key), new IntWritable(average)); }; } }
转载于:https://www.cnblogs.com/zhou-jun/p/10195749.html
最后
以上就是土豪悟空最近收集整理的关于MapReduce编程练习(四),统计多个输入文件学生的平均成绩,的全部内容,更多相关MapReduce编程练习(四),统计多个输入文件学生内容请搜索靠谱客的其他文章。
本图文内容来源于网友提供,作为学习参考使用,或来自网络收集整理,版权属于原作者所有。
发表评论 取消回复