MapReduce14学生成绩（增强版）需求3

作者：neixi_0592045 | 来源：互联网 | 2023-10-10 16:32

题目描述关于对于学生成绩相关的练习题，之前是一个入门级别的需求，现在对这些需求进行增强，首先看数据的改变：compute

题目描述

关于对于学生成绩相关的练习题&＃xff0c;之前是一个入门级别的需求&＃xff0c;现在对这些需求进行增强&＃xff0c;首先看数据的改变&＃xff1a;

computer,huangxiaoming,85,86,41,75,93,42,85 computer,xuzheng,54,52,86,91,42 computer,huangbo,85,42,96,38 english,zhaobenshan,54,52,86,91,42,85,75 english,liuyifei,85,41,75,21,85,96,14 algorithm,liuyifei,75,85,62,48,54,96,15 computer,huangjiaju,85,75,86,85,85 english,liuyifei,76,95,86,74,68,74,48 english,huangdatou,48,58,67,86,15,33,85 algorithm,huanglei,76,95,86,74,68,74,48 algorithm,huangjiaju,85,75,86,85,85,74,86 computer,huangdatou,48,58,67,86,15,33,85 english,zhouqi,85,86,41,75,93,42,85,75,55,47,22 english,huangbo,85,42,96,38,55,47,22 algorithm,liutao,85,75,85,99,66 computer,huangzitao,85,86,41,75,93,42,85 math,wangbaoqiang,85,86,41,75,93,42,85 computer,liujialing,85,41,75,21,85,96,14,74,86 computer,liuyifei,75,85,62,48,54,96,15 computer,liutao,85,75,85,99,66,88,75,91 computer,huanglei,76,95,86,74,68,74,48 english,liujialing,75,85,62,48,54,96,15 math,huanglei,76,95,86,74,68,74,48 math,huangjiaju,85,75,86,85,85,74,86 math,liutao,48,58,67,86,15,33,85 english,huanglei,85,75,85,99,66,88,75,91 math,xuzheng,54,52,86,91,42,85,75 math,huangxiaoming,85,75,85,99,66,88,75,91 math,liujialing,85,86,41,75,93,42,85,75 english,huangxiaoming,85,86,41,75,93,42,85 algorithm,huangdatou,48,58,67,86,15,33,85 algorithm,huangzitao,85,86,41,75,93,42,85,75
一、数据解释

数据字段个数不固定&＃xff1a;
第一个是课程名称&＃xff0c;总共四个课程&＃xff0c;computer&＃xff0c;math&＃xff0c;english&＃xff0c;algorithm&＃xff0c;
第二个是学生姓名&＃xff0c;后面是每次考试的分数

二、统计需求&＃xff1a;

1、统计每门课程的参考人数和课程平均分

2、统计每门课程参考学生的平均分&＃xff0c;并且按课程存入不同的结果文件&＃xff0c;要求一门课程一个结果文件&＃xff0c;并且按平均分从高到低排序&＃xff0c;分数保留一位小数

3、求出每门课程参考学生成绩最高的2个学生的信息&＃xff1a;课程&＃xff0c;姓名和平均分

三、解题思路

mapper阶段的输出&＃xff1a;

key&＃xff1a; CourseScore

value&＃xff1a; NullWritable

reducer阶段的输出&＃xff1a;

key&＃xff1a; CourseScore

value&＃xff1a;NullWritable

实现难点&＃xff1a;

分组条件&＃xff08;课程&＃xff09; 和排序规则&＃xff08;课程&＃xff0c;成绩&＃xff09;不一致&＃xff0c;所以需要自定义分组

自定义分组的代码 CourseScoreGroupComparator.java 在 MR 程序里头

四、代码实现

package com.ghgj.mazh.mapreduce.exercise.coursescore3;import org.apache.hadoop.conf.Configuration; import org.apache.hadoop.fs.FileSystem; import org.apache.hadoop.fs.Path; import org.apache.hadoop.io.*; import org.apache.hadoop.mapreduce.Job; import org.apache.hadoop.mapreduce.Mapper; import org.apache.hadoop.mapreduce.Reducer; import org.apache.hadoop.mapreduce.lib.input.FileInputFormat; import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;import java.io.IOException;public class CourseScoreMR_Pro_03 {public static void main(String[] args) throws Exception {/*** 一些参数的初始化*/String inputPath &＃61; "D:\\bigdata\\coursescore2\\input";String outputPath &＃61; "D:\\bigdata\\coursescore2\\output3";/*** 初始化一个Job对象*/Configuration conf &＃61; new Configuration();Job job &＃61; Job.getInstance(conf);/*** 设置jar包所在路径*/job.setJarByClass(CourseScoreMR_Pro_03.class);/*** 指定mapper类和reducer类等各种其他业务逻辑组件*/job.setMapperClass(Mapper_CS.class);job.setReducerClass(Reducer_CS.class);// 指定maptask的输出类型job.setMapOutputKeyClass(CourseScore.class);job.setMapOutputValueClass(NullWritable.class);// 指定reducetask的输出类型job.setOutputKeyClass(CourseScore.class);job.setOutputValueClass(NullWritable.class);job.setGroupingComparatorClass(CourseScoreGroupComparator.class);/*** 指定该mapreduce程序数据的输入和输出路径*/Path input &＃61; new Path(inputPath);Path output &＃61; new Path(outputPath);FileSystem fs &＃61; FileSystem.get(conf);if (fs.exists(output)) {fs.delete(output, true);}FileInputFormat.setInputPaths(job, input);FileOutputFormat.setOutputPath(job, output);/*** 最后提交任务*/boolean waitForCompletion &＃61; job.waitForCompletion(true);System.exit(waitForCompletion ? 0 : 1);}/*** Mapper组件&＃xff1a;*

* 输入的key:* 输入的value: computer,xuzheng,54,52,86,91,42*

* 输出的key: CourseScore* 输入的value: NullWritable*/private static class Mapper_CS extends Mapper {CourseScore keyOut &＃61; new CourseScore();&＃64;Overrideprotected void map(LongWritable key, Text value, Context context) throws IOException, InterruptedException {String[] splits &＃61; value.toString().split(",");String course &＃61; splits[0];String name &＃61; splits[1];int sum &＃61; 0;int num &＃61; 0;for(int i&＃61;2; i* 输入的key: CourseScore* 输入的values: NullWritable*

* 输出的key: CourseScore* 输入的value: NullWritable*/private static class Reducer_CS extends Reducer {// 成绩最高的两个人的信息int topN &＃61; 2;&＃64;Overrideprotected void reduce(CourseScore key, Iterable values, Context context) throws IOException, InterruptedException {int number &＃61; 0;for(NullWritable nvl: values){context.write(key, nvl);number &＃43;&＃43;;if(number &＃61;&＃61; topN){break;}}}}/*** 自定义分组组件*/public static class CourseScoreGroupComparator extends WritableComparator{CourseScoreGroupComparator(){super(CourseScore.class, true);}&＃64;Overridepublic int compare(WritableComparable a, WritableComparable b) {CourseScore cs1 &＃61; (CourseScore)a;CourseScore cs2 &＃61; (CourseScore)b;int result &＃61; cs1.getCourse().compareTo(cs2.getCourse());return result;}} }

其中 CourseScore类的实现&＃xff1a;

package com.ghgj.mazh.mapreduce.exercise.coursescore3;import org.apache.hadoop.io.WritableComparable;import java.io.DataInput; import java.io.DataOutput; import java.io.IOException;public class CourseScore implements WritableComparable {private String course;private String name;private double score;public CourseScore(String course, String name, double score) {super();this.course &＃61; course;this.name &＃61; name;this.score &＃61; score;}public CourseScore() {}public String getCourse() {return course;}public void setCourse(String course) {this.course &＃61; course;}public String getName() {return name;}public void setName(String name) {this.name &＃61; name;}public double getScore() {return score;}public void setScore(double score) {this.score &＃61; score;}&＃64;Overridepublic void write(DataOutput out) throws IOException {// TODO Auto-generated method stubout.writeUTF(course);out.writeUTF(name);out.writeDouble(score);}&＃64;Overridepublic void readFields(DataInput in) throws IOException {// TODO Auto-generated method stubthis.course &＃61; in.readUTF();this.name &＃61; in.readUTF();this.score &＃61; in.readDouble();}/*** 排序规则* compareTo方法既充当排序用&＃xff0c;用充当分组规则*/&＃64;Overridepublic int compareTo(CourseScore cs) {int courseDiff &＃61; this.course.compareTo(cs.getCourse());if (courseDiff &＃61;&＃61; 0) {double diff &＃61; cs.getScore() - this.score;if (diff &＃61;&＃61; 0) {return 0;} else {return diff > 0 ? 1 : -1;}} else {return courseDiff > 0 ? 1 : -1;}}&＃64;Overridepublic String toString() {return course &＃43; "\t" &＃43; name &＃43; "\t" &＃43; score;} }

五、执行结果

最后的结果&＃xff1a;每个课程的最高成绩的前2名

至此&＃xff0c;大功告成

推荐阅读

case
如何在有序字符序列中插入新字符并保持有序

本文介绍了如何在给定的有序字符序列中插入新字符，并保持序列的有序性。通过示例代码演示了插入过程，以及插入后的字符序列。 ... [详细]

蜡笔小新 2023-12-14 11:16:33
case
sklearn数据集库中的常用数据集类型介绍

本文介绍了sklearn数据集库中常用的数据集类型，包括玩具数据集和样本生成器。其中详细介绍了波士顿房价数据集，包含了波士顿506处房屋的13种不同特征以及房屋价格，适用于回归任务。 ... [详细]

蜡笔小新 2023-12-13 17:45:15
main
[大整数乘法] java代码实现

本文介绍了使用java代码实现大整数乘法的过程，同时也涉及到大整数加法和大整数减法的计算方法。通过分治算法来提高计算效率，并对算法的时间复杂度进行了研究。详细代码实现请参考文章链接。 ... [详细]

蜡笔小新 2023-12-13 11:21:32
main
Go GUIlxn/walk 学习3.菜单栏和工具栏的具体实现

本文介绍了使用Go语言的GUI库lxn/walk实现菜单栏和工具栏的具体方法，包括消息窗口的产生、文件放置动作响应和提示框的应用。部分代码来自上一篇博客和lxn/walk官方示例。文章提供了学习GUI开发的实际案例和代码示例。 ... [详细]

蜡笔小新 2023-12-12 20:56:55
main
大数据Hadoop生态(20)MapReduce框架原理OutputFormat的开发笔记

本文介绍了大数据Hadoop生态(20)MapReduce框架原理OutputFormat的开发笔记，包括outputFormat接口实现类、自定义outputFormat步骤和案例。案例中将包含nty的日志输出到nty.log文件，其他日志输出到other.log文件。同时提供了一些相关网址供参考。 ... [详细]

蜡笔小新 2023-12-10 11:44:06
eval
Python 教学 016

Python教学练习二Python1-12练习二一、判断季节用户输入月份，判断这个月是哪个季节？3，4，5月----春 ... [详细]

蜡笔小新 2023-12-09 08:28:13
eval
引号快捷键_首选项和设置——自定义快捷键

3.3自定义快捷键（CustomizingHotkeys）ChemDraw快捷键由一个XML文件定义，我们可以根据自己的需要， ... [详细]

蜡笔小新 2023-10-17 19:10:46
eval
Perl的测试框架Test::Base简介及使用方法

本文介绍了Perl的测试框架Test::Base，它是一个数据驱动的测试框架，可以自动进行单元测试，省去手工编写测试程序的麻烦。与Test::More完全兼容，使用方法简单。以plural函数为例，展示了Test::Base的使用方法。 ... [详细]

蜡笔小新 2023-12-13 20:05:31
request
java 模拟get post请求_Java后台模拟发送http的get和post请求，并测试

个人学习使用：谨慎参考1Client类importcom.thoughtworks.gauge.Step;importcom.thoughtworks.gauge.T ... [详细]

蜡笔小新 2023-12-13 14:20:23
eval
也就是|小窗_卷积的特征提取与参数计算

篇首语：本文由编程笔记#小编为大家整理，主要介绍了卷积的特征提取与参数计算相关的知识，希望对你有一定的参考价值。Dense和Conv2D根本区别在于，Den ... [详细]

蜡笔小新 2023-12-13 12:59:48
case
Go Cobra命令行工具入门教程

本文介绍了Go语言实现的命令行工具Cobra的基本概念、安装方法和入门实践。Cobra被广泛应用于各种项目中，如Kubernetes、Hugo和Github CLI等。通过使用Cobra，我们可以快速创建命令行工具，适用于写测试脚本和各种服务的Admin CLI。文章还通过一个简单的demo演示了Cobra的使用方法。 ... [详细]

蜡笔小新 2023-12-12 20:02:41
case
r2dbc配置多数据源

R2dbc配置多数据源问题根据官网配置r2dbc连接mysql多数据源所遇到的问题pom配置可以参考官网,不过我这样配置会报错我并没有这样配置将以下内容添加到pom.xml文件d ... [详细]

蜡笔小新 2023-12-12 16:38:53
case
【shell】网络处理：判断IP是否在网段、两个ip是否同网段、IP地址范围、网段包含关系

本文介绍了使用shell脚本判断IP是否在同一网段、判断IP地址是否在某个范围内、计算IP地址范围、判断网段之间的包含关系的方法和原理。通过对IP和掩码进行与计算，可以判断两个IP是否在同一网段。同时，还提供了一段用于验证IP地址的正则表达式和判断特殊IP地址的方法。 ... [详细]

蜡笔小新 2023-12-12 11:19:14
case
Postgresql备份和恢复的方法及命令行操作步骤

本文介绍了使用Postgresql进行备份和恢复的方法及命令行操作步骤。通过使用pg_dump命令进行备份，pg_restore命令进行恢复，并设置-h localhost选项，可以完成数据的备份和恢复操作。此外，本文还提供了参考链接以获取更多详细信息。 ... [详细]

蜡笔小新 2023-12-11 10:17:12
list
mapreduce源码分析总结

这篇文章总结的非常到位,故而转之一MapReduce概述MapReduce是一个用于大规模数据处理的分布式计算模型，它最初是由Google工程师设计并实现的ÿ ... [详细]

蜡笔小新 2023-10-17 12:36:35

neixi_0592045

这个家伙很懒，什么也没留下！

Tags | 热门标签

RankList | 热门文章