二分类模型评价指标-Python实现

网友投稿 1470 2022-09-02

二分类模型评价指标-Python实现

Sklearn的metrics模块下有多个计算模型评价指标的函数，本文只介绍二分类的指标函数。

1.准确率

1.1参数说明

sklearn.metrics.accuracy_score(y_true, y_pred, normalize=True, sample_weight=None)

	解释
参数	1. y_true 真实的label，一维数组格式 2.y_pred 模型预测的label，一维数组 3.normalize 默认True，返回正确预测的比例，False返回预测正确的个数 4.sample_weight 样本权重
返回结果	score:返回正确的比例或者个数，由normalize指定

计算公式如下，详情参考应用

import pandas as pdimport numpy as npfrom sklearn import metricsy_pred = [0,1,0,1]y_true = [0,0,0,0]

## 返回正确率metrics.accuracy_score(y_pred,y_true)

0.5

## 返回正确个数metrics.accuracy_score(y_pred,y_true,normalize=False)

2.混淆矩阵

2.1 参数说明


参数	1. y_true 真实的label，一维数组格式，列名 2.y_pred 模型预测的label，一维数组，行名 3.labels 默认不指定，此时y_true、y_pred取并集，升序，做label 4.sample_weight 样本权重
返回结果	C:返回混淆矩阵，注意label

2.2 应用

y_true = [2, 0, 2, 0, 0, 1]y_pred = [0, 0, 2, 2, 3, 2]metrics.confusion_matrix(y_true, y_pred)

array([[1, 0, 1, 1], [0, 0, 1, 0], [1, 0, 1, 0], [0, 0, 0, 0]], dtype=int64)

简单说明:y_true和y_pred的并集为[0,1,2,3]，对角线为每一类的预测准确率。[1,3]第一行第三例为1，意为实际为2，正确预测为2的样本有1个。这样看还是不直观。官网上有一个可视化混淆矩阵的函数，可以参看itertoolsimport numpy as npimport matplotlib.pyplot as pltfrom sklearn import svm, datasetsfrom sklearn.model_selection import train_test_splitfrom sklearn.metrics import confusion_matrixdef plot_confusion_matrix(cm, classes, normalize=False, title='Confusion matrix', cmap=plt.cm.Blues): """ This function prints and plots the confusion matrix. Normalization can be applied by setting `normalize=True`. """ if normalize: cm = cm.astype('float') / cm.sum(axis=1)[:, np.newaxis] print("Normalized confusion matrix") else: print('Confusion matrix, without normalization') print(cm) plt.imshow(cm, interpolation='nearest', cmap=cmap) plt.title(title) plt.colorbar() tick_marks = np.arange(len(classes)) plt.xticks(tick_marks, classes, rotation=45) plt.yticks(tick_marks, classes) fmt = '.2f' if normalize else 'd' thresh = cm.max() / 2. for i, j in itertools.product(range(cm.shape[0]), range(cm.shape[1])): plt.text(j, i, format(cm[i, j], fmt), horizontalalignment="center", color="white" if cm[i, j] > thresh else "black") plt.ylabel('True label') plt.xlabel('Predicted label') plt.tight_layout()

y_test = [1, 1, 1, 0]y_pred = [1, 1, 0, 0]# Compute confusion matrix# #注意混淆矩阵要求预测结果在第一个位置cnf_matrix = confusion_matrix(y_test, y_pred)## 限制两位小数np.set_printoptions(precision=2)

# Plot non-normalized confusion matrixplt.figure()plot_confusion_matrix(cnf_matrix, classes=[0, 1], title='Confusion matrix, without normalization')plt.show()

Confusion matrix, without normalization[[1 0] [1 2]]

# Plot normalized confusion matrixplt.figure()plot_confusion_matrix( cnf_matrix, classes=[0, 1], normalize=True, title='Normalized confusion matrix')plt.show()

Normalized confusion matrix[[ 1. 0. ] [ 0.33 0.67]]

3. Recall&Precision

3.1 参数说明

参数

1. y_true 真实的label，一维数组格式，列名

2.y_pred 模型预测的label，一维数组，行名

3.labels 默认不指定，此时y_true、y_pred取并集，升序，做label

4.sample_weight 样本权重

5.target_names 行标签，顺序和label的要一致

6.digits int，小数的位数

7. output_dict 输出格式，默认False，如果True，返回字典

返回结果

report:返回计算结果，形式依赖于output_dict

文档

应用

要注意y_true和y_pred的位置顺序。

y_true = [0, 1, 2, 2, 2]y_pred = [0, 0, 2, 2, 1]target_names = ['class 0', 'class 1', 'class 2']print(metrics.classification_report(y_true, y_pred))

precision recall f1-score support 0 0.50 1.00 0.67 1 1 0.00 0.00 0.00 1 2 1.00 0.67 0.80 3avg / total 0.70 0.60 0.61 5

print(metrics.classification_report(y_true, y_pred, target_names=target_names))

precision recall f1-score support class 0 0.50 1.00 0.67 1 class 1 0.00 0.00 0.00 1 class 2 1.00 0.67 0.80 3avg / total 0.70 0.60 0.61 5

4.Roc&Auc

计算AUC，画ROC曲线的貌似没有直接的函数。

4.1 参数


参数	1. y_true 真实的label，一维数组格式，列名 2.y_pred 模型预测的label，一维数组，行名 3.average 有多个参数可选，一般默认即可 4.sample_weight 样本权重 5.max_fpr 取值范围[0，1),如果不是None，则会标准化，使得最大值=max_fpr
返回结果	report:返回计算结果，形式依赖于output_dict
文档	= np.array([0, 0, 1, 1])y_scores = np.array([0.1, 0.4, 0.35, 0.8])metrics.roc_auc_score(y_true, y_scores) 0.75 2018-07-16 于南京市建邺区新城科技园版权声明：本文内容由网络用户投稿，版权归原作者所有，本站不拥有其著作权，亦不承担相应法律责任。如果您发现本站中有涉嫌抄袭或描述失实的内容，请联系我们jiasou666@gmail.com 处理，核实后本网站将在24小时内删除侵权内容。标签：参数上一篇:CentOS7.2与Nginx配置虚拟主机详解（centos7.6安装nginx）下一篇:PysparkNote001---Mac下JupyterNoteBook配置pyspark 相关文章小程序页面之间进行传值的操作办法 1470 2022-09-02 关于小程序网页能否进行登入的相关探讨 1470 2022-09-02 小程序生成参数二维码的具体方法全面介绍 1470 2022-09-02 发表评论取消回复暂时没有评论，来抢沙发吧~ 最近发表洞察探索小游戏大厅如何提升用户体验与企业转型效率洞察探讨小游戏SDK接入的最佳实践以及对企业跨平台开发的优势洞察如何通过小程序容器技术提升游戏推广效果在数字化转型的浪潮下，小程序游戏如何成为企业吸引年轻用户的重要工具？洞察探索小程序容器技术助力证券软件开发的最佳实践洞察了解如何通过开放银行应用提升金融服务效率与用户体验洞察探索open banking最新技术如何推动跨平台小程序开发，引领数字化转型新趋势洞察探索open banking如何通过小程序容器技术助力金融企业实现数据安全和数字化转型洞察如何提升企业数字化转型的灵活性，支持跨终端的小程序开发策略如何通过一键生成 App 加速企业数字化转型？更多内容小程序SDK Finclip技术文档小程序开发小程序容器小程序框架 Finclip小程序平台 Finclip用户投稿车联网推荐文章小程序SDK是什么意思？小程序sdk和插件有什么区别？小程序支付功能怎么实现？企业app开发流程是什么？ app运营模式有哪些？小程序多端引流怎么做？小程序生态分析的机会和威胁 Flutter入门这一篇效率文章就够了原生与跨平台解决方案分析,跨平台软件开发技术方案热更新技术：让软件更新变得更加轻松快速解决方案银行解决方案证券解决方案互联网解决方案政企OA解决方案科技解决方案 loT解决方案信任解决方案热评文章 AppCan:基于混合模式的移动应用开发,移动混合模 Hybrid App混合模式开发的了解小程序容器技术助力券商数字营销突围，小程序容器化的意用mpvue开发微信小程序基础知识（vue.js开发小程序多端框架全面测评对比，强烈推荐！券商app架构 - 解析券商应用程序的构建与设计小程序容器技术微信生态支持小程序生命周期管理开发生态工具完善小程序SDK 技术生态交流与合作地址：深圳市福田区凯丰路 10号国际金融科技城 19层电话：0755-86967467 邮箱：contact@finogeeks.com 粤ICP备2023123045号-2

二分类模型评价指标-Python实现

最近发表

更多内容

推荐文章

解决方案

热评文章