基于Scikit-Learn和Pandas的机器学习测试框架-FinClip官网

基于Scikit-Learn和Pandas的机器学习测试框架

网友投稿 556 2022-10-28

基于Scikit-Learn和Pandas的机器学习测试框架

ML Testing

The goal of this module is to create a flexible and easy to use module for testing machine learning models, specifically those in scikit-learn.

The tests will be readable enough that anyone can extend them to other frameworks and APIs with the major notions kept the same, but more or less the ideas will be extended, no work will be taken in this library to extend passed the scikit-learn API.

You can read the docs for a more detailed explaination.

Tests Covered

Testing Against Metrics Classification Tests Rule Based Testing: precision lower boundaryrecall lower boundaryf1 score lower boundaryAUC lower boundaryprecision lower boundary per classrecall lower boundary per classf1 score lower boundary per classAUC lower boundary per class Decision Based Testing: precision fold below averagerecall fold below averagef1 fold below averageAUC fold below averageprecision fold below average per classrecall fold below average per classf1 fold below average per classAUC fold below average per class Against New Predictions proportion of predictions per classclass imbalance testsprobability distribution similarity testscalibration tests environmental impact tests energyusage upper bound test Regression Tests Rule Based Testing: Mean Squared Error upper boundaryMedian Absolute Error upper boundary Decision Based Testing: Mean Squared Error fold above averageMedian Absolute Error fold above average Testing Against Run Time Performance prediction run time for simulated samples of size X Testing Against Input Data percentage of correct imputes for any columns requiring imputationdataset testing - http://vldb.org/pvldb/vol11/p1781-schelter.pdf Memoryful Tests cluster testing - this is about the overall structure of the data If the number of clusters increases or decreases substantially that should be an indicator that the data has changed enough that things should possibly be reruncorrelation testing - this is about ensuring that the correlation for a given column with previous data collected in the past does not change very much. If the data does change then the model should possibly be rerun.shape testing - this is about ensuring the general shape of for the given column does not change much over time. The idea here is the same as the correlation tests.

Possible Issues

Some known issues with this, any machine learning tests are going to require human interaction because of type 1 and type 2 error for statistical tests. Additionally, one simply needs to interrogate models from a lot of angles. It can't be from just one angle. So please use with care!

Future Features

cross validation score testingadd custom loss functionadd custom accuracy functionadd these tests: https://datasciencecentral.com/profiles/blogs/a-plethora-of-original-underused-statistical-testsclustering for classificationUnsupervised and semi supervised tests verify similarity in clusters to similarity in labelsgenerate a small representative set of labels and then propagate other labels

References

https://dzone.com/articles/quality-assurancetesting-the-machine-learning-modehttps://medium.com/datadriveninvestor/how-to-perform-quality-assurance-for-ml-models-cef77bbbcfbExplaination of UAT: https://techopedia.com/definition/3887/user-acceptance-testing-uathttps://mice.cs.columbia.edu/getTechreport.php?techreportID=419&format=pdfhttps://xenonstack.com/blog/unit-testing-tdd-bdd-deep-machine-learning/

微信公众平台开发入门教程图文详解全面呈现

556 2022-10-28

基于Scikit-Learn和Pandas的机器学习测试框架

开发微信公众平台配置接口程序详细步骤

微信公众平台开发入门教程图文详解全面呈现

微信小程序 rich-text 教程全面详解与深入分析

最近发表

更多内容

小程序SDK

Finclip技术文档

小程序开发

小程序容器

小程序框架

Finclip小程序平台

Finclip用户投稿

车联网

推荐文章

小程序SDK是什么意思？小程序sdk和插件有什么区别？

小程序支付功能怎么实现？

企业app开发流程是什么？

app运营模式有哪些？

小程序多端引流怎么做？

小程序生态分析的机会和威胁

Flutter入门这一篇效率文章就够了

原生与跨平台解决方案分析,跨平台软件开发技术方案

热更新技术：让软件更新变得更加轻松快速

解决方案

银行解决方案

证券解决方案

互联网解决方案

政企OA解决方案

科技解决方案

loT解决方案

信任解决方案

热评文章

AppCan:基于混合模式的移动应用开发,移动混合模

Hybrid App混合模式开发的了解

小程序容器技术助力券商数字营销突围，小程序容器化的意

用mpvue开发微信小程序基础知识（vue.js开发

小程序多端框架全面测评对比，强烈推荐！

开放银行银行案例，迎接金融创新的未来