TorchCV - 基于PyTorch的计算机视觉深度学习框架-FinClip官网

TorchCV - 基于PyTorch的计算机视觉深度学习框架

网友投稿 868 2022-10-30

TorchCV - 基于PyTorch的计算机视觉深度学习框架

TorchCV: A PyTorch-Based Framework for Deep Learning in Computer Vision

@misc{you2019torchcv, author = {Ansheng You and Xiangtai Li and Zhen Zhu and Yunhai Tong}, title = {TorchCV: A PyTorch-Based Framework for Deep Learning in Computer Vision}, howpublished = {\url{https://github.com/donnyyou/torchcv}}, year = {2019}}

This repository provides source code for most deep learning based cv problems. We'll do our best to keep this repository up-to-date. If you do find a problem about this repository, please raise an issue or submit a pull request.

Implemented Papers

Image ClassificationVGG: Very Deep Convolutional Networks for Large-Scale Image RecognitionResNet: Deep Residual Learning for Image RecognitionDenseNet: Densely Connected Convolutional NetworksShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile DevicesShuffleNet V2: Practical Guidelines for Ecient CNN Architecture DesignPartial Order Pruning: for Best Speed/Accuracy Trade-off in Neural Architecture Search Semantic SegmentationDeepLabV3: Rethinking Atrous Convolution for Semantic Image SegmentationPSPNet: Pyramid Scene Parsing NetworkDenseASPP: DenseASPP for Semantic Segmentation in Street ScenesAsymmetric Non-local Neural Networks for Semantic Segmentation Object DetectionSSD: Single Shot MultiBox DetectorFaster R-CNN: Towards Real-Time Object Detection with Region Proposal NetworksYOLOv3: An Incremental ImprovementFPN: Feature Pyramid Networks for Object Detection Pose EstimationCPM: Convolutional Pose MachinesOpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields Instance SegmentationMask R-CNN Generative Adversarial NetworksPix2pix: Image-to-Image Translation with Conditional Adversarial NetsCycleGAN: Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks.

QuickStart with TorchCV

Now only support Python3.x, pytorch 1.3.

pip3 install -r requirements.txtcd lib/extssh make.sh

Performances with TorchCV

All the performances showed below fully reimplemented the papers' results.

Image Classification

ImageNet (Center Crop Test): 224x224

Model	Train	Test	Top-1	Top-5	BS	Iters	Scripts
ResNet50	train	val	77.54	93.59	512	30W	ResNet50
ResNet101	train	val	78.94	94.56	512	30W	ResNet101
ShuffleNetV2x0.5	train	val	60.90	82.54	1024	40W	ShuffleNetV2x0.5
ShuffleNetV2x1.0	train	val	69.71	88.91	1024	40W	ShuffleNetV2x1.0
DFNetV1	train	val	70.99	89.68	1024	40W	DFNetV1
DFNetV2	train	val	74.22	91.61	1024	40W	DFNetV2

Semantic Segmentation

Cityscapes (Single Scale Whole Image Test): Base LR 0.01, Crop Size 769

Model	Backbone	Train	Test	mIOU	BS	Iters	Scripts
PSPNet	3x3-Res101	train	val	78.20	8	4W	PSPNet
DeepLabV3	3x3-Res101	train	val	79.13	8	4W	DeepLabV3

ADE20K (Single Scale Whole Image Test): Base LR 0.02, Crop Size 520

Model	Backbone	Train	Test	mIOU	PixelACC	BS	Iters	Scripts
PSPNet	3x3-Res50	train	val	41.52	80.09	16	15W	PSPNet
DeepLabv3	3x3-Res50	train	val	42.16	80.36	16	15W	DeepLabV3
PSPNet	3x3-Res101	train	val	43.60	81.30	16	15W	PSPNet
DeepLabv3	3x3-Res101	train	val	44.13	81.42	16	15W	DeepLabV3

Object Detection

Pascal VOC2007/2012 (Single Scale Test): 20 Classes

Model	Backbone	Train	Test	mAP	BS	Epochs	Scripts
SSD300	VGG16	07+12_trainval	07_test	0.786	32	235	SSD300
SSD512	VGG16	07+12_trainval	07_test	0.808	32	235	SSD512
Faster R-CNN	VGG16	07_trainval	07_test	0.706	1	15	Faster R-CNN

Pose Estimation

OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields

Instance Segmentation

Mask R-CNN

Generative Adversarial Networks

Pix2pixCycleGAN

DataSets with TorchCV

TorchCV has defined the dataset format of all the tasks which you could check in the subdirs of data. Following is an example dataset directory trees for training semantic segmentation. You could preprocess the open datasets with the scripts in folder data/seg/preprocess

Dataset train image 00001.jpg/png 00002.jpg/png ... label 00001.png 00002.png ... val image 00001.jpg/png 00002.jpg/png ... label 00001.png 00002.png ...

Commands with TorchCV

Take PSPNet as an example. ("tag" could be any string, include an empty one.)

Training

cd scripts/seg/cityscapes/bash run_fs_pspnet_cityscapes_seg.sh train tag

Resume Training

cd scripts/seg/cityscapes/bash run_fs_pspnet_cityscapes_seg.sh train tag

Validate

cd scripts/seg/cityscapes/bash run_fs_pspnet_cityscapes_seg.sh val tag

Testing:

cd scripts/seg/cityscapes/bash run_fs_pspnet_cityscapes_seg.sh test tag

Demos with TorchCV

国产化驱动经济自主性与科技创新的未来之路

868 2022-10-30

TorchCV - 基于PyTorch的计算机视觉深度学习框架

政务桌面应用系统开发提升政府服务效率的关键所在

国产化驱动经济自主性与科技创新的未来之路

国产操作系统生态圈推动信息安全与技术自主发展的新机遇

最近发表

更多内容

小程序SDK

Finclip技术文档

小程序开发

小程序容器

小程序框架

Finclip小程序平台

Finclip用户投稿

车联网

推荐文章

小程序SDK是什么意思？小程序sdk和插件有什么区别？

小程序支付功能怎么实现？

企业app开发流程是什么？

app运营模式有哪些？

小程序多端引流怎么做？

小程序生态分析的机会和威胁

Flutter入门这一篇效率文章就够了

原生与跨平台解决方案分析,跨平台软件开发技术方案

热更新技术：让软件更新变得更加轻松快速

解决方案

银行解决方案

证券解决方案

互联网解决方案

政企OA解决方案

科技解决方案

loT解决方案

信任解决方案

热评文章

AppCan:基于混合模式的移动应用开发,移动混合模

Hybrid App混合模式开发的了解

小程序容器技术助力券商数字营销突围，小程序容器化的意

用mpvue开发微信小程序基础知识（vue.js开发

小程序多端框架全面测评对比，强烈推荐！

券商app架构 - 解析券商应用程序的构建与设计