PyTorch神经网络压缩框架-FinClip官网

PyTorch神经网络压缩框架

网友投稿 1407 2022-10-20

PyTorch神经网络压缩框架

Neural Network Compression Framework (NNCF)

This repository contains a PyTorch*-based framework and samples for neural networks compression.

The framework is organized as a Python* package that can be built and used in a standalone mode. The framework architecture is unified to make it easy to add different compression methods.

The samples demonstrate the usage of compression algorithms for three different use cases on public models and datasets: Image Classification, Object Detection and Semantic Segmentation. Compression results achievable with the NNCF-powered samples are reported here.

Key Features

Support of various compression algorithms, applied during a model fine-tuning process to achieve best compression parameters and accuracy: QuantizationBinarizationSparsityFilter pruning Automatic, configurable model graph transformation to obtain the compressed model. The source model is wrapped by the custom class and additional compression-specific layers are inserted in the graph.Common interface for compression methodsGPU-accelerated layers for faster compressed model fine-tuningDistributed training supportConfiguration file examples for each supported compression algorithm.Git patches for prominent third-party repositories (mmdetection, huggingface-transformers) demonstrating the process of integrating NNCF into custom training pipelinesExporting compressed models to ONNX* checkpoints ready for usage with OpenVINO™ toolkit.

Usage

The NNCF is organized as a regular Python package that can be imported in your target training pipeline script. The basic workflow is loading a jsON configuration script containing NNCF-specific parameters determining the compression to be applied to your model, and then passing your model along with the configuration script to the nncf.create_compressed_model function. This function returns a wrapped model ready for compression fine-tuning, and handle to the object allowing you to control the compression during the training process:

import nncffrom nncf import create_compressed_model, Config as NNCFConfig# Instantiate your uncompressed modelfrom torchvision.models.resnet import resnet50model = resnet50()# Load a configuration file to specify compressionnncf_config = NNCFConfig.from_json("resnet50_int8.json")# Provide data loaders for compression algorithm initialization, if necessarynncf_config = register_default_init_args(nncf_config, loss_criterion, train_loader)# Apply the specified compression algorithms to the modelcomp_ctrl, compressed_model = create_compressed_model(model, nncf_config)# Now use compressed_model as a usual torch.nn.Module to fine-tune compression parameters along with the model weights# ... the rest of the usual PyTorch-powered training pipeline# Export to ONNX or .pth when done fine-tuningcomp_ctrl.export_model("compressed_model.onnx")torch.save(compressed_model.state_dict(), "compressed_model.pth")

For a more detailed description of NNCF usage in your training code, see Usage.md. For in-depth examples of NNCF integration, browse the sample scripts code, or the example patches to third-party repositories.

For more details about the framework architecture, refer to the NNCFArchitecture.md.

Model Compression Samples

For a quicker start with NNCF-powered compression, you can also try the sample scripts, each of which provides a basic training pipeline for classification, semantic segmentation and object detection neural network training correspondingly.

To run the samples please refer to the corresponding tutorials:

Image Classification sampleObject Detection sampleSemantic Segmentation sample

Third-party repository integration

NNCF may be straightforwardly integrated into training/evaluation pipelines of third-party repositories. See third_party_integration for examples of code modifications (Git patches and base commit IDs are provided) that are necessary to integrate NNCF into select repositories.

System requirements

Ubuntu* 16.04 or later (64-bit)Python* 3.6 or laterNVidia CUDA* Toolkit 10.2 or laterPyTorch* 1.5 or later.

Installation

We suggest to install or use the package in the Python virtual environment.

As a package built from checked-out repository:

Install the following system dependencies:

sudo apt-get install python3-dev

Install the package and its dependencies by running the following in the repository root directory:

For CPU & GPU-powered execution: python setup.py installFor CPU-only installation python setup.py install --cpu-only

As a Docker image

Use one of the Dockerfiles in the docker directory to build an image with an environment already set up and ready for running NNCF sample scripts.

Contributing

Refer to the CONTRIBUTING.md file for guidelines on contributions to the NNCF repository.

NNCF compression results

Achieved using sample scripts and NNCF configuration files provided with this repository. See README.md files for sample scripts for links to exact configuration files and final PyTorch checkpoints.

Model	Compression algorithm	Dataset	PyTorch FP32 baseline	PyTorch compressed accuracy
ResNet-50	None	ImageNet	-	76.13
ResNet-50	INT8	ImageNet	76.13	76.05
ResNet-50	Mixed, 44.8% INT8 / 55.2% INT4	ImageNet	76.13	76.3
ResNet-50	INT8 + Sparsity 61% (RB)	ImageNet	76.13	75.28
ResNet-50	Filter pruning, 30%, magnitude criterion	ImageNet	76.13	75.7
ResNet-50	Filter pruning, 30%, geometric median criterion	ImageNet	76.13	75.7
Inception V3	None	ImageNet	-	77.32
Inception V3	INT8	ImageNet	77.32	76.92
Inception V3	INT8 + Sparsity 61% (RB)	ImageNet	77.32	76.98
MobileNet V2	None	ImageNet	-	71.81
MobileNet V2	INT8	ImageNet	71.81	71.34
MobileNet V2	Mixed, 46.6% INT8 / 53.4% INT4	ImageNet	71.81	70.89
MobileNet V2	INT8 + Sparsity 52% (RB)	ImageNet	71.81	70.99
SqueezeNet V1.1	None	ImageNet	-	58.18
SqueezeNet V1.1	INT8	ImageNet	58.18	58.02
SqueezeNet V1.1	Mixed, 54.7% INT8 / 45.3% INT4	ImageNet	58.18	58.84
ResNet-18	None	ImageNet	-	69.76
ResNet-18	XNOR (weights), scale/threshold (activations)	ImageNet	69.76	61.61
ResNet-18	DoReFa (weights), scale/threshold (activations)	ImageNet	69.76	61.59
ResNet-18	Filter pruning, 30%, magnitude criterion	ImageNet	69.76	68.69
ResNet-18	Filter pruning, 30%, geometric median criterion	ImageNet	69.76	68.97
ResNet-34	None	ImageNet	-	73.31
ResNet-34	Filter pruning, 30%, magnitude criterion	ImageNet	73.31	72.54
ResNet-34	Filter pruning, 30%, geometric median criterion	ImageNet	73.31	72.60
SSD300-BN	None	VOC12+07	-	78.28
SSD300-BN	INT8	VOC12+07	78.28	78.07
SSD300-BN	INT8 + Sparsity 70% (Magnitude)	VOC12+07	78.28	78.01
SSD512-BN	None	VOC12+07	-	80.26
SSD512-BN	INT8	VOC12+07	80.26	80.02
SSD512-BN	INT8 + Sparsity 70% (Magnitude)	VOC12+07	80.26	79.98
UNet	None	CamVid	-	71.95
UNet	INT8	CamVid	71.95	71.66
UNet	INT8 + Sparsity 60% (Magnitude)	CamVid	71.95	71.72
ICNet	None	CamVid	-	67.89
ICNet	INT8	CamVid	67.89	67.87
ICNet	INT8 + Sparsity 60% (Magnitude)	CamVid	67.89	67.24
UNet	None	Mapillary	-	56.23
UNet	INT8	Mapillary	56.23	56.12
UNet	INT8 + Sparsity 60% (Magnitude)	Mapillary	56.23	56.0

Legal Information

[*] Other names and brands may be claimed as the property of others.

标签：js

于 Linux 环境中开发微信小程序的相关事宜

1407 2022-10-20

PyTorch神经网络压缩框架

怎样在小程序里实现标题的更改

于 Linux 环境中开发微信小程序的相关事宜

探索小程序使用 MD5 的方式与技巧

最近发表

更多内容

小程序SDK

Finclip技术文档

小程序开发

小程序容器

小程序框架

Finclip小程序平台

Finclip用户投稿

车联网

推荐文章

小程序SDK是什么意思？小程序sdk和插件有什么区别？

小程序支付功能怎么实现？

企业app开发流程是什么？

app运营模式有哪些？

小程序多端引流怎么做？

小程序生态分析的机会和威胁

Flutter入门这一篇效率文章就够了

原生与跨平台解决方案分析,跨平台软件开发技术方案

热更新技术：让软件更新变得更加轻松快速

解决方案

银行解决方案

证券解决方案

互联网解决方案

政企OA解决方案

科技解决方案

loT解决方案

信任解决方案

热评文章

AppCan:基于混合模式的移动应用开发,移动混合模

Hybrid App混合模式开发的了解

小程序容器技术助力券商数字营销突围，小程序容器化的意

用mpvue开发微信小程序基础知识（vue.js开发

小程序多端框架全面测评对比，强烈推荐！

券商app架构 - 解析券商应用程序的构建与设计