【EfﬁcientDet】《EfﬁcientDet：Scalable and Efﬁcient Object Detection》网站首页 其他

【EfﬁcientDet】《EfﬁcientDet：Scalable and Efﬁcient Object Detection》

bryant_meng 2024-06-17 10:32:09

简介【EfﬁcientDet】《EfﬁcientDet：Scalable and Efﬁcient Object Detection》

在这里插入图片描述

CVPR-2020

文章目录

1 Background and Motivation
2 Related Work
3 Advantages / Contributions
4 Method
- 4.1 BiFPN
- 4.2 EfﬁcientDet
5 Experiments
6 Conclusion（own）

1 Background and Motivation

现在的轻量级网络 only focus on a speciﬁc or a small range of resource requirements

Is it possible to build a scalable detection architecture with both higher accuracy and better efﬁciency across a wide spectrum of resource constraints.

本文在 efficientNet 的基础上，

提出 bi-directional feature pyramid network (BiFPN) ，双向特征金字塔
提出 compound scaling method，不仅 scale 主干，也同时 scale 特征金字塔，scale 头部检测器，

使得目标检测器（分割器）更快更准

在这里插入图片描述

2 Related Work

One-Stage Detectors
Multi-Scale Feature Representations
Model Scaling

3 Advantages / Contributions

BiFPN
compound scaling method（主干 / 金字塔 / 头）

4 Method

在这里插入图片描述
注意会 repeated

4.1 BiFPN

efﬁcient bidirectional cross-scale connections and weighted feature fusion.

在这里插入图片描述

$P_6^{td}$ 表示 intermediate feature at level 6

特征图融合的时候做了加权，有如下两种方式

（1）Softmax-based fusion

$sum_i frac{e^{w_i}}{varepsilon + sum_j e^{w_j}}$

缺点速度比较慢

（2）Fast normalized fusion

$sum_i frac{w_i}{varepsilon + sum_j w_j}$

效果和 Softmax-based fusion 差不多，速度快很多

金字塔采用的卷积都是 depthwise separable convolution

4.2 EfﬁcientDet

在这里插入图片描述
（1）EfﬁcientDet Architecture

one-stage 的框架

金字塔会重复堆叠

（2）Compound Scaling

uses a simple compound coefﬁcient φ to jointly scale up all dimensions of backbone network, BiFPN network, class/box network, and resolution.

在这里插入图片描述

2.1 主干网络的缩放采用的是 EfficientNet 的 B0~B6

2.2 BiFPN network 的缩放规则是

在这里插入图片描述
W 表示 width，也即通道数，D 表示 depth，也即重复的次数

2.3 Box/class prediction network 的缩放规则是

W 同 BiFPN，
在这里插入图片描述

2.4 Input image resolution 的缩放规则是

在这里插入图片描述

5 Experiments

5.1 Datasets

COCO

5.2 EfﬁcientDet for Object Detection

在这里插入图片描述
速度

5.3 EfﬁcientDet for Semantic Segmentation

use P2 for the ﬁnal per-pixel classiﬁcation
在这里插入图片描述

5.4 Ablation Study

COCO validation set

（1）Disentangling Backbone and BiFPN
在这里插入图片描述
设计的 BiFPN 还是特别的猛

（2）BiFPN Cross-Scale Connections
在这里插入图片描述
weighted + BiFPN 最猛

（3）Softmax vs Fast Normalized Fusion

在这里插入图片描述
效果仅差一点点，速度快了一些

横坐标应该是迭代次数，特征图融合的权重数值上（纵坐标）还是没有太大差异的

（4）Compound Scaling
在这里插入图片描述
一起 scale 效果最好

6 Conclusion（own）

金字塔也可以堆叠

scale 也可以包含金字塔和头部结构一起

风语者！平时喜欢研究各种技术，目前在从事后端开发工作，热爱生活、热爱工作。

上一篇
LSTM-理解 Part-2（RNN的局限性）

下一篇
大模型高效调参—PEFT库（ Parameter-Effi...

站长推荐

QT多线程的5种用法，通过使用线程解决UI主界面的耗时操作代码，防止界面卡死。
QT多线程的5种用法，通过使用线程解决UI主界面的耗时操作代码，防止界面卡死。...
U8W/U8W-Mini使用与常见问题解决
U8W/U8W-Mini使用与常见问题解决
stm32使用HAL库配置串口中断收发数据（保姆级教程）
stm32使用HAL库配置串口中断收发数据（保姆级教程）
分享几个国内免费的ChatGPT镜像网址(亲测有效)
分享几个国内免费的ChatGPT镜像网址(亲测有效)
Allegro16.6差分等长设置及走线总结
Allegro16.6差分等长设置及走线总结

您现在的位置是：首页 >其他 >【EfﬁcientDet】《EfﬁcientDet：Scalable and Efﬁcient Object Detection》网站首页其他

【EfﬁcientDet】《EfﬁcientDet：Scalable and Efﬁcient Object Detection》

文章目录

1 Background and Motivation

2 Related Work

3 Advantages / Contributions

4 Method

4.1 BiFPN

4.2 EfﬁcientDet

5 Experiments

5.1 Datasets

5.2 EfﬁcientDet for Object Detection

5.3 EfﬁcientDet for Semantic Segmentation

5.4 Ablation Study

6 Conclusion（own）

上一篇 LSTM-理解 Part-2（RNN的局限性）

下一篇 大模型高效调参—PEFT库（ Parameter-Effi...

站长推荐

上一篇
LSTM-理解 Part-2（RNN的局限性）

下一篇
大模型高效调参—PEFT库（ Parameter-Effi...