forked from amusi/daily-paper-computer-vision
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
2 changed files
with
29 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,27 @@ | ||
**2018-07-06** | ||
|
||
这篇文章有2篇论文速递,都是目标检测方向,一篇是RefineNet,其是SSD算法、RPN网络和FPN算法的结合,另一篇是DES,其是基于SSD网络进行了改进。注意,两篇都是CVPR 2018文章。 | ||
|
||
# Object Detection | ||
|
||
《Single-Shot Refinement Neural Network for Object Detection》 | ||
|
||
CVPR 2018 | ||
|
||
Abstract:对于目标检测,两阶段方法(例如,更快的R-CNN)已经实现了最高精度,而一阶段方法(例如,SSD)具有高效率的优点。为了继承两者的优点,同时克服它们的缺点,在本文中,我们提出了一种新的基于single-shot的检测器,称为RefineDet,它比两阶段方法获得更好的精度,并保持一阶段方法的检测效率。 RefineDet由两个相互连接的模块组成,即 anchor refinement 模块和目标检测模块。具体地,前者旨在(1)过滤掉negative anchor以减少分类器的搜索空间,以及(2)粗略地调整anchor的位置和大小以为随后的回归器提供更好的初始化。后一模块将精细anchor作为前者的输入,以进一步改进回归并预测多类别标签。同时,我们设计了一个传输连接块来传输锚点细化模块中的特征,以预测对象检测模块中对象的位置,大小和类别标签。多任务丢失功能使我们能够以端到端的方式训练整个网络。 PASCAL VOC 2007,PASCAL VOC 2012和MS COCO的大量实验证明,RefineDet可以高效地实现最先进的检测精度。 | ||
|
||
arXiv:https://arxiv.org/abs/1711.06897 | ||
|
||
github:https://github.com/sfzhang15/RefineDet | ||
|
||
注:之后会推出该论文的精读文章! | ||
|
||
《Single-Shot Object Detection with Enriched Semantics》 | ||
|
||
CVPR 2018 | ||
|
||
Abstract:我们提出了一种新颖的 single-shot 目标检测网络,名为“Detection with Enriched semantics”(DES)。我们的动机是通过语义分割分支和全局激活模块来丰富典型深度检测器内目标检测特征的语义。分割分支由弱分割ground-truth监督,即,不需要额外的注释。与此同时,我们采用全局激活模块,以自我监督的方式学习通道和对象类之间的关系。PASCAL VOC和MS COCO检测数据集的综合实验结果证明了该方法的有效性。特别是,使用基于VGG16的DES,我们在VOC2007测试中实现了81.7的mAP,在COCO测试开发上实现了32.8的mAP,在Titan Xp GPU上每个图像的推断速度为31.5毫秒。 使用较低分辨率的版本,我们在VOC2007上实现了79.7的mAP,每张图像的推断速度为13.0毫秒。 | ||
|
||
arXiv:https://arxiv.org/abs/1712.00433 | ||
|
||
注:之后会推出该论文的精读文章! |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters