R-FCN: Object Detection via Region-based Fully Convolutional Networks

September 2019

tl;dr: Seminal paper from MSRA that improves upon faster R-CNN.

Overall impression

Faster RCNN computation increases as ROI number grows, as each ROI has a fully connected layer. R-FCN improves the computation efficiency by moving the FCN to before ROI pooling by generating position sensitive score maps (feat maps). Each PS score map is responsible to fire at a particular region (top-left corner) of a particular class.

Note that usually R-FCN has slightly lower performance, especially compared to FPN-powered Faster RCNN.

R-FCN cannot leverage FPN directly as the number of channels are too large for large dataset such as COCO. This is improved in Light-head RCNN to reduce the number of score maps from #class x p x p to 10. Instead, the simple voting mechanism is replaced by a fully connected layer.

Key ideas

Summaries of the key ideas

Technical details

Summary of technical details

Notes

This medium blog post from Jonathan Hui explains the intuition very well.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rfcn.md

rfcn.md

R-FCN: Object Detection via Region-based Fully Convolutional Networks

Overall impression

Key ideas

Technical details

Notes

Files

rfcn.md

Latest commit

History

rfcn.md

File metadata and controls

R-FCN: Object Detection via Region-based Fully Convolutional Networks

Overall impression

Key ideas

Technical details

Notes