Comparison with Traditional CNN

1. No Fully Connected Layers

CNN: Use FC layers at the end
FCN: Replace FC layers with Conv layers throughout

2. Output Structure

CNN: Single output per image (classification)
FCN: One prediction per pixel location

3. Input Size Flexibility

CNN: Fixed input size because FC layers required fixed size
FCN: Can handle variable input sizes

Fully Convolutional Network

Implementation

The network is made up of a bunch of convolutional layers, with downsampling and upsampling inside the network

Upsampling

Why we need upsampling?

We do downsampling in the network. Hence, if we want to achieve “output per pixel”, we need to “upsample” in order to scale up the feature vector again

Upsampling

D-DL4CV-Lec16aa-Upsampling

Chilfox

目錄

D-DL4CV-Lec16a-Semantic_Segmentation_Fully_Convolutional_Network

Comparison with Traditional CNN

1. No Fully Connected Layers

2. Output Structure

3. Input Size Flexibility

Fully Convolutional Network

Implementation

Upsampling

Why we need upsampling?

Upsampling

關係圖譜

反向連結