What is Upsampling?

It is the reverse operation of downsampling. Instead of extracting features from larger input to smaller output, we expand the input to get larger output

In-Network Upsampling

Unpooling

Bed of Nails

Expand each input pixel to a $2 \times 2$ block: place original value in top-left, fill remaining positions with 0

Nearest Neighbor

Expand each input pixel to a $2 \times 2$ block: duplicate original value to all positions

Bilinear Interpolation

We use 4 nearest neighbor coordinates to fit a polynomial

p (x, y) = i = 0 \sum 1 j = 0 \sum 1 a_{ij} x^{i} y^{j}

Then we insert our output coordinates to get pixel values

Bicubic Interpolation

We use 16 nearest integer coordinates to fit in the polynomial:

p (x, y) = i = 0 \sum 3 j = 0 \sum 3 a_{ij} x^{i} y^{j}

then we insert our output coordinates to get pixel values

Max Unpooling

Each max unpooling layer pair with a max pooling layer. We first remember which positions had the max value. Then, when doing max unpooling, we place each value in the input into the remembered position, and fill other positions with zeros

Learnable Unsampling: Transposed Convolution

Introduction

Transposed convolution is an operation that performs learnable unsampling by applying mathematical transpose of a convolution operation (filter).

Core Concept

Input: Small feature map
Output Larger feature map
Difference from regular convolution: Expands rather than compresses spatial dimensions

How does it works

Input

[1, 2]
[3, 4]

Step 1: Expand with zeros

We make the rows and columns $k$ times where $k$ is the stride

(stride 2)
[1, 0, 2, 0]
[0, 0, 0, 0]
[3, 0, 4, 0]
[0, 0, 0, 0]

Step 2: Apply Convolution

We simply apply the filters like in convolution on the expanded matrix. Here we’ll create $4 \times 4$ matrix by $3 \times 3$ filter stride 1 padding 1

Stride in transposed convolution means how much times we want to upsample while in convolution it means how much do we want to downsample

Convolution as Matrix Multiplication

We can express convolution in terms of matrix multiplication

x ⋆ a = X a

x 000 y x 00 z y x 0 0 z y x 00 z y 000 z 0 a b c d 0 = a y + b z a x + b y + cz b x + cy + d z c x + d y

We can also express transposed Conv with stride 1 as normal Conv

x ⋆^{T} a = X^{T} a

x y z 000 0 x y z 00 00 x y z 0 000 x y z a b c d = a x a y + b x a z + b y + c x b z + cy + d x cz + d y d z

However, with stride > 1, we can’t express transposed Conv as normal Conv

x y z 000 00 x y z 0 [a b] = a x a y a z + b x b y b z 0

Chilfox

目錄

D-DL4CV-Lec16aa-Upsampling

What is Upsampling?

In-Network Upsampling

Unpooling

Bed of Nails

Nearest Neighbor

Bilinear Interpolation

Bicubic Interpolation

Max Unpooling

Learnable Unsampling: Transposed Convolution

Introduction

Core Concept

How does it works

Input

Step 1: Expand with zeros

Step 2: Apply Convolution

Convolution as Matrix Multiplication

關係圖譜

反向連結