Introduction
Sementation is to parse objects in an image. Generally, the type of objects are limited and pre-known. The task requires to output the fine-grain bounding of all objects.
The task is well-studied and own strong practical value in productions. Image segmentation could be classified into instance segmentation and semantic segmentation.
Semantic segmentation only cares about the category and need not to distinguish different objects belonging to the same category. On contrast, instance segmentation need to parse the bounding for every objects. Objects belonging to the same category should also be marked different.
Semantic Segmentation
FCN: fully convolutional networks for semantic segmentation
- Maybe the first one to employ full convolution network in segmentation.
- employ pretrain parameters(alexnet, vgg, googlenet) from imagenet classification (parameter initilzation is important)
- fuse deeper and coarse information with previous fine-grain, shadow layer
- other tricks: deconv instead bilinear upsample, dataset imbalance, dataset augmentation, use sbd dataset