본문 바로가기

Others/Information

Computer Vision 연구자들이 알아야 할 20개 이상의 techniques

MS의 Richard Szeliski 박사님과 옥스포드의 Andrew Zisserman 교수님이 이야기하는 "모든 컴퓨터 비젼 연구자들이 알아야 할 20개 이상의 techniques"

 

1. Image formation and optics

2. Image processing, filtering, Fourier analysis

3. Pyramids and wavelets

4. Feature extraction

5. Image matching

6. Bag of words

7. Optical flow

8. Structure from motion

9. Multi view stereo

10.Segmentation

11.Clustering

12.Viola-Jones

13.Bayesian techniques

14.Machine learning

15.RANSAC and robust techniques

16.Numerical methods

17.Optimization

18.Range finding, active illumination

19.Algorithms

20.Graph cuts

21.Dynamic programming

22.Complexity analysis

23.MATLAB and C++. and assembly (optional: GPU programming)

24.Communication and presentation skills

 

Image and features

• NCC

• Interest point operators

• Scale invariant and affine invariant detectors & descriptors

• Scale space

• Image processing, filtering, Fourier analysis

• Pyramids and wavelets

• Edge detection

• Restoration e.g. deblurring, super-resolution

– Linear, e.g. Wiener filter

– MRF

– Non-local means/BM3D/bilateral filter

 

Segmentation, grouping and tracking

• Segmentation

– Normalized cuts

• Grouping

– Hough transforms

• Clustering

– K-means

– Mean-shift

– Pedro-clustering

• Tracking

– Kalmanfilter

– Particle filter

 

Multi-view: stereo, SFM, flow

• RANSAC and other robust techniques

• Geometry:

– epipolar geometry (projective and affine)

– planar homographies

– Affine camera

• Geometry estimators

– 8 point algorithm for F

– 4 point algorithm for H

• Factorization

• Bundle-adjustment

• Flow

– Horn & Schunck L2

– Lucas-Kanade

– L1 regularized

 

Recognition

• Bag of visual words

• HOG, SIFT, GIST

• Spatial pyramid

• Spatial configurations/Pictorial structures

• Sliding window/jumping window

• Cascades

Others

Machine Learning

– Adaboost

– kNN

– SVM

– Random forest

– PCA, ICA, CCA

– EM

– MIL/Latent-SVM

– Regularization

– HMM

– Graphical & Bayesian models

Optimization

– Classical linear and non-linear

– Graph operations

– Dynamic programming/message passing for MAP, max-marginals

– Graph cuts for binary variable MAP

• Texture synthesis