In this course, we will explore this connection between vision and learning. We will cover topics in 1) image texture synthesis; 2) object detection and segmentation; 3) dynamic object tracking; 4) object and scene recognition; 5) human activity recognition and inference.
|
Date |
Topics | Papers | Discussion |
|
1/13 |
Texture: synthesis- a practical guide |
||
|
1/15 |
Texture: analysis-image statistics, similar measure |
Martin
&
Fowlkes & Malik, |
|
|
1/22 |
Texture: synthesis/analysis: probabilistic formulation |
||
|
1/27 |
Object Detection: face detection- statistical approaches |
||
2/3 |
Object Detection: more on boosting & bagging |
||
|
2/5 |
Object Detection: flexible object detection via Graphical Models |
||
|
2/10 |
Object Detection: efficient inference procedures for Graphical models(HMM, Tree, MRF): |
Ghahramani &
Jordan, |
|
|
2/12 |
Object Detection: Learning graphical models from examples |
Song & Goncalves &
Perona Meila & Jordan |
|
|
2/19 |
Object Tracking: Sampling, particle filtering |
||
|
2/24 |
Object Tracking: Markov Chain Monte Carlo(MCMC) methods |
||
|
2/26 |
Image Representation: PCA, ICA, Mixture Models |
Bell & Sejnowski
Roweis & Ghahramani |
|
|
3/3 |
Image Representation: Learning Image Features |
Lee & Seung Stauffer & Grimson |
|
|
3/17 |
Object Recognition: Digit/Face Recognition, Support Vector Machine(SVM), |
||
|
3/19 |
Object Recognition: Neutral Net, Digit Recognition with Shape Context, |
LeCun, Belongie, Malik, Puzicha | |
|
3/24 |
Object Recognition: Multi-class Object Recognition |
Mahamud, Hebert and Lafferty | |
|
3/26 |
Grouping: Object Segmentation: Graph cuts approaches |
Shi, Malik, sharon, Brandt, Basri |
|
|
3/31 |
Grouping: Stereophesis, Image labeling: Markov Random Field, and Graph Cuts |
Ishikawa Geiger, Boykov, Veksler, Zabih | |
|
4/9 |
Grouping: Learning to Segment, Learning with Partial labeling |
Barnard, et. al. | |
|
4/11 |
Grouping: Co-Training, knowledge transfer |
||
|
4/14 |
Action Recognition: Recognizing Human Movements |
||
|
4/16 |
Action Recognition: Learning Grammatical models of Human Actions |
||
|
4/21 |
Action Recognition: Vision and Sound: Lip Reading |
||
|
4/23 |
Action Recognition: Automatic Video Summarization |
||
|
|
Power-law in Large Dataset |
||
|
|
Scene Recognition with Large Dataset |
||
|
final |
Project presentation |
This course consists of three components: