ece4580:module_recognition
This is an old revision of the document!
Table of Contents
Object Recognition
/*
(1) object detector with boosting: http://people.csail.mit.edu/torralba/shortCourseRLOC/boosting/boosting.html
*/
Module #1
Clustering
- Study k-means clustering algorithm and the algorithmic steps for k-means clustering.
- Download (or clone) the clustering skeleton code here
- Implement k-means clustering algorithm working in RGB space by following the algorithmic steps. You are welcome to implement from scratch without skeleton code.
- Test your algorithm on segmenting the image segmentation.jpg using k=3
- Try different random initialization and show corresponding results.
- Comment on your different segmentation results.
Module #2
Object Recognition
- Study the bag-of-words approach for classification/Recognition task
- We begin with implementing a simple but powerful recognition system to classify faces and cars.
- Check here for skeleton code. First, follow the README to setup the dataset and vlfeat library.
- In our implementation, you will find vlfeat library very useful. One may use vl_sift, vl_kmeans and vl_kdtreebuild.
- Now, use first 40 images in both categories for training.
- Extract SIFT features from each image
- Derive k codewords with k-means clustering in module 1.
- Compute histogram of codewords using kd-tree algorithm using vlfeat.
- Use the rest of 50 images in both categories to test your implementation.
- Report the accuracy and computation time with different k
ece4580/module_recognition.1485252723.txt.gz · Last modified: 2024/08/20 21:38 (external edit)