User Tools

Site Tools


ece4580:module_recognition

This is an old revision of the document!


Object Recognition

/*

(1) object detector with boosting: 
http://people.csail.mit.edu/torralba/shortCourseRLOC/boosting/boosting.html 

*/

Module #1

Clustering

  1. Study k-means clustering algorithm and the algorithmic steps for k-means clustering.
  2. Download (or clone) the clustering skeleton code here
  3. Implement k-means clustering algorithm working in RGB space by following the algorithmic steps. You are welcome to implement from scratch without skeleton code.
  4. Test your algorithm on segmenting the image segmentation.jpg using k=3
  5. Try different random initialization and show corresponding results.
  6. Comment on your different segmentation results.

Module #2

Object Recognition

  1. Study the bag-of-words approach for classification/Recognition task
  2. We begin with implementing a simple but powerful recognition system to classify faces and cars.
  3. Check here for skeleton code. First, follow the README to setup the dataset and vlfeat library.
  4. In our implementation, you will find vlfeat library very useful. One may use vl_sift, vl_kmeans and vl_kdtreebuild.
  5. Now, use first 40 images in both categories for training.
  6. Extract SIFT features from each image
  7. Derive k codewords with k-means clustering in module 1.
  8. Compute histogram of codewords using kd-tree algorithm using vlfeat.
  9. Use the rest of 50 images in both categories to test your implementation.
  10. Report the accuracy and computation time with different k

ECE4580 Learning Modules

ece4580/module_recognition.1485252723.txt.gz · Last modified: 2024/08/20 21:38 (external edit)