Cascade Trainer GUI
Cascade Trainer GUI is a program that can be used to train, test and improve cascade classifier models. It uses a graphical interface to set the parameters and make it easy to use OpenCV tools for training and testing classifiers.
OpenCV is an open source library that can be used to develop image processing programs. (See http://opencv.org)
1.2. Cascade Classifiers
Cascade Classifiers are model files that are trained using OpenCV to detect different kind of objects in an image. Cascade Classifiers are XML files that OpenCV can produce and use efficiently.
Cascade Trainer GUI is pre-built to be used with 64-bit Windows and Ubuntu operating systems. If you are using Windows version there is nothing required but if you are on Ubuntu, you have to note the following. It includes all required libraries for GUI inside itself (built statically) but it requires OpenCV to be present on the system. Following command in Ubuntu can be used to install and satisfy Cascade Trainer GUI dependency on OpenCV libraries:
sudo apt-get install libopencv-dev
3. How to Use
In this section different parts and functionalities of Cascade Trainer GUI are described. Note that by moving your mouse over to each element on the graphical interface you will get a short description of it at the bottom status bar.
3.1. Training Cascade Classifiers
When Cascade Trainer GUI is first started you will be presented with the following screen. This is the starting screen and it can be used for training classifiers. To train classifiers usually you need to provide the utility with thousands of positive and negative image samples, but there are cases when you can achieve the same with less samples.
To start the training, you need to create a folder for your classifier. Create two folders inside it. One should be “p” (for positive images) and the other should be “n” (for negative images).
For example, you should have a folder named “Car” which has the mentioned folders inside it.
Positive image samples are the images of the object you want to train your classifier and detect.
For example, if you want to train and detect cars then you need to have many car images.
And you should also have many negative images. Negative images are anything but cars.
Important Note 1: Negative images must NEVER include any positive images. Not even partially.
Important Note 2: In theory, negative images can be any image that is not the positive image but in practice, negative images should be relevant to the positive images. For example, using sky images as negative images is a poor choice for training a good car classifier, even though it doesn’t have a bad effect on the overall accuracy:
Start by pressing the Browse button in Train tab. Select the folder you have created for the classifier.
Common, Cascade and Boost tabs can be used for setting numerous parameters for customizing the classifier training. Cascade Trainer GUI sets the most optimized and recommended settings for these parameters by default, still some parameters need to be modified for each training. Note that detailed description of all these parameters are beyond the scope of this help page and require deep knowledge about cascade classification techniques.
You can set pre-calculation buffer size to help with the speed of training process. You can assign as much memory as you can for these but be careful to not assign too much or too low. For example if you have 8 GB of RAM on your computer then you can safely set both of the buffer sizes below to 2048.
Next you need to set the sample width and height. Make sure not to set it to a very big size because it will make your detection very slow. Actually it is quite safe to always set a small value for this. Recommended settings for sample width and height is that you keep one aspect on 24 and set the other accordingly. For example, if you have sample images that are 320×240 then first calculate the aspect ratio which in this case is 1.33:1 then multiply the bigger number with 24. You’d get 32×24 for sample width and height.
You can also set the feature type to HAAR or LBP. Make sure to use HOG only if you have OpenCV 3.1 or later. HAAR classifiers are very accurate but require a lot more time to train so it is much wiser to use LBP if you can provide your classifiers with many sample images. LBP classifiers on the other hand are less accurate but train much quicker and detect almost 3 times faster.
As for the parameters in the Boost tab, it is recommended to keep the default values unless you are quite sure about what you’re doing.
After all the parameters are set, press Start button at the bottom to start training your cascade classifier.
Wait for the training to complete.
Now if you exit Cascade Trainer GUI and go to the classifier folder you will notice that there are new files and folders are created in this folder.
“n” and “p” are familiar but the rest are new. “classifier” folder contains XML files that are created during different stages of training. If you check inside “classifier” folder, you’ll notice something similar to the following.
“stage#.xml” files are temporary files that won’t be needed anymore.
“params.xml” contains the parameters you have used for the training. (Just a reminder file)
“cascade.xml” is the actual cascade classifier and if the training completed successfully then you should have this file inside classifier folder.
“neg.lst”, “pos.lst” and “pos_samples.vec” are temporary files created for training the classifier and they can also be removed without having any effect.
3.2. Testing Cascade Classifiers
To test your classifiers, go to Test tab from the tab bar at the top and set the options as described below and finally press Start.
Select your cascade classifier using the Browse button at the top. You can also manually enter the path to cascade XML file in the Cascade Classifier XML field. This cascade classifier will be used for detection in images and/or videos.
You can select one of the following for Input Settings and you need to set the path according to this option:
• Single Image: A single image will be used as the scene in which detection will be done. In this case Path should point to a single image file. (Only supported images can be selected.)
• Images in a Folder: A folder containing many images will be used for testing. In this case Path should be set to a folder that contains one or more scene images.
• Video: A video file will be used for testing the classifier. In this case Path should point to a video file that will be used as input.
After setting the input settings it is time for output settings. Output type and path should be set according to what was chosen in input settings. See below:
• Images in a Folder: Detected objects will be saved to multiple images inside a folder. In this case Path should be set to a folder. Note that this option can be used with all of the possible options in input.
• Video: A video file with the detected objects highlighted with a red rectangle will be created. In this case Path should point to a video file that will be created after the test completes. Note that this option will only work if Video is selected in the input settings.
After clicking on Start button training will start. Next section contains different examples of Test results.
3.3. Preparing and Training High Quality Classifiers
Use Cropper tool to crop images from the object you want to detect. It provides very simple tools to achieve this. Try to use relevant negative images and always separately train objects that are too much visually different.
Update: Version 3.1.10 includes an improved cropper and viewer tool that allows efficient zoom in/out and cropping functionality.
Currently only 32-bit (x86) and 64-bit (x64) Windows operating systems (Windows 7 and later) and Ubuntu x64 are supported but I’ll make sure to release other Linux and also Mac OS X versions very soon.
Updated: March 13, 2017 (Updated Windows versions to 220.127.116.11 which includes fixed cropper tool)
Updated: November 09, 2016 (Updated Ubuntu version to 2.1.10)
Updated: November 07, 2016 (Many enhancements to cropper, training and testing utilities. Version x64 for Windows is built using Intel TBB and works much faster now for training. It is also the recommended version as of now.)
Updated: October 27, 2016 (Added two more parameters to UI. Neg count and Pos usage percentage.)
Updated: October 19, 2016 (Fixed an issue with file names in positive and negative images folders.)
Make sure to let me know any issues you find in the application and I’ll try to fix it as soon as I can. Also send your comments and questions using Contact Me page.