Classification-using-KMeans-and-SimpleMKL

Very large scale classification based on K-Means clustering & Multi-Kernel SVM(SimpleMKL)

Here, we are going to implement the method proposed in this article, "Very large scale classification based on K-Means clustering & Multi-Kernel SVM(SimpleMKL)" at ACM Digital Library.

Modules:

The code has below modules:

KMeans Clustering
- Select nearest & furthest points of each cluster
Duplicate Removal
- Remove all duplicate data
Outlier Detection
- Remove the last ROT-data based on their outlier score
- Method proposed in this article, "Robust, Scalable Anomaly Detection for Large Collections of Images".
Human Labeling
- Do labeling for the new representative dataset
SimpleMKL
- Multi Kernel SVM
- Method proposed in this article, "Simplemkl".

The method is run on two diffrent types of datasets, large scale & very large scale satasets.

The large scale datasets are:

The very large scale datasets are:

Results can be seen at the end of presentation file uploaded in this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Articles		Articles
Datasets.ipynb		Datasets.ipynb
Datasets.rar		Datasets.rar
Datasets_final.py		Datasets_final.py
InstanceSelection.ipynb		InstanceSelection.ipynb
Presentation.pdf		Presentation.pdf
README.md		README.md
RandomInstanceSelection.ipynb		RandomInstanceSelection.ipynb
SimpleMKL.ipynb		SimpleMKL.ipynb
Stage3_LibSVM.ipynb		Stage3_LibSVM.ipynb