Genetic Clustering Algorithm

This Python script implements a genetic algorithm for clustering data. The algorithm optimizes the cluster assignments of data points using a genetic approach, aiming to improve the silhouette score. The silhouette score is a measure of how well-defined the clusters are in the data.

Getting Started

Prerequisites

Python 3
Required libraries: numpy, pandas, scikit-learn, matplotlib

Installation

Clone the repository:

https://github.com/parvvaresh/clustering-with-genetic
cd clustering-with-genetic

Install the required dependencies:

pip install -r requirements.txt

Usage

Run the genetic_clustering.py script to execute the genetic clustering algorithm on the provided dataset. Make sure to update the script with your dataset or use the default Iris dataset.

python3 test_iris.py

Algorithm Overview

The genetic clustering algorithm consists of the following components:

Genetic Class

Defines the genetic operations such as mutation, generation, and fitness calculation.

Cluster Class

Manages the clustering process, including the initialization of populations, evolution, and convergence.

Main Script

Utilizes the genetic and clustering classes to run the algorithm on a given dataset.

Parameters

size_population: Number of individuals in the population.
goal: The desired fitness score to achieve.
repeat: Number of generations to run the algorithm.
is_mutation: Boolean flag to enable or disable mutation.

Results

The script outputs the progress of the algorithm, including the generation number and the fitness score achieved. Additionally, a plot of the fitness scores over generations is displayed at the end of the execution.

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

Acknowledgments

This implementation is inspired by genetic algorithms and clustering techniques.
Special thanks to the scikit-learn library for providing the silhouette score metric.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.github/workflows		.github/workflows
cluster_ga		cluster_ga
README.md		README.md
requirements.txt		requirements.txt
test_iris.py		test_iris.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Genetic Clustering Algorithm

Table of Contents

Getting Started

Prerequisites

Installation

Usage

Algorithm Overview

Genetic Class

Cluster Class

Main Script

Parameters

Results

License

Acknowledgments

About

Releases

Packages

Languages

parvvaresh/clustering-with-genetic

Folders and files

Latest commit

History

Repository files navigation

Genetic Clustering Algorithm

Table of Contents

Getting Started

Prerequisites

Installation

Usage

Algorithm Overview

Genetic Class

Cluster Class

Main Script

Parameters

Results

License

Acknowledgments

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages