dm-group1-project

Project Structure

The project consists of two parts:

Part A: Classification Problem

Our tasks include developing and optimizing predictive models based on various criteria, such as accuracy, interpretability, lift for the top 30% of cases, and cost of misclassification. We also explore clustering analysis to identify potential patterns among the students.

Tasks:

Task A1: Develop an accurate predictive model.
Task A2: Generate an explanatory predictive model, preferably a Decision Tree with 4-6 rules.
Task A3: Develop a predictive model focused on obtaining the highest lift for the top 30% of cases.
Task A4: Construct a model taking into account the different costs associated with misclassification.
Task A5: Perform clustering analysis to identify three natural groups of students based on the interval and ordinal input variables.

Part B: Regression Problem

In this part, we work with the DMABASE.CSV dataset, using LOGSALAR as the target variable. Our tasks involve generating predictive models, performing clustering segmentation, and constructing decision trees.

Tasks:

Task B1: Generate the 'best' predictive model using Average Square Error as the model assessment measure.
Task B2: Perform clustering segmentation and provide an explanation for the clusters.
Task B3: Partition the continuous variable LOGSALAR into three intervals and generate the best decision tree.

Usage

Please follow the Jupyter notebooks in order for each task. Each notebook is self-contained and includes the plan, execution, and results for the task.

Note: In case of any discrepancies, the Jupyter notebook takes precedence.

Contributions

All team members actively participated in developing and executing the project, contributing to different aspects such as data understanding, data preparation, model building, evaluation, and interpretation.

We appreciate any feedback or suggestions to improve our work.

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
datasets		datasets
docs		docs
previous_codes		previous_codes
.gitignore		.gitignore
Final_Project.ipynb		Final_Project.ipynb
IST340_Final_Project_Report_Team1.pdf		IST340_Final_Project_Report_Team1.pdf
LICENSE		LICENSE
README.md		README.md
issue_sheet.txt		issue_sheet.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dm-group1-project

Project Structure

Part A: Classification Problem

Tasks:

Part B: Regression Problem

Tasks:

Usage

Contributions

About

Releases

Packages

Contributors 3

Languages

License

MSWinds/dm-group1-project

Folders and files

Latest commit

History

Repository files navigation

dm-group1-project

Project Structure

Part A: Classification Problem

Tasks:

Part B: Regression Problem

Tasks:

Usage

Contributions

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages