Skip to content

A personal project to understand and implement basic data science algorithms

Notifications You must be signed in to change notification settings

malik-ust/data_science_algorithms

 
 

Repository files navigation

data_science_algorithms

A codebase to implement basic ML algorthims from scratch. Feel free to add your suggestions to the TODO lists.

TODO lists

Data processing

  • MinMax Scaler
  • Standard Scaler
  • PCA
  • Kernel PCA

ALgorithms

  • Linear regression
  • Logistic regression
  • Classification
  • Clustering
  • Feature Engineering
  • Random Forests
  • Decision Tree
  • Naive Bayes
  • Support Vector Machine
  • K-Nearest Neighbors
  • Gradient Boosting Machine​
  • Gaussian Process

Statistical techniques

  • Hypothesis testing
  • Confidence Interval
  • Regression Analysis
  • Dimensionality Reduction
  • ANOVA
  • f-test
  • chi-squre test
  • t-test

Data analysis libraries

  • pandas
  • numpy
  • scipy
  • scikit-learn
  • xgboost

Deep learning

  • Neural Networks
  • CNN
  • Tensorflow
  • Keras

Kaggle challenges

About

A personal project to understand and implement basic data science algorithms

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 97.9%
  • Python 2.0%
  • Other 0.1%