Mahaboob Sheik skmahaboob

👨‍💻 Mahaboob Sheik | Data Engineer

Welcome to my GitHub

I'm Mahaboob Sheik, a passionate Data Engineer with over 3 years of experience in transforming complex datasets into actionable insights. My expertise spans across big data technologies, cloud platforms, and advanced data processing tools, driving efficient data solutions that empower businesses to thrive.

🛠️ Technical Skills

🖥️ Languages

Python 🐍
PySpark ⚡
SQL 🗃️
Scala 🔍

💾 Databases

SQL Server 🛢️
Postgres 🐘
MongoDB 🍃

☁️ Big Data & Cloud Technologies

Apache Spark ✨
Hadoop 🐘
Databricks 🚀
Azure Data Factory 🏭
ADLS Gen 2 💾
Google Cloud Platform (GCP) ☁️

🛠️ DevOps & CI/CD

Azure DevOps 🚀
Docker 🐳
CI/CD Pipelines 🔄
Apache Airflow 🌬️

🧰 Other Tools & Technologies

Kafka 🔗
Snowflake ❄️
StreamSets 🌐
Linux 🐧
Data Modeling 📊

🚀 Key Projects & Achievements

1. Scalable Big Data Pipelines

Role: Lead Data Engineer
Technologies: Spark, Hadoop, Azure Databricks
Description: Designed and managed scalable big data pipelines handling over 50 terabytes monthly. Improved query performance by 20% using distributed computing technologies.

2. CI/CD Pipeline Automation

Role: DevOps Lead
Technologies: Azure DevOps, StreamSets, Docker
Description: Developed fully automated CI/CD pipelines, reducing deployment time by 50%. Facilitated seamless migration of 40+ pipelines from Development to QA with minimal downtime.

3. Resource Optimization on GCP

Role: Data Engineer
Technologies: Google Cloud Platform (GCP), Apache Spark
Description: Optimized GCP resources by implementing Storage Lifecycle Management, reducing costs by over 10% annually and boosting operational efficiency.

👥 Leadership & Team Management

Successfully led a cross-functional team of 10 members, achieving a 20% increase in on-time project completions.
Focus on collaboration, continuous learning, and collective success.

🎓 Education & Certifications

Bachelor of Technology in Electronics and Communication Engineering
- SRKR Engineering College, 2017-2021
Certifications
- Microsoft Certified Azure Data Engineer Associate (DP-203) 🎓
- Microsoft Certified Azure Fundamentals (AZ-900) 🎓
- StreamSets White Belt Certification 🥋

🌱 Current Learning & Interests

Current Projects: Working on a new Data Engineering project using Google Cloud Platform (GCP).
Learning: Kafka 🔗 and Snowflake ❄️.

🤝 Let's Connect!

LinkedIn: Mahaboob Sheik 🌐
Email: MahaboobSheik26@gmail.com 📧
Phone: +91-9182912647 📞

🌐 Website

Visit my live portfolio website at https://skmahaboob.github.io to learn more about me, my skills, and my work.

Feel free to explore the repository and contact me if you have any questions or would like to collaborate on a project!

Contact: MahaboobSheik26@gmail.com 📧

Provide feedback

Saved searches

Use saved searches to filter your results more quickly