This repository is a structured collection of my Data Science learning journey, covering fundamentals, hands-on practice, and projects across multiple tools and technologies.
It is intended for learning, practice, and reference purposes.
This repository includes learning materials and practical implementations related to:
- Excel-based data analysis
- Python programming for data science
- SQL for data querying and analysis
- Machine Learning algorithms and models
- End-to-end projects
- Datasets and learning resources used
The content reflects progressive learning, starting from basics and moving toward applied projects.
- Data cleaning and preprocessing
- Basic to advanced Excel functions
- Pivot tables and analytical summaries
- Exploratory analysis using spreadsheets
- Core Python concepts
- Data structures and control flow
- Practice programs for logic building
- Python for data analysis basics
Libraries commonly used:
- NumPy
- Pandas
- Matplotlib / Seaborn
- Basic SQL queries
- Filtering, sorting, and aggregation
- Joins and subqueries
- Real-world analytical queries
- SQL applied to data analysis use cases
- Supervised learning models
- Linear Regression
- Logistic Regression
- Naive Bayes
- KNN
- Decision Trees
- Support Vector Machines
- HyperParameter Tuning
- Unsupervised learning basics
- Model evaluation metrics
- Data preprocessing and feature scaling
- Train-test split and cross-validation
- ANN
- CNN
- RNN
- End-to-end projects including problem statement & datasets used
- Python Finder (Python)
- Blogging Website (SQL)
- Placement Prediction
- Publicly available datasets
- Practice datasets for ML and SQL
- Cleaned and raw versions
- Learning notes
- Reference materials(Textbooks)
- Useful links and documentation
- To document my data science learning journey
- To practice and revise concepts regularly
- To build a strong foundation in analytics and machine learning
- To serve as a reference repository for future projects and interviews
- Excel
- Python
- SQL
- Jupyter Notebook
- Machine Learning libraries (scikit-learn, pandas, numpy)
- Deep Learning (Tensorflow with Keras)
- Add advanced machine learning projects
- Include deep learning implementations
- Integrate real-time data analysis projects
- Improve documentation and explanations
For suggestions, improvements, or collaboration, feel free to connect via GitHub.