
Data Science (21 days Program Outline)
Week 1: Core Python, Data Handling, and Statistics (Days 1β7)
π Day 1 β Intro to Data Science
- What is Data Science?
- Lifecycle: Data Collection β Cleaning β Analysis β Modeling β Deployment
- Tools: Jupyter, Python, Git, Excel
π Day 2 β Python for Data Science
- Data types, loops, functions
- List, Dict, Tuple, Set
- File handling
π Day 3 β Numpy & Pandas Basics
- Arrays and matrix operations (Numpy)
- Series & DataFrame (Pandas)
- Importing CSV, Excel
π Day 4 β Data Cleaning & Preprocessing
- Handling missing values
- Duplicates, nulls
- Renaming, replacing, mapping
π Day 5 β Exploratory Data Analysis (EDA)
- Descriptive statistics
- Groupby, sorting, filtering
- Hands-on: Titanic dataset
π Day 6 β Data Visualization
- Matplotlib, Seaborn
- Plot types: histogram, bar, scatter, box, heatmap
- Hands-on: Correlation analysis
π Day 7 β Statistics for Data Science
- Mean, median, mode, std dev, variance
- Probability basics
- Distributions: normal, binomial
Week 2: Advanced Stats, ML Algorithms, and Model Building (Days 8β14)
π Day 8 β Inferential Stats & Hypothesis Testing
- Confidence intervals
- t-test, chi-square, ANOVA
- p-value explained
π Day 9 β Linear Regression
- Simple and multiple regression
- RΒ², adjusted RΒ²
- Hands-on: House price prediction
π Day 10 β Classification: Logistic Regression
- Binary vs multi-class classification
- Sigmoid function
- Evaluation: Confusion matrix, ROC curve
π Day 11 β Decision Trees & Random Forest
- Splitting criteria: Gini, Entropy
- Overfitting, pruning
- Hands-on: Loan approval prediction
π Day 12 β KNN & Naive Bayes
- Distance metrics in KNN
- Bayes theorem and Gaussian NB
- Hands-on: Email spam detection
π Day 13 β Unsupervised Learning
- K-means clustering
- Elbow method
- PCA for dimensionality reduction
π Day 14 β Model Evaluation & Tuning
- Cross-validation
- GridSearchCV, RandomSearchCV
- Bias-variance tradeoff
Β
Week 3: Projects, Real-World Tools & Career Prep (Days 15β21)
π Day 15 β Time Series Analysis
- Date/time handling
- Rolling mean, autocorrelation
- Forecasting with ARIMA (brief)
π Day 16 β Natural Language Processing (NLP)
- Text cleaning (tokenize, stopwords, stemming)
- TF-IDF
- Sentiment analysis mini project
π Day 17 β SQL for Data Science
- SELECT, WHERE, JOIN, GROUP BY
- Subqueries
- Practice with sample database (e.g., SQLite or MySQL)
π Day 18 β Working with Real Datasets
- Kaggle datasets
- End-to-end EDA + model
- Hands-on: Diabetes prediction / Customer churn
π Day 19 β Mini Capstone Project
Choose 1:
- Sales prediction
- Fake news detection
- Movie recommendation system
- Smart city traffic analysis
π Day 20 β Model Deployment
- Save model with Pickle/Joblib
- Flask/Streamlit web app
- Deploy to Heroku (or local server)
π Day 21 β Career in Data Science
- Resume tips, GitHub portfolio
- Data science roles: Analyst, ML engineer, DS
- Certifications, interview prep (case studies, SQL/ML Qs)
π§° Tools & Libraries:
- Python (Jupyter Notebook)
- Numpy, Pandas, Matplotlib, Seaborn
- Scikit-learn
- SQL (SQLite / MySQL)
- Streamlit or Flask for deployment
- Kaggle for datasets
Schedule
