This repository contains my machine learning learning journey and will be continuously updated as I learn and implement new concepts.
- From API - Fetching data from REST APIs
- Web Scraping - Extracting data from websites
- From CSV - Working with CSV files
- Numerical Data Preprocessing - Preprocessing numerical datasets
- Handle Missing Values - Techniques for dealing with missing data
- Handle Imbalanced Data - Methods for handling imbalanced datasets
- Text Data Preprocessing - Text cleaning and feature extraction
- Text Data Preprocessing 2 - Advanced text processing techniques
- Standardization - Z-score normalization (mean=0, std=1)
- Normalization - Min-Max scaling
- One Hot Encoding - Converting categorical to binary columns
- Ordinal & Label Encoding - Converting categories to integers
- Binarization - Converting numerical data to binary
- Discretization - Binning continuous variables
- Column Transformer - Apply different transformations to different columns
- Function Transformer - Custom transformations using functions
- Power Transformer - Box-Cox and Yeo-Johnson transformations
- Titanic Without Pipeline - Manual preprocessing steps
- Titanic With Pipeline - Using sklearn Pipeline
- Predict Without Pipeline - Manual prediction workflow
- Predict With Pipeline - Prediction using Pipeline
- Diabetes Prediction - Binary classification for diabetes prediction
- Sleep Disorder Prediction - Multi-class classification for sleep disorders
- Fake News Detection - NLP classification for fake news
- Wine Quality Prediction - Multi-class classification for wine quality
- Sonar Rocks vs Mine - Binary classification using sonar data
- Loan Status Prediction - Predicting loan approval status
- House Price Prediction - Regression model for house prices
- House Price Prediction 2 - Advanced regression techniques
- Car Price Prediction - Regression model for car prices
- Gold Price Prediction - Time series prediction for gold prices
- Pokemon Data Analysis - Comprehensive EDA on Pokemon dataset
- Python - Primary programming language
- pandas - Data manipulation and analysis
- scikit-learn - Machine learning algorithms and preprocessing
- numpy - Numerical computations
- matplotlib/seaborn - Data visualization
- requests/BeautifulSoup - Web scraping and API calls
├── README.md
├── requirements.txt
├── data_gathering/
│ ├── from_api.ipynb
│ ├── web_scrapping.ipynb
│ └── with_csv.ipynb
├── data_preprocessing/
│ ├── numerical_ds_preprocessing.ipynb
│ ├── handle_missing_values.ipynb
│ ├── handle_imbalanced_dp.ipynb
│ ├── text_ds_preprocessing.ipynb
│ └── text_ds_preprocessing2.ipynb
├── feature engineering/
│ ├── feature scaling/
│ │ ├── standaization.ipynb
│ │ └── normalization.ipynb
│ ├── encode categorical data/
│ │ ├── one_hot_encoding.ipynb
│ │ └── ordinal_and_label_encoding.ipynb
│ ├── encoding numerical data/
│ │ ├── binarization.ipynb
│ │ └── discritization.ipynb
│ ├── transformer/
│ │ ├── column_transformer.ipynb
│ │ ├── function_transformer.ipynb
│ │ └── power_transformer.ipynb
│ └── pipelines/
│ ├── titanic_without_pipeline.ipynb
│ ├── titanic_with_pipeline.ipynb
│ ├── predict_without_pipeline.ipynb
│ └── predict_with_pipeline.ipynb
└── projects/
├── diabeties_prediction.ipynb
├── Sleep Disorder Prediction.ipynb
├── fake_news_prediction.ipynb
├── wine_quality_prediction.ipynb
├── sonar_rocks_vs_mine_predition.ipynb
├── loan_status_prediction.ipynb
├── house_price_prediction.ipynb
├── house_price_prediction2.ipynb
├── price_card_prediction.ipynb
├── gold_price_prediction.ipynb
└── pokemon.ipynb
- Clone the repository
- Install dependencies:
pip install -r requirements.txt - Open any notebook to explore the learning materials
This repository represents my ongoing journey in machine learning.