This project focuses on predicting an individual's income category using machine learning techniques. The model is trained on demographic and economic features to classify whether a person earns above or below a certain income threshold.
- Python β Core programming language
- Pandas & NumPy β Data preprocessing and manipulation
- Scikit-Learn β Machine learning model development
- Matplotlib & Seaborn β Data visualization
- Jupyter Notebook β Experimentation and analysis
This project includes the following steps:
- Data Cleaning & Preprocessing: Handling missing values, encoding categorical variables, and feature scaling
- Exploratory Data Analysis (EDA): Understanding feature relationships and distributions
- Model Training & Evaluation: Implementing various machine learning algorithms
- Hyperparameter Tuning: Optimizing model performance
- Prediction & Insights: Analyzing model outputs and key influencing factors
data/β Contains raw and processed datasetsnotebooks/β Jupyter Notebooks for analysis and experimentationmodels/β Saved trained modelsscripts/β Python scripts for data processing and trainingREADME.mdβ Project documentation and description
- Clone the Repository:
git clone https://github.com/Utkuersy/IncomePredictionMachineLearningProject.git
- Install Dependencies:
pip install -r requirements.txt
- Run Jupyter Notebook (for exploration and training):
jupyter notebook
- Execute Python Scripts (for automated processing and training):
python scripts/train_model.py