In this project, we built a machine learning model to predicts a patient's risk of lung cancer and asthma emergency department visits using air quality data.
Created a web application for physicians to predict a patient's risk of lung cancer, given the air quality in their region, and other risk factors (i.e., smoking, gender, age, diet). If the patient suffers from asthma, physicians can also predict their rate of emergency department visits for asthma attacks given the air quality of their region.
- California Department of Public Health.
- Open Data. Asthma ED Visit Rates
- US EPA Air Data. Annually Summary Data
- Ahmad, Ahmad S., and Ali M. Mayya. “A new tool to predict lung cancer based on risk factors.” Heliyon 6.2 (2020): e03402
- Air Now API for air quality index: https://docs.airnowapi.org/
- Historical air quality index data: https://aqs.epa.gov/aqsweb/airdata/download_files.html#AQI
Used StandardScaler to scale the training and testing sets.
Created a training set from the 2019 loans using pd.get_dummies() to convert the categorical data to numeric columns. Similarly, create a testing set from the 2020 loans, also using pd.get_dummies()
Created different classification models to predict patient’s risk of developing lung cancer. Different models are:
- Decision Tree Classifier
- Random Forest Classifier
- Extremely Random Trees
- Boosting- AdaBoostClassifier
Created different Regression models to predict patient’s emergency department visit rate for severe Asthma attacks. Different models are:
- Linear Regression
- Support Vector Machine
- Decision Tree
- Random Forest
- Extremely Random Trees
- Boosting- AdaBoostRegressor
Used Flask templating to create a website to predict risk of lung cancer and asthma emergency department visits using air quality data. The saved model was used for the prediction.
Healthcare tool is designed to assess a patient’s respiratory health based on their local air quality. To begin patient's zip code need to enter. The Air Quality Index category (i.e., Good, Moderate, Unhealthy) for the patient’s region will populate the display.
The “Model Prediction for Cancer” model is used to predict a patient's risk of developing lung cancer (Low, Medium, High).
The “Model Prediction for Asthma” model is used to predict the patient’s emergency department visit rate for severe asthma attacks.