Option C: Machine Learning Final Project
Benjamin Hislop: 900896194
The goal of this project is to predict student dropout risk based on various demographic, socioeconomic, and academic factors. By identifying students at high risk early, institutions can intervene and provide necessary support.
This application uses a Random Forest Classifier to sort students into distinct categories: Dropout, Enrolled, or Graduate.
Dataset: Predict Students' Dropout and Academic Success (UCI Machine Learning Repository)
Key features include Marital Status, Course, Qualifications (Mother/Father), GDP, Unemployment Rate, Tuition Fees status, and Age at enrollment.
We chose Random Forest because:
Performance: The model achieves ~81% accuracy on the test set, with a strong ability to distinguish between Dropouts and Graduates.
The web application provides two main modes of interaction:
Fill out a form with a student's details to get an immediate risk assessment. Input validation ensures data quality before processing.
Upload a CSV file containing data for multiple students. The app will: