Predicting Customer Booking Behaviour

Aug 28, 2023

Project Title: Predicting Customer Booking Behaviour

Tools and Libraries used: Python, jupyter notebook, pandas, ydata_profiling,matplotlib,seaborn,scikit-learn,imblearn,XGBoost

GitHub Repository: The full analysis code can be found here- Link

Approach:

  • Loaded customer booking dataset using pandas.

  • Generated data profiling report with ydata_profiling - Link.

  • Selected relevant features and split data into train and test sets.

  • Trained Logistic Regression model for feature importance.

  • Evaluated XGBoost classifier model for accuracy.

  • Visualized feature importances using bar plot.

  • Identified top 5 important features.

Results:

  • The XGBoost model attains an accuracy of 0.8512, which is 85%.

  • Features positively correlated with booking completion - Extra baggage, preferred seat, in-flight meals and flight hour.

  • Features negatively correlated with booking purchase - Purchase lead, length of stay and flight duration.

Visualization: Plot of Feature importance's

Conclusion:

  • Passenger preferences for baggage, seating, in-flight meals, flight timing, and purchase lead time significantly impact booking decisions.

If you want to reach out!