SMU Data Science Review
Abstract
Using the physicochemical properties of wine to predict quality has been done in numerous studies. Given the nature of these properties, the data is inherently skewed. Previous works have focused on handful of sampling techniques to balance the data. This research compares multiple sampling techniques in predicting the target with limited data. For this purpose, an ensemble model is used to evaluate the different techniques. There was no evidence found in this research to conclude that there are specific oversampling methods that improve random forest classifier for a multi-class problem.
Recommended Citation
Burigo, Robert; Frazier, Scott; Kravez, Eli; and Lohia, Nibhrat
(2023)
"Comparison of Sampling Methods for Predicting Wine Quality Based on Physicochemical Properties,"
SMU Data Science Review: Vol. 7:
No.
1, Article 8.
Available at:
https://scholar.smu.edu/datasciencereview/vol7/iss1/8
Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial 4.0 License