This project aims to classify the quality of wines using Self-Organizing Maps (SOM), an unsupervised machine learning algorithm. The dataset used for this analysis is the "Red Wine Quality" dataset sourced from the UCI Machine Learning Repository. The dataset contains various physicochemical properties of red wines, along with their quality ratings provided by experts.
Importance
Wine quality classification is essential for both wine producers and consumers. Accurately assessing wine quality based on its physicochemical attributes is crucial for production optimization, quality control, and informed consumer choices. Self-Organizing Maps offer an effective method for visualizing and clustering complex data, making them suitable for wine quality classification.
The "Red Wine Quality" dataset consists of 1,599 instances of red wines, each described by 11 physicochemical attributes. These attributes include fixed acidity, volatile acidity, citric acid, residual sugar, chlorides, free sulfur dioxide, total sulfur dioxide, density, pH, sulphates, and alcohol content. The quality rating ranges from 0 to 10, representing wine quality on a discrete scale.
- Fixed acidity
- Volatile acidity
- Citric acid
- Residual sugar
- Chlorides
- Free sulfur dioxide
- Total sulfur dioxide
- Density
- pH
- Sulphates
- Alcohol
- Quality (score between 0 and 10)