Wine Quality Datasets

Wine Quality Datasets

These datasets are public available for research purposes only.
The details are described in [Cortez et al., 2009]:
[©Elsevier]
[Pre-press (pdf)]
[bib].
Please include this citation if you plan to use these datasets:

P. Cortez, A. Cerdeira, F. Almeida, T. Matos and J. Reis.
Modeling wine preferences by data mining from physicochemical properties.
In Decision Support Systems, Elsevier, 47(4):547-553, 2009.

The data can be used to test (ordinal) regression or classification (in effect, this is a multi-class task, where
the clases are ordered) methods. Other research issues are feature selection and outlier detection.
The data includes two datasets:

  • winequality-red.csv – red wine preference samples;

  • winequality-white.csv – white wine preference samples;

  • The datasets are available here: winequality.zip

    Vinho verde is a unique product from the Minho (northwest) region of Portugal. Medium in alcohol, is it particularly appreciated due to its freshness (specially in the summer). More details can be found at: http://www.vinhoverde.pt/en/

    Return to: Paulo Cortez Downloads
    Paulo Cortez Home Page