machine learning

sklearn에 있는 연습용 dataset

민동기 2020. 7. 19. 21:53
728x90

scikit-learn.org/stable/datasets/index.html

7.2. Toy datasets

scikit-learn comes with a few small standard datasets that do not require to download any file from some external website.

They can be loaded using the following functions:

load_boston(\*[, return_X_y])

Load and return the boston house-prices dataset (regression).

load_iris(\*[, return_X_y, as_frame])

Load and return the iris dataset (classification).

load_diabetes(\*[, return_X_y, as_frame])

Load and return the diabetes dataset (regression).

load_digits(\*[, n_class, return_X_y, as_frame])

Load and return the digits dataset (classification).

load_linnerud(\*[, return_X_y, as_frame])

Load and return the physical excercise linnerud dataset.

load_wine(\*[, return_X_y, as_frame])

Load and return the wine dataset (classification).

load_breast_cancer(\*[, return_X_y, as_frame])

Load and return the breast cancer wisconsin dataset (classification).

These datasets are useful to quickly illustrate the behavior of the various algorithms implemented in scikit-learn. They are however often too small to be representative of real world machine learning tasks.