load_iris#
- sklearn.datasets.load_iris(*, return_X_y=False, as_frame=False)[source]#
Load and return the iris dataset (classification).
The iris dataset is a classic and very easy multi-class classification dataset.
Classes
3
Samples per class
50
Samples total
150
Dimensionality
4
Features
real, positive
Read more in the User Guide.
Changed in version 0.20: Fixed two wrong data points according to Fisher’s paper. The new version is the same as in R, but not as in the UCI Machine Learning Repository.
- Parameters:
- return_X_ybool, default=False
If True, returns
(data, target)
instead of a Bunch object. See below for more information about thedata
andtarget
object.Added in version 0.18.
- as_framebool, default=False
If True, the data is a pandas DataFrame including columns with appropriate dtypes (numeric). The target is a pandas DataFrame or Series depending on the number of target columns. If
return_X_y
is True, then (data
,target
) will be pandas DataFrames or Series as described below.Added in version 0.23.
- Returns:
- data
Bunch
Dictionary-like object, with the following attributes.
- data{ndarray, dataframe} of shape (150, 4)
The data matrix. If
as_frame=True
,data
will be a pandas DataFrame.- target: {ndarray, Series} of shape (150,)
The classification target. If
as_frame=True
,target
will be a pandas Series.- feature_names: list
The names of the dataset columns.
- target_names: list
The names of target classes.
- frame: DataFrame of shape (150, 5)
Only present when
as_frame=True
. DataFrame withdata
andtarget
.Added in version 0.23.
- DESCR: str
The full description of the dataset.
- filename: str
The path to the location of the data.
Added in version 0.20.
- (data, target)tuple if
return_X_y
is True A tuple of two ndarray. The first containing a 2D array of shape (n_samples, n_features) with each row representing one sample and each column representing the features. The second ndarray of shape (n_samples,) containing the target samples.
Added in version 0.18.
- data
Examples
Let’s say you are interested in the samples 10, 25, and 50, and want to know their class name.
>>> from sklearn.datasets import load_iris >>> data = load_iris() >>> data.target[[10, 25, 50]] array([0, 0, 1]) >>> list(data.target_names) [np.str_('setosa'), np.str_('versicolor'), np.str_('virginica')]
See Principal Component Analysis (PCA) on Iris Dataset for a more detailed example of how to work with the iris dataset.
Gallery examples#
data:image/s3,"s3://crabby-images/a4234/a42343c54ccad3e3334b169c73de09a4f5b6c285" alt=""
Plot the decision surface of decision trees trained on the iris dataset
data:image/s3,"s3://crabby-images/fddf8/fddf81efd1623fb31ee0e476f385784648ca4281" alt=""
Comparison of LDA and PCA 2D projection of Iris dataset
data:image/s3,"s3://crabby-images/01964/019645086035a9edc05bbf2c858df1293ea8c607" alt=""
Factor Analysis (with rotation) to visualize patterns
data:image/s3,"s3://crabby-images/554ff/554ff80678a586b7871761ddaffe9aef1e8efcd3" alt=""
Principal Component Analysis (PCA) on Iris Dataset
data:image/s3,"s3://crabby-images/451eb/451eb5a181343f81715d8bb401888381d7fcd1ab" alt=""
Plot the decision boundaries of a VotingClassifier
data:image/s3,"s3://crabby-images/0aece/0aece559d0479f3801b52560acfeb7f41bc193b8" alt=""
Plot the decision surfaces of ensembles of trees on the iris dataset
data:image/s3,"s3://crabby-images/2566b/2566bcedde68d8be04ddad2bd8af775a3c4960ca" alt=""
Gaussian process classification (GPC) on iris dataset
data:image/s3,"s3://crabby-images/989e5/989e5eb9b30a3261ddc91bbb9d4981ecd79b0a1f" alt=""
Multiclass Receiver Operating Characteristic (ROC)
data:image/s3,"s3://crabby-images/06a8e/06a8e5244929f00d7ba7eb63c7fcc095039a2387" alt=""
Receiver Operating Characteristic (ROC) with cross validation
data:image/s3,"s3://crabby-images/749cd/749cd2a767332eb324f4a0a7288f9a87233599bd" alt=""
Test with permutations the significance of a classification score
data:image/s3,"s3://crabby-images/0d5d4/0d5d43b23eb80b1146dff84b25ce81f6ed19bf1f" alt=""
Comparing Nearest Neighbors with and without Neighborhood Components Analysis
data:image/s3,"s3://crabby-images/2d9d2/2d9d220da84cbf89cff366ca359baec858ee2220" alt=""
Compare Stochastic learning strategies for MLPClassifier
data:image/s3,"s3://crabby-images/e6faf/e6faf8470604b9ff2c6c13f1caafc3911db0c05b" alt=""
Decision boundary of semi-supervised classifiers versus SVM on the Iris dataset
data:image/s3,"s3://crabby-images/2a4f6/2a4f6817c5e3b62fcb0ff1531fca79f7176b80b4" alt=""
Plot different SVM classifiers in the iris dataset