example datasets in sklearn
来源:互联网 发布:淘宝与京东 编辑:程序博客网 时间:2024/06/05 06:29
- sklearn.datasets: Datasets¶
- make_** ⇒ generator
- load_** ⇒ loader
1. nonlinear example datasets
1.1 half_moon
产生非线性数据集,比如用以测试核机制的性能;
核方法最终的使命是:unfold the half-moons(展开)from sklearn.datasets import make_moonsX, y = make_moons(n_samples=200, shuffle=True, random_state=123)plt.scatter(X[y==0, 0], X[y==0, 1], color='r', marker='^', alpha=.4)plt.scatter(X[y==1, 0], X[y==1, 1], color='r', marker='o', alpha=.4)plt.show()
1.2 concentric circles
from sklearn.datasets import make_circlesX, y = make_circles(n_samples=1000, noise=.1, factor=.2, random_state=123)plt.scatter(X[y==0, 0], X[y==0, 1], color='r', marker='^', alpha=.4)plt.scatter(X[y==1, 0], X[y==1, 1], color='b', marker='o', alpha=.4)plt.show()
2. datasets in sklearn
from sklearn import datasets
- iris
>>> iris = datasets.load_iris()>>> dir(iris)
>>> iris.features_names['sepal length (cm)', 'sepal width (cm)', 'petal length (cm)', 'petal width (cm)']>>> iris.target_namesarray(['setosa', 'versicolor', 'virginica'], dtype='<U10')>>> iris.data.shape(150, 4) # 训练样本 >>> iris.target.shape(150,) # 一维的训练样本
digits
>> digits = datasets.load_digits()>> dir(digits)>> digits.data.target_names...
3. UCI 数据
Breast Cancer Wisconsin dataset
which contains 569 samples of malignant(恶性的) and benign(良性的) tumor cells.
The first two columns in the dataset store the unique ID numbers of the samples and the corresponding diagnoisi (M=malignant, B=benign), respectively.
The columns 3-32 contains 30 real-value features that have been computed from digitized images of the cell nuclei, which can be used to build a model to predict whether a tumor is benign or malignant.
import pandas as pddf = pd.read_csv('https://archive.ics.uci.edu/ml/machine-learning-databases/' 'breast-cancer-wisconsin/wdbc.data', header=None)X, y = df.values[:, 2:], df.values[:, 1]
0 0
- example datasets in sklearn
- sklearn.datasets.base.Bunch简介
- An example in sklearn: Faces recognition example using eigenfaces and SVMs
- sklearn的数据集模块datasets
- sklearn.datasets.base中Bunch类
- 5 sklearn的数据集-datasets
- sklearn.datasets.fetch_20newsgroups英文文档翻译
- No handlers could be found for logger "sklearn.datasets.twenty_newsgroups"
- Datasets
- Use ADO.NET datasets in Delphi
- webProxy Example in C#
- HttpListener in C#:example
- XMLRPC example in PHP
- JSONObject example in Java
- PreferenceActivity In Android Example
- DateTime Example in MySQL
- from sklearn import datasets ImportError: cannot import name dataset折腾过程纪念
- 机器学习(二)使用sklearn库的datasets练习KNN分类
- 【NCRE】——关于Excel中字体的所有属性
- HDU:1233 还是畅通工程(kruskal)
- 野指针和空指针
- 浅析JavaScript原型链与原型链式继承
- Android-在浏览器启动Activity
- example datasets in sklearn
- NSRange类详解
- 安卓进程通讯之aidl
- HDOJ 1242 Rescue
- Simple Java—Strings and Arrays(一)String是引用传值吗?
- zzulioj--1638--Happy Thanksgiving Day - Say 3Q I(水题)
- ROS naviagtion analysis: costmap_2d--ObstacleLayer
- 自己编写栈和队列的声明和调用
- CodeForces 611C New Year and Domino【预处理】