Seaborn 的示例数据集（load_dataset）

相信大家在学习GroupBy，或者数据透视表时，都有可能会碰到类似下面的一行代码：

import seaborn as sns
planets = sns.load_dataset('planets')

然后就可以发现planets已经存储了数据了，那么这些数据到底是从哪里来的呢？

我们查看一下load_dataset的docstring：

In [54]: sns.load_dataset??
Signature: sns.load_dataset(name, cache=True, data_home=None, **kws)
Source:
def load_dataset(name, cache=True, data_home=None, **kws):
    """Load a dataset from the online repository (requires internet).
    Parameters
    ----------
    name : str
        Name of the dataset (`name`.csv on
        https://github.com/mwaskom/seaborn-data).  You can obtain list of
        available datasets using :func:`get_dataset_names`
    cache : boolean, optional
        If True, then cache data locally and use the cache on subsequent calls
    data_home : string, optional
        The directory in which to cache data. By default, uses ~/seaborn-data/
    kws : dict, optional
        Passed to pandas.read_csv
    """

可以看到docstring的第一行就说明了这个函数是从在线存储库加载数据集的（需要互联网）。

网址：我是GitHub

下面就是可以在线或取得数据集啦（可以用来做练习哦）

相关阅读:
Python中Random随机数返回值方式
SQL跨库查询
正则表达式基本语法
excel VBA使用教程
使用某些Widows API时，明明包含了该头文件，却报错“error C2065: undeclared identifier”
电脑开机后数字键盘为关闭状态
编译Boost 详细步骤适用 VC6 VS2003 VS2005 VS2008 VS2010
变量作用域，不能理解，先记下
解决MySQL 在 Java 检索遇到timestamp空值时报异常的问题
Annotation

原文地址：https://www.cnblogs.com/lskreno/p/10844263.html