python 用pandas查数据库好不好_sql

启动IPython notebook，加载pylab环境：

ipython notebook --pylab=inline

Pandas提供了IO工具可以将大文件分块读取，测试了一下性能，完整加载9800万条数据也只需要263秒左右，还是相当不错了。

import

pandas as pd

reader = pd.read_csv('data/servicelogs',

iterator=True)

try:

df = reader.get_chunk(100000000)

except

StopIteration:

print "Iteration is stopped."

第一行。

_andas [1] 是python的一个数据分析包，最初由AQR Capital Management于2008年4月开发，并于2009年底开源出来，目前由专注于Python数据包开发的PyData开发team继续开发和维护，属于PyData项目的一部分。Pandas最初被作为金融数据分析工具而开发出来，因此，pandas为时间序列分析提供了很好的支持。

_andas的名称来自于面板数据（panel data）和python数据分析（data analysis）。panel data是经济学中关于多维数据集的一个术语，在Pandas中也提供了panel的数据类型。

欢迎分享，转载请注明来源：内存溢出

原文地址: http://outofmemory.cn/sjk/6699660.html

python 用pandas查数据库好不好

发表评论

评论列表（0条）