python 爬取代理IP网站_python

概述#爬取代理IP数据importrequestsfromlxmlimportetree#url='https://www.xicidaili.comn/'url='http://ip.yqie.com/proxygaoni/'headers={'User-Agent':'Mozilla/5.0(WindowsNT10.0;WOW64)AppleWebKit/537.36

#爬取代理IP数据import requestsfrom lxml import etree# url = 'https://www.xicIDaili.com/nn/'url = 'http://ip.yqIE.com/proxygaoni/'headers = {    'User-Agent':'Mozilla / 5.0(windows NT 10.0; WOW64) AppleWebKit / 537.36(KHTML, like Gecko) Chrome / 72.0.3626.81 Safari / 537.36 SE 2.X MetaSr 1.0'}res = requests.get(url= url,headers =headers)if res.status_code ==200 :    response = res.content.decode('utf-8')    res_HTML = etree.HTML(response)    ips = res_HTML.xpath('//table[@ID="GrIDVIEwOrder"]//tr/td[2]/text()')    ports = res_HTML.xpath('//table[@ID="GrIDVIEwOrder"]//tr/td[3]/text()')    data = List(zip(ips,ports))    for i in data:        print(i)    print(len(data))

总结

以上是内存溢出为你收集整理的python 爬取代理IP网站全部内容，希望文章能够帮你解决python 爬取代理IP网站所遇到的程序开发问题。

如果觉得内存溢出网站内容还不错，欢迎将内存溢出网站推荐给程序员好友。

欢迎分享，转载请注明来源：内存溢出

原文地址: http://outofmemory.cn/langs/1185555.html