[Python网络编程] DNS缓存解决方案

打开APP

未登录

开通VIP，畅享免费电子书等14项超值服

开通VIP

首页

好书

留言交流

下载APP

联系客服

[Python网络编程] DNS缓存解决方案

userphoto

心不留意外尘 >《python》

2016.08.26

关注

http://blog.csdn.net/yueguanghaidao/article/details/26449911

2014

记得以前写爬虫的时候为了防止dns多次查询，是直接修改/etc/hosts文件的，最近看到一个优美的解决方案，修改后记录如下：

[python] view plain copy

import socket
_dnscache={}
def _setDNSCache():
"""
Makes a cached version of socket._getaddrinfo to avoid subsequent DNS requests.
"""
def _getaddrinfo(*args, **kwargs):
global _dnscache
if args in _dnscache:
print str(args)+" in cache"
return _dnscache[args]
else:
print str(args)+" not in cache"
_dnscache[args] = socket._getaddrinfo(*args, **kwargs)
return _dnscache[args]
if not hasattr(socket, '_getaddrinfo'):
socket._getaddrinfo = socket.getaddrinfo
socket.getaddrinfo = _getaddrinfo
def test():
_setDNSCache()
import urllib
urllib.urlopen('http://www.baidu.com')
urllib.urlopen('http://www.baidu.com')
test()

结果如下：

[python] view plain copy

('www.baidu.com', 80, 0, 1) not in cache
('www.baidu.com', 80, 0, 1) in cache

不过这个方案虽好，但也有缺陷，罗列如下：

1.相当于只对socket.getaddrinfo打了一个patch，但socket.gethostbyname,socket.gethostbyname_ex还是走之前的策略

2.只对本程序有效，而修改/etc/hosts将对所有程序有效，包括ping

本站仅提供存储服务，所有内容均由用户发布，如发现有害或侵权内容，请点击举报。

打开APP，阅读全文并永久保存查看更多类似文章

猜你喜欢

类似文章

【热】打开小程序，算一算2024你的财运

Python爬虫爬取百度搜索结果

零基础自学用Python 3开发网络爬虫(三): 伪装浏览器君

分享一个简单的爬虫案例，几十行代码爬取百度贴吧，原理简单易懂

[经验]Python获取本机IP(外网IP)的方法总结

Python网页抓取urllib,urllib2,httplib[3]

《Python程序设计》第10章网络编程

更多类似文章 >>

生活服务

热点新闻

分享收藏导长图关注下载文章

绑定账号成功
后续可登录账号畅享VIP特权！

如果VIP功能使用有故障，
可点击这里联系客服！

联系客服