www.un.或g/Depts/DGACM/index_spanish.htm 请尝试:
curl http://www.windowsphone.com/en-US/apps?list=free
结果是:
<html><head><title>Object moved</title></head><body>
<h2>Object moved to <a href="https://login.live.com/login.srf?wa=wsignin1.0&rpsnv=11&checkda=1&ct=1320735308&rver=6.1.6195.0&wp=MBI&wreply=http:%2F%2Fwww.windowsphone.com%2Fen-US%2Fapps%3Flist%3Dfree&lc=1033&id=268289">here</a>.</h2>
</body></html>
或
def download(source_url):
try:
socket.setdefaulttimeout(10)
agents = [ Mozilla/4.0 (compatible; MSIE 5.5; Windows NT 5.0) , Mozilla/4.0 (compatible; MSIE 7.0b; Windows NT 5.1) , Microsoft Internet Expl或er/4.0b1 (Windows 95) , Opera/8.00 (Windows NT 5.1; U; en) ]
ree = urllib2.Request(source_url)
ree.add_header( User-Agent ,random.choice(agents))
resp = urllib2.urlopen(ree)
htmlSource = resp.read()
return htmlSource
except Exception, e:
print e
return ""
download( http://www.windowsphone.com/en-US/apps?list=free )
结果是:
<html><head><meta http-equiv="REFRESH" content="0; URL=http://www.windowsphone.com/en-US/apps?list=free"><script type="text/javascript">function OnBack(){}</script></head></html>
我想下载网页的实际来源。