抓取网页时,乱码问题

2020-12-13 02:26

阅读:611

标签:style   blog   class   code   java   color   

soscw.com,搜素材
 1 def get_content():
 2     user_agent="Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/34.0.1847.131 Safari/537.36"
 3     headers = { User-Agent : user_agent }
 4     url = "http://bj.58.com/"
 5     req = urllib2.Request(url, headers = headers)
 6     response = urllib2.urlopen(req)
 7     the_page = response.read()
 8     type = sys.getfilesystemencoding()
 9     the_page = the_page.decode("UTF-8").encode(type)
10     print the_page
soscw.com,搜素材

 

抓取网页时,乱码问题,搜素材,soscw.com

抓取网页时,乱码问题

标签:style   blog   class   code   java   color   

原文地址:http://www.cnblogs.com/isharer/p/3718396.html


评论


亲,登录后才可以留言!