python3 报错:UnicodeDecodeError: 'utf-8' codec can't decode byte 0xd6 in position 201: invalid continuation byte

时间:2018-04-18 16:00:59   收藏:0   阅读:1285

代码:

# -*- coding:utf-8 -*-

from urllib import request

resp = request.urlopen(http://www.xxx.com)

print(resp.read().decode(utf-8))

报错:

Traceback (most recent call last):
  File "F:/workspace/python/py3/test_urllib.py", line 7, in <module>
    print(resp.read().decode(utf-8))
UnicodeDecodeError: utf-8 codec cant decode byte 0xd6 in position 201: invalid continuation byte

原因:

  确定要抓取的页面的编码,并不是所有网站的编码都是utf-8的,resp.read().decode()应传入与要抓取的网页一致的编码。

评论(0
© 2014 mamicode.com 版权所有 京ICP备13008772号-2  联系我们:gaon5@hotmail.com
迷上了代码!