Python库beautifulsoup的安装与使用

在Python Extension Packages for Windows 上找到相应的库，解压后把bs4目录复制到Python安装目录下的lib目录下。参考：Python爬虫利器二之Beautiful Soup的用法觅

#coding:utf-8

from bs4 import BeautifulSoup

html = ...
soup = BeautifulSoup(html,"lxml")
print soup.title

要指定解释器，否则会报错：

<title>The Dormouse's story</title>
D:\Python27\lib\bs4\__init__.py:166: UserWarning: No parser was explicitly specified, so I'm using the best available HTML parser for this system ("lxml"). This usually isn't a problem, but if you run this code on another system, or in a different virtual environment, it may use a different parser and behave differently.

To get rid of this warning, change this:

 BeautifulSoup([your markup])

to this:

 BeautifulSoup([your markup], "lxml")

  markup_type=markup_type))

Process finished with exit code 0

文档信息

本文作者：zhupite
本文链接：https://zhupite.com/python/Python%E5%BA%93beautifulsoup%E7%9A%84%E5%AE%89%E8%A3%85%E4%B8%8E%E4%BD%BF%E7%94%A8.html
版权声明：自由转载-非商用-非衍生-保持署名（创意共享3.0许可证）

朱皮特的烂笔头

Python库beautifulsoup的安装与使用

文档信息

Search

Table of Contents