Python爬虫基础 - VoidKing(2)_H5之家 - 中国HTML5教程资源分享第一站

出现了警告，根据提示，我们在创建BeautifulSoup对象时，指定解析器即可。

soup = BeautifulSoup(html_doc,'html.parser')

5、从文档中获取所有文字内容

print(soup.get_text())

6、正则匹配

link_node = soup.find('a',href=re.compile(r"til")) print(link_node) 后记

python爬虫基础知识，至此足够，接下来，在实战中学习更高级的知识。

书签

Python开发简单爬虫

The Python Standard Library
https://docs.python.org/3/library/index.html

Beautiful Soup 4.2.0 文档
https://www.crummy.com/software/BeautifulSoup/bs4/doc/index.zh.html

为什么python适合写爬虫？

如何学习Python爬虫[入门篇]？
https://zhuanlan.zhihu.com/p/21479334?refer=passer

你需要这些：Python3.x爬虫学习资料整理
https://zhuanlan.zhihu.com/p/24358829?refer=passer

如何入门 Python 爬虫？
https://www.zhihu.com/question/20899988

Python3.X 抓取网络资源

python网络请求和"HTTP Error 504:Fiddler - Receive Failure"

怎么使用Fiddler抓取自己写的爬虫的包？
https://www.zhihu.com/question/52614615

fiddler对python脚本抓取https包时发生了错误?
https://www.zhihu.com/question/42104344?sort=created

HTTPS和HTTP的区别

posted @