Python 爬虫/抓取框架

BeautifulSoup

BeautifulSoup (美丽的汤)是一个纯 Python 的html(xml)解释器,为许多 python 开发者所钟爱,其官方网站已有详尽的文档可作参考。

Scrapy

解析支持 HTML,XML,CSV ,Javascript。Scrapy is a fast high-level screen scraping and web crawling framework, used to crawl websites and extract structured data from their pages. It can be used for a wide ran

Ruya

It is targeted solely towards developers who want crawling functionality in their code.

mechanize

非常高层次的浏览功能(超简单的表格填写和提交)

FMiner

FMiner是一个使用PySide和webkit开发的网站内容提取软件

pykoala

简单,小巧,快速的开源爬虫

分享:Python 爬虫/抓取框架

Copyright© Python4cn(news, jobs) simple-is-better.com, 技术驱动:powered by web.py 空间主机:Webfaction

版权申明:文章转载已注明出处,如有疑问请来信咨询。本站为 python 语言推广公益网站,与 python 官方没有任何关系。

联系/投搞/留言: en.simple.is.better@gmail.com 向本站捐赠