项目作者: dumbengineer

项目描述 :
http request in python to return a link from a web_page using BeautifulSoup
高级语言: Python
项目地址: git://github.com/dumbengineer/Web_crawl.git
创建时间: 2017-10-02T13:48:44Z
项目社区:https://github.com/dumbengineer/Web_crawl

开源协议:

下载


Web_crawl

returns the first link appears in a paragraph on any wikipedia page
ignoring the link of helpbox and the link for pronounciation

CONDITIONS:

It will terminate if;

-article reaches to philosophy article

-the number of urls exceed the value of 25

-article trapped in a loop i.e.,same article appears as of before