项目作者: romanrosh

项目描述 :
Airbnb scraper and hotel api for data compare
高级语言: Python
项目地址: git://github.com/romanrosh/airbnb-web-scaper.git
创建时间: 2018-12-15T16:05:08Z
项目社区:https://github.com/romanrosh/airbnb-web-scaper

开源协议:MIT License

下载


Airbnb_mining and hotels data API for comfortable comparison between 2 sleeping arrangements

Created by Tom Cohen and Roman Rosh

A data scraping program for airbnb and hotel API for data retrieval

Note that the name of the classes used change regularly on airbnb , so you should change them according to the latest airbnb website update.

Airbnb has invested a significant amount of time to make sure you cant scrape their website , but this scraper can handle it , it will show you the errors while running but will take care of them.

This program will uses by default all the cores of your computer , this can be changed in the class initialization.

You will need to include the user and password of mysql installed on your computer since this program also creates a new database called airbnb.

Requires the chromedriver file , can be downloaded at : http://chromedriver.chromium.org/downloads
should be put in the same directory as the scraper to avoid unnecessary config

Hotels API connects to hotellookapi and retrieves hotel data, for comparing with AirBnb scraper or any other use. The data is connected separately, the aim is to offer all the options for sleeping arrangements in once database.

Hotels API does not require a key but is limited to 100 records per request per location if more records are required perform the request several times as much as needed.