We’ll create Google maps depicting rental prices in Minsk, Belarus. First, we’ll scrape data from a website, clean the data, and then generate three maps of different kinds.
All the code from this tutorial is available on GitHub.
Prerequisites
To create a custom Google Map with Python, we’ll need the following:
- Python installed on your machine
- Pip: package management system (it comes with Python)
- Jupyter Notebook: an online editor for data visualization
- Pandas: a library to prepare data for plotting
- BeautifulSoup: a library for scraping websites
- Gmaps: a Jupyter plugin for embedding Google Maps in Jupyter notebooks
- Google API key: an individual key starting with “AI” we’ll need to create a custom Google Map
You can download the latest version of Python for Windows on the official website.
To get Jupyter Notebook and Pandas, you’ll need to install recommended Scientific Python Distributions. Type this in your terminal:
To install BeautifulSoup:
And to install gmaps, type this:
Finally, follow instructions to create a Google API key. Don’t share it with anyone!
Note that you might need to turn on billing in Google Console. Otherwise, your map will have the “For development purposes only” watermark.
Data Preparation
First of all, we need to prepare data for mapping. We’ll do this in 4 steps.
1. Scraping website
For this tutorial, I scraped the Realt.by website, namely about 4,000 rental listings in Minsk, Belarus.
First, I scraped listings’ IDs (scrape-ids.py in my GitHub repository) and then individual listings corresponding to each ID (scrape-listings.py). The result is a data set (realt-listings.csv) that includes coordinates, address, district, metro station, number of rooms, total area, living area, and rental price (BYN per month) for each listing.
To create maps, we’ll need only coordinates and rental prices. We scraped the coordinates (latitudes and longitudes or x and y) from the website’s HTML map element:
Here’s how we access the coordinates while scraping:
2. Cleaning data
Next, create a folder that will contain your notebook (e.g. “maps-gmaps”) and open Jupyter Notebook by typing this command in your terminal (don’t forget to change the path):
This will automatically open the Jupyter home page at http://localhost:8888/tree. Click on the “New” button in the top right corner, select the Python version installed on your machine, and a notebook will open in a new browser window.
In the new notebook, we clean the data: select columns, drop NaN values, convert values to numeric, and delete values that are too small or too large:
We save this result as data.csv:
3. Binning
To create maps of some types, it’s also useful to know the categories of data. We can use the binning method to divide our data into categories and count values for each of them:
This shows us that the majority of values fall into the BYN 15–20 category and that BYN 60–80 is almost non-existent.
Find more tips on data manipulation with the Pandas library in our article 50+ Pandas Tricks.
4. Editing datasets.py
Next, we upload the data.csv file on GitHub, open it as a raw file, and get the link (https://raw.githubusercontent.com/jukuznets/jupyter-notebooks/main/data.csv). If you upload your .csv file on GitHub, be sure to click on the “Raw” button to get the link to the raw downloadable .csv file. Also, your repository should be public.
After that, find and edit the file called datasets.py in gmaps package. It should be located at an address like this:
Once found, add the following code in the METADATA object of datasets.py:
I edited my file like this:
headers correspond to the data set’s columns, and types correspond to each column’s data type.
Once done, save this file and open your Jupyter Notebook.
Creating Maps
Open a new notebook and import all the necessary libraries in the first line:
We’ll need the last line to export an HTML file with the finished map.
After importing all the necessary libraries and plugins, insert your Google API key:
Next, we’ll create 3 maps:
- Heatmap depicting data density
- Map with multicolored dots depicting rental prices from the lowest ones to the highest ones
- Map with icons representing the most expensive rentals
Here are the lines of codes that are common for each of the three maps:
1. Heatmap
The first map we’ll create is a weighted heatmap to show data density, with the locations painted in gradient colors ranging from green to red.
For this type of map, we’ll need 3 variables: latitude, longitude, and weights. The weight represents how frequent or important the event is in one place.
center=coordinates should come together with zoom_level. Increase zoom_level to zoom in the map.
map_type is the type of map: 'ROADMAP' (default; the classic Google Maps style), 'SATELLITE' (satellite tiles with no overlay), 'HYBRID' (satellite base tiles with roads and cities overlaid), or 'TERRAIN' (map showing terrain features).
point_radius is the number of pixels for each point passed in the data. This determines the “radius of influence” of each data point.
max_intensity is a strictly positive floating-point number indicating the numeric value that corresponds to the hottest color in the heatmap gradient. Any density of points greater than that value will get mapped to the hottest color.
Open the map in an interactive mode in a new window.
2. Map with dots
Our second map, a map width dots, will show data variation or, in our example, the variety of rental prices ranging from BYN 5 to BYN 80 per sqm per month. Locations will be marked with multicolored dots (blue, green, yellow, orange, and red) depending on the price.
Our selection of price ranges is based on categories selected in the Data Preparation section (Step 3 — Binning).
We’ll create this map in several steps:
1. Divide the data set in several categories:
2. Use the symbol_layer function to draw dots. Symbols represent each latitude, longitude pair with a circle whose color and size you can customize. You can have several layers of markers.
gmaps documentation states that Google Maps may become very slow if you try to represent more than a few thousand symbols or markers. If you have a larger data set, you should either consider subsampling or use heatmaps.
3. Create a map figure.
4. Add layers.
Open the map in an interactive mode in a new window.
3. Map with icons
For our third map, we’ll add a layer of markers/icons to a Google map. Each marker will represent an individual data point. Markers are currently limited to the Google maps style drop icon.
1. Create a dictionary with coordinates presented as tuples and add any other key/value pairs you need.
2. Create a template. We can attach a pop-up box to each marker. Clicking on the marker will bring up the infobox. The content of the box can be either plain text or HTML.
The <dl> tag defines a description list, <dt> defines terms/names, and the <dd> tag is used to describe a term/name in a description list.
3. Use the marker_layer function to place icons.
4. Create a map figure.
5. Add a layer.
Open the map in an interactive mode in a new window.
Exporting Maps
You can save maps to PNG by clicking the Download button in the toolbar. This will download a static copy of the map. The low quality of the rendering is still an issue, though.
Alternatively, you can export interactive maps to HTML using the infrastructure provided by ipywidgets. Here’s how we export a map to HTML:
Next, we can edit the export.html file the way we like. For example, we can add a title, change the text color, background, etc.
For our map with multicolored dots, we can also add a legend:
Once again, all the code from this tutorial is available on GitHub.
You can also read other tutorials on visualizing data with maps: