Chapter 6. Estimating Unknown Values

In this chapter, we will use interpolation methods to estimate the unknown values at one location based on the known values at other locations.

Interpolation is a technique to estimate unknown values entirely on their geographic relationship with known location values. As space can be measured with infinite precision, data measurement is always limited by the data collector's finite resources. Interpolation and other more sophisticated spatial estimation techniques are useful to estimate the values at the locations that have not been measured. In this chapter, you will learn how to interpolate the values in weather station data, which will be scored and used in a model of vulnerability to a particular agricultural condition: mildew. We've made the weather data a subset to provide a month in the year during which vulnerability is usually historically high. An end user could use this application to do a ground truthing of the model, which is, matching high or low predicted vulnerability with the presence or absence of mildew. If the model were to be extended historically or to near real time, the application could be used to see the trends in vulnerability over time or to indicate that a grower needs to take action to prevent mildew. The parameters, including precipitation, relative humidity, and temperature, have been selected for use in the real models that predict the vulnerability of fields and crops to mildew.

In this chapter, we will cover the following topics:

Adding data from MySQL
Using the NetCDF multidimensional data format
Interpolating the unknown values for visualization and reporting
Applying a simple algebraic risk model
Python GDAL wrappers to filter and update through SQLite queries
Interpolation
Map algebra modeling
Sampling a raster grid with a layer of gridded points
Python CGI Hosting
Testing and debugging during the CGI development
The Python SpatiaLite/SQLite3 wrapper
Generating an OpenLayers3 (OL3) map with the QGIS plugin
Adding AJAX Interactivity to an OL3 map
Dynamic response in the OL3 pixel popup

Importing the data

Often, the data to be used in a highly interactive, dynamic web application is stored in an existing enterprise database. Although these are not the usual spatial databases, they contain coordinate locations, which can be easily leveraged in a spatial application.

Connecting and importing from MySQL in QGIS

The following section is provided as an illustration only—database installation and setup are needlessly time consuming for a short demonstration of their use.

Note

If you do wish to install and set up MySQL, you can download it from http://dev.mysql.com/downloads/. MySQL Community Server is freely available under the open source GPL license. You will want to install MySQL Workbench and MySQL Utilities, which are also available at this location, for interaction with your new MySQL Community Server instance. You can then restore the database used in this demonstration using the Data Import/Restore command with the provided backup file (c6/original/packt.sql) from MySQL Workbench.

To connect to and add data from your MySQL database to your QGIS project, you need to do the following (again, as this is for demonstration only, it does not require database installation and setup):

Navigate to Layer | Add Layer | Add vector layer.
- Source type: Database
- Type: MySQL, as shown in the following screenshot:
Once you've indicated that you wish to add a MySQL Database layer, you will have the option to create a new connection. In Connections, click on New. In the dialog that opens, enter the following parameters, which we would have initially set up when we created our MySQL Database and imported the .sql backup of the packt schema:
- Name: packt
- Host: localhost
- Database: packt
- Port: 3306
- Username: packt
- Password: packt, as shown in the following screenshot:
Click on Test Connect.
Click on OK.
Click on Open, and the Select vector layers to add dialog will appear.
From the Select vector layers dialog, click on Select All. This includes the following layers:
- fields
- precipitation
- relative_humidity
- temperature
Click on OK.

The layers (actually just the data tables) from the MySQL Database will now appear in the QGIS Layers panel of your project.

Converting to spatial format

The fields layer (table) is only one of the four tables we added to our project with latitude and longitude fields. We want this table to be recognized by QGIS as geospatial data and these coordinate pairs to be plotted in QGIS. Perform the following steps:

Export the fields layer as CSV by right–clicking on the layer under the Layers panel and then clicking on Save as.
In the Save vector layer as… dialog, perform the following steps:
1. Click on Browse to choose a filesystem path to store the new .csv file. This file is included in the data under c6/data/output/fields.csv.
2. For GEOMETRY, select <Default>.
3. All the other default fields can remain as they are given.
4. Click on OK to save the new CSV, as shown in the following screenshot:

Now, to import the CSV with the coordinate fields that are recognized as geospatial data and to plot the locations, perform the following steps:

From the Layer menu, navigate to Add Layer | Add Delimited Text Layer.
In Create a Layer from the Delimited Text File dialog, perform the following steps:
1. Click on the Browse… button to browse the location where you previously saved your fields.csv file (for example, c6/data/output/fields.csv).
2. All the other parameters should be correctly populated by default. Take a look at the following image.
3. Click on OK to create the new layer in your QGIS project.

You will receive a notification that as no coordinate system was detected in this file, WGS 1984 was assigned. This is the correct coordinate system in our case, so no further intervention is necessary. After you dismiss this message, you will see the fields locations plotted on your map. If you don't, right–click on the new layer and select Zoom to Layer.

Note that this new layer is not reflected in a new file on the filesystem but is only stored with this QGIS project. This would be a good time to save your project.

Finally, join the other the other tables (precipitation, relative_humidity, and temperature) to the new plotted layer (fields) using the field_id field from each table one at a time. For a refresher on how to do this, refer to the Table join section of Chapter 1, Exploring Places – from Concept to Interface. To export each layer as separate shapefiles, right-click on each (precipitation, relative_humidity, and temperature), click on Save as, populate the path on which you want to save, and then save them.

The layer/table relations

The newer versions of QGIS support layer/table relations, which would allow us to model the one-to-many relationship between our locations, and an abstract measurement class that would include all the parameters. However, the use of table relationships is limited to a preliminary exploration of the relationships between layer objects and tables. The layer/table relationships are not recognized by any processing functions. Perform the following steps to explore the many-to-many layer/table relationships:

Add a relation by navigating to Project | Project Properties | Relations. The following image is what you will see once the relationships to the three tables are established:
To add a relation, select a nonlayer table (for example, precipitation) in the Referencing Layer (Child) field and a location table (for example, fields) in the Referenced Layer (Parent) field. Use the common Id field (for example, field_id), which references the layer, to relate the tables. The name field can be filled arbitrarily, as shown in the following screenshot:
Now, to use the relation, click on a geographic object in the parent layer using the identify tool (you need to check Auto open form in the identify tool options panel). You'll see all the child entities (rows) connected to this object.

NetCDF

Network Common Data Form (NetCDF) is a standard—and powerful—format for environmental data, such as meteorological data. NetCDF's strong suit is holding multidimensional data. With its abstract concept of dimension, NetCDF can handle the dimensions of latitude, longitude, and time in the same way that it handles other often physical, continuous, and ordinal data scales, such as air pressure levels.

For this project, we used the monthly global gridded high-resolution station (land) data for air temperature and precipitation from 1901-2010, which the NetCDF University of Delaware maintains as part of a collaboration with NOAA. You can download further data from this source at http://www.esrl.noaa.gov/psd/data/gridded/data.UDel_AirT_Precip.html.

Viewing NetCDF in QGIS

While there is a plugin available, NetCDF can be viewed directly in QGIS, in GDAL via the command line, and in the QGIS Python Console. Perform the following steps:

Navigate to Layer | Add Raster Layer.
Browse to c6/data/original/air.mon.mean.v301.nc and add this layer.
Use the path Raster | Miscellaneous > Information to find the range of the values in a band. In the initial dialog, click on OK to go to the information dialog and then look for air_valid_range. You can see this information highlighted in the following image. Although QGIS's classifier will calculate the range for you, it is often thrown off by a numeric nodata value, which will typically skew the range to the lower end.
Enter the range information (-90 to 50) into the Style tab of the Layer Properties tab.
Click on Invert to show cool to hot colors from less to more, just as you would expect with temperature.
Click on Classify to create the new bins based on the number and color range. The following screenshot shows what an ideal selection of bins and colors would look like:
Click on OK. The end result will look similar to the following image:

To render the gridded NetCDF data accessible to certain models, databases, and to web interaction, you could write a workflow program similar to the following after sampling the gridded values and attaching them to the points for each time period.

Previous Chapter

Summary

Next Chapter

Interpolated model values

Table of Contents for QGIS: Becoming a GIS Power User