Merge hydrography from different sources
Sometimes the NHD datasets are not complete if the study area across bounders. In this post I will show you how we can merge different datasets into a predefined format for hydrology research.
When we merge stream networks data from different sources, the first question is what do we need to generate in the final product.
- Stream segment
- Stream order
- Stream confluence
And what format we want the data to be stored?
- Shapefile
- NetCDF
Considering that the current NHD is using Shapefile and the hexagon mesh is also shapefile, one direction we can take is shapefile.
GDAL supports API level intersection, so it is also possible to use another format. GDAL Geometry
This process can be coded into the Python package as well.
As a start, we can rely on Shapefile for testing purpose.
In the shapefile attribute table, we can add two fields to define the stream segment information.
- Stream order
- Stream segment index
Ideally, we can break the whole line feature into parts then re-build the topology.
The follow steps may work:
- Remove erros in line feature, mainly loops
- loop
-
short path
2. Merge all flowline together 3. Merge features into one single feature 4. Break into parts
- Define outlet (this step might need DEM)
- Define segment
- Define order
- Write into a new shapefile
We might be able to add other attributes later if needed.
If possbile, we can implement all the steps within Python without using ArcGIS API. However, step 1 requires manual examination, so we can try ArcGIS first. Then the following steps can be carried out using Python.