Publication Date

8-1-2025

Document Type

Article

Publication Title

International Journal of Applied Earth Observation and Geoinformation

Volume

142

DOI

10.1016/j.jag.2025.104742

Abstract

Sufficient abundance and variety of field site sampling are crucial for obtaining an accurate reach-scale river classification of a regional stream network in support of scientific research and river management. However, many studies still randomly select field sites or only visit accessible streams. This leads to an inadequate exploration of stream characteristics, resulting in incomplete or inaccurate classification. Machine learning has been recognized for discovering and extracting streams’ geomorphic patterns efficiently and accurately from data, but its application in field site sampling design is still in its infancy. This study developed a general and practical field site selection framework by incorporating machine learning in a human-in-the-loop manner. This framework includes three steps: (1) initial field site selection via machine learning from prior datasets, (2) selected field site accessibility evaluation and observation, and (3) additional field site decision and selection via an iterative learning process. In an example application to the San Francisco Bay Area (California, USA), our framework extracted representative geomorphic characteristics of (i) previous known stream types from prior labeled and geospatial datasets and (ii) previously unrecognized stream types based on uncertainty information obtained by machine learning. Moreover, we propose methods for replacing inaccessible sites to ensure sufficient information is retained in the selected field sites. Results revealed clear differences in variable distributions between the 148 high‐certainty sites and the 51 high‐uncertainty sites, a pattern that was validated by our field surveys. Furthermore, the 41 newly identified high‐uncertainty sites were found under-represented in the initial surveyed sites and thus their selection for the next round of field surveys will help fill the important feature gaps left by the initial survey. The feasibility of this framework allows river scientists and land use decision-makers to better understand river patterns and manage spatial planning.

Funding Number

R02CP6967

Funding Sponsor

Arts and Science Council

Keywords

Field site selection, Machine learning, Prior datasets, River classification, Uncertainty information

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

Share

COinS