A platform for research: civil engineering, architecture and urbanism
Regional regression models of percentile flows for the contiguous United States: Expert versus data-driven independent variable selection
Study region: 918 basins in the contiguous United States. Study focus: Regional regression models were developed to predict 13 percentile flows for groups of basins clustered based on physical and climatic characteristics. The research question investigated how the number and information content of independent variables affected model performance, and compared data-driven versus expert assessment approaches for variable selection. New hydrological insights for the region: A set of three variables selected based on an expert assessment of factors that influence percentile flows performed similarly to larger sets of variables selected using a data-driven method. Expert assessment variables included mean annual precipitation, potential evapotranspiration, and baseflow index. Larger sets of up to 37 variables contributed little, if any, additional predictive information. Variables used to describe the distribution of basin data (e.g. standard deviation) were not useful, and average values were sufficient to characterize physical and climatic basin conditions. Effectiveness of the expert assessment variables may be due to the high degree of multicollinearity (i.e. cross-correlation) among additional variables. A tool is provided in the Supplementary material to predict percentile flows based on the three expert assessment variables. Future work should develop new variables with a strong understanding of the processes related to percentile flows.
Regional regression models of percentile flows for the contiguous United States: Expert versus data-driven independent variable selection
Study region: 918 basins in the contiguous United States. Study focus: Regional regression models were developed to predict 13 percentile flows for groups of basins clustered based on physical and climatic characteristics. The research question investigated how the number and information content of independent variables affected model performance, and compared data-driven versus expert assessment approaches for variable selection. New hydrological insights for the region: A set of three variables selected based on an expert assessment of factors that influence percentile flows performed similarly to larger sets of variables selected using a data-driven method. Expert assessment variables included mean annual precipitation, potential evapotranspiration, and baseflow index. Larger sets of up to 37 variables contributed little, if any, additional predictive information. Variables used to describe the distribution of basin data (e.g. standard deviation) were not useful, and average values were sufficient to characterize physical and climatic basin conditions. Effectiveness of the expert assessment variables may be due to the high degree of multicollinearity (i.e. cross-correlation) among additional variables. A tool is provided in the Supplementary material to predict percentile flows based on the three expert assessment variables. Future work should develop new variables with a strong understanding of the processes related to percentile flows.
Regional regression models of percentile flows for the contiguous United States: Expert versus data-driven independent variable selection
Geoffrey Fouad (author) / André Skupin (author) / Christina L. Tague (author)
2018
Article (Journal)
Electronic Resource
Unknown
Metadata by DOAJ is licensed under CC BY-SA 1.0
Mapping the 95^t^h Percentile Daily Rainfall in the Contiguous U.S
British Library Conference Proceedings | 2013
|Regional Regression Models of Annual Streamflow for the United States
British Library Online Contents | 1999
|Taylor & Francis Verlag | 2012
|An examination of the stability of regional lumber demand in the contiguous United States, 1950–1985
Springer Verlag | 1991
|