Modeling estimation of river suspended sediment amount using support vector regression and data control group method

Author

Soil Conservation and Watershed Management Research Department, Markazi Agricultural and Natural Resources Research and Education Center, AREEO, Arak, Iran.

Abstract

Introduction
Erosion and deposition cause irreparable damage to water construction projects. Among these damages, we can mention the accumulation of sediment behind the dams and the reduction of their useful volume, the destruction of structures, the reduction of capacity and the increase in the maintenance cost of irrigation canals. On the other hand, sediment transport has an effect on water quality indicators in terms of drinking and agriculture. Accurate estimation of river suspended sediment load from various aspects of water resources engineering, environmental issues and water quality is of particular importance. In this regard, the hydrological models of the basin have not shown proper efficiency in estimating the amount of suspended sediment, due to the many influencing factors. Most of the simulation studies of suspended sediment estimation are based only on the flow rate of the basin outlet, and the obtained results are proof of their inefficiency. In the meantime, the development and use of new sediment estimation methods that are easy to use in addition to sufficient accuracy will play an important role. Nowadays, the fuzzy and neural intelligent conductor system has found many applications in various water engineering problems, including sedimentation, due to its ability to solve complex and nonlinear phenomena. Due to the great importance of sediment transport in the optimal use of water resources and the design of dams, it is very necessary to obtain an accurate method for estimating the amount of suspended sediment load in rivers.
Materials and Methods
The purpose of this research is to evaluate and compare the two methods of support vector machine models (SVM) and group data control method (GMDH) in estimating the suspended sediment load of the Pol Doab station of Qarachai River in Markazi Province and comparing it with the results of the measuring curve. is for this purpose, the daily data of discharge, sediment, temperature and rainfall parameters of Shazand Pol Doab station were used. For this work, 13 scenarios and different combinations of parameters were defined. Then the results of the two methods were compared with each other and with the results of the measuring curve. Finally, the best method was suggested. For this purpose, library and field studies and review of related sources, statistics and information were collected. The statistics of temperature, rainfall, daily average discharge of stream and sediment measured daily during a long-term statistical period of 40 years at Pol Doab Hydrometry Station were received from the Meteorology and Regional Water Department of Markazi Province. The received data were categorized and converted into the input format of the models. According to the discharge and corresponding sediment, the curve of the sediment gauge was drawn and its equation was obtained. Appropriate patterns of input variables were selected based on trial and error. Considering that the mentioned parameters have a historical course, therefore, the design of the input patterns of soft computing models should be done based on time delays (like what is discussed in the analysis and forecasting of time series). Then the model was taken for each input and output pattern. In the next step, the most appropriate time delay of the input parameters in the modeling, which had a higher R2 determination coefficient and a lower root mean square error (RMSE) was selected. In this research, 70% of research data was used as training and 30% for validation and testing. Finally, two data-mining methods were compared with each other and also with gauge curve and observational data.
Results and Discussion
The results obtained from this research indicated the acceptable performance of the methods used in predicting suspended sediment amounts. Comparing the results of GMDH, SVR and SRC models shows the superiority of GMDH and SVR models in predicting suspended sediment values compared to Verdi model number 6. The results showed the acceptable performance of the GMDH model with the highest R2 determination coefficient of 0.99 and the lowest root mean square error of 83 tons per day. According to the obtained results, it can be said that the GMDH model as a powerful and high-speed model can be used to model suspended sediment. The results of the research showed that both data-mining methods have far better efficiency and accuracy in estimating the suspended load of river sediment than the sediment gauge curve. Data mining-based methods can be used as an alternative to estimate the river's suspended load. It should be noted that due to climate change and droughts, industrial development, colonization of land use and changes in the morphology of watersheds, the obtained results cannot be used forever at any time, but should be used whenever the models need to be updated. Another weakness of the models is that with the increase in the number of developed layers, the accuracy of the produced answers increases, but the produced relationships between the input and output variables become very complicated.
Conclusions
The results of modeling with (SVM) showed that scenario number 6, which includes discharge at the current time and discharge and sediment with a time delay step, with the highest determination coefficient R2 with a value of 0.98 and the lowest root mean square error RMSE equal to 185 kg per day. It performs better in other scenarios. In the next step, the best model selection model (SVM) was used as the input of the GMDH model. The results were compared with the model (SVM) and sediment gauge curve. The results show the acceptable performance of the GMDH model with the highest R2 determination coefficient of 0.99 and the lowest root mean square error of 83 kg per day compared to the other two methods. The obtained results showed that both investigated data mining methods provide much better results than the sediment gauge curve. The coefficients and functions used to calibrate the intelligent models used in this research can be very useful for estimating the suspended sediments of nearby stations without statistics having the same geological and hydrological conditions at the regional level.

Keywords


Adnan RM, Yaseen ZM, Heddam S, Shahid S, Sadeghi-Niaraki A and Kisi O, 2022. Predictability
performance enhancement for suspended sediment in rivers: Inspection of newly developed hybrid
adaptive neuro-fuzzy system model. International Journal of Sediment Research 37(10): 383–398.
https://doi.org/10.1016/j.ijsrc.2021.10.001
Alp M and Cigizoglu HK, 2007. Suspended sediment load simulation by two artificial neural
network methods using hydrometeorological data. Environmental Modelling and Software 22: 2-13.
Aytek A and Kisi O, 2007. A genetic programming approach to suspended sediment modeling. Journal of
Hydrology 351: 288-298.
Biranvand N, Sepahvand A and Haqizadeh A, 2022. Modeling of suspended sediment using machine
learning algorithms in periods of low and high water (Case study: Kashkan watershed). Soil and Water
Modeling and Management (3)2: DOI: 10.22098/mmws.2022.11262.1115.(In Persian with English
Abstract).
Chen XY and Chau KW, 2016. A Hybrid Double Feedforward Neural Network for Suspended Sediment
Load Estimation, Water Resource Management. 30(7): 2179-2194. https://doi.org/10.1007/s11269-016-
1281-2
Cigizoglu HK and Kisi O, 2006. Methods to improve the neural network performance in suspended sediment
estimation. Journal of Hydrology 317: 221-238.
Eivani1 Z, Ahmadi MM and Qaderi K, 2016. Estimation of suspended sediment load concentration in river
system using Group Method of Data Handling (GMDH). Journal of Watershed Management Research
7(13):218-229. (In Persian with English Abstract).
Hesavi M and Shafaee-Bajestan M, 2010. Estimate of Sediment Bed Load in Karoon River Ahwaz Station.
8th International River Engineering Conference, Date 26 June 2010, Ahwaz, Iran. (In Persian with
English Abstract).
Ivakhnenko AG, 1968. The Group Method of Data Handling-a Rival of the Method of Stochastic
Approximation, Soviet Automatic Control c/c of Avtomatika 1: 43-55.
Jie LC and Yu ST, 2011. Suspended sediment load estimate using support vector machines in Kaoping River
basin. International Conference on Consumer Electronics, Communications and Networks (CECNet).
16-18 April .doi: 10.1109/CECNET.2011.5769267
Keshtegar B, Piri J, Hussan WU, Ikram K, Yaseen M, Kisi O, Adnan RM, Adnan M and Waseem M, 2023.
Prediction of sediment yields using a Data-Driven Radial M5 Tree Model. Journal of Water, 15: 1437.
https://doi.org/10.3390/w15071437
Kisi O and Shiri J, 2012. River Suspended sediment estimation by climate variables implication:
Comparative study among soft computing techniques. Computer and Geosciences 43: 73-82.
Kisi O, Ozkan C and Bahriye A, 2012. Modeling discharge-sediment relationship using neural networks with
Artificial Bee Colony Algorithm. Journal of Hydrology: 94-103.
Mehrizi Haeri AA, 2012. Data mining: concepts, methods and applications. Master's thesis in economic and
social statistics, Faculty of Economics, Allameh Tabatabai University. (In Persian with English
Abstract).
222مرادی نژادنشریه دانش آب و خاک / جلد34شماره2/ سال1403
Moradi-Nejad A, Davoudmaqami D and Moradi M, 2018. Investigating the efficiency of methods for
estimating the suspended sediment load of Qarachai river. Environment and Water Engineering (4)5:
https://doi.org/10.22034/jewe.2020.211925.1341.(In Persian with English Abstract).
Moradi-Nejad A, Khosrobigi S, Akbari M and Hosseini SA, 2023. Evaluation of soft calculation methods in
estimation of river suspended sediment (Hassanabad station of Tirah River). Soil and Water Modeling
and Management. https://doi.10.22098/mmws.2023.12620.1258. (In Persian with English Abstract).
Nikpour MR and Sani Khani H, 2016. Modeling of river suspended sediments using soft calculations
(Darah-Rood River). Irrigation and Water Engineer's Scientific and Research Quarterly. 8(30):29-44.
https://civilica.com/doc/888603. (In Persian with English Abstract).
Qobadian R and Shokri H, 2018. Numerical investigation of factors affecting the distribution of unbalanced
sediment concentration in natural rivers (Case study: Qarasu River, Kermanshah). Journal of Water and
Soil 2(34): 241-253. DOI: 10.22067/jsw.v34i2.7632. (In Persian with English Abstract).
Safavi H, 1390 Engineering Hydrology, Arkan Danesh Publications, 3rd edition, Isfahan. 699P.
Satari MT, Rezazadeh Jodi A, Safdari F and Kahramanzadeh F, 2016. Performance evaluation of M5 tree
model and support vector regression methods in river suspended sediment modeling. Protection of
Water and Soil Resources 6(1):109-124. https://civilica.com/doc/1297623. (In Persian with English
Abstract).
Sheikh Alipour Z, Hasanpour F and Bigi M, 2014. Comparison of artificial intelligence methods in
estimating suspended sediment load (Case study: Sistan River). Water and Soil Conservation Research
7(2): 41-60. (In Persian with English abstract).
Zakaria N, Azamathulla HMD, Chang CHK and Ghani AAB, 2010. Gene expression programming for total
bed material load estimation-a case study. Science of the Total Environment, 408: 5078-5085.
Zhou Y, Lu XX, Huang Y and Zhu YM, 2007. Suspended sediment flux modeling with artificial neural
network: An example of the Longchuanjiang River in the upper Yangtze catchment, China.
Geomorphology 84: 111-125.