1. Random forest for the detection of unauthorized consumption in water supply systems: a case study in Southern Brazil.
- Author
-
Stramari, Marcos Roberto, Kalbusch, Andreza, and Henning, Elisa
- Subjects
RANDOM forest algorithms ,WATER supply ,MACHINE learning ,WATER consumption ,FRAUD ,TECHNICAL information - Abstract
Clandestine connections represent a significant portion of the apparent water losses. The objective of this research work is to propose a systematic approach to inspect sites with possible illegal water consumption more precisely. A case study was conducted in a District Metered Area (DMA) located in the city of Joinville, Southern Brazil, using technical information and consumption history from 1400 consumer units located in this area. The proposed methodology uses the bagging technique called Random Forest, through a machine learning algorithm. With success rates of 88.10% when classifying frauds in the model training phase and 88.20% in the test phase, the obtained model shows great ability to properly classify frauds in the water supply system in the studied DMA. The use of the random forest classification model, combined with the SMOTE technique for data balancing, proves to be a viable technical alternative for the detection of unauthorized water consumption. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF