Will Indonesia's Forests Survive Development Pressure? Machine Learning Predictions for Energy-Critical Tropical Watersheds

Widyanti Utami A ORCID
Irlan Irlan ORCID
Nur Hilal A Syahrir ORCID
Rosmaeni Rosmaeni ORCID

Abstract

Land Use and Land Cover (LULC) changes play an important role in influencing the hydrological conditions of a watershed. The conversion of land such as forests, shrubs and grasslands into agricultural land can disrupt the hydrological balance of the watershed. The availability of information related to LULC dynamics in the future is needed to assist sustainable watershed management planning. Machine learning technology, such as Cellular Automata, can provide accurate predicting. The objective of this research is to simulate LULC based on machine learning in the Mamasa Sub-watershed. Two model combinations were employed to simulate LULC: Artificial Neural Network-Cellular Automata (ANN-CA) and Logistic Regression-Cellular Automata (LR-CA). The research results found that the ANN-CA model achieved percent of correctness and overall kappa of 83.6745 and 0.75412, respectively, which were higher than those of the LR-CA model (82.3498 and 0.73361). The prediction results of both model combinations still fall below the actual LULC values, especially in the case of large LULC classes such as forests, range-shrub, rice, and pasture. Conversely, higher accuracy is observed for smaller classes such as wetlands-forested, orchard, residential, and oak. However, it should be noted that this research did not include several socio-economic variables, such as population and income level, which are considered to influence changes in LULC. Future research is expected to analyse the influence of each variable and include some socio-economic variables that may have a significant influence on LULC change.

1.   Introduction

Land use and land cover (LULC) changes play an important role in influencing the hydrological conditions of a watershed. The conversion of forest land, shrubs, and grasslands into agricultural land can disrupt the hydrological balance of a watershed (Regasa, Nones, & Adeba, 2021). One of the impacts of such land conversion is the increased rate of erosion, which causes siltation of rivers and water bodies (Isra, Arsyad, & Chairuddin, 2023). This condition alters the hydrological balance, including increased surface runoff and changes in river discharge, thereby reducing the benefits provided by watershed areas, such as the Mamasa Sub-watershed (Sugianto et al., 2022). Mamasa Sub watershed plays an important role because it provides water discharge for the Bakaru Hydroelectric Power Plant, where water discharge is the main input in the process of producing electrical energy (Singh & Singal, 2017; Stickler et al., 2013). If the hydrological condition of the Mamasa Sub watershed declines, it can disrupt the electricity production process, which has an impact on the sustainability of electrical energy supply.

Regular monitoring is of paramount importance to assess changes and the current status of LULC (Kim, 2016; Reddy et al., 2016; Vaggela, Sanapala, & Mokka, 2022). In Indonesia, the Ministry of Environment and Forestry (MoEF) has taken steps to conduct periodic monitoring of LULC, particularly with regard to forests, on an annual basis. However, the acquisition of LULC data has not been complemented by efforts to predict future LULC patterns based on the ongoing dynamics of changes. Precise predictions of LULC data would prove immensely valuable to stakeholders involved in decisions-making and policy formulation, as they can shape future LULC conditions (Hamad, Balzter, & Kolo, 2018; Liu et al., 2017; Singh et al., 2018). The availability of methods and technology for predicting LULC can offer solutions to these challenges.

The rapid advancement of Geographic Information System (GIS) technology has enabled researchers to engage in diverse spatial modeling task. This progress is further facilitated by the introduction of machine learning as a novel technology that enables the prediction of dynamic spatial changes, including changes in LULC (Chaturvedi & de Vries, 2021). Numerous studies have been undertaken to assess the accuracy of machine learning-based LULC prediction models. Cellular automata stand out as the machine learning algorithm widely employed in contemporary LULC simulations (Koko et al., 2020; Rimal et al., 2017; Salakory & Rakuasa, 2022). Land use and land cover simulations based on cellular automata have consistently demonstrated high levels of accuracy (Rimal et al., 2017).

The cellular automata algorithm operates by identifying spatial patterns based on the transition potentials of cells (pixels), which are derived from initial conditions, environmental factors, and predefined transition rules (Hu, Li, & Lu, 2018). Consequently, LULC simulations that rely on cellular automata are frequently integrated with various models to predict potential land transitions. These models encompass Markov chains (Salakory & Rakuasa, 2022; Ghosh et al., 2017), Artificial Neural Network (ANN) (Saputra & Lee, 2019; Zeshan, Mustafa, & Baig, 2022), logistic regression (Koko et al., 2020), random forest (Gounaridis et al., 2019), as well as several metaheuristic approaches, including particle swarm optimization (PSO), generalized simulated annealing (GSA) and genetic algorithm (GA) (Feng, Liu, & Tong, 2018). The selection of the appropriate transition model plays a crucial role in enhancing the accuracy of cellular automata-based simulations (Zhuang et al., 2017). Land potential transition models, such as artificial neural network and logistic regression, represent viable option as they consistently yield satisfactory results.

Research related to LULC simulation based on machine learning is to support watershed management planning (Li & Yeh, 2002). As reported by previous studies, machine learning can provide accurate prediction. The primary objective of this study is to simulate a LULC based on machine learning for monitoring LULC change in the Mamasa Sub watershed. This research involves the integration of the cellular automata algorithm with two additional algorithms, namely artificial neural network and logistic regression, to create land transition models. Subsequently, the simulation results will be rigorously evaluated to identify the most effective approach. Furthermore, this combined modeling approach will be utilized to forecast LULC patterns for the years 2026 and 2031 within the Mamasa Sub watershed. The anticipated outcome of this research extends to the provision of valuable data that can inform the development of Mamasa sub watershed management plans in the future.

 

2.   Method

1)   Study Area

This research was conducted within the Mamasa Sub watershed, as depicted in Figure 1. The Mamasa Sub watershed is an integral part of the Saddang Watershed, falling under the category of a large watershed. It is situated between two provinces, namely West Sulawesi (upstream) and South Sulawesi (downstream) provinces. The total area covered by the Mamasa Sub watershed encompasses approximately 116,108.37 hectares. The Mamasa Sub watershed plays a significant role in supplying water discharge, which is vital for the operation of the Bakaru Hydroelectric Power Plant. This hydroelectric facility is equipped with two turbines, each boasting a power capacity of 63 Megawatts. It serves as the primary source of electricity for meeting the substantial power requirements of the South Sulawesi Province. It is essential to note that the LULC condition within this area has a direct impact on the water discharge generated by the Mamasa Sub watershed. These conditions, in turn, influence the electricity production of the Bakaru hydroelectric power plant.

 

A map of the southern region

Description automatically generated with medium confidence

Figure 1. Research Location Map

2)   Materials

Land use and land cover data were procured from the Ministry of Environment and Forestry, encompassing LULC information for the years 2011, 2016, and 2021. The LULC data for 2011 and 2016 were employed in land change analysis and for modeling potential transitions. In contrast, the validation of the models was conducted using the LULC data from 2021. This research leveraged five spatial variables as driving factors for predicting LULC changes. These variables include the Digital Elevation Model (DEM), slope, distance from roads, distance from residential areas, and distance from rivers. A detailed description of the data sources and characteristics is provided in Table 1.

Table 1. Data used in the study

Table 2 Preview

Data type

Format

Data Source

LULC 2011

Shapefile

Ministry of Environment and Forestry (MoEF)

LULC 2016

Shapefile

Ministry of Environment and Forestry (MoEF)

LULC 2021

Shapefile

Ministry of Environment and Forestry (MoEF)

Digital Elevation Model (DEM)

Raster

https://tanahair.indonesia.go.id/

Slope

Raster

Calculated from DEM

Road

Shapefile

Indonesian Geospatial Information Agency

Residential

Shapefile

Calculated from LC2016

River

Shapefile

Calculated from DEMNas


3)   Data Analysis

Preprocessing data

This stage involves data preparation to make it suitable for processing within the GIS application utilized for this modeling. This includes the conversion of spatial variables such as slope and river network into the appropriate format. The modeling of LULC necessitates input data in raster format. Therefore, spatial data in alternative formats, such as LULC, road network, residential areas, and rivers (typically stored as shapefile), must first undergo conversion into raster data, as outlined by Saputra and Lee (2019). Furthermore, it is imperative that all datasets share a consistent pixel size, as highlighted by Liu et al. (2017). In this study, a pixel size of 30 meters was adopted as the standard for all datasets, ensuring uniformity and compatibility in the analysis.

Spatial variable correlation analysis

Changes in LULC can be influenced by a multitude of factors, encompassing both biophysical and socio-economic variables, as highlighted in studies by Salakory and Rakuasa (2022) and Singh et al. (2018). The selection of these spatial variables is primarily contingent upon data availability, as emphasized by Ghosh et al. (2017). Within this study, the spatial variables employed comprise DEM, slope, distance from roads, distance from rivers, and distance from residentials. To ensure the absence of multicollinearity among these spatial variables, a correlation analysis was conducted. This step is vital to confirm that there are no strong relationships among the variables, as recommended by Feng et al. (2018) and Mustafa et al. (2018). Pearson correlation is employed to assess the relationship, with coefficients falling between -0.7 and 0.7 indicating the absence of a strong correlation, in line with the criteria established by Avtar et al. (2022) and Marshall et al. (2021).

LULC Change Analysis

Land use and land cover data from 2011 and 2016 were utilized to generate LULC change maps as well as transition matrices. The Modules for Land-Use Change Simulation (Molusce) plugin version 3.0.13 in QGIS software version 2.18.15 was employed for the analysis of LULC changes. These land use and land cover change maps, along with the corresponding transition matrices, are essential components in the development of potential transition models.

Transition potential modelling

The potential transition model of land use and cover uses the Artificial Neural Network-Multilayer Perceptron (ANN-MLP) algorithm and logistic regression which is available in the Molusce plugin in QGIS software [19, 26, 28–30]. Multilayer Perceptron is an ANN algorithm consisting of one input layer, one or more hidden layer, and one output layer. Each layer has neurons, and the neurons in MLP process the input, apply weights to the input and use an activation function to process input and weights to produce output or target variable. Logistic regression is a statistical model used to describe the relationship between one or more explanatory variables and a binary target variable, which typically two possible values, such as 0 and 1.

The model parameters employed in ANN-MLP include 1000 samples, neighborhood 1, learning rate 0.001, maximum iteration 100, hidden layer 10, and momentum 0.05. Meanwhile, the model parameters used in logistic regression include sample size of 1000, neighborhood 1, and a maximum of 100 iterations. The sample selection is accomplished using a random method, where 80 percent of the sample is designated for model construction, and the remaining 20 percent is allocated for validation purposes.

Simulation Model CA

Simulations of LULC for the years 2021, 2026, and 2031 were conducted using the cellular automata (CA) algorithm. Cellular automata have demonstrated their ability to accurately predict LULC changes (Gounaridis et al., 2019; Gomes et al., 2019; Kura & Beyene, 2020). The CA model simulation was executed using the Molusce plugin within the QGIS software environment (Zeshan, Mustafa, & Baig, 2021; Avtar et al., 2022; Adam, Masupha, & Xulu, 2023; Baig et al., 2022; Sajan et al., 2022).

Model validation

Model validation is conducted to assess the accuracy of the model in predicting LULC changes. The reference data for the simulation is the LULC information for the year 2021. The model’s accuracy will be evaluated based on several metrics, including the kappa statistic, which encompasses percent of correctness, kappa overall, kappa location, and kappa histogram (Koko et al., 2020; Adam, Masupha, & Xulu, 2023; Baig et al., 2022; Sajan et al., 2022). In addition, evaluation will also be carried out on each LULC class by comparing the actual 2021 LULC and simulated 2021 LULC using precision and recall values. Precision refers to the proportion of correctly predicted instances of a land use class out of all instances predicted as that class. Meanwhile, the proportion of correctly predicted instances of a land use class out of all actual instances of that class. The precision and recall values are calculated using the equation (Sajan et al., 2022; Gündüz, 2025; Mirzakhani, Behzadfar, & Azizi Habashi, 2025):

 

Precision and Recall

\[ \text{Precision} = \frac{\text{True Positive (TP)}}{\text{True Positive (TP)} + \text{False Positive (FP)}} \]

\[ \text{Recall} = \frac{\text{True Positive (TP)}}{\text{True Positive (TP)} + \text{False Negative (FN)}} \]

where TP refers to the correct simulated LULC; FP indicates the incorrect simulated LULC as correct; and FN indicates the correct simulated LULC as incorrect (Gündüz, 2025).

3.   Result and Discussion

1)   Spatial Variable Correlation

Modeling of LULC using various spatial variables often leads to multicollinearity, as noted by Gomes et al. (2019). Multicollinearity should be considered, as it may have a detrimental impact on the model's outcomes, as indicated by Feng et al. (2018). The results of the correlation test conducted among the spatial variables reveal that there is no strong correlation among these spatial variables, as depicted in Table 2. This observation is supported by the correlation coefficients obtained, which only range from -0.028 to 0.507, with the highest coefficient occurring between the DEM variable and the distance from the road variable. Based on these findings, all spatial variables can be considered as factors contributing to LULC changes, as suggested by Marshall et al. (2021). Nevertheless, it is worth noting that this study has not yet analyzed the specific impact of each spatial variable on the transition of individual LULC classes.


Table 2. Correlation matrix

Table 3 Preview

 

Distance from residential

DEM

Distance from rivers

Slope

Distance from roads

Distance from residential

-

0.413

-0.028

0.285

0.334

DEM

 

-

-0.465

0.260

0.507

Distance from rivers

 

 

-

-0.095

0.093

Slope

 

 

 

-

0.271

Distance from roads

 

 

 

 

-

Source: Research finding, 2023


2)   LULC change 2011 to 2016

The changes in LULC that transpired between 2011 and 2016 hold substantial significance in shaping the potential transition model. An analysis of the LULC changes reveals that all LULC classes underwent modifications within this 5-year period. The most significant variations were observed in the range-brush category, which decreased by -3.08%, while agriculture increased by 2.51%. Additionally, there were marginal increments in the orchard, rice, residential, and oak forest categories, with changes of 0.10%, 0.47%, 0.14%, and 0.09%, respectively. In contrast, forests and water exhibited the lowest changes, with a minor decrease of -0.05% from the preceding period. Similar results were obtained by Asrianto et al. (2023) using Landsat satellite imagery from 2011–2020, where LULC dynamics of the Mamasa Sub-watershed reported a decrease in forest cover and an increase in agricultural land. For a comprehensive overview of the land use and land cover conditions in the Mamasa Sub-basin from 2011 to 2016, please refer to Table 3.


Table 3. LULC conditions from year 2011 to 2016

Table 4 Preview

LULC

2011 (ha)

2016 (ha)

Delta ∆ (ha)

2011 (%)

2016 (%)

Delta ∆ (%)

Forest

118,868.67

118,683.09

-185.58

29.78

29.73

-0.05

Wetlands-forested

1,059.12

707.31

-351.81

0.27

0.18

-0.09

Orchard

26.73

440.82

414.09

0.01

0.11

0.10

Range-brush

47,731.86

35,533.17

-12,198.7

11.96

8.90

-3.06

Agriculture

194,449.95

204,457.86

10,007.91

48.71

51.22

2.51

Rice

12,513.42

14,378.40

1,864.98

3.13

3.60

0.47

Pasture

19,500.93

19,208.61

-292.32

4.89

4.81

-0.07

Residential

162.00

740.79

578.79

0.04

0.19

0.14

Water

4,074.21

3,892.23

-181.98

1.02

0.98

-0.05

Oak

806.22

1,150.83

344.61

0.20

0.29

0.09

Source: Research finding, 2023


3)   LULC transition matrix

The transition of LULC from 2011 to 2016 is quantified by assessing the percentage of cells or pixels that either underwent changes or remained unchanged across all LULC classes, as emphasized by Singh et al. (2018). This matrix serves as an indicator of potential transitions for each LULC class within the research site, in line with the insights of Rimal et al. (2017) and Singh et al. (2018). For instance, the range-brush class exhibits a higher rate of change compared to other LULC classes. Only approximately 57.9213% of pixels in this class remained unchanged from 2011 to 2016, with the remainder transitioning into other categories, such as forests (0.0334%), agriculture (41.8973%), rice (0.0645%), water (0.0115%), and oak (0.072%). This condition provides insight into the potential for range-brush to transition into other LULC classes in the future, notably with a 41.8973% potential to become agriculture (Hamad, Balzter, & Kolo, 2018). Conversely, in low transition classes like forests and orchard, a high percentage is projected to remain in the same LULC category, specifically 99.8305% and 100%, respectively. For a comprehensive overview of the potential transition matrix for all LULC classes, please refer to Table 4.

Table 4. Conservation status of birds

Table 5 Preview

 

LULC 2016

Forest

Wetlands-forested

Orchard

Range-brush

Agriculture

Rice

Pasture

Residential

Water

Oak

LULC 2011

 

 

 

Forest

0.998305

0

0

0.001127

0.000109

0

0

0

0

0.00046

Wetlands-forested

0

0.66171

0.009432

0

0

0.287305

0

0

0

0.041553

Orchard

0

0

1

0

0

0

0

0

0

0

Range-brush

0.000334

0

0

0.579213

0.418973

0.000645

0

0

0.000115

0.00072

Agriculture

0

0

0

0.03962

0.944621

0.011109

0

0.002689

0.000086

0.001875

Rice

0

0

0.032293

0.000698

0.027093

0.93404

0

0.005876

0

0

Pasture

0

0

0

0.000125

0.015022

0

0.984719

0

0.000134

0

Residential

0

0

0

0.196667

0.021667

0.033333

0.035

0.713333

0

0

Water

0

0.00159

0

0

0

0.046566

0

0.002607

0.949237

0

Oak

0

0

0

0.006475

0.16075

0

0

0.022661

0

0.810114

Source: Research finding, 2023


4)   LULC transition matrix

The transition of LULC from 2011 to 2016 is quantified by assessing the percentage of cells or pixels that either underwent changes or remained unchanged across all LULC classes, as emphasized by Singh et al. (2018). This matrix serves as an indicator of potential transitions for each LULC class within the research site, in line with the insights of Rimal et al. (2017) and Singh et al. (2018). For instance, the range-brush class exhibits a higher rate of change compared to other LULC classes. Only approximately 57.9213% of pixels in this class remained unchanged from 2011 to 2016, with the remainder transitioning into other categories, such as forests (0.0334%), agriculture (41.8973%), rice (0.0645%), water (0.0115%), and oak (0.072%). This condition provides insight into the potential for range-brush to transition into other LULC classes in the future, notably with a 41.8973% potential to become agriculture (Hamad, Balzter, & Kolo, 2018). Conversely, in low transition classes like forests and orchard, a high percentage is projected to remain in the same LULC category, specifically 99.8305% and 100%, respectively. For a comprehensive overview of the potential transition matrix for all LULC classes, please refer to Table 4.

Table 5. LULC transition matrix

Table 6 Preview

 

LULC 2016

Forest

Wetlands-forested

Orchard

Range-brush

Agriculture

Rice

Pasture

Residential

Water

Oak

LULC 2011

 

 

 

Forest

0.998305

0

0

0.001127

0.000109

0

0

0

0

0.00046

Wetlands-forested

0

0.66171

0.009432

0

0

0.287305

0

0

0

0.041553

Orchard

0

0

1

0

0

0

0

0

0

0

Range-brush

0.000334

0

0

0.579213

0.418973

0.000645

0

0

0.000115

0.00072

Agriculture

0

0

0

0.03962

0.944621

0.011109

0

0.002689

0.000086

0.001875

Rice

0

0

0.032293

0.000698

0.027093

0.93404

0

0.005876

0

0

Pasture

0

0

0

0.000125

0.015022

0

0.984719

0

0.000134

0

Residential

0

0

0

0.196667

0.021667

0.033333

0.035

0.713333

0

0

Water

0

0.00159

0

0

0

0.046566

0

0.002607

0.949237

0

Oak

0

0

0

0.006475

0.16075

0

0

0.022661

0

0.810114

Source: Research finding, 2023


5)   Validation of LULC simulated

The results of the validation comparing the actual of LULC of 2021 with the simulated LULC of 2021 demonstrate that both model combinations exhibit high values (see Table 5). This suggests that the model is suitable for predicting future LULC patterns (Koko et al., 2020; Adam, Masupha, & Xulu, 2023). The accuracy of the simulation is quantified by the percentage of correctness values for ANN-CA and LR-CA, which are 83.6745 and 82.3498, respectively (Saputra & Lee, 2019). The overall model performance is evaluated based on the overall kappa values of 0.75412 (ANN-CA) and 0.73361 (LR-CA), respectively (Koko et al., 2020; Sajan et al., 2022). The histogram kappa values, representing the quantitative suitability of the model, are 0.88901 and 0.88136, respectively (Lauf et al., 2012). The model's ability to detect location or spatial suitability is assessed based on location kappa values of 0.84827 (ANN-CA) and 0.83236 (LR-CA), respectively (Koko et al., 2020; Sajan et al., 2022). These results also indicate that the ANN-CA model exhibits greater accuracy compared to the LR-CA model.

Table 6.Validation value of ANN–CA and LR–CA model

Table 7 Preview

Validation value

ANN–CA

LR–CA

Percent of correctness

83.6745

82.3498

Kappa (overall)

0.75412

0.73361

Kappa (histogram)

0.88901

0.88136

  Kappa (location)

0.84827

0.83236

Source: Research finding, 2023

The model's performance can also be assessed based on the prediction results for each LULC class area, precision and recall. The two combination models used have predicted results that fall below the actual LULC area, particularly in the case of large classes such as forests, range-brush, rice, water, and pasture. In contrast, higher results were obtained for small LULC classes such as wetlands-forested, orchard, residential, and oak. Similar findings have been reported in previous studies on Cellular Automaton (CA)-based LULC simulations, particularly regarding overestimated conditions in small-sized classes (Hamad, Balzter, & Kolo, 2018; Gomes et al., 2019).


Table 7. Actual LULC of 2021 and the simulated LULC of 2021

Table 8 Preview

LULC

Actual LULC 2021

Simulated ANN–CA 2021

Simulated LR–CA 2021

Area (ha)

Area (ha)

Precision

Recall

Area (ha)

Precision

Recall

Forest

128,152.98

118,681.83

0.72

0.91

118,545.80

0.72

0.91

Wetlands-forested

343.26

707.31

0.96

0.98

402.75

0.98

0.79

Orchard

90.27

440.91

0.98

1.00

832.59

0.93

0.92

Range-brush

39,947.13

35,529.30

0.68

0.47

33,150.96

0.65

0.40

Agriculture

176,809.23

204,461.91

0.30

0.93

206,262.80

0.29

0.92

Rice

23,831.19

14,378.13

0.59

0.50

15,113.97

0.42

0.50

Pasture

21,261.87

19,208.61

0.80

0.78

19,128.15

0.80

0.78

Residential

4,433.58

742.05

1.00

0.16

940.59

0.96

0.14

Water

4,223.52

3,892.23

0.96

0.84

3,591.27

0.96

0.78

Oak

100.08

1,150.83

0.00

0.00

1,224.27

0.00

0.00

Source: Research finding, 2023


The cellular automata model employed in this research exhibits good accuracy in simulating of LULC. However, it is essential to consider several factors that influence the simulation results, such as land potential transitions and spatial variables. In this study, it is noteworthy that the LULC transition from 2011 to 2016 differs significantly from that observed from 2016 to 2021. Meanwhile, the selection of appropriate spatial variables can significantly enhance simulation results. Notably, this study has omitted socio-economic variables, such as population and income levels, as spatial variables, despite their known influence on LULC. A population increase of 1% will increase 0.104% residential as shown by Li et al. (2016). Furthermore, future research is expected to conduct in-depth analyses of the impact of each spatial variable on LULC transitions, as recommended by Gomes et al. (2019). Such analyses would provide valuable insights into the spatial variables that exert the most influence on future LULC change events, including phenomena such as deforestation and land degradation, as discussed by Gounaridis et al. (2019).

6)   Future LULC of Mamasa Subwatershed

The simulation results from the two combinations of cellular automata models indicate that LULC in the Mamasa Subwatershed for 2026 and 2031 will continue to experience deforestation and land degradation. This is evident in the declining trend of the forest class, accompanied by a significant increase in rice cultivation (refer to Table 7). A decrease in forest cover could have adverse effects on the hydrological conditions of the Mamasa Subwatershed, as discussed by Anila et al. (2020) and Isra et al. (2023). In response to this issue, there is an urgent need for the rehabilitation of degraded and highly degraded lands to optimize the hydrological function of the Mamasa Subwatershed, as highlighted by Hendraswari et al. (2023). To address this concern, the recommended optimal LULC scenario for the Mamasa subwatershed area, which includes agroforestry, is approximately 63.26 percent, as suggested by Zarghaami et al., 2007. The implementation of agroforestry schemes has proven to be highly effective in soil and water conservation efforts within the Mamasa Subwatershed, as documented by Murtilaksono et al. (2017)

Table 8. LULC simulated for 2026 and 2031

Table 9 Preview

LULC

ANN-CA (ha)

LR-CA (ha)

2026

2031

2026

2031

Forest

118,650.42

118,646.73

118,513.71

118,501.65

Wetlands-forested

133.38

129.96

378.81

370.80

Orchard

522.54

464.31

1,110.78

1,307.61

Range-brush

29,052.00

28,951.47

32,573.70

32,431.50

Agriculture

211,102.20

210,860.73

206,176.86

205,753.50

Rice

15,425.37

15,909.39

15,501.24

15,852.42

Pasture

19,092.06

19,067.40

19,106.46

19,096.29

Residential

646.74

643.14

1,138.86

1,294.92

Water

3,533.49

3,502.98

3,423.60

3,308.76

Oak

1,034.91

1,017.00

1,269.09

1,275.66

Source: Research finding, 2023


Figure 2. Simulated LULC: a) ANN–CA 2026; b) ANN–CA 2031; c) LR–CA 2026; d) LR–CA 2031

When considering the changes in LULC area from 2026 to 2031, it's evident that the LR-CA model exhibits more significant changes compared to the simulated results of the ANN-CA model. For instance, the reduction in forest area in the ANN-CA model from 2026 to 2031 is 3.69 hectares, while in the LR-CA model, it amounts to 12.06 hectares. This pattern is consistent across all LULC classes. This distinction can be instrumental in selecting a model based on the desired planning scenario, whether it involves a pessimistic or optimistic outlook, such as scenarios related to deforestation and land degradation (Fonseca et al., 2022). The LR-CA model can be employed as a pessimistic scenario to simulate future LULC trends, whereas the ANN-CA model can serve as an alternative optimistic scenario (Shabani et al., 2022; Niya et al., 2020). Simulations of LULC under various scenarios are imperative for decision-makers, as emphasized by Liu et al. (2017). A visual representation of the LULC simulation results for 2026 and 2031 is provided in Figure 2.

 

4.   Conclusion

We simulated of LULC in the Mamasa Sub watershed based on machine learning algorithms using two model combination. The ANN-CA model exhibits % of correctness and an overall kappa of 83.6745 and 0.75412, respectively, surpassing the percent of correctness and overall kappa of the LR-CA model, which are 82.3498 and 0.73361, respectively. However, it's important to note that the prediction results of both model combinations still fall below the actual LULC values, especially in the case of large LULC classes such as forests, range-shrub, rice, and pasture. Conversely, higher accuracy is observed for smaller classes such as wetlands-forested, orchard, residential, and oak. The simulation results show that there will still be a decrease in forest cover and a significant increase in agricultural land in the Mamasa Sub watershed in 2026 and 2031. The decrease in forest cover and increase in agricultural land in the future will further impact hydrological conditions. This could reduce the benefits provided by the Mamasa Sub watershed, such as the disruption of the Bakaru Hydroelectric Power Plant's water supply. Therefore, policy makers must take appropriate and timely management actions to protect the Mamasa Sub watershed from the impacts of these land use and cover changes. It's worth highlighting that our study has not incorporated several socio-economic variables, such as population size and income levels. Future research is expected to analyse the influence of each variable and include some socio-economic variables that may have a significant influence on LULC change.

 

5.   Author Contributions

The first author (WUA) contributed as compiler and data analyst based on the research results. The second author (II) contributed methods to help interpret data and research results, the third author (NHAS) contributed methods to help interpret data, and the fourth author (RR) contributed to article preparation.

 

6.   Competing Interests

All authors declare no conflicts of interest regarding this publication.

 

7.   Acknowledgements

We would like to thank LPPM Universitas Sulawesi Barat, for funding this research We would also like to thank PLTA Bakaru for supporting this research.

 

8.   References

Adam, E., Masupha, N. E., & Xulu, S. (2023). Spatial assessment and prediction of urbanization in Maseru using earth observation data. Applied Sciences, 13, 1–15. https://doi.org/10.3390/app13105854

Anila, C., Barkey, R. A., Arsyad, U., & others. (2020). Land use planning for mitigation of erosion, sediments, and stabilization of water discharge in the Mamasa Watershed. In IOP Conference Series: Earth and Environmental Science (pp. 1–16). IOP Publishing. 0.1088/1755-1315/575/1/012131

Asrianto, Samsuar, Daniel Useng, Nazif Ichwan, & Febriana Intan Permata Hati. (2023). Effects of Land Cover Change on River Discharge Conditions in the Mamasa Watershed Using the SWAT Model. Salaga Journal, 1(2), 44–54. https://doi.org/10.70124/salaga.v1i2.1356

Avtar, R., Rinamalo, A. V., Umarhadi, D. A., & others. (2022). Land use change and prediction for valuating carbon sequestration in Viti Levu island, Fiji. Land, 11, 1. https://doi.org/10.3390/land11081274

Baig, M. F., Mustafa, M. R. U., Baig, I., & others. (2022). Assessment of land use land cover changes and future predictions using CA-ANN simulation for Selangor, Malaysia. Water, 14, 1–17. 10.3390/w14030402

Chaturvedi, V., & de Vries, W. T. (2021). Machine learning algorithms for urban land use planning: A review. Urban Science, 5, 1–16. https://doi.org/10.3390/urbansci5030068

Feng, Y., Liu, Y., & Tong, X. (2018). Comparison of metaheuristic cellular automata models: A case study of dynamic land use simulation in the Yangtze River Delta. Computers, Environment and Urban Systems, 70, 138–150. https://doi.org/10.1016/j.compenvurbsys.2018.03.003

Feng, Y., Yang, Q., Hong, Z., & others. (2018). Modelling coastal land use change by incorporating spatial autocorrelation into cellular automata models. Geocarto International, 33, 470–488. 10.1080/10106049.2016.1265597

Fonseca, C. A. B. da, Al-Ansari, N., Silva, R. M. da, & others. (2022). Investigating relationships between runoff–erosion processes and land use and land cover using remote sensing multiple gridded datasets. ISPRS International Journal of Geo-Information, 11, 1–27. https://doi.org/10.3390/ijgi11050272

Ghosh, P., Mukhopadhyay, A., Chanda, A., & others. (2017). Application of Cellular automata and Markov-chain model in geospatial environmental modeling: A review. Remote Sensing Applications, 5, 64–77. https://doi.org/10.1016/j.rsase.2017.01.005

Gomes, E., Abrantes, P., Banos, A., & others. (2019). Modelling future land use scenarios based on farmers’ intentions and a cellular automata approach. Land Use Policy, 85, 142–154. https://doi.org/10.1016/j.landusepol.2019.03.027

Gounaridis, D., Chorianopoulos, I., Symeonakis, E., & others. (2019). A Random Forest-Cellular Automata modelling approach to explore future land use/cover change in Attica (Greece), under different socio-economic realities and scales. Science of the Total Environment, 646, 320–335. https://doi.org/10.1016/j.scitotenv.2018.07.302

Gündüz, H. İ. (2025). Land-use land-cover dynamics and future projections using GEE, ML, and QGIS-MOLUSCE: A case study in Manisa. Sustainability, 17, 1–20. https://doi.org/10.3390/su17041363

Hamad, R., Balzter, H., & Kolo, K. (2018). Predicting land use/land cover changes using a CA-Markov model under two different scenarios. Sustainability, 10, 1–23. https://doi.org/10.3390/su10103421

Hendraswari, I., Putri, I., & Ansari, F. (2023). Forest management units’ role in conserving and maintaining watershed sustainability to support electrical energy. In IOP Conference Series: Earth and Environmental Science (pp. 1–8). IOP Publishing. https://doi.org/0.1088/1755-1315/1181/1/012002 

Hu, X., Li, X., & Lu, L. (2018). Modeling the land use change in an arid oasis constrained by water resources and environmental policy change using cellular automata models. Sustainability, 10, 1–14. https://doi.org/10.3390/su10082878

Isra, N., Arsyad, U., & Chairuddin, Z. (2023). Sedimentation analysis using SWAT model (soil and water assessment tool) in Mamasa Sub-Watershed. In IOP Conference Series: Earth and Environmental Science (pp. 1–9). IOP Publishing. 0.1088/1755-1315/1230/1/012027

Kim, C. (2016). Land use classification and land use change analysis using satellite images in Lombok Island, Indonesia. Forest Science and Technology, 12, 183–191. https://doi.org/10.1080/21580103.2016.1147498

Koko, A. F., Yue, W., Abubakar, G. A., & others. (2020). Monitoring and predicting spatio-temporal land use/land cover changes in Zaria City, Nigeria, through an integrated cellular automata and Markov chain model (CA-Markov). Sustainability, 12, 1–21. https://doi.org/10.3390/su122410452

Kura, A. L., & Beyene, D. L. (2020). Cellular automata Markov chain model-based deforestation modelling in the pastoral and agro-pastoral areas of southern Ethiopia. Remote Sensing Applications Society and Environment, 18, 1–10. 10.1016/j.rsase.2020.100321

Lauf, S., Haase, D., Hostert, P., & others. (2012). Uncovering land-use dynamics driven by human decision-making: A combined model approach using cellular automata and system dynamics. Environmental Modelling & Software, 27, 71–82. https://doi.org/10.1016/j.envsoft.2011.09.005

Li, X., & Yeh, A. G.-O. (2002). Neural-network-based cellular automata for simulating multiple land use changes using GIS. International Journal of Geographical Information Science, 16, 323–343. https://doi.org/10.1080/13658810210137004

Li, X., Wang, Y., Li, J., & others. (2016). Physical and socioeconomic driving forces of landuse and landcover changes: A case study of Wuhan City, China. Discrete Dynamics in Nature and Society, 2016, 111. https://doi.org/10.1155/2016/8061069

Liu, X., Liang, X., Li, X., Chen, Y., Xu, X., Li, S., & Wu, J. (2017). A future land use simulation model (FLUS) for simulating multiple land use scenarios by coupling human and natural effects. Landscape and Urban Planning, 168, 94–116. https://doi.org/10.1016/j.landurbplan.2017.09.019 

Marshall, L., Beckers, V., Vray, S., & others. (2021). High thematic resolution land use change models refine biodiversity scenarios: A case study with Belgian bumblebees. Journal of Biogeography, 48, 345–358. https://doi.org/10.1111/jbi.14000

Mirzakhani, A., Behzadfar, M., & Azizi Habashi, S. (2025). Simulating urban expansion dynamics in Tehran through satellite imagery and cellular automata Markov chain modelling. Modeling Earth Systems and Environment, 11, 1–24. 10.1007/s40808-025-02325-y

Murtilaksono, K., Yusufg, S. M., Astuti, R. K., & others. (2017). Model of soil and water conservation measures application based on district spatial planning in Mamasa Watershed, South Sulawesi. Geoplanning: Journal of Geomatics and Planning, 4, 263–272. http://creativecommons.org/licenses/by-nc-sa/4.0.

Mustafa, A., Rienow, A., Saadi, I., & others. (2018). Comparing support vector machines with logistic regression for calibrating cellular automata land use change models. European Journal of Remote Sensing, 51, 391–401. https://doi.org/10.1080/22797254.2018.1442179

Niya, A. K., Huang, J., Kazemzadeh-Zow, A., & others. (2020). Comparison of three hybrid models to simulate land use changes: A case study in Qeshm Island, Iran. Environmental Monitoring and Assessment, 192, 1–19. https://doi.org/10.1007/s10661-020-08274-6  

Reddy, C. S., Pasha, S. V., Jha, C. S., & others. (2016). Development of national database on long-term deforestation (1930–2014) in Bangladesh. Global and Planetary Change, 139, 173–182. 10.1016/j.gloplacha.2016.02.003

Regasa, M. S., Nones, M., & Adeba, D. (2021). A review on land use and land cover change in Ethiopian basins. Land, 10, 1–18. https://doi.org/10.3390/land10060585

Rimal, B., Zhang, L., Keshtkar, H., & others. (2017). Monitoring and modeling of spatiotemporal urban expansion and land-use/land-cover change using integrated Markov chain cellular automata model. ISPRS International Journal of Geo-Information, 6, 1–21. https://doi.org/10.3390/ijgi6090288

Salakory, M., & Rakuasa, H. (2022). Modeling of Cellular Automata Markov Chain for predicting the carrying capacity of Ambon City. Jurnal Pengelolaan Sumberdaya Alam dan Lingkungan, 12, 372–387. https://doi.org/10.29244/jpsl.12.2.372-387

Saputra, M. H., & Lee, H. S. (2019). Prediction of land use and land cover changes for North Sumatra, Indonesia, using an artificial-neural-network-based cellular automaton. Sustainability, 11, 1–16. https://doi.org/10.3390/su11113024

Sajan, B., Mishra, V. N., Kanga, S., & others. (2022). Cellular automata-based artificial neural network model for assessing past, present, and future land use/land cover dynamics. Agronomy, 12, 1–18. https://doi.org/10.3390/agronomy12112772

Shabani, M., Darvishi, S., Rabiei-Dastjerdi, H., & others. (2022). An integrated approach for simulation and prediction of land use and land cover changes and urban growth (case study: Sanandaj city in Iran). Journal of the Geographical Institute “Jovan Cvijic” SASA, 72, 273–289. 10.2298/IJGI2203273S

Singh, S. K., Laari, P. B., Mustak, S., & others. (2018). Modelling of land use land cover change using earth observation data-sets of Tons River Basin, Madhya Pradesh, India. Geocarto International, 33, 1202–1222. 10.1080/10106049.2017.1343390

Singh, V. K., & Singal, S. K. (2017). Operation of hydro power plants: A review. Renewable and Sustainable Energy Reviews, 69, 610–619. https://doi.org/10.1016/j.rser.2016.11.169

Stickler, C. M., Coe, M. T., Costa, M. H., & others. (2013). Dependence of hydropower energy generation on forests in the Amazon Basin at local and regional scales. Proceedings of the National Academy of Sciences, 110, 9601–9606. https://doi.org/10.1073/pnas.1215331110

Sugianto, S., Deli, A., Miswar, E., & others. (2022). The effect of land use and land cover changes on flood occurrence in Teunom Watershed, Aceh Jaya. Land, 11, 1–18. https://doi.org/10.3390/land11081271

Vaggela, A., Sanapala, H., & Mokka, J. R. (2022). Monitoring land use and land cover changes prospects using remote sensing and GIS for Mahanadi River Delta, Orissa, India. Geoplanning: Journal of Geomatics and Planning, 9, 47–60. https://doi.org/10.14710/geoplanning.9.1.47-60

Zarghaami, M., Ardakanian, R., Memariani, A. 2007. Fuzzy Multiple Attribute Decision Making on Water Resources Projects Case Study: Ranking Water Transfers to Zayanderud Basin in Iran. Water International.32(2):280-293). 10.1080/02508060708692207

Zeshan, M. T., Mustafa, M. R. U., & Baig, M. F. (2021). Monitoring land use changes and their future prospects using GIS and ANN-CA for Perak River Basin, Malaysia. Water, 13, 1–20. https://doi.org/10.3390/w13162286

Zhuang, Y., Li, W., Wang, H., & others. (2017). A bibliographic review of Cellular Automaton publications in the last 50 years. Journal of Cellular Automata, 12, 475–492. https://www.researchgate.net/publication/322161519


 

 

 

Articles