Putting together a clean database, dealing with issues such as duplicate and missing data, are among the first tasks when developing a numerical model. Then, the data are analyzed to understand the nature of different variables, their statistical distributions, their spatial distributions, statistical problems linked to spatial clustering, preferential sampling, censored distributions, relationships between variables, and stationarity.
The main goals of this stage are to create a clean consolidated database to work with, and to perform the basic exploratory analyses required to understand the data and the possible problems we will face during the subsequent modeling stages. This stage is the preamble to the definition of the do mains for modeling.
Author: Julian Ortiz
Download link: https://drive.google.com/file/d/1iDWYkKFbrr7nvBok893pDYZ_P3MOvXO6/view?usp=sharing