
3.3 Logistic Regression Coefficients
Table 4 displays the logistic regression model's
coefficients, which provide vital information about
how each predictor variable relates to the desired
result. When all other variables are held constant,
these coefficients, which are given in log-odds units,
measure how the log-odds of the target variable (such
as the incidence of precipitation) change with a one-
unit rise in the related predictor variable. Whereas a
negative coefficient implies an inverse association, a
positive coefficient shows that an increase in the
predictor variable raises the chance of the event
happening. For continuous variables, the coefficient
reflects the change in log-odds per unit increment; for
categorical variables, it represents the difference in
log-odds compared to the reference category. The
magnitude of each coefficient corresponds to the
strength of association, with larger absolute values
indicating more substantial influences on the outcome.
Importantly, these log-odds coefficients can be
transformed into odds ratios through exponentiation,
which offers a more intuitive interpretation of the
effect sizes. The statistical significance of these
coefficients, typically assessed through Wald tests or
likelihood ratio tests, determines whether the
observed relationships are likely to exist in the
population rather than occurring by random chance.
This parametric output of logistic regression proves
particularly valuable for understanding the
directional effects and relative importance of
different meteorological factors in precipitation
forecasting.
Table 4: Logistic Regression Coefficients.
All coefficients were statistically significant (p <
0.05), indicating that each feature contributes
meaningfully to the prediction. Humidity and
precipitation showed the strongest positive
associations with the target variable, while pressure
had a negative effect.
4 CONCLUSION
Numerical weather prediction (NWP) has evolved
significantly with the integration of machine learning
techniques, particularly logistic regression (LR) and
random forest (RF) models. This study compared the
performance of these two approaches in predicting
binary weather events (e.g., precipitation) using
historical meteorological data from NOAA and
ECMWF. The results demonstrate that both models
offer valuable insights, but RF exhibits superior
predictive accuracy and robustness in handling
complex atmospheric interactions.
The logistic regression model achieved an
accuracy of 87.2%, with humidity (OR = 3.329, p <
0.001) and precipitation (OR = 7.296, p < 0.001)
emerging as statistically significant predictors. While
LR provides interpretable coefficients-valuable for
understanding linear relationships-its performance is
constrained by inherent assumptions of linearity and
additivity. In contrast, the random forest model
outperformed LR with an accuracy of 90.1%, higher
precision (0.887), and better recall (0.892). RF's
ensemble approach effectively captured nonlinear
patterns and variable interactions, with temperature
(Gini importance = 0.245), humidity (0.198), and
pressure (0.176) identified as the most influential
features. This aligns with meteorological theory,
where these variables drive convective processes and
weather system dynamics.
The practical implications of these findings are
substantial. For operational meteorology, RF's higher
accuracy supports its use in short-term forecasting,
particularly for extreme weather warnings. Its ability
to rank feature importance also aids in optimizing
data collection-for instance, prioritizing temperature
and humidity measurements over less critical
variables like solar radiation. However, LR retains
utility for scenarios requiring model interpretability,
such as communicating forecast uncertainty to
stakeholders.
In conclusion, this study underscores machine
learning's transformative potential in NWP. RF's
superior performance highlights its suitability for
operational forecasting, while LR offers a simpler,
interpretable alternative. By integrating these tools
with traditional physical models, meteorologists can
achieve more accurate, actionable forecasts-
ultimately benefiting agriculture, transportation, and
disaster preparedness. Future research should focus
on hybrid modeling approaches and real-time system
integration to further advance weather prediction
capabilities.
The Research of Key Roles of Logistic Regression Model and Random Forest Model in Numerical Weather Prediction
647