
3.3  Logistic Regression Coefficients 
Table  4  displays  the  logistic  regression  model's 
coefficients,  which  provide  vital  information  about 
how  each  predictor  variable  relates  to  the  desired 
result.    When  all  other  variables  are  held  constant, 
these coefficients, which are given in log-odds units, 
measure how the log-odds of the target variable (such 
as the incidence of precipitation) change with a one-
unit rise in the related predictor variable.  Whereas a 
negative coefficient implies an inverse association, a 
positive  coefficient  shows  that  an  increase  in  the 
predictor  variable  raises  the  chance  of  the  event 
happening. For continuous variables,  the coefficient 
reflects the change in log-odds per unit increment; for 
categorical  variables,  it  represents  the  difference  in 
log-odds  compared  to  the  reference  category.  The 
magnitude  of  each  coefficient  corresponds  to  the 
strength  of  association,  with  larger  absolute  values 
indicating more substantial influences on the outcome. 
Importantly,  these  log-odds  coefficients  can  be 
transformed into odds ratios through exponentiation, 
which  offers  a  more  intuitive  interpretation  of  the 
effect  sizes.  The  statistical  significance  of  these 
coefficients, typically assessed through Wald tests or 
likelihood  ratio  tests,  determines  whether  the 
observed  relationships  are  likely  to  exist  in  the 
population rather than occurring by random chance. 
This parametric output of logistic regression  proves 
particularly  valuable  for  understanding  the 
directional  effects  and  relative  importance  of 
different  meteorological  factors  in  precipitation 
forecasting. 
Table 4: Logistic Regression Coefficients. 
 
All  coefficients  were  statistically  significant  (p  < 
0.05),  indicating  that  each  feature  contributes 
meaningfully  to  the  prediction.  Humidity  and 
precipitation  showed  the  strongest  positive 
associations  with the  target  variable, while pressure 
had a negative effect. 
4    CONCLUSION 
Numerical  weather  prediction  (NWP)  has  evolved 
significantly with the integration of machine learning 
techniques, particularly logistic regression (LR) and 
random forest (RF) models. This study compared the 
performance  of  these  two  approaches  in  predicting 
binary  weather  events  (e.g.,  precipitation)  using 
historical  meteorological  data  from  NOAA  and 
ECMWF.  The results demonstrate  that both models 
offer  valuable  insights,  but  RF  exhibits  superior 
predictive  accuracy  and  robustness  in  handling 
complex atmospheric interactions. 
The  logistic  regression  model  achieved  an 
accuracy of 87.2%, with humidity (OR = 3.329, p < 
0.001)  and  precipitation  (OR  =  7.296, p <  0.001) 
emerging as statistically significant predictors. While 
LR  provides  interpretable  coefficients-valuable  for 
understanding linear relationships-its performance is 
constrained by inherent assumptions of linearity and 
additivity.  In  contrast,  the  random  forest  model 
outperformed LR with an accuracy of 90.1%, higher 
precision  (0.887),  and  better  recall  (0.892).  RF's 
ensemble  approach  effectively  captured  nonlinear 
patterns  and  variable  interactions,  with  temperature 
(Gini  importance  =  0.245),  humidity  (0.198),  and 
pressure  (0.176)  identified  as  the  most  influential 
features.  This  aligns  with  meteorological  theory, 
where these variables drive convective processes and 
weather system dynamics. 
The  practical  implications  of  these  findings  are 
substantial. For operational meteorology, RF's higher 
accuracy  supports  its  use  in  short-term  forecasting, 
particularly for extreme weather warnings. Its ability 
to  rank  feature  importance  also  aids  in  optimizing 
data collection-for instance, prioritizing temperature 
and  humidity  measurements  over  less  critical 
variables  like  solar  radiation.  However,  LR  retains 
utility for scenarios requiring model interpretability, 
such  as  communicating  forecast  uncertainty  to 
stakeholders. 
In  conclusion,  this  study  underscores  machine 
learning's  transformative  potential  in  NWP.  RF's 
superior  performance  highlights  its  suitability  for 
operational  forecasting,  while  LR  offers  a  simpler, 
interpretable  alternative.  By  integrating  these  tools 
with traditional physical models, meteorologists can 
achieve  more  accurate,  actionable  forecasts-
ultimately benefiting agriculture, transportation, and 
disaster  preparedness.  Future  research  should  focus 
on hybrid modeling approaches and real-time system 
integration  to  further  advance  weather  prediction 
capabilities. 
The Research of Key Roles of Logistic Regression Model and Random Forest Model in Numerical Weather Prediction
647