temperature, current, and power parameters.
The proposal for accurate PV string diagnosis
suggests a stacking classifier (MLSC) model to
accomplish accurate classification automatically and
remotely. The MLSC for PV string accurate fault
diagnosis is inspired and based on the physical fault
detection methods known as artificial intelligence.
PV fault diagnosis techniques incorporate physical
detection methods, threshold methodologies and
machine learning methods of detection. Additionally,
physical techniques tackle the identification and
position of the fault through the use of measuring
instruments to capture and scrutinize the performance
features of the PV defect. Threshold techniques apply
the I- V technique based on the position of inverter
current scans to generate I-V curves under normal and
faulty conditions.
3 RESEARCH METHODOLOGY
It describes strategies for analyzing data with the
intent of anticipating hotspot faults in PV systems. By
Performing Empirical analysis, we can identify
appropriate ML and DL algorithms for fault
detection. Algorithms like Support Vector Machine
(SVM), Logistic Regression, Decision Tree,
XGBoost, Naive Bayes, and Convolutional Neural
Networks (CNN) were analyzed in this work.
This work employs quantitative research
methodology to analyze the use of ML and DL
techniques for hotspot fault detection in PV systems.
The methodology includes data capture, feature
engineering, data preparation, model choice, and
analysis of results with the purpose of determining the
practicality of hotspot fault methods within the
context of solar energy system reliability and
performance optimization.
Data Collection: The analysis is based on
operational data acquired from a real-world solar PV
system in Location. Data on temperature patterns, in
the form of infrared (IR) thermography imaging, are
collected from different equipment and saved in
cloud- based systems for real-time analysis of hotspot
faults in solar panels.
Data Processing: The use of higher-order statistics
(HOS) such as mean, variance, skewness, and kurtosis
in data processing to reveal subsurface defects is an
essential part of machine learning. These parameters
compress the entire thermographic sequence into one,
or very few, images that contain detailed information
about the defects.
Model Development: ML and DL techniques are
used to develop fault detection models that can
identify deterioration patterns in solar panel
infrastructure. For classification problems, logistic
regression, tree-based methods, XGBoost, Naive
Bayes, and Support Vector Machines are
examples of supervised learning methods which are
adopted.
Model Training and Validation: To predict hotspot
faults in solar Panels, training and validation data
were evaluated using a 60-40 split. Cross- validation
techniques were applied to validate the data.
Optimising model performance and improving
predicted accuracy is achieved by hyperparameter
tuning. Model validation is performed with the use of
Accuracy, Precision, Recall, and F1-score. The figure
2 represents a basic architecture diagram for AI based
hotspot fault detection in PV panels.
Figure 2: The proposed AI model to detect hotspot.
Support Vector Machine (SVM): Support Vector
Machines (SVMs) are learning algorithms that use
supervised models to tackle tricky problems in
classification, regression, and spotting outliers. Two
distinct classes- health (non-faulty) and faulty
(hotspot) are created from the thermal pictures of PV
panels using SVM to detect faults. SVM's primary
goal is to find a function for training data that is as
smooth as possible with no deviations from the real
vectors. This method splits thermographic images of
PV systems into separate parts, and creates color
descriptors for each area. These color descriptors then
serve as features to train various learning algorithms.
These algorithms group PV panels into three types:
normal, hotspot, and faulty. After rigorous testing and
in-depth analysis, the results show that the learning
system has an accuracy of 92%.
Decision Trees: Decision trees are great approach
for hotspot analysis in solar PV system. Very simple
to deploy with good performance for both numerical
and categorical data. This method helps narrow down
the conditions that can predict hotspots considering