Knowledge Discovery in the Smart Grid
A Machine Learning Approach
Aldo Dagnino
ABB Corporate Research, ABB Inc., 940 Main Campus Drive, Raleigh, NC, U.S.A.
Keywords: Machine Learning, Smart Grid, Knowledge Discovery, Fault Prognostics.
Abstract: The increased availability of cheaper sensing technologies, the implementation of fibre-optic networks, the
availability of cheaper data storage repositories, and development of powerful machine learning models are
fundamental components that provide a new facet to the concept of the Smart Power Grid. An important
element in the Smart Grid concept is predicting potential fault events in the Smart Power Grid, or better
known as fault prognostics. This paper discusses an approach that uses machine learning methods to
discover fault event-related knowledge from historical data and helps in the prognostics of fault events in
power grids and critical and expensive components such as power transformers circuit breakers, and others.
1 INTRODUCTION
Recent technological advances in sensor
technologies, fibre-optic networks, cheaper data
storage capabilities, powerful data mining
techniques, and faster computing power coupled
with the need of improving the efficiency of
electrical power utilization have contributed to the
development of smarter power grids in the
transmission and distribution industries. Utilities are
increasingly interested in incorporating sensor
technologies to expensive assets such as power
transformers, circuit breakers, and back-up batteries,
in overhead and underground transmission lines and
connecting equipment. Many utilities are developing
fibre-optic networks that allow the transmission of
data from sensors to central data repositories.
2 THE SMART GRID
The existing power grids consist of multiple power
networks that coordinate their operations using
various levels of communication and control
mechanisms, which are primarily manually
controlled. The primary elements of the Smart Grid
include: (a) data; (b) information; (c) intelligence;
(d) communications (Mousavi, 2009). Data elements
are supplied by sensors embedded in different
components of the grid. The information element is
delivered by processors that perform certain
operations on data. The intelligence element is
generated by processing data and information via
analytics models. The communications element is
required to deliver data, information, and
intelligence to the right decision making agent in the
right format at the right time. The IEEE – Power and
Energy Society (IEEE-PES) and the National
Institute of Standards (NIST) have developed a
conceptual model for the Smart Grid that defines
seven important domains: Bulk Generation,
Transmission, Distribution, Customers, Operations,
Markets, and Service Providers.
3 FAULTS IN THE SMART GRID
Machine learning approaches have been utilized to
forecast fault events in the power distribution grid
and in critical equipment. This section discusses
how machine learning models were utilized
determine; (a) fault vulnerability profiles in power
distribution grids; (b) equipment fault forecasting.
3.1 Power Grid Fault Prognostics
Power distribution is typically managed by power
substations that receive power from the transmission
lines and distribute electrical power through feeders
to consumers. In addition of the equipment within
the substation, the typical distribution grid is
composed of equipment such as distribution
366
Dagnino A..
Knowledge Discovery in the Smart Grid - A Machine Learning Approach.
DOI: 10.5220/0004144303660369
In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (KDIR-2012), pages 366-369
ISBN: 978-989-8565-29-7
Copyright
c
2012 SCITEPRESS (Science and Technology Publications, Lda.)
transformers, switches, fuses, power lines
(underground or overhead), and relays. Utilities are
very interested in fault prognostics in the power
distribution grid to minimize power disruptions to
customers. Faults in the distribution grid are
typically related with power line fatigue, burned
fuses, lightning falling on equipment (such as
distribution transformers, etc.), short circuits, animal
contacts, trees and tree branches falling on assets,
weather related faults to overhead or underground
equipment, faults in splicing, power lines touching
each other, and many more. Currently, the vast
majority of utilities are reactive to faults and they
manually deal with a contingency. There are many
factors identified in the literature that can cause fault
events in a power distribution grid (Lu, 2010). These
factors can be broadly classified into (a) physical
properties of the distribution grid; (b) electrical
values of grid; (c) weather conditions; (d) assets or
components degradation in the grid; and (e) type of
grid infrastructure (see Figure 1). The work
described in this paper has been focused on the
prognostics of faults in two primary areas: (a) the
forecast of fault events in a distribution grid; (b)
forecasting potential faults to expensive assets such
as power transformers in either the transmission or
distribution network. Forecasting fault events in the
distribution grid has been conducted by utilizing
historical data on weather conditions, grid electric
value readings at the time of a fault event, and the
type of grid infrastructure. Forecasting faults on
expensive assets has been conducted by analyzing
the condition of power transformers utilizing
historical data collected while performing dissolved
gas analyses and other tests. This investigation was
conducted with an Investor Own Utility (IOU)
partner in the US. Several types of historical datasets
associated with the IOU were collected and utilized
during this investigation. The historical dataset types
utilized include: (a) fault data and electrical values
from the IOU; (b) weather data; (c) infrastructure
type of the IOU. The fault data from the IOU was
collected utilizing an automated system of intelligent
electronic devices (IED’s) with sensing and analytic
capabilities located at power feeders. These IED’s
monitor electrical values from the distribution lines
and are able to detect a fault event in the grid after it
occurred. The fault data includes these electrical
values, and was also corroborated with data entries
documented by IOU engineers after restoring
service. The weather data was collected from the US
National Weather Service (NWS) and from the
WeatherBug (WBUG) weather services. The NWS
data was collected by their weather station every
five minutes in METAR format. The WeatherBug
data was collected from small weather stations
located in various locations close to the different
substations of the IOU.
Figure 1: Fault factors in Smart Grid.
3.2 Machine Learning for Forecasting
Power Distribution Grid Fault
Events
Supervised classification machine learning
techniques were utilized to forecast the occurrence
of faults in the distribution power grid of the IOU.
Four supervised classification machine learning
algorithms were utilized to conduct the analyses:
Neural Networks (NN), kernel support vector
machines (KSVM), decision-tree based
classification (recursive partitioning; RPART), and
Naïve Bayes (NB). Five analyses were conducted
utilizing these four algorithms: (a) fault event
prediction; (b) grid zone prediction; (c) substation
prediction; (d) type of grid infrastructure; (e) feeder
number prediction.
3.2.1 Fault Prediction Models
Four models were created to identify weather
patterns that are most likely to result in a fault event
using the NN, KSVM, RPART, and NM algorithms.
The models were constructed by taking weather data
points joined to fault events, as well as random
weather data samples when no fault events were
recorded in the selected IOU substations. The
dataset contained a total of 3471 records (1725 with
faults and 1746 without faults), of which 2430 were
used for training each of the four models and 1041
for testing the models. The output of these models
shows a prediction of the weather conditions for
which a fault event may or may not occur. The best-
Weather
Component
Degradation
Physical
Properties of
Distribution
Grid
Volume
precipitation
Snow precipitation
Storms
Temperature
(outdoor, indoor)
Humidity
Ice formation
Insulator integrity
levels
Zinc coating
degradation
Cable (OH)
sagging
Cable (OH)
strand fatigue
Rate of cable
(OH) breaks
Infrastructure
Type
Overhead
(OH)
Underground
(UG)
Type of UG cable
connectors
Temperature at (UG)
cable joints
Material of cables
Transmission
temperature profiles
Maximum conductor
tension limits
Cable operation
temperature (OH)
Type of cables (OH)
Materials of poles
Pole foundation
characteristics
Voltage profiles of
network
Temperatures of circuit
breakers
Temperatures of line
connectors
Temperatures of
transformers
Age of grid assets
Wind (speed,
direction)
Urban or
rural grid
Asset degradation
Grid Electrical
Values
Pre-Fault
Power
Values
Pre-Fault
Current
Values
Pre-Fault
Voltage
Values
Lightning
parameters
Pressure
Factors that can
cause fault events in
distribution grids
Temperatures of
disconnectors
KnowledgeDiscoveryintheSmartGrid-AMachineLearningApproach
367
performing model was the one created with the feed-
forward trained by a multi-layer perceptron back-
propagation Neural Network algorithm with an f-
measure of 75%.
3.2.2 Zone Prediction Models
The four zone prediction models were trained by
considering fault historical data from the IOU grid
and weather data. Of the 1725 records with faults
and weather data, 70% were used for training and
30% for testing the trained models. The output of
these models predict in what zone (AMZ, UMZ,
PMZ) on the IOU grid the fault occurred. The best-
performing model was the one created training a
Neural Network algorithm. The model contains one
hidden layer with 20 nodes, and produces an
accuracy of 66%, an average precision of 69%, an
average recall of 68%, and an f-measure of 68%.
3.2.3 Substation Prediction Models
The four substation prediction models were trained
by considering fault historical data from the IOU
grid and weather data. Of the 1725 records with
faults and weather data, 70% were used for training
and 30% for testing the trained models. The output
of these models predicts the IOU substation ID
where the fault occurred. The best performing model
was the one created with the recursive partitioning
algorithm and produces an accuracy of 59%, an
average precision of 66%, an average recall of 54%,
and an f-measure of 59%.
3.2.4 Infrastructure Prediction Models
The four infrastructure prediction models were
trained by considering fault historical data from the
IOU grid and weather data. Of the 1725 records with
faults and weather data, 70% were used for training
and 30% for testing the trained models. The output
of these models predicts the type of infrastructure
(overhead or underground) on the section of the IOU
grid where the fault occurred. The best-performing
model was the one created training a Neural
Network algorithm with an f-measure of 57%.
3.2.5 Feeder Prediction Models
The four feeder prediction models were trained by
considering fault historical data from the IOU grid
and weather data. Of the 1725 records with faults
and weather data, 70% were used for training and
30% for testing the trained models. The output of
these models predicts the IOU Feeder where the
fault occurred. The best-performing model is the one
created with the recursive partitioning algorithm
with an f-measure of 74%.
3.3 Machine Learning for Forecasting
Fault Events in Assets
Many utilities have deployed diverse types of
sensors in their mission critical and expensive assets
such as power transformers. When monitoring
power transformers two types of on-line
measurements can be collected: (a) operational
information such as voltage, load, current, oil
temperature, winding temperatures, pump status, fan
status, cooling system status, etc; (b) condition
information, such as oil quality, gassing, dielectric
properties, aging, etc. Utilities use a variety of
sensors in their transformers and such sensors have
different monitoring capabilities, especially in terms
of the types and concentrations of gasses in the oil of
the transformers. Some time utilities supplement the
monitored concentration of gasses by conducting a
dissolved gas analysis (DGA) test periodically. A
study has been completed with the objective of
developing analytical models based on data mining
to identify patterns in gas concentrations, to identify
trends of gas concentrations that may lead to
catastrophic failures of equipment, and in general to
identify correlations between observations that
would result in new knowledge or confirm existing
heuristic knowledge about power transformers. The
example presented below does not identify the name
of the utility with which this study was conducted.
Hence, we refer our customer as Utility A. In our
example, Utility A had a fleet of over 300 power
transformers and had historical data collected for a
period of ten years. The historical data collected
included DGA analysis tests for all transformers
(concentration of H2, CH4, C2H6, C2H4, C2H2,
CO, CO2, O2, N2, and moisture), ID transformer
data (transformer name, type, age, pump type,
construction type, and conservator type), oil
temperature, winding temperature, and fluid quality
(metal particles present in oil). Utility A installed
sensors in its transformers fleet and recently
installed a fibre-optic network that helped to
transmit the monitored data into a central repository.
The objective was to develop a profile of
potential “hot spots” in power transformers where
the concentration of CO, CO2, and O2 are high (CO
> 571 parts per million, CO2 > 4001 ppm, and O2 >
10,000 ppm) and oil temperatures need to be
monitored so they do not exceed values > 150 C.
These conditions can show deterioration of the
KDIR2012-InternationalConferenceonKnowledgeDiscoveryandInformationRetrieval
368
insulation of the windings. The following data sets
were employed to conduct the data mining analyses:
(a) Transformer Description Database: that includes
the following data attributes: SERIAL_NUMBER,
AGE, CONSTRUCTION_TYPE; (b) Gasses
Concentration Database: concentrations in parts per
million (ppm) of the following gasses: H2, CH4,
C2H6, C2H4, C2H2, CO, CO2, O2, N2, and
MOISTURE. The data availability studied for this
case includes 335 power transformers for which
gasses data concentrations have been collected
during 10 years. The total number of gasses
concentration observations is 3100. The working
dataset analyzed includes the fusion of both the
transformer description and combustible gasses
concentration databases. Entries with missing data
points were removed from the analysis. Figure 2
shows the sequence of machine learning approaches
utilized for this analysis. First, the data was
classified based on the CONSTRUCTION _ TYPE
of the transformers (where transformers can be
Core_Form and Shell_Form).
Figure 2: CO, CO2, and O2 Concentration Analysis.
With this classification completed, CO, CO2, and
O2 gasses concentrations were identified for each of
the construction types. Also, a cluster analysis was
conducted with all data using the SimpleKMeans
algorithm and the clusters shown in Table 1 were
obtained. Sixty percent of the data was utilized to
train the algorithm and forty percent of the data was
utilized to test the algorithm. The data mining open
source tool utilized for this analysis was Weka.
Table 1: Cluster Analysis of CO, CO2, and O2
Results from these analyses suggest that the number
of CORE_FORM transformers that has high
concentrations of O2 is larger than SELL_FORM or
TPN-V CORE_FORM. Similarly, Fig. 8 shows that
CORE_FORM transformers have the largest number
of CO2 gas concentration. Results also suggest that
CORE_FORM transformers have the highest
concentration of CO gas. The results of the analyses
above show that a utility with a fleet of transformers
that have CORE_FORM construction type should
pay to the temperature of these transformers if the
concentrations of CO, CO2, and O2 are in dangerous
concentration neighbourhoods (CO > 571 parts per
million, CO2 > 4001 ppm, and O2 > 10,000 ppm).
After running the cluster algorithm, Table 1 shows
that a cluster of 481 transformers (cluster 2) have
high concentrations of CO and CO2 with moderate
concentration of O2. For this cluster of transformers,
it is important to monitor temperatures and also
concentration of O2.
4 CONCLUSIONS
The development of smart sensors, fibre-optic
networks, large data storage repositories, powerful
hardware, and robust machine learning algorithms
are becoming important elements that bring the
concept of the Smart Grid to the forefront. The
objective of the work presented in this paper is to
demonstrate that machine learning models can be a
powerful element of the Smart Grid concept.
Machine learning can be utilized for diagnostics and
forecasting of faults in a power transmission and
distribution grid. This is an important element of the
concept of the Smart Grid.
REFERENCES
Heine, P., Turunen, J., Lehtonen, M., Oikarinen, A., 2005,
Measured Faults During Lightning Storms , Proc.
IEEE Power Tech 2005, Russia, pp.1- 5.
Lu, N., Taylor, T., Jiang, W., Jin, C., Correia, J., Leung,
L., and Wong, P. C., 2010, Climate Change Impacts
on Residential and Commercial Loads in the Western
U.S. Grid, IEEE Transactions on Power Systems, Vol.
25, No. 1, pp. 480 – 488.
Mousavi, M., Donde, V., Stoupis, J., McGowan, J., Tang,
L., 2009, Information, not Data: Real-time automated
distribution event detection and notification for grid
control, ABB Review Journal, The Corporate
Technical Journal of the ABB Group, no. 3, pp. 38-44.
Wei, C., 2010, A Conceptual Framework for Smart Grid,
IEEE, 978-1-4244-4813-5.
KnowledgeDiscoveryintheSmartGrid-AMachineLearningApproach
369