Vous êtes sur la page 1sur 9

IEEE CANADIAN JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, VOL. 44, NO.

1, WINTER 2021 41

Evaluation of Dimensionality Reduction


Techniques for Load Profiling Application
in Smart Grid Environment
Évaluation des techniques de réduction de la
dimensionnalité pour l’application de profilage de
charge dans l’environnement d’un réseau intelligent
Ahmed Aleshinloye, Muhammad Asif Manzoor, and Abdul Bais , Senior Member, IEEE
Abstract— The traditional power grid system is evolving toward a smart grid system, which will improve the
user experience. However, this system is capable of generating high-dimensional data at a very high sample
rate. A common technique for reducing high-dimensional data is dimensionality reduction. With this technique,
we are able to reduce the data to a lower dimension, making it suitable for smart grid applications including
transmission, storage, and visualization. Linear dimensionality reduction techniques are mostly explored in
the context of smart grid applications. Due to the nonlinear nature of data generation in the smart grid,
we anticipate that the nonlinear dimensionality reduction techniques can perform better. This work evaluates
different nonlinear dimensionality reduction techniques and compares them with principal component analysis,
which is a widely used linear dimensionality reduction technique in the smart grid environment. We use the
visualization of load profile data and adjusted rank index (ARI) for comparison of dimensionality reduction
techniques. Load profiling is an important task to complement the demand-side management and tariff selection.
The visual depiction of the load profiles and ARI suggest that the nonlinear dimensionality reduction techniques
perform better compared with linear dimensionality reduction techniques.
Résumé— Le système de réseau électrique traditionnel évolue vers un système de réseau intelligent, qui
améliorera l’expérience utilisateur. Cependant, ce système est capable de générer des données de grande
dimension à une fréquence d’échantillonnage très élevée. Une technique courante pour réduire les données de
grande dimension est la réduction de la dimensionnalité. Avec cette technique, nous sommes en mesure de réduire
les données à une dimension inférieure, ce qui la rend adaptée aux applications pour des réseaux intelligents
incluant la transmission, le stockage et la visualisation. Les techniques de réduction de la dimensionnalité
linéaire sont principalement explorées dans le contexte des applications au réseau intelligent. En raison de la
nature non-linéaire de la génération de données dans le réseau intelligent, nous prévoyons que les techniques de
réduction de la dimensionnalité non-linéaire peuvent mieux fonctionner. Ce travail évalue différentes techniques
de réduction de la dimensionnalité non-linéaire et les compare avec l’analyse en composantes principales, qui
est une technique de réduction de dimensionnalité linéaire largement utilisée dans l’environnement d’un réseau
intelligent. Nous utilisons la visualisation des données de profil de charge et l’indice de rang ajusté (ARI) pour
comparer les techniques de réduction de la dimensionnalité. Le profil de charge est une tâche importante pour
compléter la gestion de la demande et la sélection tarifaire. La représentation visuelle des profils de charge et
ARI suggère que les techniques de réduction de la dimensionnalité non-linéaire fonctionnent mieux par rapport
aux techniques de réduction de la dimensionnalité linéaire.
Index Terms— Clustering, dimension reduction, load profiling, smart grid, visualization.

I. I NTRODUCTION control, optimization, and simulation, and are becoming com-


plex and computationally expensive. The intelligent devices
W ITH the emergence of the smart grid, the number of
intelligent devices (sensors, meters, and so on) are
increasing and the models, which are used to perform analysis,
are expected to generate a high volume of data at a high
rate, which must be collected and analyzed in real time to
utilize the resources more efficiently and for better demand-
Manuscript received March 22, 2019; revised November 4, 2019,
January 10, 2020, and May 19, 2020; accepted August 16, 2020. Date of side management. It is estimated that the smart meter readings
current version February 17, 2021. This work was supported by the University will increase from 24 to 220 million per day for a large utility
of Regina’s SaskPower Clean Energy Research Grant entitled An Application company when advanced metering infrastructure (AMI) is
Framework for Real-time Streaming Big Data Analysis. (Corresponding
author: Abdul Bais.) fully integrated into the smart grid [1]. There were 70.8 million
The authors are with the Electronic Systems Engineering Program, AMI installations in the U.S. at the end of 2016 [2]. The data
Faculty of Engineering and Applied Science, University of Regina, Regina, volume, for 70.8 million metering devices in a year, is shown
SK S4S 0A2, Canada (e-mail: abdul.bais@uregina.ca).
Associate Editor managing this article’s review: Bhavesh Bhalja. in Table I. The AMI data are assumed to use a 128-bit data
Digital Object Identifier 10.1109/ICJECE.2020.3018433 size representing 8 bytes of numerical data and 8 bytes of
2694-1783 © 2021 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.

Authorized licensed use limited to: University of New Orleans. Downloaded on February 08,2023 at 22:07:26 UTC from IEEE Xplore. Restrictions apply.
42 IEEE CANADIAN JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, VOL. 44, NO. 1, WINTER 2021

TABLE I Linear dimensionality reduction techniques, such as PCA,


D ATA G ROWTH C OLLECTED FOR O NE M ILLION are mostly used for smart grid applications. The samples
M ETERING D EVICES IN A Y EAR are eigen decomposed resulting in the principal components.
These components are obtained by projecting the samples
on the basis of orthogonality. Due to linearly independent
principal components and removal of eigenvectors associated
with low eigenvalues, accurate reconstruction is inefficient.
These dimensionality reduction techniques place dissimilar
metadata [3]. The table shows the increase in AMI data with linear data points far apart in lower dimension representation
an increasing sampling frequency. At a sampling frequency but fail at retaining the intrinsic shape of nonlinear samples
of 15 min, the total data recorded is 2.48 trillion with a when mapped to a lower dimension. Moreover, due to their
data size of 39.69 TB in a single year. The number of smart linear nature, these techniques do not model the underlying
meters is also increasing in addition to increasing the sampling nonlinear complex relations. We have compared linear dimen-
rate. Moreover, distributed energy storage and generation, sionality reduction techniques for demand-side management
renewable energy sources, and other new technologies are in [17] where we concluded that PCA performs better in
also being added to the energy grid at a rapid pace. This comparison to other linear techniques. Due to the nonlinear
phenomenon increases the complexity of an already complex nature of the underlying process, we anticipate that nonlinear
framework, requiring the need for intelligent management, approaches can be more effective to represent the data in
real-time processing, analysis, and visualization system. lower dimensions. The objective of this work is to analyze
In today’s electricity market, every energy supplier wants and evaluate nonlinear dimensionality reduction techniques.
to understand their customer’s consumption behavior cor- As PCA outperforms other linear dimensionality reduction
rectly. This allows for the provision of affordable supply techniques [17], we compare the results of our tested nonlinear
to consumers and to earn a fair share of profit. Similarly, techniques with PCA to get a fair perspective. Clustering is
customers need to know their consumption behavior in order applied to dimensionally reduced data to generate load profiles.
to purchase a suitable tariff plan or to successfully implement Finally, graphs are created to visually depict the results of the
energy efficiency measures. The classification of end-users process for qualitative evaluation, and the adjusted rank index
based on consumption behavior is fundamental to the success (ARI) is computed to quantitatively evaluate the dimension
of demand-side management. This process is a part of load reduction techniques. Data visualization plays a key role in
profiling and can be achieved through direct and indirect smart grid data analytics and may also provide assistance to
clustering [4]. operators in decision-making process. The proposed load pro-
There are implementations of direct clustering using self- filing method is based on nonlinear dimensionality reduction,
organizing map (SOM), k-means, hierarchical clustering, and which helps in modeling the underlying nonlinear relations.
fuzzy k-means [5]–[7]. Aside from the traditional clustering The proposed solution has better ARI and generates good
methods, Chicco et al. [8] proposed an ant colony-based results in terms of human interpretation (visualization). The
algorithm to determine the users’ behavior. The algorithm is proposed method (with t-SNE) clearly divides the different
evaluated using the fitness functions called clustering disper- users into different load profiles based on their usage.
sion indicator and the scatter index. The results are shown to The rest of this article is organized as follows. The data
outperform k-means clustering. Other methods of evaluating sets used in this work are explained in Section II. Section III
clusters are the silhouette index, the Davies–Bouldin indicator, discusses the methodology of this work. Section IV presents
and the similarity matrix indicator [9]. the experimental results and discussion. This section also
In indirect clustering, features for independent identically incorporates the comparison of clustering techniques. This
distributed samples are extracted as input to the clustering article is concluded in Section V.
methods. The feature extraction is achieved using dimensional-
ity reduction and time-based techniques [4]. In dimensionality II. DATA
reduction, the dimensions of the observation are reduced using One of the major contributions of this work is to reduce
principal component analysis (PCA) [10], the Sammon map, the dimensionality of smart meter data sets so that it can
curvilinear component analysis [7], and autoencoders. In [11], be efficiently used in later stages. We use commercial and
autoencoders are used to extract input features before applying residential hourly load profile for the TMY3 locations in the
fuzzy k-means for clustering to the reduced representation of United States to evaluate the performance of PCA and other
the data. Considering the nature of smart meter data, time nonlinear dimensionality reduction techniques [18]. The res-
series methods are also used to extract features. Zhong and idential data contain energy consumption for 3045 buildings,
Tam [12] transformed smart meter data from the time domain while the commercial data contain energy consumption for
to the frequency domain before using a classification and 14976 buildings. For both data sets, the measurement is polled
regression tree for load profiling. Verdu et al. [13] applied at 1 sample/h for a year’s period. There are ten features
SOM clustering to the frequency component of the smart recorded in the case of commercial buildings (excluding
meter data. Altogether, an improvement of results is observed timestamp), whereas 13 parameters are recorded in the case of
when the data are in the frequency domain. Other time-based residential buildings (excluding timestamps). The features of
techniques are discrete wavelet transform [14], hidden Markov the residential data are electricity (facility), gas (facility), heat-
model [15], and singular value decomposition (SVD) [16]. ing (electricity), heating (gas), cooling (electricity), HVac fan

Authorized licensed use limited to: University of New Orleans. Downloaded on February 08,2023 at 22:07:26 UTC from IEEE Xplore. Restrictions apply.
ALESHINLOYE et al.: EVALUATION OF DIMENSIONALITY REDUCTION TECHNIQUES FOR LOAD PROFILING APPLICATION 43

Fig. 2. Energy consumption of commercial building for a two-day period


sampled at 1 sample/h.

Fig. 1. Energy consumption of residential building for a two-day period including timing information. They are global active power,
sampled at 1 sample/h.
global reactive power, voltage, global intensity, submetering 1
(electrical equipment in the kitchen), submetering 2 (electrical
(fans/electricity), HVac (electricity), fans (electricity), general equipment in the laundry room), and submetering 3 (electric
interior lights (electricity), general exterior lights (electricity), water heater and air conditioner). The observations are accu-
appliance interior equipment (electricity), miscellaneous inte- mulated on a weekly basis, and these observations are used
rior equipment (electricity), and water heater (water systems in load profiling. The data set contains some missing values
and gas). Similarly, the ten features of the commercial data (approximately 1.25%) that are filled by the average values of
are electricity (facility), fans (electricity), heating (electric- the respective column. The data set is divided into four load
ity), cooling (electricity), general interior lights (electricity), profiles based on seasons, i.e., spring, summer, autumn, and
interior equipment (electricity), gas (facility), heating (gas), winter.
interior equipment (gas), and water heater (water systems and
gas). All measurements are aggregated on an hourly basis. III. M ETHODOLOGY
We use all the parameters except timestamps in the process Visualization of data is important in data analysis as it
of dimensionality reduction. The output of each observation allows for the identification of outliers, observing clustered
is the timestamps and 2-D vector (reduced). The timestamps data points, and determining the underlying structure of the
are not considered for reduction because it is required in other data. However, the increasing amount of metering and sensing
operations, e.g., billing. devices in the grid makes it difficult for industry profes-
The TMY3 data set has defined three categories for res- sionals to visualize data in an intuitive manner. With the
idential load profile data and 16 categories for the com- aid of dimensionality reduction tools, data from the phasor
mercial load profile data. Each observation in the data set measurement unit (PMU), smart meters, and sensors can be
is assigned to one of these categories. We use the same reduced to a 2-D or 3-D representation in real time for quick
categorization and divide buildings into the residential data visualization to aid the identification of anomalies or outliers
set into three classes based on their energy consumption, in the data. We have converted the multidimensional input
i.e., low, base, or high [19]. A two-day energy consumption vector into a 2-D feature vector. The 2-D vector can easily
profile is visualized in Fig. 1. The vertical axis shows the be plotted and visualized. Furthermore, it is easy to interpret
hourly electricity utilization (kW), and the horizontal axis a 2-D plot and identify abnormal usage behaviors and out-
shows the time. Similarly, each commercial building falls in liers. These are the reasons to reduce the multidimensional
one of 16 classes representing midrise apartment, small hotel, input vector into a 2-D vector. Hyndman et al. [22] applied
large hotel, stand-alone retail, outpatient health care, hospital, dimensionality reduction techniques to visualize smart meter
secondary school, primary school, strip mall, supermarket, and PMU data for detecting users with abnormal consumption
quick service restaurant, full-service restaurant, warehouse, patterns, identifying islanding event, postdisturbance voltage,
large office, medium office, and small office [20]. The energy and visualizing probability forecast for renewable energy.
consumption for a building in each of the 16 classes is shown In this article, both linear and nonlinear dimensionality
in Fig. 2. Due to differences in the utilization of different reduction techniques are used to transform the input data
commercial buildings, we plot the hourly electricity utilization of dimension m to a reduced dimension k. Mathematically,
as natural logarithmic values. We also use different line the input X ∈ Rn×m is reduced to Y ∈ Rn×k . In this article,
styles for plotting electricity consumption for each category in PCA is used for the linear reduction since it is shown to
addition to using a logarithmic scale for better interpretation. outperform other linear reduction techniques in the smart
In addition to the TMY3 data set, we also use the “Indi- grid environment [17], while other nonlinear dimensionality
vidual household electric power consumption Data Set” [21] reduction techniques tested are Isometric Feature Mapping
to create load profiles for a single user based on seasonal (Isomap), kernel PCA (KPCA), locally linear embedding
patterns as another use case. The data set contains smart meter (LLE), t-distributed stochastic neighbor embedding (t-SNE),
data for a single customer with the granularity of one minute. autoencoders, and Laplacian eigenmaps.
The data are collected over a period of around four years. Each technique is selected considering its performance in
Each observation in the data set consists of nine attributes, other domains, e.g., Isomap is successfully applied to various

Authorized licensed use limited to: University of New Orleans. Downloaded on February 08,2023 at 22:07:26 UTC from IEEE Xplore. Restrictions apply.
44 IEEE CANADIAN JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, VOL. 44, NO. 1, WINTER 2021

to a lower dimension in either parametric or nonpara-


metric way. The input multidimensional feature vectors
are reduced to 2-D feature vectors. We use a 2-D feature
vector to represent the data set since more than 90% of
the variance can be retained by a 2-D feature vector,
further discussed in Section IV.
2) Clustering: K-means clustering algorithm is applied to
the 2-D data set. The number of clusters is three in
the case of residential buildings, 16 in the case of
commercial buildings, and four in the case of individual
household data set. The number of clusters is selected
based on the available categories in the data set.
3) Visualization: Finally, the graphs are created for the
clustered data by plotting the first feature against the sec-
ond feature. The graphs are provided and discussed
in IV. The observations are color-coded based on the
associated clusters.
The visualization results can be used for tariff selection
by customer or tariff recommendation by the supplier, better
demand-side management, and load forecasting.

IV. R ESULTS AND D ISCUSSION


For the purpose of visualization, we reduce the dimension
for each building to two using PCA, K-PCA, Isomap, LLE,
the Laplacian eigenmaps, t-SNE, and autoencoders. With this,
it is possible to see how each reduction technique cluster data
samples with similar consumption patterns. Fig. 4 shows the
cumulative variance explained for each dimension using PCA.
It is observed that more than 90% of the variance is retained
for all three data sets with two dimensions. This suggests that
more than 90% of the information contents are retained in the
case of these data sets if the dimensions are reduced from
the initial respective higher value (13 for residential build-
ings, ten for commercial buildings, and nine for individual
household data set) down to two. In general, many parameters
are recorded for different processes. These recorded para-
meters are correlated with other parameters in many cases.
Xie et al. [40] retained more than 90% variance in two
dimensions. Dimensionality reduction techniques transform
Fig. 3. Flow diagram of the visualization process. the original data set into higher dimensional space and reduce
the correlation between the features. The amount of variance
retained (more than 90% in our case) is directly linked with
applications for example wood inspection [23], identification the correlation of parameters.
of hepatic tumors [24], visualization of biomedical data [25], Fig. 5 shows the first component that is plotted against
and head pose estimation [26]. KPCA is applied to face the second dimension clustered data for both residential, com-
recognition [27], speech recognition [28], novelty detec- mercial, and individual household data, and dimensionality
tion [29], and smoke detection [30]. LLE is applied to super- reduction is performed using PCA. It is seen that the low, base,
resolution [31] and sound source localization [32]. The Lapla- and high samples are separated but with some overlaps, e.g.,
cian eigenmaps are applied to clustering [33]–[35] and also some samples with high consumption overlap with the base
face recognition [36]. t-SNE is applied for the visualization of samples. Similarly, for the commercial data with 16 classes,
handwritten digits and face data [37]. Autoencoder is applied some of the classes are clearly separated when projected to a
to image compression [38] and for the retrieval of medical lower dimension using PCA. There is an overlap between the
images using random projections as input features [39]. observations in both cases. Moreover, a few of the classes are
The flowchart of the proposed methodology is shown quite close, and any outlier can easily be labeled incorrectly.
in Fig. 3. The main steps are the following. Similarly, summer and winter load profiles are in the opposite
1) Dimension Reduction: One of the dimension reduction location for individual household consumers, while spring and
technique is selected, and the input samples are mapped autumn data points are in between and overlapped with others.

Authorized licensed use limited to: University of New Orleans. Downloaded on February 08,2023 at 22:07:26 UTC from IEEE Xplore. Restrictions apply.
ALESHINLOYE et al.: EVALUATION OF DIMENSIONALITY REDUCTION TECHNIQUES FOR LOAD PROFILING APPLICATION 45

Fig. 5. First feature plot against second feature of the PCA reduced data.
(a) Residential. (b) Commercial. (c) Individual household data set.

in nearest neighbors of 8 and 5, respectively, for both the


Fig. 4. Cumulative variance explained for PCA. (a) Residential. residential and commercial data. The plot is shown in Fig. 8.
(b) Commercial. (c) Individual household data set. The LLE-based dimensionality reduction technique is unable
to accurately cluster and identify the load profiles.
With KPCA, we choose a Gaussian-based kernel to map The Laplacian eigenmaps preserve the local structure by
the samples lying near or on a nonlinear manifold in the weighing the nearest neighbors. The nearest neighbor is cho-
intrinsic dimension to a sample lying on a linear subspace in a sen as (number of samples/10). The reduced 2-D data are
higher dimension. Using a gamma of 1/sample size, we obtain shown in Fig. 9. The Laplacian eigenmap-based reduced data
the lower data projection of KPCA on the residential and visualize the seasonal load profiles in a better way. The data
commercial data, as shown in Fig. 6. All four seasons’ load points for winter and summer are in completely opposite
profiles are overlapped with each other. It is very hard to directions; autumn’s load profile is in the middle and can
visually distinguish them. also be distinguished. Spring data points are mostly overlapped
Similarly, the first and second reduced features using Isomap with winter.
for both residential, commercial, and individual household In t-SNE, the performance is affected by perplexity, learning
data are plotted in Fig. 7. It is required to do a nearest rate, and the number of iterations. For the purpose of this
neighbor parameter search since it impacts the performance work, we select a perplexity of 30, a learning rate of 200, and
of Isomap. We iteratively search for the best parameter. The the number of iterations of 1000. Considering the algorithm
selected nearest-neighbor parameters for both residential and outputs different embedding for each run, a total of ten runs
commercial data are 4 and 5, respectively. are executed, and the final embedding is the average, as shown
Similar to Isomap, the standard LLE algorithm uses the in Fig. 10. t-SNE performs better for individual household
nearest neighbor to preserve the local structure. We use the data. All the four clusters can be viewed separately with very
same technique to select the nearest neighbor. This results little overlap.

Authorized licensed use limited to: University of New Orleans. Downloaded on February 08,2023 at 22:07:26 UTC from IEEE Xplore. Restrictions apply.
46 IEEE CANADIAN JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, VOL. 44, NO. 1, WINTER 2021

Fig. 6. First feature plot against second feature of the KPCA reduced data. Fig. 7. First feature plot against second feature of the Isomap reduced data.
(a) Residential. (b) Commercial. (c) Individual household data set.
(a) Residential. (b) Commercial. (c) Individual household data set.

Autoencoder aims to reduce the data dimensionality by computes a similarity measure between the two clustering
learning to ignore the signal noise. We train our architecture techniques by considering all pairs of samples and counting
using 100 epochs with a batch size of 64 and a learning rate pairs that are assigned in the same or different clusters in
of 0.01 using Adam optimizer [41]. The reduced dimensions the predicted and true clusters and provides a quantitative
are activated using ReLu activation to model the nonlinearity measurement. ARI is computed using the following equation:
in the output. Fig. 11 shows the first and second component
RI − Expected RI
plots after dimensionality reduction. ARI = . (1)
The visualization provides a quantitative assessment of max(RI) − Expected RI
dimension reduction techniques for load profiling applications. The resulting scores range from zero (minimum) to one
The objective of the proposed method is to create a load (maximum). The result for each reduction technique using
profiling visualization that can accurately cluster different two components of the data sets is shown in Table II. From
users. Clusters, being far apart, can also handle the outliers the result, it is seen that some of the nonlinear dimension-
more effectively. Based on the visualization results, we clearly ality reduction techniques outperform PCA for all data sets
see that t-SNE is better suited for the smart meter data and can and, thus, confirms that for the application of load profiling,
accurately separate various classes in all data sets. However, an indirect clustering technique using nonlinear dimensionality
few of the techniques can only visualize one of the data set reduction techniques would perform better. However, the result
properly, and few of the techniques cannot handle any of the of each nonlinear technique is inconsistent with a different
data set. We can also conclude that nonlinear techniques can number of runs. This is solved by averaging the results of the
produce better results compared with linear methods. For the projection over multiple runs.
purpose of comparing the reduction techniques, the resulting Furthermore, all nonlinear techniques require tuning para-
clusters are then evaluated using the ARI [42]. Rand index (RI) meters to obtain the best result. This is a major challenge

Authorized licensed use limited to: University of New Orleans. Downloaded on February 08,2023 at 22:07:26 UTC from IEEE Xplore. Restrictions apply.
ALESHINLOYE et al.: EVALUATION OF DIMENSIONALITY REDUCTION TECHNIQUES FOR LOAD PROFILING APPLICATION 47

Fig. 8. First feature plot against second feature of the LLE reduced data.
(a) Residential. (b) Commercial. (c) Individual household data set.
Fig. 9. First feature plot against second feature of the Laplacian eigenmaps
reduced data. (a) Residential. (b) Commercial. (c) Individual household
TABLE II data set.

C OMPARISON OF D IMENSIONALITY R EDUCTION T ECHNIQUES


FOR L OAD L OSS E VENT D ETECTION U SING ARI S CORE 2) t-SNE is a probabilistic approach; and 3) due to the
nonlinear and probabilistic nature, t-SNE represents the similar
points close together, whereas PCA reduces the dimensionality
by doing the opposite, i.e., placing dissimilar data points far
apart in lower dimension representation. The performance,
in terms of ARI score and visual representation, suggests that
the t-SNE performs better for the load profile visualization in
a smart grid environment.
Table III lists the computational complexity and memory
in selecting the best technique for each task and the requirement of the dimensionality reduction techniques. The
corresponding parameters. Also, the performance of each computational complexity of the methods is determined by
technique is dependent on the data as well; from the result, the number of features m, the window size n, the ratio of
KPCA performs well in the residential data but fails to capture nonzero elements in a sparse matrix to the total number of
the nonlinearity in the commercial data. Similarly, t-SNE has elements n z , the number of iterations i , and the weight w. For a
average performance on the residential data but produces an moderate number of features, the computational complexity is
impressive score on the commercial data. The t-SNE shows not a major issue for PCA, LLE, and the Laplacian eigenmaps.
superiority in terms of visualization for both residential and A moderate window size would reduce the computational cost
commercial data set compared with PCA (linear technique). of Isomap, KPCA, t-SNE, and autoencoder. The computational
The reasons for the superiority of t-SNE technique are: cost of the encoder–decoder network is dependent on the
1) t-SNE is a nonlinear dimensionality reduction technique; number of iterations and the sample size for training. The

Authorized licensed use limited to: University of New Orleans. Downloaded on February 08,2023 at 22:07:26 UTC from IEEE Xplore. Restrictions apply.
48 IEEE CANADIAN JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, VOL. 44, NO. 1, WINTER 2021

Fig. 11. First feature plot against second feature of the autoencoders reduced
Fig. 10. First feature plot against second feature of the t-SNE reduced data. data. (a) Residential. (b) Commercial. (c) Individual household data set.
(a) Residential. (b) Commercial. (c) Individual household data set.

TABLE III grids, the need for information has also evolved into new
C OMPUTATIONAL C OMPLEXITY AND M EMORY R EQUIREMENT dynamic visualization to incorporate the fuel-based energy
OF D IMENSIONALITY R EDUCTION T ECHNIQUES [43] sources along with renewable energy sources. The visual-
izations can empower decision-makers to better understand
the incoming information and, consequently, make a better
decision in response. This work investigated the dimension-
ality reduction for load profile data visualization. We have
employed both linear and nonlinear dimension reduction tech-
niques. The chosen linear dimensionality reduction technique
has no hyperparameters to be tuned aside from choosing the
number of eigenvectors corresponding to high eigenvalues.
It is evident from the experimentation that reducing the
effect of the weight is negligible because we use only
feature vectors to two features retains more than 90% of the
the first two components of each reduced data. This reduces
original information contents. In this article, we have provided
the computation time and memory requirement relative to
two test cases: residential data and commercial data. Our
using the data samples without dimensionality reduction. results show that some of the nonlinear dimensionality reduc-
tion techniques outperform PCA (widely used dimensionality
V. C ONCLUSION reduction technique), which emphasizes that the intrinsic
The traditional power grid usually required voltage, avail- dimension of the energy consumption data for both residential
able power, and simple demand data. With the advent of smart and commercial use-case is nonlinear. In the residential data,

Authorized licensed use limited to: University of New Orleans. Downloaded on February 08,2023 at 22:07:26 UTC from IEEE Xplore. Restrictions apply.
ALESHINLOYE et al.: EVALUATION OF DIMENSIONALITY REDUCTION TECHNIQUES FOR LOAD PROFILING APPLICATION 49

we observed KPCA outperforms other techniques, and in the [19] R. Hendron and C. Engebrecht, “Building America house simulation
case of commercial data, we observed t-SNE outperforms protocols,” Office of energy efficiency and renewable energy (EERE),
Washington, DC, USA, Tech. Rep. TP-550-49426, 2010. Accessed:
other techniques. This shows that the three data sets have a dif- Aug. 28, 2020. [Online]. Available: https://www.nrel.gov/docs/fy11osti/
ferent distribution, and not just a single dimensionality reduc- 49246.pdf
tion technique would work for all data sets. Selecting a suitable [20] M. Deru et al., “Us department of energy commercial reference building
models of the national building stock,” Nat. Renew. Energy Lab.,
dimensionality reduction technique is very important to retain Golden, CO, USA, Tech. Rep. NREL/TP-5500-46861, 2011. Accessed:
the relationship between the data points in a lower dimensional Aug. 28, 2020. [Online]. Available: https://www.nrel.gov/docs/fy11osti/
space and allow for proper load profiling of the consumers. 46861.pdf
[21] G. Hebrail and A. Berard. UCI Machine Learning Repository: Individual
It can also be concluded from the experimentation that the Household Electric Power Consumption Data Set. Accessed: Aug. 28,
nonlinear techniques can outperform the linear techniques. 2020. [Online]. Available: https://archive.ics.uci.edu/ml/datasets/
Individual+household+electric+power+consumption
R EFERENCES [22] R. J. Hyndman, X. Liu, and P. Pinson, “Visualizing big energy data:
Solutions for this crucial component of data analysis,” IEEE Power
[1] C. Tu, X. He, Z. Shuai, and F. Jiang, “Big data issues in smart grid–a Energy Mag., vol. 16, no. 3, pp. 18–25, May 2018.
review,” Renew. Sustain. Energy Rev., vol. 79, pp. 1099–1107, 2017. [23] M. Niskanen and O. Silvén, “Comparison of dimensionality reduction
[2] U. EIA. (2016). International Energy Statistics. US Energy and Infor- methods for wood surface inspection,” emphProc. SPIE, vol. 5132,
mation Administration, Washington, DC, USA. Accessed: Sep. 6, 2018. pp. 178–189, May 2003.
[Online]. Available: http://www.eia.gov [24] H. Liu et al., “Dimensionality reduction for identification of hepatic
[3] K. Balachandran, R. L. Olsen, and J. M. Pedersen, “Bandwidth analysis
tumor samples based on terahertz time-domain spectroscopy,” IEEE
of smart meter network infrastructure,” in Proc. 16th Int. Conf. Adv.
Trans. THz Sci. Technol., vol. 8, no. 3, pp. 271–277, May 2018.
Commun. Technol., Feb. 2014, pp. 928–933.
[4] Y. Wang, Q. Chen, C. Kang, M. Zhang, K. Wang, and Y. Zhao, “Load [25] I. Soo Lim, P. de Heras Ciechomski, S. Sarni, and D. Thalmann,
profiling and its application to demand response: A review,” Tsinghua “Planar arrangement of high-dimensional biomedical data sets by isomap
Sci. Technol., vol. 20, no. 2, pp. 117–129, Apr. 2015. coordinates,” in Proc. 16th IEEE Symp. Comput.-Based Med. Syst.,
[5] K.-L. Zhou, S.-L. Yang, and C. Shen, “A review of electric load Jun. 2003, pp. 50–55.
classification in smart grid environment,” Renew. Sustain. Energy Rev., [26] H. Dadkhahi, M. F. Duarte, and B. M. Marlin, “Out-of-sample extension
vol. 24, pp. 103–110, Aug. 2013. for dimensionality reduction of noisy time series,” IEEE Trans. Image
[6] G. J. Tsekouras, P. B. Kotoulas, C. D. Tsirekis, E. N. Dialynas, and Process., vol. 26, no. 11, pp. 5435–5446, Nov. 2017.
N. D. Hatziargyriou, “A pattern recognition methodology for evaluation [27] K. In Kim, K. Jung, and H. Joon Kim, “Face recognition using kernel
of load profiles and typical days of large electricity customers,” Electr. principal component analysis,” IEEE Signal Process. Lett., vol. 9, no. 2,
Power Syst. Res., vol. 78, no. 9, pp. 1494–1510, Sep. 2008. pp. 40–42, Feb. 2002.
[7] G. Chicco, R. Napoli, and F. Piglione, “Comparisons among clustering [28] A. Lima, H. Zen, Y. Nankaku, C. Miyajima, K. Tokuda, and T. Kitamura,
techniques for electricity customer classification,” IEEE Trans. Power “On the use of kernel PCA for feature extraction in speech recognition,”
Syst., vol. 21, no. 2, pp. 933–940, May 2006. IEICE Trans. Inf. Syst., vol. 87, no. 12, pp. 2802–2811, 2004.
[8] G. Chicco, O.-M. Ionel, and R. Porumb, “Electrical load pattern group- [29] H. Hoffmann, “Kernel PCA for novelty detection,” Pattern Recognit.,
ing based on centroid model with ant colony clustering,” IEEE Trans. vol. 40, no. 3, pp. 863–874, Mar. 2007.
Power Syst., vol. 28, no. 2, pp. 1706–1715, May 2013. [30] F. Yuan, X. Xia, J. Shi, H. Li, and G. Li, “Non-linear dimensionality
[9] T. Zhang, G. Zhang, J. Lu, X. Feng, and W. Yang, “A new index reduction and Gaussian process based classification method for smoke
and classification approach for load pattern analysis of large electricity detection,” IEEE Access, vol. 5, pp. 6833–6841, 2017.
customers,” IEEE Trans. Power Syst., vol. 27, no. 1, pp. 153–160, [31] H. Chang, D.-Y. Yeung, and Y. Xiong, “Super-resolution through neigh-
Feb. 2012. bor embedding,” in Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern
[10] M. Koivisto, P. Heine, I. Mellin, and M. Lehtonen, “Clustering of Recognit. CVPR, vol. 1, Jul. 2004, p. I.
connection points and load modeling in distribution systems,” IEEE [32] R. Duraiswami and V. C. Raykar, “The manifolds of spatial hearing,”
Trans. Power Syst., vol. 28, no. 2, pp. 1255–1265, May 2013. in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., ICASSP,
[11] E. D. Varga, S. F. Beretka, C. Noce, and G. Sapienza, “Robust real- Mar. 2005, p. 285.
time load profile encoding and classification framework for efficient [33] A. Y. Ng, M. I. Jordan, and Y. Weiss, “On spectral clustering: Analysis
power systems operation,” IEEE Trans. Power Syst., vol. 30, no. 4, and an algorithm,” in Proc. Adv. Neural Inf. Process. Syst., 2002,
pp. 1897–1904, Jul. 2015. pp. 849–856.
[12] S. Zhong and K.-S. Tam, “Hierarchical classification of load profiles [34] J. Shi and J. Malik, “Normalized cuts and image segmentation,”
based on their characteristic attributes in frequency domain,” IEEE IEEE Trans. Pattern Anal. Mach. Intell., vol. 22, no. 8, pp. 888–905,
Trans. Power Syst., vol. 30, no. 5, pp. 2434–2441, Sep. 2015. Aug. 2000.
[13] S. V. Verdu, M. O. Garcia, C. Senabre, A. G. Marin, and F. J. G. Franco,
[35] M. A. Oikawa, Z. Dias, A. de Rezende Rocha, and S. Goldenstein,
“Classification, filtering, and identification of electrical customer load
“Manifold learning and spectral clustering for image phylogeny forests,”
patterns through the use of self-organizing maps,” IEEE Trans. Power
IEEE Trans. Inf. Forensics Security, vol. 11, no. 1, pp. 5–18, Jan. 2016.
Syst., vol. 21, no. 4, pp. 1672–1682, Nov. 2006.
[14] Y. Xiao, J. Yang, H. Que, M. Junjie Li, and Q. Gao, “Application of [36] X. He, S. Yan, Y. Hu, P. Niyogi, and H.-J. Zhang, “Face recognition
wavelet-based clustering approach to load profiling on AMI measure- using laplacianfaces,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 27,
ments,” in Proc. China Int. Conf. Electr. Distrib. (CICED), Sep. 2014, no. 3, pp. 328–340, Mar. 2005.
pp. 1537–1540. [37] L. van der Maaten and G. Hinton, “Visualizing data using t-SNE,”
[15] A. Albert and R. Rajagopal, “Smart meter driven segmentation: What J. Mach. Learn. Res., vol. 9, pp. 2579–2605, Nov. 2008.
your consumption says about you,” IEEE Trans. Power Syst., vol. 28, [38] L. Theis, W. Shi, A. A. Cunningham, and F. Huszár, “Lossy image
no. 4, pp. 4019–4030, Nov. 2013. compression with compressive autoencoders,” 2017, arXiv:1703.00395.
[16] S. Lhermitte, J. Verbesselt, W. W. Verstraeten, and P. Coppin, “A com- [Online]. Available: https://arxiv.org/abs/1703.00395
parison of time series similarity measures for classification and change [39] A. Sriram, S. Kalra, H. R. Tizhoosh, and S. Rahnamayan, “Learning
detection of ecosystem dynamics,” Remote Sens. Environ., vol. 115, autoencoded radon projections,” in Proc. IEEE Symp. Ser. Comput.
no. 12, pp. 3129–3152, Dec. 2011. Intell. (SSCI), Nov. 2017, pp. 1–5.
[17] A. Aleshinloye, A. Bais, and I. Al-Anbagi, “Performance analysis of [40] L. Xie, Y. Chen, and P. R. Kumar, “Dimensionality reduction
dimensionality reduction techniques for demand side management,” in of synchrophasor data for early event detection: Linearized analy-
Proc. IEEE Electr. Power Energy Conf. (EPEC), Oct. 2017, pp. 1–6. sis,” IEEE Trans. Power Syst., vol. 29, no. 6, pp. 2784–2794,
[18] E. Wilson. (2014). Commercial and Residential Hourly Load Profiles Nov. 2014.
for all TMY3 Locations in the United States. U.S. Department of Energy [41] D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,”
Open Data Catalog: U.S. Department of Energ, Office of Energy Effi- in Proc. Int. Conf. Learn. Represent. (ICLR), 2015, pp. 1–15.
ciency and Renewable Energy (EERE), National Renewable Energy [42] L. Hubert and P. Arabie, “Comparing partitions,” J. Classification, vol. 2,
Laboratory, Golden, CO, USA. Accessed: Aug. 28, 2020. [Online]. no. 1, pp. 193–218, Dec. 1985.
Available: https://openei.org/doe-opendata/dataset/commercial-and- [43] L. Van Der Maaten, E. Postma, and J. Van den Herik, “Dimensionality
residential-hourly-load-profiles-for-all-tmy3-locations-in-the-united- reduction: A comparative,” J. Mach. Learn. Res., vol. 10, pp. 66–71,
states Oct. 2009.

Authorized licensed use limited to: University of New Orleans. Downloaded on February 08,2023 at 22:07:26 UTC from IEEE Xplore. Restrictions apply.

Vous aimerez peut-être aussi