Dataset Ninja LogoDataset Ninja:

Substation Equipment Dataset

1660153080
Tagenergy-and-utilities
Taskinstance segmentation
Release YearMade in 2023
LicenseCC BY 4.0
Download2 GB

Introduction #

Released 2023-05-01 ·Andreas Gomes, Fabiano Magrin, Leonardo Fernandeset al.

The authors present A Semantically Annotated 15-Class Ground Truth Dataset for Substation Equipment with 1660 images annotated with 15 classes, including insulators, disconnect_switches, transformers and other equipment commonly found in substation environments. The images were captured using a combination of human, fixed and AGV-mounted cameras at different times of the day, providing a diverse set of training and testing data for algorithm development. In total, 50,705 annotations were created by a team of experienced annotators, using a standardized process to ensure accuracy across the dataset. The resulting dataset provides a valuable resource for researchers and practitioners working in the fields of substation automation, substation monitoring and computer vision. Its availability has the potential to advance the state of the art in this important area.

Table 1 provides an overview of the distribution of the images based on source. The unspecified smartphone and digital cameras collected by humans on the field are referred to as miscellaneous. The images in the dataset come in a variety of resolutions, as shown in Table 2.

Source Camera Quantity
Human (Morning) Misc, T540 682
AGV (Morning and afternoon) A700 230
AGV (Night, light) A700 360
AGV (Night, no light) A700 348
Fixed (Morning and afternoon) A700 32
Fixed (Night, light) A700 8

Table 1. Dataset distribution based on source.

Resolution Occurrences
1280 × 960 887
2592 × 1944 268
2880 × 2160 266
4032 × 3024 98
4000 × 3000 36
640 × 480 31
4624 × 3468 28
2048 × 1536 27
704 × 480 10
1156 × 867 4
1280 × 720 2
2324 × 1440 1
4624 × 2604 1
4672 × 3504 1

Table 2. Resolutions and occurrences of the 1660 images in this dataset.

There are 15 classes of substation equipment. Table 3 lists them, along with how many times each object appears in the “Instances” column. An example of each object class is presented in Figure 1.

Class Instances RGB
Background - (000, 000, 000)
Open blade disconnect switch 310 (162, 000, 255)
Closed blade disconnect switch 5243 (097, 016, 162)
Open tandem disconnect switch 1599 (081, 162, 000)
Closed tandem disconnect switch 966 (048, 097, 165)
Breaker 980 (121, 121, 121)
Fuse disconnect switch 355 (255, 097, 178)
Glass disc insulator 3185 (154, 032, 121)
Porcelain pin insulator 26,499 (255, 255, 125)
Muffle 1354 (162, 243, 162)
Lightning arrester 1976 (143, 211, 255)
Recloser 2331 (040, 000, 186)
Power transformer 768 (255, 182, 000)
Current transformer 2136 (138, 138, 000)
Potential transformer 654 (162, 048, 000)
Tripolar disconnect switch 2349 (162, 000, 096)

Table 3. Object classes, number of instances in the dataset and their colours in RGB values used for the .png segmentation masks.

Fig 1

Figure 1. An example for each of the 15 classes present in the dataset. (a) Open blade disconnect switch. (b) Closed blade disconnect switch. © Open tandem disconnect switch. (d) Closed tandem disconnect switch. (e) Breaker. (f) Fuse disconnect switch. (g) Glass disc insulator. (h) Porcelain pin insulator. (i) Muffle. (j) Lightning arrester. (k) Recloser. (l) Power transformer. (m) Current transformer. (n) Potential transformer. (o) Tripolar disconnect switch.

The images in this dataset were captured from a single electrical distribution substation in Brazil over a period of two years, at different times of day and under varying weather and seasonal conditions, ensuring a diverse range of lighting conditions for the depicted objects. All the images underwent a curation process by experts in Electrical Engineering to ensure that the angles and distances depicted in the images were suitable for automating inspections in a substation.

According to the provided information from the electrical company overseeing the substation, the energy consumption profile exhibits a consistent pattern throughout the year. The profile shows a global minimum at 6h00, a local maximum at 13h00, a local minimum at 17h00 and a global maximum at 20h00, with slight amplitude variations. Based on this insight, a collection schedule was devised for automated inspections, encompassing time slots at 8h00, 13h00, 17h00 and 20h00. Human-led collections were conducted exclusively during the morning period to align with the availability of on-field company technicians assisting our researchers.

The human-collected images were captured by various individuals using different camera models, including smartphone cameras, unspecified digital cameras and the FLIR T540, as shown in Figure 3. There was no standardization in terms of camera angles or distances, although the maximum distance for image capture was limited to 30 m. Those images were taken during the morning period, with the time of capture ranging from 8h00 to 12h00.

Fig.3

Figure 3. A technician capturing images using the FLIR T540 camera.

The Autonomous Ground Vehicle (AGV) shown in Figure 4 used the FLIR A700 camera to collect the majority of the images in this dataset. This AGV followed a predetermined path through 60 possible scenes, capturing images at fixed angles and distances ranging from 3 to 5 m. Those collections were conducted three times per day, with the specific times being in the morning (between 8h00 and 10h00), in the afternoon (between 13h00 and 17h00) and at night (between 20h00 and 21h00).

Fig.4

Figure 4. A photo of our AGV equipped with the FLIR A700 (marked with the red box) and the FLIR A310 (the white one, with two separate lenses) cameras on a substation floor. The A310 was not used to take any photos for this dataset but was used for pan-tilt purposes for other mounted cameras.

The FLIR A700 camera fixed in the substation, shown in Figure 5, collected photos also three times per day: in the morning (between 6h00 and 11h00), in the afternoon (between 13h00 and 18h00) and at night (between 21h00 and 00h00).

Fig.5

Figure 5. Photos of the A700 fixed in the substation. The camera is encircled in red in both subimages. (a) Front view of the A700. (b) Back view of the A700 showing objects in its field of view.

Authors annotated the semantic dataset using the software LabelMe, as shown in Figure 6. The annotation process took approximately 1100 man-hours over the course of 4 months by 9 people.

Fig.6

Figure 6. Examples of manual annotation for semantic segmentation using LabelMe. The colors used by LabelMe have no relation to the colors used in the masks from Table 3. (a) Insulators (red) and breakers (green). (b) A recloser (purple).

Please note that DatasetNinja upload Instance version of Substation Equipment. The dataset contains objects separated by objects located in the foreground, which can lead to incorrect interpretation of data.

ExpandExpand
Dataset LinkHomepageDataset LinkResearch Paper

Summary #

A Semantically Annotated 15-Class Ground Truth Dataset for Substation Equipment is a dataset for instance segmentation, object detection, semantic segmentation, and weakly supervised learning tasks. It is used in the energy industry.

The dataset consists of 1660 images with 50604 labeled objects belonging to 15 different classes including porcelain_pin_insulator, closed_blade_disconnect_switch, recloser, and other: glass_disc_insulator, current_transformer, lightning_arrester, power_transformer, breaker, potential_transformer, closed_tandem_disconnect_switch, open_tandem_disconnect_switch, tripolar_disconnect_switch, muffle, fuse_disconnect_switch, and open_blade_disconnect_switch.

Images in the Substation Equipment dataset have pixel-level instance segmentation annotations. Due to the nature of the instance segmentation task, it can be automatically transformed into a semantic segmentation (only one mask for every class) or object detection (bounding boxes for every object) tasks. There are 6 (0% of the total) unlabeled images (i.e. without annotations). There are no pre-defined train/val/test splits in the dataset. Note, that background class has been removed due to possible misinterpretation of data. The dataset was released in 2023 by the Universidade Tecnológica Federal do Paraná, Pontifícia Universidade Católica do Rio de Janeiro, and Paranaense de Energia SA (Copel).

Here are the visualized examples for the classes:

Explore #

Substation Equipment dataset has 1660 images. Click on one of the examples below or open "Explore" tool anytime you need to view dataset images with annotations. This tool has extended visualization capabilities like zoom, translation, objects table, custom filters and more. Hover the mouse over the images to hide or show annotations.

OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
OpenSample annotation mask from Substation EquipmentSample image from Substation Equipment
👀
Have a look at 1660 images
View images along with annotations and tags, search and filter by various parameters

Class balance #

There are 15 annotation classes in the dataset. Find the general statistics and balances for every class in the table below. Click any row to preview images that have labels of the selected class. Sort by column to find the most rare or prevalent classes.

Search
Rows 1-10 of 15
Class
Images
Objects
Count on image
average
Area on image
average
porcelain_pin_insulator
polygon
1413
26450
18.72
1.58%
closed_blade_disconnect_switch
polygon
1066
5225
4.9
1.77%
recloser
polygon
861
2330
2.71
4.17%
glass_disc_insulator
polygon
739
3180
4.3
0.25%
current_transformer
polygon
707
2128
3.01
0.61%
lightning_arrester
polygon
537
1974
3.68
0.25%
power_transformer
polygon
406
768
1.89
9.21%
breaker
polygon
367
980
2.67
2.92%
potential_transformer
polygon
275
651
2.37
0.33%
closed_tandem_disconnect_switch
polygon
274
957
3.49
1.39%

Co-occurrence matrix #

Co-occurrence matrix is an extremely valuable tool that shows you the images for every pair of classes: how many images have objects of both classes at the same time. If you click any cell, you will see those images. We added the tooltip with an explanation for every cell for your convenience, just hover the mouse over a cell to preview the description.

Images #

Explore every single image in the dataset with respect to the number of annotations of each class it has. Click a row to preview selected image. Sort by any column to find anomalies and edge cases. Use horizontal scroll if the table has many columns for a large number of classes in the dataset.

Class sizes #

The table below gives various size properties of objects for every class. Click a row to see the image with annotations of the selected class. Sort columns to find classes with the smallest or largest objects or understand the size differences between classes.

Search
Rows 1-10 of 15
Class
Object count
Avg area
Max area
Min area
Min height
Min height
Max height
Max height
Avg height
Avg height
Min width
Min width
Max width
Max width
porcelain_pin_insulator
polygon
26450
0.08%
16.16%
0%
4px
0.17%
2291px
53.66%
64px
3.28%
2px
0.09%
2244px
64.71%
closed_blade_disconnect_switch
polygon
5225
0.35%
16.22%
0%
9px
0.38%
2003px
92.73%
216px
11.44%
3px
0.1%
1107px
38.44%
glass_disc_insulator
polygon
3180
0.05%
0.78%
0%
6px
0.24%
399px
16.35%
62px
3.37%
3px
0.2%
283px
16.95%
tripolar_disconnect_switch
polygon
2348
0.23%
4.9%
0%
5px
0.19%
754px
40.1%
124px
7.7%
3px
0.15%
839px
38.33%
recloser
polygon
2330
1.52%
25.44%
0%
3px
0.14%
1355px
62.73%
230px
12.73%
4px
0.15%
1486px
81.04%
current_transformer
polygon
2128
0.2%
2.68%
0%
6px
0.23%
761px
26.35%
125px
6.42%
4px
0.14%
1232px
30.56%
lightning_arrester
polygon
1974
0.06%
1.78%
0%
4px
0.19%
963px
31.85%
84px
4.19%
4px
0.17%
397px
9.85%
open_tandem_disconnect_switch
polygon
1596
0.18%
4.31%
0%
4px
0.21%
1114px
51.57%
108px
5.86%
4px
0.22%
659px
22.88%
muffle
polygon
1354
0.09%
0.92%
0%
4px
0.31%
956px
36.26%
165px
8.88%
5px
0.17%
280px
9.26%
breaker
polygon
980
1.08%
15.14%
0%
6px
0.36%
1468px
60.74%
298px
15.63%
7px
0.2%
1487px
53.8%

Spatial Heatmap #

The heatmaps below give the spatial distributions of all objects for every class. These visualizations provide insights into the most probable and rare object locations on the image. It helps analyze objects' placements in a dataset.

Spatial Heatmap

Objects #

Table contains all 50604 objects. Click a row to preview an image with annotations, and use search or pagination to navigate. Sort columns to find outliers in the dataset.

Search
Rows 1-10 of 50604
Object ID
Class
Image name
click row to open
Image size
height x width
Height
Height
Width
Width
Area
1
porcelain_pin_insulator
polygon
FLIR7238_rgb_wQvPtR5.jpg
960 x 1280
47px
4.9%
24px
1.88%
0.07%
2
porcelain_pin_insulator
polygon
FLIR7238_rgb_wQvPtR5.jpg
960 x 1280
40px
4.17%
42px
3.28%
0.08%
3
porcelain_pin_insulator
polygon
FLIR7238_rgb_wQvPtR5.jpg
960 x 1280
43px
4.48%
33px
2.58%
0.07%
4
porcelain_pin_insulator
polygon
FLIR7238_rgb_wQvPtR5.jpg
960 x 1280
46px
4.79%
42px
3.28%
0.09%
5
porcelain_pin_insulator
polygon
FLIR7238_rgb_wQvPtR5.jpg
960 x 1280
42px
4.38%
42px
3.28%
0.09%
6
porcelain_pin_insulator
polygon
FLIR7238_rgb_wQvPtR5.jpg
960 x 1280
51px
5.31%
52px
4.06%
0.15%
7
porcelain_pin_insulator
polygon
FLIR7238_rgb_wQvPtR5.jpg
960 x 1280
28px
2.92%
46px
3.59%
0.06%
8
porcelain_pin_insulator
polygon
FLIR7238_rgb_wQvPtR5.jpg
960 x 1280
43px
4.48%
50px
3.91%
0.11%
9
porcelain_pin_insulator
polygon
FLIR7238_rgb_wQvPtR5.jpg
960 x 1280
49px
5.1%
50px
3.91%
0.14%
10
porcelain_pin_insulator
polygon
FLIR7238_rgb_wQvPtR5.jpg
960 x 1280
49px
5.1%
49px
3.83%
0.13%

License #

A Semantically Annotated 15-Class Ground Truth Dataset for Substation Equipment is under CC BY 4.0 license.

Source

Citation #

If you make use of the Substation Equipment data, please cite the following reference:

@dataset{gomes_andreas_2023_7884270,
  author       = {Gomes, Andreas},
  title        = {{A Semantically Annotated 15-Class Ground Truth 
                   Dataset for Substation Equipment}},
  month        = may,
  year         = 2023,
  publisher    = {Zenodo},
  version      = {1.0},
  doi          = {10.5281/zenodo.7884270},
  url          = {https://doi.org/10.5281/zenodo.7884270}
}

Source

If you are happy with Dataset Ninja and use provided visualizations and tools in your work, please cite us:

@misc{ visualization-tools-for-substation-equipment-dataset,
  title = { Visualization Tools for Substation Equipment Dataset },
  type = { Computer Vision Tools },
  author = { Dataset Ninja },
  howpublished = { \url{ https://datasetninja.com/substation-equipment } },
  url = { https://datasetninja.com/substation-equipment },
  journal = { Dataset Ninja },
  publisher = { Dataset Ninja },
  year = { 2025 },
  month = { jan },
  note = { visited on 2025-01-22 },
}

Download #

Dataset Substation Equipment can be downloaded in Supervisely format:

As an alternative, it can be downloaded with dataset-tools package:

pip install --upgrade dataset-tools

… using following python code:

import dataset_tools as dtools

dtools.download(dataset='Substation Equipment', dst_dir='~/dataset-ninja/')

Make sure not to overlook the python code example available on the Supervisely Developer Portal. It will give you a clear idea of how to effortlessly work with the downloaded dataset.

The data in original format can be downloaded here.

. . .

Disclaimer #

Our gal from the legal dep told us we need to post this:

Dataset Ninja provides visualizations and statistics for some datasets that can be found online and can be downloaded by general audience. Dataset Ninja is not a dataset hosting platform and can only be used for informational purposes. The platform does not claim any rights for the original content, including images, videos, annotations and descriptions. Joint publishing is prohibited.

You take full responsibility when you use datasets presented at Dataset Ninja, as well as other information, including visualizations and statistics we provide. You are in charge of compliance with any dataset license and all other permissions. You are required to navigate datasets homepage and make sure that you can use it. In case of any questions, get in touch with us at hello@datasetninja.com.