Dataset Ninja LogoDataset Ninja:

MFVT Dataset

1350732090
Tagsurveillance, safety
Taskobject detection
Release YearMade in 2022
Licenseunknown

Introduction #

Trong-Vu Hoang, Huu-Thien Tran, Trong-Le Do

The MFVT Dataset for Real-time Masked Facial Detection dataset is a groundbreaking resource designed to rectify the limitations of existing datasets in the realm of Real-time Masked Facial Detection. Comprising 13,507 meticulously annotated images, it integrates authors’ unique mask-wearing conventions to classify individuals into three distinct categories: ok, none, and wrong, offering a comprehensive and invaluable tool for advancing the field.

Although there have been a number of Masked Facial Detection datasets published, they all have their own limitations, from having different mask-wearing conventions to careless labelling and classes imbalanced. Therefore, a new dataset to overcome those problems, with correct, clear conventions on wearing masks and carefully annotated for research purposes is very essential. That motivates authors to build the MFVT dataset.

The MFVT dataset built from two datasets PWMFD and FMLD, along with some images collected from the Internet.

The reason authors chose those two datasets is that their images have been carefully selected by the authors from previous datasets such as MAFA and Wider Face. However, the mask-wearing conventions of those two datasets were different and they also differed from authors’, so they re-annotated the entire dataset.

Authors follow WHO’s advice on how to wear masks and the types of masks that can be worn during the COVID-19 epidemic to annotate the MFVT dataset. All the faces, or objects, in authors’ dataset belong to one of three classes: properly wearing (ok), incorrectly wearing (wrong) and not wearing mask (none). Details on these classes are given below:

  • Class ok: Firstly, about the type of mask that can be worn, authors consider cases that not only be medical masks but also can be something like fabric masks, scarves, buffs, or hijabs. Authors will call them “masks”. According to WHO’s advice, authors should wear a mask that could cover nose, mouth and chin. Therefore, if a person uses masks with “proper thickness” and wears them in that way, authors will annotate them as OK. For image data, they consider a mask with proper thickness when they can not see the part of the face occluded by that mask.
image

Example images for class ok. (a) - (d) Faces with medical masks. (e) - (f) Using scarves to cover nose, mouth and chin. (g) - (j) Wearing fabric masks properly. (k) - (l) Illustrations for hijabs.

  • Class wrong: Authors regard that a presented mask that does not cover the nose, mouth, and chin is the wrong way to wear a face mask. In addition, a mask worn without being pulled high enough is marked as an improperly wearing way (also, in a perspective, these cases are difficult for models to learn because it is easy to confuse between wrong and ok). Especially, masks with small holes (such as wool masks) or wearing the mask on one side do not provide good protection for health during the pandemic, so authors also consider these as incorrect.
image

Example images for class wrong. (a) - © Not intending to use masks to protect from the virus. (d) - (f) The nose, mouth or chin is not covered. (g) - (h) The masks are not pulled high enough. (i) - (j) Small holes on wool masks. (k) - (l) The masks are worn just in one side.

  • Class none: In the COVID-19 pandemic, wearing too thin masks (such as mesh face veils, or satin face veils) or using hands, arms to cover the face are quite useless in preventing the virus from spreading. Therefore, these cases will be considered as not wearing masks. Authors also suppose that people using a mask to conceal the face but not wearing it on, or using kinds of stuff such as phones, papers, cups, books, placards, etc. to hide their face are not acceptable and will be annotated as none. Specially, when people use a scarf but do not intend to pull it up to cover the face (the scarf does not hide the mouth), authors assume that they do not intend to use it as a way to protect from viruses. Furthermore, authors attempt to consider the scene that people fake the face mask by pulling up a collar to cover the face. These cases are deemed as not wearing masks.
image

Example for class none. (a) Normal face. (b) - © Using masks to conceal the face but not wearing them on. (d) - (h) Using kinds of stuff such as phones, papers, cups, books, placards to hide the face. (i) Pulling up a collar to cover the face. (j) Wearing too thin masks. (k) - (l) Using hands or arms to cover the face.

The PWMFD dataset primarily focuses on medical masks, and the authors collected numerous medical mask-related images from this dataset. Additionally, they included some images of fabric masks or scarves in their selection. From the original 6340 training images and 1633 test images in the PWMFD dataset, the authors chose 2473 images for the training set and 500 images for the test set of the MFVT dataset.

Regarding the FMLD dataset, it contains a wide variety of images covering different cases. While the FMLD dataset includes many images labeled as “NONE,” which were derived from the Wider Face dataset, the authors decided that these “NONE” images didn’t significantly contribute to the Masked Facial Detection problem. Therefore, they selected only a few of these “NONE” images for the MFVT dataset. The cases mentioned in the mask conventions align with the FMLD dataset, albeit with different annotations. This makes the FMLD dataset a reliable source of images. From the original 34,784 training images in the FMLD dataset, the authors selected 6,182 images, and from the 7,152 test images, they selected 850 images for the MFVT dataset.

It’s important to note that during the process of image selection from the two datasets, the authors may come across images that are present in both datasets. In some cases, an image in the training set of one dataset may appear in the testing set of the other dataset. The authors carefully reviewed and removed such duplicate images.

With two training sets selected from the PWMFD and FMLD datasets, the authors randomly selected a small portion from each set to create two validation sets, which they then merged to obtain the overall validation set for the MFVT dataset. The training and testing sets for the authors’ dataset are also composed of merged selections from the two respective sets. Furthermore, the authors augmented the training set by adding background images (images without objects) to enhance the robustness of the models trained on the MFVT dataset. They also included a few “Extra” images representing unusual cases in both the training and testing sets.

ExpandExpand
Dataset LinkHomepageDataset LinkResearch Paper

Summary #

MFVT Dataset for Real-time Masked Facial Detection is a dataset for an object detection task. It is used in the surveillance industry.

The dataset consists of 13507 images with 26201 labeled objects belonging to 3 different classes including ok, none, and wrong.

Images in the MFVT dataset have bounding box annotations. There are 220 (2% of the total) unlabeled images (i.e. without annotations). There are 3 splits in the dataset: train (10445 images), valid (1706 images), and test (1356 images). The dataset was released in 2022 by the University of Science - VNUHCM, Vietnam and Vietnam National University.

Dataset Poster

Explore #

MFVT dataset has 13507 images. Click on one of the examples below or open "Explore" tool anytime you need to view dataset images with annotations. This tool has extended visualization capabilities like zoom, translation, objects table, custom filters and more. Hover the mouse over the images to hide or show annotations.

OpenSample annotation mask from MFVTSample image from MFVT
OpenSample annotation mask from MFVTSample image from MFVT
OpenSample annotation mask from MFVTSample image from MFVT
OpenSample annotation mask from MFVTSample image from MFVT
OpenSample annotation mask from MFVTSample image from MFVT
OpenSample annotation mask from MFVTSample image from MFVT
OpenSample annotation mask from MFVTSample image from MFVT
OpenSample annotation mask from MFVTSample image from MFVT
OpenSample annotation mask from MFVTSample image from MFVT
OpenSample annotation mask from MFVTSample image from MFVT
OpenSample annotation mask from MFVTSample image from MFVT
OpenSample annotation mask from MFVTSample image from MFVT
👀
Have a look at 13507 images
Because of dataset's license preview is limited to 12 images
View images along with annotations and tags, search and filter by various parameters

Class balance #

There are 3 annotation classes in the dataset. Find the general statistics and balances for every class in the table below. Click any row to preview images that have labels of the selected class. Sort by column to find the most rare or prevalent classes.

Search
Rows 1-3 of 3
Class
Images
Objects
Count on image
average
Area on image
average
ok
rectangle
6610
12180
1.84
13.11%
none
rectangle
4982
8521
1.71
9.33%
wrong
rectangle
4803
5500
1.15
13.58%

Co-occurrence matrix #

Co-occurrence matrix is an extremely valuable tool that shows you the images for every pair of classes: how many images have objects of both classes at the same time. If you click any cell, you will see those images. We added the tooltip with an explanation for every cell for your convenience, just hover the mouse over a cell to preview the description.

Images #

Explore every single image in the dataset with respect to the number of annotations of each class it has. Click a row to preview selected image. Sort by any column to find anomalies and edge cases. Use horizontal scroll if the table has many columns for a large number of classes in the dataset.

Object distribution #

Interactive heatmap chart for every class with object distribution shows how many images are in the dataset with a certain number of objects of a specific class. Users can click cell and see the list of all corresponding images.

Class sizes #

The table below gives various size properties of objects for every class. Click a row to see the image with annotations of the selected class. Sort columns to find classes with the smallest or largest objects or understand the size differences between classes.

Search
Rows 1-3 of 3
Class
Object count
Avg area
Max area
Min area
Min height
Min height
Max height
Max height
Avg height
Avg height
Min width
Min width
Max width
Max width
ok
rectangle
12180
7.13%
100%
0.02%
2px
0.89%
2179px
100%
115px
23.66%
5px
0.88%
1954px
100%
none
rectangle
8521
5.46%
87.29%
0.01%
3px
1.17%
1821px
100%
107px
20.57%
2px
0.36%
1821px
99.44%
wrong
rectangle
5500
11.86%
98.89%
0.05%
9px
2.3%
2554px
99.44%
177px
31.02%
6px
1.6%
1743px
100%

Spatial Heatmap #

The heatmaps below give the spatial distributions of all objects for every class. These visualizations provide insights into the most probable and rare object locations on the image. It helps analyze objects' placements in a dataset.

Spatial Heatmap

Objects #

Table contains all 26201 objects. Click a row to preview an image with annotations, and use search or pagination to navigate. Sort columns to find outliers in the dataset.

Search
Rows 1-10 of 26201
Object ID
Class
Image name
click row to open
Image size
height x width
Height
Height
Width
Width
Area
1
ok
rectangle
vt_test_00048.jpg
567 x 480
316px
55.73%
260px
54.17%
30.19%
2
none
rectangle
vt_test_00628.jpg
461 x 333
83px
18%
83px
24.92%
4.49%
3
ok
rectangle
vt_test_00980.jpg
176 x 200
68px
38.64%
69px
34.5%
13.33%
4
none
rectangle
vt_test_00713.jpg
310 x 310
100px
32.26%
83px
26.77%
8.64%
5
none
rectangle
vt_test_00817.jpg
253 x 550
57px
22.53%
52px
9.45%
2.13%
6
ok
rectangle
vt_test_00136.jpg
220 x 220
115px
52.27%
100px
45.45%
23.76%
7
ok
rectangle
vt_test_01335.jpg
326 x 580
125px
38.34%
125px
21.55%
8.26%
8
none
rectangle
vt_test_00594.jpg
826 x 690
207px
25.06%
161px
23.33%
5.85%
9
none
rectangle
vt_test_00386.jpg
268 x 202
121px
45.15%
84px
41.58%
18.77%
10
ok
rectangle
vt_test_01161.jpg
800 x 800
297px
37.12%
296px
37%
13.74%

License #

License is unknown for the MFVT Dataset for Real-time Masked Facial Detection dataset.

Source

Citation #

If you make use of the MFVT data, please cite the following reference:

@dataset{MFVT,
  author={Trong-Vu Hoang and Huu-Thien Tran and Trong-Le Do},
  title={MFVT Dataset for Real-time Masked Facial Detection},
  year={2022},
  url={https://www.kaggle.com/datasets/trngvhong/mfvt-dataset}
}

Source

If you are happy with Dataset Ninja and use provided visualizations and tools in your work, please cite us:

@misc{ visualization-tools-for-mfvt-dataset,
  title = { Visualization Tools for MFVT Dataset },
  type = { Computer Vision Tools },
  author = { Dataset Ninja },
  howpublished = { \url{ https://datasetninja.com/mfvt } },
  url = { https://datasetninja.com/mfvt },
  journal = { Dataset Ninja },
  publisher = { Dataset Ninja },
  year = { 2024 },
  month = { nov },
  note = { visited on 2024-11-21 },
}

Download #

Please visit dataset homepage to download the data.

. . .

Disclaimer #

Our gal from the legal dep told us we need to post this:

Dataset Ninja provides visualizations and statistics for some datasets that can be found online and can be downloaded by general audience. Dataset Ninja is not a dataset hosting platform and can only be used for informational purposes. The platform does not claim any rights for the original content, including images, videos, annotations and descriptions. Joint publishing is prohibited.

You take full responsibility when you use datasets presented at Dataset Ninja, as well as other information, including visualizations and statistics we provide. You are in charge of compliance with any dataset license and all other permissions. You are required to navigate datasets homepage and make sure that you can use it. In case of any questions, get in touch with us at hello@datasetninja.com.