data, the images can be classified and the disease
stage predicted. With this data, it is not possible to
run a CNN, as the CNN is overfitted for the data.
There are many methods like CNN that cannot
perform well with very small data sets and when
limited computational power issues exist. However,
it was demonstrated in this study using a confusion
matrix that MCM works even with a very small data
set. MCM can be customized or adapted more
easily to the specific characteristics of small
datasets. MCM also fits smaller and bigger data sets
to find patterns in the data easily and can understand
the patterns of the data with a much smaller data
sample size like 10, whereas CNN needs bigger data
sets to find a pattern. Although the CNN works with
smaller datasets, it does not work as effectively for
data as small as the one used in this study.
Another important property of MCM is that even
if a small data set is used and the classification
thresholds are calculated, it can be generalized to a
big data set, which is an important data
augmentation property. That is, the thresholds of
classification obtained by small data could be
applied to a large data set to classify images. While
CNN can also do this, it cannot do so with smaller
data of as few images as 10 or 20. Moreover, MCM
can successfully find patterns in very small data sets
and work effectively with larger data sets, that is
when the same thresholds of classification obtained
using a small data set are applied to larger data sets
Domain expertise is needed to work with CNN.
For example, this data set is related to the
classification of the stages of lung cancer. If CNN is
used, then in most cases, the user must know what is
stage 1, stage 2, stage 3, and stage 4, because CNN
needs manually classified data, so the user has to
first classify the data manually, whereas MCM uses
a normal image to classify images with
abnormalities.
A quantitative comparison of MCM and CNN
showed significant differences in performance
between MCM and CNN, such that MCM showed
better performance than CNN.
The results of this study demonstrate that the
MCM was more effective than the MFA for all
kinds of data, and for small data sets, the MCM
worked better than the CNN. The main limitation of
MCM is that many kinds of standard or normal
images have to be used. In addition, noise in the
images should be removed before using MCM,
which was done in this study by cropping the lung
images.
6 Conclusion
To conclude, MCM is a generalization of the MFA
method, showing that MCM more accurately
classifies images. Specifically, MCM is 21% more
accurate than MFA_c and 9.5% more accurate than
MCA_l. Both the MCM and MFA methods are
successful in quantifying the abnormality in an
image, such as a cancer tumor. However, the MCM
is very sensitive and can catch small changes in the
abnormality when compared to the MFA. On the
other hand, the MFA method is subjective to the
standard image. Both work with a very small data
set, so they are useful for studying rare diseases or
abnormalities in the form of images. The main
problem with the MFA method is that it is based on
comparing a single normal image with an abnormal
image. That is, if a single normal image is replaced
with another, then the classification thresholds will
be altered. However, this problem was rectified in
this study using multiple normal images for
comparison with one abnormal image. Thus, the SSI
score and the classification determined using the
MCM method were made more robust than with the
MFA. Thus, physicians and scientists could use the
MCM with confidence to obtain an accurate initial
overview of an abnormality or disease in a patient.
Furthermore, the MCM method can be used to make
accurate predictions for rare diseases or problems
with very little data.
When comparing MCM with the gold standard,
CNN, for a small data set, the results of all the
statistical tools used show that the MCM performed
better than CNN. Moreover, MCM accepted
DICOM images and conversion to PNG. Hence,
some of the ‘fuzziness’ was avoided. The MCM can
also be used to detect the abnormality in a small
data set with two images. Rare diseases typically do
not have a lot of data to train, test, or validate the
CNN process. Thus, MCM can be used to detect
rare diseases using a limited number of diagnostic
images, as the minimum data needed to run the
MCM is more than one normal image and one
image for each group or stage of abnormality.
Additionally, although the MCM method was
applied to cancer images in the current study, it
could be applied to any image type, like other
medical images, or images from any other field of
science, such as astronomy and geography.
One of the research gaps that needs more
detailed study is rare events, such as rare diseases.
These rare diseases have limited data and using
traditional tools, it is not possible to study these
diseases. However, MCM is designed for both
smaller and larger data sets, and the comparison
performed between CNN and MCM in the current
WSEAS TRANSACTIONS on COMPUTERS
DOI: 10.37394/23205.2024.23.16
Kishore Kumar Akula, Maura Marcucci,
Romain Jouffroy, Farzad Arabikhan,
Raheleh Jafari, Monica Akula, Alexander Gegov