9 Conclusion
In this paper, we displayed TDA techniques in
hypothesis testing. It is proposed that this test is
based on the nearest neighbour distance function. In
addition, a suggested modification for presented
tests depending on TDA is proposed. At different
patterns, a comparison depending on persistent
homology is performed among tests. These tests are
based on a distance function, and other tests depend
on a persistent landscape with two requirements: the
test's size and power. According to our observations,
tests that depend on persistent homology are much
more appropriate. However, tests based on
persistent landscape or distance function have
higher power than the remaining. Generally, all
TDA-based tests have fulfilled properties at
dimension one; if the sample size for the point cloud
data increases, it will positively impact the overall
number of tests. We demonstrated the efficacy of
the above tests on Statlog's heart disease dataset and
Wisconsin breast cancer dataset. There is still much
work to be conducted in future studies. For instance,
in generalizing the preceding tests to more than two
groups, comparing the various methods, [2], in-
depth clustering analysis depends on TDA and
evaluating it to another statistically existing method.
As TDA techniques improve, we expect many
researchers to apply topological analysis in their
studies.
Acknowledgments:
The authors would like to thank the team of TDA,
namely Brittany Terese Fasy, Jisu Kim, and
Clement Maria, for their help, advice, vast expertise,
and willingness to share their time freely. Moreover,
a lot of gratitude to Dr. Fabrizio Lecci for her help
in sending her thesis.
References:
[1] Bubenik, Peter, and Nikola Milićević,
"Homological algebra for persistence modules",
Foundations of Computational Mathematics
21.5 (2021): 1233-1278.
https://doi.org/10.1007/s10208-020-09482-9.
[2] Fasy, Brittany Terese, et al. "Confidence sets for
persistence diagrams", The Annals of Statistics
(2014): 2301-2339. https://doi.org/10.1214/14-
AOS1252.
[3] Alaa, H. N., and S. A. Mohamed. "On the
topological data analysis extensions and
comparisons", Journal of the Egyptian
Mathematical Society 25.4 (2017): 406-413.
https://doi.org/10.1016/j.joems.2017.07.001.
[4] Fasy, Brittany Terese, et al. "Introduction to the
R package TDA", arXiv preprint
arXiv:1411.1830 (2014).
https://doi.org/10.48550/arXiv.1411.1830.
[5] Edelsbrunner, Herbert, David Letscher, and
Afra Zomorodian. "Topological persistence and
simplification", Proceedings 41st annual
symposium on foundations of computer science.
IEEE, 2000. DOI:10.1109/SFCS.2000.892133.
[6] Bubenik, Peter. "Statistical topological data
analysis using persistence landscapes", J.
Mach. Learn. Res.16.1 (2015): 77-102.
https://doi.org/10.48550/arXiv.1207.6437
[7] Balchin, Scott, and Etienne Pillin. "Comparing
Metrics on Arbitrary Spaces using Topological
Data Analysis", arXiv preprint
arXiv:1503.04619 (2015).
https://doi.org/10.48550/arXiv.1503.04619
[8] Carlsson, Gunnar. "Topology and data",
Bulletin of the American Mathematical Society
46.2 (2009): 255-308. DOI:10.1090/S0273-
0979-09-01249-X.
[9] Emmett, Kevin, et al. "Parametric inference
using persistence diagrams: A case study in
population genetics", arXiv preprint
arXiv:1406.4582 (2014).
https://doi.org/10.48550/arXiv.1406.4582
[10] Gamble, Jennifer, and Giseon Heo.
"Exploring uses of persistent homology for
statistical analysis of landmark-based shape
data", Journal of Multivariate Analysis 101.9
(2010): 2184-2199.
https://doi.org/10.1016/j.jmva.2010.04.016.
[11] Kim, Wonse, et al. "Investigation of flash
crash via topological data analysis", Topology
and its Applications 301 (2021): 107523.
https://doi.org/10.1016/j.topol.2020.107523.
[12] Dłotko, Paweł, and Thomas Wanner.
"Topological microstructure analysis using
persistence landscapes", Physica D: Nonlinear
Phenomena 334 (2016): 60-81.
https://doi.org/10.1016/j.physd.2016.04.015.
[13] Sizemore, Ann E., et al. "The importance of
the whole: topological data analysis for the
network neuroscientist", Network
Neuroscience 3.3 (2019): 656-673.
https://doi.org/10.1162/netn_a_00073.
[14] Artamonov, Oleg. "Topological methods for
the representation and analysis of exploration
data in oil industry", Diss. Technische
Universität Kaiserslautern, 2010.
urn:nbn:de:hbz:386-kluedo-25456.
WSEAS TRANSACTIONS on MATHEMATICS
DOI: 10.37394/23206.2023.22.3
S. Z. Rida, Alaa Hassan Noreldeen, Faten R. Karar