restricted behavior, DSM-5 classified ASD into
three severity levels. Level one - Requiring Support,
Lack of social communication causes obvious
impairments if there are no supports in place. Level
two - Requiring Substantial Support, Social
impairments are noticeable even with assistance,
and Significant weaknesses in verbal and nonverbal
social communication abilities level three -
Requiring Very Substantial Support - Ability to
initiate social contacts is severely hampered by
severe inadequacies and there is little reaction to
social advances made by others.
The ADI-R and ADOS are two clinical instruments
and diagnostic approaches that are unfortunately
time-consuming and necessary for the medical
identification of people with autism (ADOS), [4].
Parent interviews are a part of the rigorous,
organized, and structured ADI-R test, which looks
at the child's early life, [5]. ADOS advises the
parent about the child's current social,
communication, and play abilities during a
prearranged play session.
2 Literature Survey
Several kinds of research with the help of machine
learning was done to speed up and enhance the
diagnosis of ASD. Utilizing machine learning,
which had an accuracy of 84%, to categorize the
retinal obsessions of kids with ASD and TD. These
investigations showed that when compared to
established diagnostic measures, machine learning is
more efficient and objective, [6]. The integrating
multidimensional features gives the best degree of
accuracy for depicting the correlation coefficients of
the brain when compared to examining individual
heterogeneous variables, [7]. With an accuracy of
97.6%, The Support Vector Machine technique to
screen for ASD. The limitation of this paper is the
small sample containing 612 autistic cases and 11
non autistic cases, [8]. Sixty-five Social
Impartiality individuals from two thousand nine
hundred and twenty-five individuals with ASD or
ADHD with the help of six machine learning
models. They employed forward feature selection
with minimum sampling, [9]. Five out of the sixty-
five tests, with an efficiency of 96.4%, were
sufficient to differentiate ASD from ADHD. There
was a considerable imbalance favoring the ASD
class in the dataset, which was mostly based on
collections about autism, [10]. There were 367
variables in the 95,577 children's data, 256 of which
were deemed sufficient, and the accuracy was
87.1% in SVM, RFC, and NB. The innovative
intelligence-sharing structure was designed to hide
responsive and simultaneously unstable individual
data. The study also recommended linear variance
analysis as a simple way for keyword separation
(LVA), [11]. SVM was the technique applied in this
filtration procedure. The study's findings offer hope
for employing text mining to safeguard private
health information transmitted over the Internet,
[12]. ASD is predicted using parameters based on
brain activity, 95.9% accuracy was achieved using
SVM with 2 groups and 19 features. The amount of
data was relatively small. It used a cross-validation
approach to extract 6 personality variables from the
data of 851 individuals to train and evaluate their
ML models. This was utilized to categorise patients
into those with and those without ASD, [13]. Facial
expressions from photo to identify psychological
stress is not possible. Using haar cascade algorithm
is used to evaluate the stress via logarithmic
regression, [14], [15].
3 Working Principle
Data processing converts raw data into a more
comprehensible format from a preliminary step.
Figure 1 illustrates the workflow of our proposed
system. The dataset will be preprocessed for missing
data, duplicate data, and noisy data. Pre-processing
of the data can be done by dimensionality reduction
technique. Autism can be predicted using
classification approaches after the dataset is
preprocessed. The efficiency of each classification
may contrasted. The training efficiency will be
greater than the test efficiency if the classification
performs properly. Then, this classification
algorithm may be used for the next training and
classification as the best model.
3.1 Dataset Pre-processing
One of the most essential components in Machine
Learning is the dataset. Once the dataset is
collected, researchers should employ various dataset
types that may depend on their prediction model.
Preprocessing can convert the data into a pattern
that is more easily and productively refined in data
mining, and machine learning through
dimensionality reduction techniques using selection
and extraction.
3.2 Feature Selection
It is the process of contracting the proposal to your
model by using only proper data so the noise in data
can be cleared. The most essential task for building
an efficient classification system. There are three
methods of feature selection are available.
WSEAS TRANSACTIONS on COMPUTERS
DOI: 10.37394/23205.2023.22.42
Sindhu Veeramani, S. M. Ramesh, B. Gomathy