economics and finance, [11], [12], [13], [14]. and
they rely on fundamental and technical analysis. The
fundamental analysis technique, [15], 16], [17].
investigates external elements that affect the stock’s
intrinsic value, including interest rates, currency
rates, inflation, industrial policy, listed company
finance, international relations, and other economic
and political aspects. On the other hand, the
technical analysis method primarily focuses on the
direction of stock price, trading volume, and
investors’ psychological expectations. It focuses on
using Kline charts and other tools to analyze the
stock index trajectory of individual stocks or the
entire market. The number of information sources
employed in stock market prediction methods can be
classified into two categories: single-source
techniques and dual-source approaches, [18], [19],
[20]. Single-source techniques depend entirely on
numerical data to predict the price of a stock.
According to the Efficient Market Hypothesis
(EMH), [21], the stock price will reflect all relevant
information. The stock price may be influenced by
different sources of information that complement
one another. Thus, the dual-source approaches focus
on developing appropriate news representations
while capturing the data’s temporal relationship,
[22], [23], [24]. In recent years, however, both the
rate of publication and the number of daily news
providers have risen dramatically, considerably
outstripping investors’ ability to sift through
massive amounts of data. As a result, an automated
decision-making system is essential to analyze and
forecast future stock movements. One of the most
challenging tasks for both traders and academics
/researchers is stock market forecasting, [25].
Because of its enormous earning potential, the stock
market has always attracted a large number of
investors. Researchers believe stock market
prediction is challenging due to the difficulty in
obtaining nonlinear and non-stationary variance in
data, [26]. Thus, Stock market forecasting has long
relied on machine learning and deep learning
approaches, [27], [28]. The development of
Recurrent Neural Networks (RNNs) with Long
Short-Term Memory (LSTM), [29]. Attention
Mechanisms [30], particularly Self-Attention and
Transformers, [31], are examples of recent
developments in deep learning. These
methodologies have significantly increased the
accuracy of word-based tasks such as sentiment
analysis prediction, [32]. As a result, this paper
proposes a successful model for sentiment analysis
of stock market-related news that uses BERT
(Bidirectional Encoder Representations from
Transformers), [33]. word-embedding and LSTMs.
After stating the motivations for analyzing stock
market-related news, the utilization of LSTMs, and
highlighting the difference between single-source
and dual-source techniques, the remaining part of
the paper is organized as follows. Selected relevant
approaches for dual-source stock market prediction
(mostly using deep learning approaches) are
reviewed in Section 2. Section 3 presents the
proposed methodology, including the used word
embedding and the planned pipeline. In Section 4, a
description of the constructed dataset is given. The
results are presented and discussed in Sections 5 and
6, respectively. Finally, the conclusion and further
work are found in Section 7
2 Literature Review
A Numerical-based Attention (NBA) approach for
dual-source stock market prediction was proposed
by, [34]. First, they proposed an attention-based
stock price prediction strategy that effectively
harnesses the complementarity of news and
numerical data. The stock trend information hidden
in the news is captured by the crucial distribution of
numerical data. As a result, the information is
encoded to make numerical data selection easier.
Their approach effectively filters out noise while
boosting the usefulness of news trend information.
Then, three datasets were created using a news
corpus and numerical data from two sources to
evaluate the NBA model. And with the advancement
of text mining techniques, [35]. proposed a modern
autoregressive neural network architecture that
incorporated sentiment predictors. They suggested
that using predictors based on counts of news
articles/stories and Twitter posts will considerably
improve the accuracy of stock price predictions.
Also, in, [36]. projected the stock price movements
by exploring a stock price prediction based on news
sentiment analysis. In addition, the authors proposed
using sentiment analysis to rate articles using single
combined strings and a positive, negative, or neutral
rating string. The performance of the sentiment
analysis is incorporated into any machine learning
models that predict the stock market. Instead of
utilizing complete news articles, [37]. concentrated
on the economic news headlines. They employed
several approaches to analyzing the sentiment of the
headlines. They employed BERT as a baseline and
then used other tools (namely, VADER, Text Blob,
and a Recurrent Neural Network) to compare the
sentiment analysis findings to stock changes over
WSEAS TRANSACTIONS on COMPUTERS
DOI: 10.37394/23205.2022.21.36
Mohsen A. Hassan, Aliaa Aa Youssif,
Osama Imam, Amr S. Ghoneim