
and phrases commonly used in traditional financial sentiment
research. However, recent advancements in NLP, particularly
BERT, have highlighted the importance of comprehending
language within its context. [9]
This study holds great significance as it aims to bridge
the gap between historical sentiment analysis and financial
prediction modeling. Our goal is to equip investors, analysts,
and financial institutions with a comprehensive and precise
toolkit for decision-making. We achieve this by combining the
strengths of established financial sentiment research method-
ologies with the advanced language comprehension abilities
of BERT.
In this study, we delve into the realm of financial sentiment
analysis, a field that intersects finance and natural language
processing (NLP). [2]
The research presented in this study has significant im-
plications for both the academic and business communities.
It introduces a comprehensive framework for examining the
financial sentiment expressed in various types of financial
texts, including news articles and earnings reports. Further-
more, it lays the foundation for developing automated tools
and systems that can offer real-time sentiment analysis in
the dynamic financial landscape. These advancements will
ultimately assist stakeholders in making more informed and
effective decisions.
In the realm of analyzing financial texts, a comprehensive
evaluation was conducted on the tone, utilizing a specialized
lexicon known as the Loughran-McDonald dictionary. This
dictionary consists of an extensive collection of financial
terminology and sentiment scores, allowing for a deep un-
derstanding of the emotional connotations associated with
specific financial terms and expressions. By leveraging this
lexicon, precise sentiment classification becomes achievable.
Each financial term or expression within the database is as-
signed sentiment scores indicating whether it carries positive,
negative, or neutral meanings. Consequently, financial docu-
ments can be accurately categorized based on their underlying
attitudes. Among various models evaluated for performance
and accuracy, it was found that the Bidirectional Encoder
Representations from Transformers (BERT) model exhibited
exceptional results with 90 percent accuracy in predicting
sentiments. However, it should be noted that this model does
face challenges in terms of loading time and prediction speed
due to its intricate nature.[8]
In the fascinating study on sentiment analysis of news
articles, they employed a methodology based on the Lexicon-
based approach. When it comes to sentiment analysis, there are
generally two main approaches: supervised and unsupervised.
The supervised approach involves training a classification
model using labeled data to classify new data without labels.
On the other hand, unsupervised or Lexicon-based approaches
do not require any training data. Instead, they rely on inferring
the sentiments of words based on their polarity.[9]
In the case of a sentence or document, the collective
polarities of individual words determine the overall sentiment
conveyed. This is achieved by summing up the polarities of
each word or phrase within the sentence. To facilitate this
approach, predefined lists of words are used, with each word
associated with a specific sentiment. Additionally, there are
various methods that can be utilized within this approach.
Overall, research sheds light on an intriguing methodology
for sentiment analysis in news articles using the Lexicon-
based approach. Their findings provide valuable insights into
understanding and interpreting sentiments in textual content
without relying on labeled training data.
In the realm of financial news analysis, the text found stands
out for its authoritative nature and distinct characteristics. To
enhance its quality, we have developed a novel workflow
framework that incorporates customized text cleanup, fine-
tuning of the Bert model, segmentation techniques, and a
Chinese enterprise name database. This framework enables us
to classify the emotions conveyed in news articles, identify
negative financial events, and recognize relevant entities within
Chinese financial news texts.
The accuracy of these classification models aligns with their
application requirements. Building upon this foundation, we
have designed and implemented a comprehensive system for
analyzing Chinese financial news texts throughout their entire
lifecycle. This system comprises three main modules: the
financial news collection module, the financial news analysis
module, and the financial news standardization and persistence
module.
To ensure scalability and sustainable optimization of our
system, we have employed an asynchronous design approach
between the financial news collection module and the fi-
nancial news analysis module. This strategic decision allows
for seamless integration while accommodating future growth
opportunities.[10]
Kim et al. delved into the fascinating realm of corporate
bankruptcy prediction. The aim was to explore whether em-
ploying context-specific textual sentiment analysis, specifically
BERT, could enhance the accuracy of these predictions. To
conduct their study, they meticulously gathered and analyzed
data from various sources, including five financial variables
derived from stock market data and annual reports, which have
been identified as precursors to impending insolvencies.[3]
Additionally, we embarked on a comprehensive exami-
nation of a vast collection of MDA narrative disclosures
spanning from 1995 to 2020. The objective was to investigate
whether incorporating textual sentiment analysis could offer
valuable insights into predicting financial distress. The findings
were remarkable: textual sentiment analysis demonstrated an
augmented predictive capability beyond the well-established
financial variables commonly used in such analyses.
Moreover, the study revealed that BERT-based analysis
outperformed the dictionary-based approach proposed by
Loughran and McDonald (2011), as well as the Word2Vec-
based analysis combined with convolution neural network.
2. Literature Review
Financial Engineering
DOI: 10.37394/232032.2024.2.15
Sheetal R., Prakash K. Aithal