Concept-based Extension of SKOS Defense Controlled Vocabulary:
Techniques and Implications
PERICLES S. GIANNARIS1, NIKOLAOS DOUKAS2, NIKOS MASTORAKIS1
1Sector of Electrical Engineering and Computer Science,
ASEI (Military Institutes of University Education), Hellenic Naval Academy,
Hadjikyriakou Avenue, Piraeus, P.C. 18539,
GREECE
2Division of Mathematics and Engineering,
ASEI (Military Institutes of University Education), Hellenic Army Academy,
Leoforos Eyelpidon (Varis – Koropiou) Avenue Vari P.O. 16673,
GREECE
Abstract: - A controlled vocabulary is a set of terms that are utilized to represent knowledge in a domain. In the
domain of defense, the use of terms such as “command, control, communications, computers, intelligence,
surveillance and reconnaissance (C4ISR)”, “armored personnel carriers (APC)”, or “biological and bioinspired
structures for multispectral surveillance” denote the core competencies of domain experts and the depth of
diverse knowledge. This paper describes the second phase of the pilot project to create a defense-related
controlled vocabulary with a focus on the Russo-Ukraine conflict. Applications for data annotation, SKOS
hierarchical vocabulary development, and vocabulary quality analysis are used to identify terms in text, express
identified terms in military press releases as SKOS vocabulary, and assess its structure. The preliminary
vocabulary is extended by 173 concepts. The quality of the vocabulary is validated against a SKOS checker of
twenty-four criteria.
Key-Words: - Controlled Vocabulary, Knowledge Representation, SKOS, Semantic Web, Semantic Web,
Named Entity Recognition.
Received: June 19, 2023. Revised: February 8, 2024. Accepted: March 15, 2024. Published: May 7, 2024.
1 Introduction
A controlled vocabulary is a set of terms that are
utilized to represent knowledge in a domain. For
example, in the domain of defense are used the
following terms: “command, control,
communications, computers, intelligence,
surveillance and reconnaissance (C4ISR)”,
“armored personnel carriers (APC)”, or “biological
and bioinspired structures for multispectral
surveillance”. The structure that a controlled
vocabulary can acquire is that of a simple list of
terms or a complex graph structure. The World
Wide Web Consortium (W3C) considers controlled
vocabularies as the basic element of the Semantic
Web, a set of technologies to link disparate data
sources. To standardize the creation of a controlled
vocabulary in a machine-readable format and to
assist with its assessment, W3C recommends the
simple knowledge organization (SKOS) schema.
SKOS is based on the machine-readable resource
description framework (RDF) data model therefore,
it can be defined as an OWL ontology, [1], [2].
In the initial phase of our pilot study for the
creation of a Russo-Ukraine conflict controlled
vocabulary, we extracted terms from a sample of
North Atlantic Treaty Organization (NATO) press
releases. The terms in the vocabulary are
hierarchically structured according to the SKOS
data model. In this paper, we describe three steps to
extend our initial controlled vocabulary with new
concepts. First, we analyze a new batch of press
releases to identify terms to express in SKOS.
Second, we utilize the friend-of-a-friend (FOAF)
language to describe persons and their social links
within the context of NATO. FOAF depends on
W3C's standards, specifically on extensible markup
language (XML), XML Namespaces, RDF, and web
ontology language (OWL), [3]. Lastly, we conduct a
preliminary assessment of our extended controlled
vocabulary.
A couple of published research is noted to
demonstrate the utility of controlled vocabularies in
WSEAS TRANSACTIONS on INFORMATION SCIENCE and APPLICATIONS
DOI: 10.37394/23209.2024.21.22
Pericles S. Giannaris,
Nikolaos Doukas, Nikos Mastorakis
E-ISSN: 2224-3402
229
Volume 21, 2024
different disciplines. First, [4], emphasize that Gene
Ontology (GO) is a controlled vocabulary with three
hierarchies of terms for (a) gene functions, (b)
larger-scale biological processes, and (c) cellular
components to discover information about gene
products, [4]. Second, Based on a methodology to
semantically encode text data to machine
understandable format, [5], propose a controlled
vocabulary as the foundation of a thematic thesaurus
modelled according to SKOS to streamline the
interoperability of knowledge bases at knowledge-
intensive institutions such as libraries and
universities, [5].
The rest of the paper is organized as follows,
section two discusses the methodology for
extending our Russo-Ukraine conflict controlled
vocabulary based on text analysis and the
employment of tools to leverage SKOS and FOAF;
section three discusses pilot results, preliminary
evaluation of the controlled vocabulary and,
limitations; and section four concludes the paper.
2 Methodology
Figure 1 displays the tasks involved in creating a
controlled vocabulary for the Russo-Ukraine
conflict using terms from NATO press releases.
2.1 Data Acquisition
Similarly to the initial phase, we obtain press
releases from the NATO Press Office News site, [6].
Our query, “Ukraine AND Russia AND Conflict”,
covers the period from 27 July 2023 to 14
December, 2023. The search engine returns twenty-
eight press releases, between 23 August, 2023 and
14 December, 2023, in batches of ten per results
page. We manually store and organize the results in
a spreadsheet. The data are organized in three
columns: date, image, headline, and opening
paragraph.
2.2 Data Wrangling
This task has two steps. Firstly, we utilize the
"select object", "filter", "sort", "macro option for
Visual Basic for Application (VBA) code", and
"trim" functions in our spreadsheet editor to remove
images, standardize word spacing, convert hypertext
to plain text, and sort our sample of text data by
date. Our text data sample is now organized into
three columns: the date, the headline, and the press
release. Second, we measure the frequency of each
word in the data set and count data according to
unique dates, the number of press releases per date,
words per headline, and press release, [7], [8] using
R base functions, [9]. Text quantification provides a
preliminary view of the potential distribution of
terms considering our data set.
2.3 Text Data Annotation
Text data annotation is the process of tagging
information to text to circumstantiate its meaning.
As in the first phase of our research, we annotate
terms in our sample of press releases to be added
toour controlled vocabulary. Specifically, we
annotate text similarly to preparing data for named
entity recognition (NER), the computational
technique to recognize predefined categories of
entities in a body of text, [10] for example,
“[Secretary General]designation [Jens
Stoltenberg]proper_name visited [Kyiv]city”. To annotate
text, we employ Label Studio [11], a data annotation
tool for NER projects that is offered as a feature in
DagsHub [12], a platform that supports data
analytics projects. Additionally, we leverage the
code configuration interface of the tool to customize
the annotation tags to our previous manual
annotation codebook, [13]. The annotated data are
saved in a comma-separated values (CSV) file and
supplied in a JavaScript Object Notation (JSON)
like structure. Next, we programmatically parse the
data file to extract terms and their corresponding
annotations, [14], [15], in a data frame, a type of a
two-dimensional array, Table 1.
Table 1. Randomly selected terms and
corresponding annotation tags
Term
Annotation tag
nato deputy
secretary general
“designation”
vilnius summit
“defense_or_civil_event”
new york
“city”
intensive care units
in hospitals
“infrastructure”
italy
“country”
pressing security
challenges
“challenges_situation_problem”
artillery systems
“aegis_arms_research_instruments”
ministry of the
interior
“government_authority”
medical services to
civilians
“defense_process_measures”
WSEAS TRANSACTIONS on INFORMATION SCIENCE and APPLICATIONS
DOI: 10.37394/23209.2024.21.22
Pericles S. Giannaris,
Nikolaos Doukas, Nikos Mastorakis
E-ISSN: 2224-3402
230
Volume 21, 2024
Fig. 1: Flowchart of the process to create a Russo-Ukraine conflict SKOS-based controlled vocabulary
The data frame is stored in a spreadsheet for
further analysis. For each annotated term and
corresponding tag, we provide definitions and
related sources, which are used in the next task.
2.4 Creation of Controlled Vocabulary
SKOS has three basic elements that are defined as
classes, properties, and relations. Every element
begins with the prefix “skos”. The elements are
distinguished according to the uppercase or
lowercase letter after the prefix for example, a class
can be encoded as “skos:Concept”; a property can
be encoded as skos:prefLabel, [16]. The property
skos:related” can be used to express a relation
between two concepts. Importantly, SKOS models
data according to a concept-centered approach
compared to a term-centered approach, [17].
In this phase, we employ the Controlled
Vocabulary Editor (CoVEd), which uses the SKOS
schema and the Terse RDF Triple Language (TTL)
syntax to extend our controlled vocabulary. CoVEd
was developed by the Narralive research team of the
Athena Research and Innovation Center at the
University of Athens, Greece, [18], [19] in [20]. We
begin by creating a concept scheme that
incorporates the sets of concepts for our vocabulary.
Each concept is identified by a uniform resource
identifier (URI). The following SKOS concepts and
properties are the backbone of our vocabulary:
“skos:ConceptScheme”, “skos:Concept”,
“skos:inScheme”, “skos:topConceptOf”,
“skos:hasTopConcept”. The “skos:prefLabel”
encodes the preferred lexical label for our resource.
The use of the “skos:note” and “skos:definition”
properties helps us to provide information about our
SKOS concepts. Furthermore, the “skos:broader”
and “skos:narrower” properties are used to assert a
hierarchical connection between two concepts; the
“skos:related” property is used to declare semantic
relations between concepts in our concept scheme,
[21].
To encode data related to persons and their social
network in a semantic way, we have the opportunity
to use named properties and classes from the friend-
of-friend (FOAF) schema that are provided by
CoVEd. FOAF is based on RDF and the Web
Ontology Language (OWL) to link persons and
information using the Web, [22]. In this analysis, we
use the properties “foaf:person”, “foaf:name”,
“foaf:knows”.
3 Results and Discussion
Our search query returns 28 press releases published
by the NATO Press Office about the Russo-Ukraine
conflict between 27July, 2023, and 14December,
2023. The six most frequent words, as in unigram,
in our sample of press releases are the following:
“nato” 23 times, “secretary” 14 times, “Ukraine” 14
times, “general” 13 times, “support” 9 times, and
“defense” 6 times. We annotate 173 unique terms in
the text using twenty different tags, Table 2. The
annotation tags correspond to the hierarchy of
concepts in the SKOS vocabulary that we developed
during the first phase of this study.
WSEAS TRANSACTIONS on INFORMATION SCIENCE and APPLICATIONS
DOI: 10.37394/23209.2024.21.22
Pericles S. Giannaris,
Nikolaos Doukas, Nikos Mastorakis
E-ISSN: 2224-3402
231
Volume 21, 2024
Fig. 2: Snippet of the hierarchy of concepts of the
SKOS Russo-Ukraine Conflict vocabulary
As mentioned in the methodology, the result of
the NER-like annotation process is available in
JSON-based structure:
[{"end": 22, "text": "NATO Secretary
General", "start": 0, "labels":
["DESIGNATION"]}, [...], {"end": 413,
"text": "US-led Ukraine Defence Contact
Group meeting", "start": 369, "labels":
["DEFENCE_OR_CIVIL_EVENT"]}, [...],
{"end": 136, "text": "Brussels", "start":
128, "labels": ["CITY"]}, [...], {"end": 39,
"text": "Jens Stoltenberg", "start": 23,
"labels": ["PROPER_NAME"]}]
The above excerpt shows that JSON data are
structured as name-value pairs e.g., "text":
"Brussels" or "labels": ["CITY"]. Here, “text” is
paired with the term we identify in the press release;
“labels” is paired with the annotation tag we select
for that term.
Table 2. Excerpt of tags that are used to annotate
terms in press releases
Annotation tags
designation
proper_name
continent_or_state_or_province
defence_or_civil_event
nato_engagements
defense_project
biomedical
SKOS models data in concept schemes. Our SKOS
data structure has a five-level hierarchy of concept
specificity. The preferred label for our concept
scheme is expressed as ‘skos:prefLabel "russo-
ukrainian-defence-diplomacy-vocabulary"@en’.
SKOS Play, [23], prints the controlled vocabulary in
HTML format that allows to computationally
quantify the generated information. We count 224
concepts. Concerning terms, SKOS Play defines
“nato” as the top term (TT). Here, TT corresponds
to the above-mentioned first level of hierarchy. The
second level comprises twelve concepts; the third
level has eighteen concepts, and the fourth level has
twenty concepts. The above concepts are defined as
broad terms (BTs). The rest 173 concepts
correspond to terms identified in the sample of press
releases. The later are defined as narrow terms
(NTs). Figure 2 shows part of the tree structure of
our SKOS model. Furthermore, there are 180
relations expressed as “skos:related” between
concepts.
Below, excerpts of our SKOS model, in TTL
serialization, that models poly-hierarchy about the
concept “russell-c-leffingwell-lecture”:
ex4:nato rdf:type skos:Concept ;
dcterms:identifier
"http://example.com/periclesRepo/ex4/nato"^^xsd:anyURI ;
skos:inScheme ex4: ;
skos:topConceptOf ex4: ;
skos:narrower ex4:[. . .] ;
skos:narrower ex4:democratic_action ;
skos:narrower ex4:[. . .] ;
skos:prefLabel "nato"@en .
ex4:democratic_action rdf:type skos:Concept ;
dcterms:identifier
"http://example.com/periclesRepo/ex4/democratic_action"^
^xsd:anyURI ;
skos:broader ex4:nato ;
skos:inScheme ex4: ;
skos:narrower ex4:peace_building_event ;
skos:prefLabel "democratic_action"@en .
ex4:peace_building_event rdf:type skos:Concept ;
dcterms:identifier
"http://example.com/periclesRepo/ex4/peace_building_even
t"^^xsd:anyURI ;
skos:broader ex4:democratic_action ;
skos:inScheme ex4: ;
skos:narrower ex4:defence_or_civil_event ;
skos:prefLabel "peace_building_event"@en .
ex4:defence_or_civil_event rdf:type skos:Concept ;
WSEAS TRANSACTIONS on INFORMATION SCIENCE and APPLICATIONS
DOI: 10.37394/23209.2024.21.22
Pericles S. Giannaris,
Nikolaos Doukas, Nikos Mastorakis
E-ISSN: 2224-3402
232
Volume 21, 2024
dcterms:identifier
"http://example.com/periclesRepo/ex4/defence_or_civil_ev
ent"^^xsd:anyURI ;
skos:broader ex4:peace_building_event ;
skos:definition "Event: a thing that happens or takes place,
especially one of importance; a planned public or social
occasion"@en ;
skos:inScheme ex4: ;
skos:narrower ex4:[. . .] ;
skos:narrower ex4:russell-c-leffingwell-lecture ;
skos:narrower ex4:[]. . . ;
skos:note "source: https://languages.oup.com/google-
dictionary-en"@en ;
skos:prefLabel "defence_or_civil_event"@en .
ex4:russell-c-leffingwell-lecture rdf:type skos:Concept ;
dcterms:identifier
"http://example.com/periclesRepo/ex4/russell-c-leffingwell-
lecture"^^xsd:anyURI ;
skos:broader ex4:defence_or_civil_event ;
skos:definition "Russell C. Leffingwell Russell C.
Leffingwell Century Society Archives The Russell C.
Leffingwell Lecture, inaugurated in 1969, was named for a
charter member of CFR who served as its president from
1944 to 1946 and as its chairman from 1946 to 1953. This
lecture is given by a distinguished foreign official who is
invited to address CFR members on a topic of major
international significance. The lectureship was originally
endowed by the Morgan Guaranty Trust Company and by
Edward and Lucy Leffingwell Pulling, and more recently
through the generosity of Thomas Leffingwell Pulling and
his son Edward Leffingwell Pulling"@en ;
skos:inScheme ex4: ;
skos:note "source: https://www.cfr.org/project/russell-c-
leffingwell-lecture-series"@en ;
skos:prefLabel "russell c. leffingwell lecture"@en .
Below, example of relations between concepts
based on SKOS and FOAF in TTL serialization:
ex4:proper_name rdf:type skos:Concept ;
dcterms:identifier
"http://example.com/periclesRepo/ex4/proper_name"^^xsd:
anyURI ;
skos:broader ex4:person ;
skos:definition "A proper noun is the name of a particular
person, place, organization, or thing. Proper nouns begin
with a capital letter"@en ;
skos:inScheme ex4: ;
skos:narrower ex4:[. . .] ;
skos:narrower ex4:jens-stoltenberg ;
skos:narrower ex4:[. . .] ;
skos:note "source:
https://www.collinsdictionary.com/dictionary/english/prope
r-
noun#:~:text=A%20proper%20noun%20is%20the,Compare
%20common%20noun."@en ;
skos:prefLabel "proper_name"@en .
ex4:jens-stoltenberg rdf:type skos:Concept ;
dcterms:identifier
"http://example.com/periclesRepo/ex4/jens-
stoltenberg"^^xsd:anyURI ;
skos:broader ex4:proper_name ;
skos:definition "Jens Stoltenberg is a Norwegian politician
who has served as the 13th secretary general of NATO
since 2014"@en ;
skos:inScheme ex4: ;
skos:note "source:
https://en.wikipedia.org/wiki/Jens_Stoltenberg"@en ;
skos:prefLabel "jens stoltenberg"@en ;
skos:related ex4:[. . .] ;
skos:related ex4:joe-biden ;
skos:related ex4:[. . .] ;
foaf:knows ex4:[. . .] ;
foaf:knows ex4:joe-biden ;
foaf:knows ex4:[. . .] ;
foaf:name ex4:jens-stoltenberg ;
foaf:person ex4:jens-stoltenberg .
Interestingly, the Narralive research team
extends the CoVEd’s graph functionality with a
social network-like visualization for relations
between concepts expressed as “skos:related”,
Figure 3.
Fig. 3: The matrix represents extracts of relations between concepts. Relations are expressed as “skos:related”
WSEAS TRANSACTIONS on INFORMATION SCIENCE and APPLICATIONS
DOI: 10.37394/23209.2024.21.22
Pericles S. Giannaris,
Nikolaos Doukas, Nikos Mastorakis
E-ISSN: 2224-3402
233
Volume 21, 2024
The “graph” feature in CoVEd provides users
with an additional functionality to visualize skos
encoded relations in a social network-like depiction,
Figure 3. Here, “graph” is leveraged to illustrate
relations between concepts that are hierarchically
below “proper_name”. The nodes represent
concepts. The dotted threads link two concepts. (A)
Node that illustrates the higher level concept
“proper_name”; (B) and (C) Nodes that illustrate
narrower concepts, here, the proper names of NATO
leaders; (D) Link that illustrates relation between
two concepts. The network representation is used to
demonstrate potential professional relations between
individuals within the organizational context of the
Alliance.
3.1 Controlled Vocabulary Assessment
To assess the quality of our SKOS vocabulary, we
use the SKOS Play Testing Tool, which is frontend
for qSKOS, a tool for finding quality issues in
SKOS vocabularies, [24], [25], [26]. By default, are
selected twenty four rules that correspond to quality
checking functions to assess the vocabulary, [27].
The output is a report that states that 224 concepts
are processed. Of the 24 rules that are checked,
three failed to be verified. Two of the three failed
rules are warnings such as “uc - Undocumented
Concepts” and “urc - Unidirectionally Related
Concepts”. The warnings indicate that there are
thirty one concepts that are found not to use any
SKOS documentation properties, and two concepts
that do not include reciprocal relations. The one fail
found is the “var - Valueless Associative Relations”
rule. It refers to finding sibling concept pairs that are
also connected by an associative relation. The
following are excerpts from warnings:
uc:http://example.com/periclesRepo/ex4/democratic_action
urc:http://example.com/periclesRepo/ex4/farid-safarov
var:http://example.com/periclesRepo/ex4/jane-harman
This assessment of the vocabulary quality is an
initial step to determine its suitability for reuse and
extensibility in the development processes.
Furthermore, the checking functions flag potential
quality problems that can be interpreted as structural
errors. Or, they can hamper vocabulary integration
due to concept incompatibility and inconsistencies
3.2 Discussion
The preliminary text analysis of our sample of
NATO press releases provides an overview of the
volume of information pertaining to civil and
military affairs. Yet, text analysis is the groundwork
for capturing and structuring knowledge that could
be leveraged by computational models. Moreover,
text analysis aids in emphasizing the grammatical
form of the terms that we are interested in selecting.
For example, the vocabulary should contain terms in
one of the following grammatical forms: noun or
noun phrase for instance, “alliance” or “council on
foreign relations”; verbal noun for instance,
“hosting a multinational nato battlegroup”; adjective
or pre-modified phrase for instance, “humanitarian
aid”; and, post-modified noun phrase for instance,
“lessons learned on countering hybrid tactics” [28].
There are five levels in the hierarchical structure
of this SKOS vocabulary. Conceptually, the last
level corresponds to terms annotated during the
NER-like procedure. Our polyhierarchical system
for classifying terms in press releases is the source
of these levels. The report on the analysis of our
vocabulary that is generated by SKOS Play
distinguishes between broad terms (BT) and narrow
terms (NT) with “nato” being the top term (TT).
Generally, polyhierarchy denotes that concepts can
potentially belong to more than one category.
For relations between concepts, there are two
main types of SKOS properties: associative and
hierarchical. For example, the property
“skos:broader” asserts that a concept has a general
meaning and “skos:narrower” is the inverse
property. The following excerpt describes
hierarchical relations:
ex4: rdf:type skos:ConceptScheme ;
dcterms:identifier
"http://example.com/periclesRepo/ex4/"^^xsd:anyURI ;
skos:hasTopConcept ex4:nato ;
skos:prefLabel "russo-ukrainian-defence-diplomacy-
vocabulary"@en .
ex4:nato rdf:type skos:Concept ;
dcterms:identifier
"http://example.com/periclesRepo/ex4/nato"^^xsd:anyURI ;
skos:inScheme ex4: ;
skos:topConceptOf ex4: ;
skos:narrower ex4:[. . .] ;
skos:narrower ex4:democratic_action ;
skos:narrower ex4:[. . .] ;
skos:prefLabel "nato"@en .
ex4:democratic_action rdf:type skos:Concept ;
dcterms:identifier
"http://example.com/periclesRepo/ex4/democratic_action"^
^xsd:anyURI ;
skos:broader ex4:nato ;
skos:inScheme ex4: ;
skos:narrower ex4:peace_building_event ;
skos:prefLabel "democratic_action"@en .
ex4:peace_building_event rdf:type skos:Concept ;
dcterms:identifier
"http://example.com/periclesRepo/ex4/peace_building_even
t"^^xsd:anyURI ;
skos:broader ex4:democratic_action ;
skos:inScheme ex4: ;
skos:narrower ex4:defence_or_civil_event ;
skos:prefLabel "peace_building_event"@en .
ex4:defence_or_civil_event rdf:type skos:Concept ;
dcterms:identifier
"http://example.com/periclesRepo/ex4/defence_or_civil_ev
ent"^^xsd:anyURI ;
skos:broader ex4:peace_building_event ;
WSEAS TRANSACTIONS on INFORMATION SCIENCE and APPLICATIONS
DOI: 10.37394/23209.2024.21.22
Pericles S. Giannaris,
Nikolaos Doukas, Nikos Mastorakis
E-ISSN: 2224-3402
234
Volume 21, 2024
skos:definition "Event: a thing that happens or takes place,
especially one of importance; a planned public or social
occasion"@en ;
skos:inScheme ex4: ;
skos:narrower ex4:[. . .] ;
skos:narrower ex4:russell-c-leffingwell-lecture ;
skos:narrower ex4:[. . .] ;
skos:note "source: https://languages.oup.com/google-
dictionary-en"@en ;
skos:prefLabel "defence_or_civil_event"@en .
ex4:russell-c-leffingwell-lecture rdf:type skos:Concept ;
dcterms:identifier
"http://example.com/periclesRepo/ex4/russell-c-leffingwell-
lecture"^^xsd:anyURI ;
skos:broader ex4:defence_or_civil_event ;
skos:definition "Russell C. Leffingwell Russell C.
Leffingwell Century Society Archives The Russell C.
Leffingwell Lecture, inaugurated in 1969, was named for a
charter member of CFR who served as its president from
1944 to 1946 and as its chairman from 1946 to 1953. This
lecture is given by a distinguished foreign official who is
invited to address CFR members on a topic of major
international significance. The lectureship was originally
endowed by the Morgan Guaranty Trust Company and by
Edward and Lucy Leffingwell Pulling, and more recently
through the generosity of Thomas Leffingwell Pulling and
his son Edward Leffingwell Pulling"@en ;
skos:inScheme ex4: ;
skos:note "source: https://www.cfr.org/project/russell-c-
leffingwell-lecture-series"@en ;
skos:prefLabel "russell c. leffingwell lecture"@en .
The property “skos:related” expresses associative
connection between concepts. For example, we use
associative relations for concepts that are under the
proper_name category. However, according to the
W3C recommendation for SKOS vocabularies, it is
not required to encode associative relations for pairs
of concepts that share the same broader or narrower
concept categories, [28], [29]. Nevertheless, in this
phase, we intend to express links between
individuals that are associated with the alliance
under various professional capacities. Similarly, we
use the “foaf:name” and “foaf:knows” properties to
express relations between persons in the vocabulary.
The utilization of the properties “skos:definition
and “skos:note” aims to provide a better
understanding of the meaning of a concept through
general documentation and the corresponding
source, [30]. Concerning limitations to the pilot
phase of our research, we underline two top issues.
First, the size of our data sample should be
increased to full press releases. Potentially, a larger
data set will provide additional terms for the
vocabulary. Consequently, it provides an increased
knowledge organization system (KOS). Second, the
polyhierarchy of categories of the SKOS vocabulary
is a structure that is under revaluation. The goal is to
address potential complicatedness regarding
broader-narrower concept relations. The next step is
to analyze entire press releases to expand the current
vocabulary.
Professional networks within the Treaty
connected by FOAF relationships can present
insights into characteristics and motifs of social
networks in the Semantic Web. Here, the use of
FOAF has the potential capacity to derive
organizational relations from contextual information
or domain knowledge through data mining
techniques such as classification clustering, or
logical inference.
4 Conclusion
This paper describes the second phase of the pilot
project to create a Russo-Ukraine conflict controlled
vocabulary. At this phase, CoVED and SKOS Play
Test are utilized to express 224 terms as SKOS
vocabulary and assess its structure. Of the twenty-
four rules that test the vocabulary, one rule is
marked as failed in the assessment report due to
Valueless Associative Relations. The analysis of full
press releases will be used as the next phase in this
research to potentially increase the number of
concepts in the vocabulary.The RDF-based
foundation of the proposed vocabulary provides a
nascent glimpse of an artificial intelligence (AI)
system to define classifications for defense
initiatives supporting peace-building civil and
diplomatic actions.
Acknowledgements:
We thank Dr. N. Mastorakis at the Hellenic Naval
Academy and Dr. N. Bardis at the Hellenic Army
Academy for facilitating this research.
Appreciations are also extended to Myrto Koukouli
and the NARRALIVE Research Team for providing
access to CoVEd and creating a repository for this
project.
References:
[1] A. Miles and S. Bechhofer, “SKOS Simple
Knowledge Organization System Reference,”
W3C Recommendation, [Online].
https://www.w3.org/TR/skos-reference/
(Accessed Date: February 15, 2024).
[2] I. Yoo and X. Hu, “Clustering Large
Collection of Biomedical Literature Based on
Ontology-Enriched Bipartite Graph
Representation and Mutual Refinement
Strategy,” in Advances in Knowledge
Discovery and Data Mining, in Lecture
Notes in Computer Science. Berlin,
WSEAS TRANSACTIONS on INFORMATION SCIENCE and APPLICATIONS
DOI: 10.37394/23209.2024.21.22
Pericles S. Giannaris,
Nikolaos Doukas, Nikos Mastorakis
E-ISSN: 2224-3402
235
Volume 21, 2024
Heidelberg: Springer, 2006, pp. 303–312,
https://doi.org/10.1007/11731139_36.
[3] D. Brickley and L. Miller, “FOAF
Vocabulary Specification,” FOAF
Vocabulary Specification, [Online].
http://xmlns.com/foaf/spec/ (Accessed Date:
February 15, 2024).
[4] J. H. Gennari, A. Silberfein, and J. C. Wiley,
“Integrating genomic knowledge sources
through an anatomy ontology,” Pac. Symp.
Biocomput. Pac. Symp. Biocomput.,
Fairmont Orchid on the Big Island of
Hawaii, USA, vol. 10, pp. 115–126, 2005,
ISBN 981-256-046-7.
[5] G. D. Solomou, D. A. Koutsomitropoulos, A.
K. Kalou, and S. D. Botsios, “Semantify
Educational Resources using SKOS and
Learning Object Ontologies,” 19th
International Conference on Circuits,
Systems, Communications and Computers
(CSCC 2015), Zakynthos Island, Greece, pp.
360–365, 2015.
[6] NATO Press Office, “NATO - News:
News,” NATO’s Newsroom, [Online].
https://www.nato.int/cps/en/natohq/news.htm
(Accessed Date: February 15, 2024).
[7] C. Khancome, “String Matching Algorithm
Using Multi-Characters Inverted Lists,”
WSEAS Transactions on Computers, Vol. 22,
pp. 151-158, 2023,
https://doi.org/10.37394/23205.2023.22.18.
[8] S. Oh, J. Jeong, C.-G. Lee, J. Yoo, and G.
Nam, “Synergistic Training: Harnessing
Active Learning and Pseudo-Labeling for
Enhanced Model Performance in Deep
Learning,” WSEAS Transactions on
Computers, Vol. 22, pp. 114-119, 2023,
https://doi.org/10.37394/23205.2023.22.14.
[9] The R Foundation, “R version 4.2.2 beta,” R:
The R Project for Statistical Computing,
[Online]. https://www.r-project.org/
(Accessed Date: February 15, 2024).
[10] IBM.com, “What is named entity
recognition?, IBM,” IBM, [Online].
https://www.ibm.com/topics/named-entity-
recognition (Accessed Date: February 15,
2024).
[11] Label Studio, “Label Studio Documentation
Overview of Label Studio,” Label Studio,
[Online].
https://labelstud.io/guide/get_started.html
(Accessed Date: February 15, 2024).
[12] DagsHub, “DagsHub: The Home for
Machine Learning Collaboration,” DagsHub,
[Online]. https://dagshub.com/ (Accessed
Date: February 15, 2024).
[13] P. S. Giannaris, J. Borges, and N. Doukas,
“Design of a Defense Controlled
Vocabulary, SKOS-based Encoding of
Terms from Military Press Releases,” in
2023 13th International Conference on
Dependable Systems, Services and
Technologies (DESSERT), Athens, Greece,
Oct. 2023, pp. 1–6, doi:
10.1109/DESSERT61349.2023.10416475.
[14] S. Sivkov, “Residue Number Systems
Quantization for Deep Learning Inference,”
WSEAS Transactions on Computers, Vol. 22,
pp. 296-301, 2023,
https://doi.org/10.37394/23205.2023.22.33.
[15] S. Saranya and G. Usha, “Emotion
Classification on Social Media Comments
Using Categorical Feature Extraction Along
With the Bidirectional Encoder-based
Recurrent Neural Network Classification,”
WSEAS Transactions on Computers, Vol. 22,
pp. 284-295, 2023,
https://doi.org/10.37394/23205.2023.22.32.
[16] Linked Heritage, “Terminology,” Linked
Heritage, [Online].
http://linkedheritage.cab.unipd.it/training/LO
-07/en/02.html (Accessed Date: February 15,
2024).
[17] T. Baker, S. Bechhofer, A. Isaac, A. Miles,
G. Schreiber, and E. Summers, “Key choices
in the design of Simple Knowledge
Organization System (SKOS),” J. Web
Semant., vol. 20, pp. 35–49, May 2013, doi:
10.1016/j.websem.2013.05.001.
[18] Koukouli, M., Katifori, A., Roussou, M., &
Ioannidis, Y. (2024). Controlled Vocabulary
Editor. SKOS vocabularies without SKOS
training. In K. Karpouzis, Y. Skarpelos
(eds.), Computational Methods for the
Digital Humanities. Springer.
https://doi.org/10.5281/zenodo.11080329.
[19] M. Koukouli, A. Katifori, and Y. Ioannidis,
“SKOS vocabulary without SKOS training.
A tool for everyone,” presented at the
Symbiosis of Tradition and Digital
Technology, Tallinn, Estonia, May 22, 2022.
[20] CIDOC, “Symbiosis of Tradition and Digital
Technology CIDOC conference abstracts
MAY 23-26, 2022 Tallinn, Estonia.”
CIDOC, May 23, 2022. Accessed: Feb. 15,
2024, [Online].
https://vlaamsekunstcollectie.be/volumes/im
ported/Esinejate-abstraktid_0505.pdf
(Accessed Date: February 15, 2024).
WSEAS TRANSACTIONS on INFORMATION SCIENCE and APPLICATIONS
DOI: 10.37394/23209.2024.21.22
Pericles S. Giannaris,
Nikolaos Doukas, Nikos Mastorakis
E-ISSN: 2224-3402
236
Volume 21, 2024
[21] A. Miles and D. Brickley, “SKOS Core
Vocabulary Specification,” SKOS Core
Vocabulary Specification W3C Working
Draft 2 November 2005, [Online].
https://www.w3.org/TR/swbp-skos-core-
spec/#note (Accessed Date: November 19,
2024).
[22] D. Brickley and L. Miller, “FOAF
Vocabulary Specification,” FOAF
Vocabulary Specification 0.99, [Online].
http://xmlns.com/foaf/spec/ (Accessed Date:
February 15, 2024).
[23] T. Francart, “SKOS Play! - Thesaurus &
Taxonomies,” SKOS Play, [Online].
https://skos-play.sparna.fr/play/home
(Accessed Date: February 15, 2024).
[24] T. Francart and C. Maden, “SKOS testing
tool,” [Online]. https://skos-
play.sparna.fr/skos-testing-tool/ (Accessed
Date: February 15, 2024).
[25] C. Mader, “cmader/qSKOS.” Aug. 15, 2023,
[Online]. Available:
https://github.com/cmader/qSKOS (Accessed
Date: February 15, 2024).
[26] C. Mader, B. Haslhofer, and A. Isaac,
“Finding Quality Issues in SKOS
Vocabularies.” arXiv, Jun. 06, 2012.
[27] O. Suominen and C. Mader, “Assessing and
Improving the Quality of SKOS
Vocabularies,” J. Data Semant., vol. 3, no. 1,
pp. 47–73, Mar. 2014, 2024, doi:
10.1007/s13740-013-0026-0.
[28] I. Spasić, D. Schober, S. A. Sansone, D.
Rebholz-Schuhmann, D. B. Kell, N. W.
Paton, "Facilitating the development of
controlled vocabularies for metabolomics
technologies with text mining." BMC
Bioinformatics, 9 (Suppl 5), S5 (2008), pp.1-
16, https://doi.org/10.1186/1471-2105-9-S5-
S5.
[29] P. P. F. Barcelos, “Quality Issues,” GitHub,
[Online].
https://github.com/cmader/qSKOS/wiki/Qual
ity-Issues (Accessed Date: February 15,
2024).
[30] E. Summers, A. Isaac, C. Redding, D. Krech,
"LCSH, SKOS and Linked Data", Metadata
for Semantic and Social Applications, Dublin
Core: Innovation and Moving Forward,
DCMI '08: Proceedings of the 2008
International Conference on Dublin Core
and Metadata Applications, Berlin Germany
September 22-28, 2008, pp.25-33, [Online].
https://library.oapen.org/bitstream/handle/20.
500.12657/37029/1/DC_proceedings.pdf#pa
ge=41 (Accessed Date: April 26, 2024).
Contribution of Individual Authors to the
Creation of a Scientific Article (Ghostwriting
Policy):
The authors equally contributed tothe present
research, at all stages from the formulation of the
problem to the final findings and solution.
Sources of Funding for Research Presented in a
Scientific Article or Scientific Article Itself:
No funding was received for conducting this study.
Conflict of Interest:
The authors have no conflicts of interest to declare.
Creative Commons Attribution License 4.0
(Attribution 4.0 International, CC BY 4.0)
This article is published under the terms of the
Creative Commons Attribution License 4.0
https://creativecommons.org/licenses/by/4.0/deed.
en_US
WSEAS TRANSACTIONS on INFORMATION SCIENCE and APPLICATIONS
DOI: 10.37394/23209.2024.21.22
Pericles S. Giannaris,
Nikolaos Doukas, Nikos Mastorakis
E-ISSN: 2224-3402
237
Volume 21, 2024