calculate the BCG signals peak group characteristics, and then
calculate the heartbeat interval. Clustering algorithm has strong
dependencies on BCG signals with standard morphology and
single mode, and the accuracy of extracting J peaks tends to be
somewhat lower than supervised learning. However, due to less
training steps, simple process and fast output results of
clustering algorithm, clustering algorithm can efficiently label
BCG signal peak classes, thereby providing a role for
supervised learning.
In the actual acquisition process, BCG signals exhibit
different morphologies due to noise interference, individual
variations, acquisition modalities and acquisition equipment.
The morphological changes are manifested in the relative
changes of the amplitude and distance of each peak, and the
peak group model is no longer obvious. In summary, the
instability of BCG morphology needs to be taken into account
when using BCG signals to extract heart rate. Detection based
on the intrinsic temporal features of BCG complex is the key to
solving such problems. Long Short-Term Memory network
(LSTM) excels at extracting features from sequences and can
make predictions for each data in sequences. Since heart rate
information is rich in temporal characterization, LSTM network
have been widely used in heart rate information research
[13-15]. There have been studies applying LSTM on BCG
signals to calculate heart rate [16], but this method simply
calculates the approximate heart rate of a certain segment of the
signal and cannot accurately localize the heartbeat cycle.
Therefore, this paper proposes a heartbeat detection method
based on BiLSTM for J peak localization of BCG signals. By
dividing BCG sequence segments and extracting peak feature
parameters as input, the BiLSTM advanced semantic
recognition model is used to classify BCG peaks in the test set.
The heart rate is calculated according to the classification
results of the J peaks, and the accuracy is compared with the
labels in the data set to verify the effectiveness of the method in
this paper.
2. Methods
2.1 Dataset
Since there is no uniform standard for the sensors,
measurement positions, and methods used to collect BCG
signals, there is no recognized standard BCG database. The
BCG dataset used in this study need to be collected and labeled
independently. A total of 28 volunteers were recruited in the
experiment, ranging in age from 18 to 50 years old, with 16
males and 12 females. The 28 subjects were defined as P1~P28.
The BCG signals acquisition equipment uses DEEBCG
ballistocardiography (DeE Software Co., Ltd. Zhejiang, China).
As shown in Fig. 2a, after the signal is collected by the device, it
is denoised and uploaded to the cloud, and the data is obtained
at the client. The data acquisition process is shown in Figure 2b.
The subject lies flat on the bed, and the sensor is placed under
the pillow. The data is collected in the four sleeping positions of
supine, prone, left and right for 10min respectively.
Figure 2. (a) Acquisition equipment (b) Signal acquisition process
In order to obtain the ECG signals as the reference standard
synchronously, each subject used the three-lead ECG
acquisition system designed by the research group. The R peaks
are detected using the Pan-Tumpkins algorithm [17], and the
RR interval are calculated as the reference (“ground truth”)
data.
Experimental data need to be labeled for each peak and
entered as a sequence fragment. Clustering algorithm can divide
unlabeled data into several disjoint subsets according to certain
rules [18]. The similarity within the same subset is as large as
possible, the similarity between subsets is as small as possible.
According to the characteristics of BCG complex showing
regularity and periodicity in a continuous period of time,
K-means clustering algorithm is used to distinguish various
types of BCG peaks when labeling each BCG J peaks. The
algorithm steps are as follows:
(1) Search all peaks and troughs of the signal in the interval
and the corresponding index positions,
(2) Use all the peak and trough data for parameter calculation,
as shown in Fig. 3, 4 parameters are calculated for each peak,
which are the amplitude Pa(n) of the peak P(n), the amplitude
Ta(n) of the trough T(n) adjacent to the peak P(n), the distance
Pd(n) between the peak P(n) and the trough T(n), and the
distance Td(n) between the trough T(n) and the next peak
P(n+1). Take the parameters of 5 consecutive peaks as the
eigenvector of the first peak to construct a feature set, each
eigenvector has a total of 20 parameters, the eigenvector fn is:
, , , , ,
4 , 4 , 4 , 4
n
Pa n Ta n Pd n Td n
fPa n Ta n Pd n Td n
(1)
(3) The K-means algorithm is used to cluster the feature set.
The number of clusters is 5 and the distance is Euclidean
distance. The best arrangement is selected in the five
initializations, and the peak group index after clustering is
confirmed.
Figure 3. Characteristic parameters of peaks and troughs
WSEAS TRANSACTIONS on BIOLOGY and BIOMEDICINE
DOI: 10.37394/23208.2022.19.16