Linear Mixed Model Approach to Protein Significance Analysis
JONGSOO JUN, TAESUNG PARK*
Department of Statistics
Seoul National University
1 Gwanak-ro, Gwanak-gu, Seoul 08826
KOREA
Abstract: Discovering protein biomarkers is one of the important issues in biomedical researches. The enzyme-
linked immunosorbent assay (ELISA) is one of the traditional techniques for protein quantitation. Recently, the
multiple reaction monitoring (MRM) mass spectrometry has been proposed as a new method for protein
quantification and has been popular as an alternative to ELISA. However, not many analysis methods are
available yet to analyse MRM data. Linear mixed models (LMMs) are effective in analysing MRM data.
MSstats is one of the most widely used tools for MRM data analysis which is based on the LMMs. MSstats is
well implemented on Skyline program and R programming language. However, LMMs often provide various
significance results depending on model specification. Thus, sometimes it would be difficult to specify a right
LMM for the analysis of MRM data. In this paper, we systematically investigated the effect of model
specification on significance of proteins through simulation studies. Our results provide a practical guideline of
using LMMs for MRM data analysis.
Key-Words: Linear mixed model, MRM, MSstats, Power, Protein significance analysis, Skyline
1 Introduction
Discovering protein biomarkers is one of the
primary interest in biomedical researches [1]. The
enzyme-linked immunosorbent assay (ELISA) is
one of the traditional protein quantitation techniques
that provide high sensitivity [2]. The result of
ELISA is treated as the “gold standard” for targeted
protein quantification [3]. However, recent studies
discovered many novel proteins and the availability
of highly qualified ELISAs for those proteins is
limited [4], which created a need for a different
technique of protein quantitation. The multiple
reaction monitoring (MRM) mass spectrometry is a
new method used in tandem mass spectrometry for a
systematic development of targeted protein assays
[1]. As an alternative to ELISA, MRM assays
become gradually used in systems biology and in
clinical investigations [6]. For the MRM data
analysis, two sample t-test or paired t-test was
applied to identify proteins that would change in
abundance between two groups [8]. To test for
multiple groups, one-way analysis of variance
(ANOVA) was employed [9]. Recently, a linear
mixed model (LMM) approach was proposed for
MRM data analysis and implemented in MSstats
[10] and has been popularly used [11]. The
proposed LMM approach treats either or both
subject and run effect as random or fixed. However,
we observed that the proposed LMM approach often
provides various p-values for the same data
depending on which effects are treated as random or
fixed. Moreover, the data structure also affects the
performance of LMM approach. If intensity patterns
of peptides from a protein are heterogeneous, there
is a loss of power in LMM approach. If intensity
patterns are homogeneous, there is a power gain in
LMM approach.
In this paper, we systematically
investigated the effect of model specification on
significance of proteins through simulation studies.
Our results provide a practical guideline of using
LMMs for MRM data analysis.
2 Methods
For a given protein, the LMM used in MSstats is
given as follows:
DESIGN, CONSTRUCTION, MAINTENANCE
DOI: 10.37394/232022.2022.2.1
Jongsoo Jun, Taesung Park
E-ISSN: 2732-9984
1
Volume 2, 2022
,(),,=++()()+
,(),,++(×),
,(),,+(×),+,(),,
(1)
where ,(),, denotes log2() value of
the j-th subject nested in the i-th group of the k-th
peptide and the l-th run; is a global mean,
stands for the i-th group effect; ()() stands for
the j-th subject effect nested in i-th group;
stands for the k-th peptide effect; stands for the
l-th run effect, (×) stands for interaction
effect of the i-th group and the k-th peptide;
(×) stands for interaction effect of the k-th
peptide and the l-th run. When all effects are treated
as fixed, these parameters have the following
restrictions:
2
=0 = 0 , ()()
=0 = 0 ,
=1 = 0 ,
=1 = 0 , (×),
2
=0 = 0 ,
(×),
=1 = 0 , (×),
=1 = 0 and
(×),
=1 = 0, and ,(),,~(0, 2). Here,
0 stands for the effect of reference group of MRM
data. When the subject effects and the run effects
are treated as random, the restrictions of ()(),
and (×) are replaced by
()()~(0, 2), ~(0, 2) and (×
)~(0, ×
2), respectively.
The Model (1) can be equivalently written
as follows:
=0+++
++(×)×
+(×)×+
Here, is a log2() value of the i-th
sample; is a (× 1) group indicator variable;
stands for the number of groups except the
reference group; is a (× 1) subject indicator
variable, where stands for the number of
subjects except the reference sample; is a
(1 × 1) peptide indicator variable, where
stands for the number of peptides; is a
(1 × 1) run indicator variable, where
stands for the number of MS runs; (×) is a
interaction of group and peptide indicator variable;
(×) is a interaction of run and peptide
indicator variable; is an error term that follows
normal distribution with mean 0 and variance 2.
, and × are coefficients of subject, run
and interaction of run and peptide, respectively.
These coefficients can be treated either as fixed or
random. In most MRM data analyses, the interest
lies in determining proteins that differ from groups.
Thus, the hypothesis of interest is given below for
comparing two groups:
0:(1)(2)
(2)
0:+(1)×()(2)×()
=2
= 0
where () is the coefficient of the group and
()×() is the interaction coefficient of group
and peptide . Here, () is equal to 0+
(×)(,1)(×)(0,1) and ()×() to
(×)(,)(×)(,1). Thus, the hypothesis
(2) is equivalent to 0: 1=2
3 Simulations
3.1 Simulation Settings
We performed simulation studies to investigate the
performance of LMMs. There are four LMMs
depending on how to specify the random or fixed
effect: (i) LMM(FF) with fixed subject effect and
fixed run effect, (ii) LMM(FR) with fixed subject
effect and random run effect, (iii) LMM(RF) with
random subject effect and fixed run effect, and (iv)
LMM(RR) with random subject effect and random
run effect. For each simulated data set, the best
LMM, LMM(best), was selected among four LMMs
which had the smallest Akaike Information
Criterion (AIC) value. We generated parameters of
model (1) either as random or fixed effects. We
generated random effects from the identical normal
distribution independently with mean 0 and a
specific variance.
On the other hand, for the fixed effect, we
generated equally spaced sequence such that the
average of the sequence is 0 and its squared
average is the same with the value of the variance
that we specified to generate random effect. In
model (1), the global mean, , was set to 15 and
2, the variance of ,(),,, was set to 0.5
throughout all simulations. ()(), , and
(×) are nuisance parameters to test
hypothesis (2). Therefore, we fixed their effects
throughout simulations; their variances for random
effect or squared averages for fixed effect were set
to 0.25, 0.1, 0.25 and 0.1 for ()(), ,
and (×) , respectively. The number of
peptides was assumed to vary from 2 to 5 in order to
investigate the effect of the number of peptides on
type I error and power. Various sample sizes, 20, 50
and 100, were considered with the ratio of case and
control fixed to 1:1.
3.1.1 Settings for Generating Random Effects
We considered four scenarios for generating the
group effect and the interaction effect as follows.
Scenario 1: 21= 0 and (×) = 0 for all i, k
DESIGN, CONSTRUCTION, MAINTENANCE
DOI: 10.37394/232022.2022.2.1
Jongsoo Jun, Taesung Park
E-ISSN: 2732-9984
2
Volume 2, 2022
Scenario 2: 21= 1 and (×) = 0 for all i, k
Scenario 3: 21= 0 and Var{(×)}= 0.2
Scenario 4: 21= 0.5 and Var{(×)}= 0.1
The first scenario was to investigate the type I error
rate of LMMs for testing group difference; the
second to the fourth scenarios were considered to
evaluate empirical power of LMMs when only
group effect was present (Scenario 2), only the
interaction effect was present (Scenario 3), and both
group and interaction effects were present (Scenario
4). All simulation data sets were generated 1,000
times from model (1) and the significance level was
set to 0.05 through simulations.
3.1.2 Settings for Generating Fixed Effects
Basically, the settings for generating fixed effects
were identical to those of random effects explained
in section 3.1.1 except that the fixed sequences were
used as previously described. That is as follows.
Scenario 5: 21= 0 and (×) = 0 for all i, k
Scenario 6: 21= 1 and (×) = 0 for all i, k
Scenario 7: 21= 0 and Var{(×)}= 0.2
Scenario 8: 21= 0.5 and Var{(×)}= 0.1
Here, Var{(×)} stands for the squared
average of a fixed effect. Since the effects were
fixed, we could control the interaction effect of
groups and peptides more easily. We considered
three types of interaction model (IM). (i) The first
model IM1 was when the peptide effects between
two groups are the same, that is, (×)2,1
(×)1,1 ==(×)2,(×)1,> 0 .
The number of peptides was varied from 2 to 5. (ii)
The second model IM2 was when the number of
peptides was assumed to be four among which two
peptides have positive effects andthe other two have
negative effects. The effect sizes of peptides
between two groups were assumed to be the same.
Thus, the interaction effect becomes (×)2,1
(×)1,1 =(×)2,2 (×)1,2 =
(×)1,3 (×)2,3 =(×)1,4 (×)2,4 .
(iii) The third model IM3 was the case when the
number of positive (×)2,(×)1, was
assumed to vary from 1 to 4. Here, the number of
peptides was assumed to be five.
3.2 Simulation Results
3.2.1 Results for Random Effects
Figure 1 shows the type I error rate of LMMs when
the effects were random. Among four models,
LMMs with random subject effect, LMM(RF) and
LMM(RR), controlled type I error well, while
LMMs with fixed subject effect, LMM(FF) and
LMM(FR), did not. LMM(FR) showed the highest
type I error rate. The AIC value of LMM(FF) tended
to be the smallest among four LMMs. Thus,
LMM(FF) was most frequently selected as the best
LMM and accordingly its behavior was very similar
to that of LMM(FF).
Figure 1. Type I error rate of LMMs when effects
were randomly generated.
The number of peptides was 2 and 5 for the top panel and
the bottom panel, respectively. The x-axis and y-axis
represent the number of samples and type I error rate,
respectively. Grey dotted horizontal line represents the
significant level.
Regarding the effect of the number of
peptides on the type I error, there are some
differences among the models. For LMM(RF) and
LMM(RR), there is no effect of the number of
peptides, while LMM(FF) and LMM(FR) showed
inflated type I error rates. LMM(best) was not
affected by the number of peptides either. Since
LMM(FF) and LMM(FR) did not control type I
error well, their power was higher than those of
LMM(RF) and LMM(RR) for scenarios 2 to 4, as
shown in Figure 2. LMM(FR) showed higher power
than LMM(FF), as similarly observed in Scenario 1.
LMM(RR) showed relatively higher power than
LMM(RF). The behaviors of the LMM(best) for
scenarios 2 to 4 were similar to what we observed in
Type I error rate
20 50 100
0.00 0.05 0.10 0.15 0.20 0.25
Type I error rate
20 50 100
0.00 0.05 0.10 0.15 0.20 0.25
LMM(FF) LMM(FR) LMM(RF) LMM(RR)
DESIGN, CONSTRUCTION, MAINTENANCE
DOI: 10.37394/232022.2022.2.1
Jongsoo Jun, Taesung Park
E-ISSN: 2732-9984
3
Volume 2, 2022
Scenario 1. Four LMMs showed consistent power
patterns under all simulated cases for scenarios 2 to
4. That is, the power of LMMs has the following
ordering: LMM(FR) > LMM(FF) > LMM(RR) >
LMM(RF).
Figure 2. Estimated empirical power of LMMs when
effects were randomly generated.
Top panel (Scenario 2): = and {(×
)}=. Middle panel (Scenario 3): = and
{(×)}=.. Bottom panel (Scenario 4):
=. and {(×)}=.. The
number of peptides was 2. The x-axis and y-axis
represent the number of samples and estimated empirical
power, respectively. Grey dotted horizontal line
represents the significant level.
The effect of sample sizes on power
depended on the group and interaction effects, but
showed very consistent patterns for all LMMs.
When the group effect 21= 1 and the
interaction effect Var{(×)}= 0, the sample
size 20 yielded power of 0.6, while the sample size
50 yielded higher than 0.8. When the group effect
21= 0.5 and the interaction effect Var{(×
)}= 0.1, the sample size of 100 yielded power
higher than 0.6. On the other hand, when the group
effect 21= 0 and the interaction effect
Var{(×)}= 0.2 , the sample size of 100
produced power lower than 0.6.
3.2.2 Results for Fixed Effects
Figure 3 shows the type I error rate of LMMs when
the effects were fixed (Scenario 5). The type I error
rates of LMMs with fixed effects showed different
patterns from those of LMMs with random effects.
All four LMMs controlled type I error well, as
shown in Figure 3. LMM(RF) and LMM(RR) tend
to control type I error rate more strongly than
LMM(FF) and LMM(FR). Regarding the effect of
the number of peptides on the type I error, there is
no strong effect of the number of peptides.
Empirical power
20 50 100
0.0 0.2 0.4 0.6 0.8 1.0
Empirical power
20 50 100
0.0 0.2 0.4 0.6 0.8 1.0
Empirical power
20 50 100
0.0 0.2 0.4 0.6 0.8 1.0
LMM(FF) LMM(FR) LMM(RF) LMM(RR)
DESIGN, CONSTRUCTION, MAINTENANCE
DOI: 10.37394/232022.2022.2.1
Jongsoo Jun, Taesung Park
E-ISSN: 2732-9984
4
Volume 2, 2022
Figure 3. Type I error rate of LMMs when effects
were fixedly generated.
The number of peptides was 2 and 5 for the top panel and
the bottom panel, respectively. The x-axis and y-axis
represent the number of samples and type I error rate,
respectively. Grey dotted horizontal line represents the
significant level.
The power comparison results are
summarized in Figures 4 and 5. The effect of sample
sizes on power depended on the group and
interaction effects. Figure 4 shows the results for the
interaction model IM1, when the number of peptides
was two. Four LMMs showed consistent power
patterns under all simulated cases for scenario 6 to 8.
The power of LMM(FF) and LMM(FR) was
relatively higher than those of LMM(RF) and
LMM(RR) for scenarios 6 to 8. LMM(FR) showed
the highest power among four LMMs. The power of
LMM(RF) was slightly higher than that of
LMM(RR).
The effect of sample sizes on power
depended on the group and interaction effects, but
showed very consistent patterns for all LMMs.
Sample size 50 yielded power of 0.8 for scenarios 6
to 8.
Figure 5 shows the results for the
interaction model IM2. Here, the number of
peptides was assumed to be four; two peptides have
positive effects and the other two have negative
effects. As expected, the opposite direction of
effects dramatically decreased power of all four
LMMs.
Figure 6 shows the results for the
interaction model IM3. The number of peptides was
fixed to be five. Here, 21= 0 and the
squared average of (×) was set to 0.2. As
the number of positive (×)2,(×)1,
increases, the power of all LMMs increases. The
increase patterns of LMM(FF) and LMM(FR) are
different from those of LMM(RF) and LMM(RR).
The power of LMMs has the following ordering:
LMM(FR) > LMM(FF) > LMM(RF) > LMM(RR).
Figure 4. Estimated empirical power of LMMs when
the effects are fixed and the interaction model is IM1
Type I error rate
20 50 100
0.00 0.05 0.10 0.15 0.20 0.25
Type I error rate
20 50 100
0.00 0.05 0.10 0.15 0.20 0.25
LMM(FF) LMM(FR) LMM(RF) LMM(RR)
Empirical power
20 50 100
0.0 0.2 0.4 0.6 0.8 1.0
Empirical power
20 50 100
0.0 0.2 0.4 0.6 0.8 1.0
Empirical power
20 50 100
0.0 0.2 0.4 0.6 0.8 1.0
LMM(FF) LMM(FR) LMM(RF) LMM(RR)
DESIGN, CONSTRUCTION, MAINTENANCE
DOI: 10.37394/232022.2022.2.1
Jongsoo Jun, Taesung Park
E-ISSN: 2732-9984
5
Volume 2, 2022
Top panel (Scenario 6): = and {(×
)}=. Middle panel (Scenario 7): = and
{(×)}=.. Bottom panel (Scenario 8):
=. and {(×)}=.. The
number of peptides was 2. The x-axis and y-axis
represent the number of samples and estimated empirical
power, respectively. Grey dotted horizontal line
represents the significant level.
Figure 5. Estimated empirical power of LMMs when
effects are fixed and the interaction models are IM1
and IM2
The x-axis represents the sample size and the y-axis
represents estimated empirical power. The top panel: The
number of peptides is four with the interaction model
IM1. The bottom panel: The total number of peptides is
four with the interaction model IM2.
Figure 6. Estimated empirical power of LMMs when
effects are fixed and the interaction model is IM3
The x-axis represents the number of positive (×
)2,(×)1, when the number of peptides was
five. The y-axis respresents estimated empirical power
when the sample size was 20. Here, 21= 0 and
the squared average of (×) was set to 0.2.
4 Conclusion
LMMs have been widely used to identify significant
protein from MRM assay. However, LMM approach
provides various significance results for the same
MRM data depending on which effects are treated
as random or fixed. It is well known properties of
LMMs that the variance of model parameters are
underestimated when the fixed effect model is fitted
when the true effects are random and vice versa
[14]. As a result, the significance result of LMMs
may vary depending on whether the true effect is
random or fixed. Thus, it is important to specify
correctly the effect as random or fixed.
We examined the performance of LMMs
through extensive simulation studies. We utilized
AIC for the model selection. However, our
empirical study showed that LMM(FF) has the
smallest AIC for all simulation settings, which made
it difficult to use AIC as a model selection criterion.
Based on our simulation results, we
suggest the following practical guideline to use
LMMs for MRM data analysis. First, if there is a
strong evidence that effects are fixed, then use
LMM(FR), because it controlled type I error well
and provided the highest power among four LMMs.
Second, if there is no evidence that effects are fixed,
then use LMM(RR), because it controlled type I
error and showed higher power than LMM(RF).
Third, when some of peptides in a protein behaved
oppositely from the others, we found out that LMMs
did not perform well. Thus, the nonsignificant
results should be more carefully examined.
Empirical power
20 50 100
0.0 0.2 0.4 0.6 0.8 1.0
Empirical power
20 50 100
0.0 0.2 0.4 0.6 0.8 1.0
LMM(FF) LMM(FR) LMM(RF) LMM(RR)
Empirical power
123 4 5
0.0 0.2 0.4 0.6 0.8 1.0
LMM(FF) LMM(FR) LMM(RF) LMM(RR)
DESIGN, CONSTRUCTION, MAINTENANCE
DOI: 10.37394/232022.2022.2.1
Jongsoo Jun, Taesung Park
E-ISSN: 2732-9984
6
Volume 2, 2022
Acknowledgement:
This work was supported by the Industrial Strategic
Technology Development Program (#10045352),
funded by the Ministry of Knowledge Economy
(MKE, Korea).
References:
[1] FRANTZI, Maria; BHAT, Akshay;
LATOSINSKA, Agnieszka. Clinical proteomic
biomarkers: relevant issues on study design &
technical considerations in biomarker
development. Clin Transl Med, 2014, 3.1: 7.
[2] WHITEAKER, Jeffrey R.; PAULOVICH,
Amanda G. Peptide immunoaffinity enrichment
coupled with mass spectrometry for peptide
and protein quantification. Clinics in
laboratory medicine, 2011, 31.3: 385-396.
[3] PAN, Sheng, et al. Mass spectrometry based
targeted protein quantification: methods and
applications. Journal of proteome research,
2008, 8.2: 787-797.
[4] SHI, Tujin, et al. Advancing the sensitivity of
selected reaction monitoringbased targeted
quantitative proteomics. Proteomics, 2012,
12.8: 1074-1092.
[5] LIN, De, et al. Comparison of protein
immunoprecipitation-multiple reaction
monitoring with ELISA for assay of biomarker
candidates in plasma. Journal of proteome
research, 2013, 12.12: 5996-6003.
[6] MESRI, Mehdi. Advances in Proteomic
Technologies and Its Contribution to the Field
of Cancer. Advances in Medicine, 2014, 2014.
[7] HUANG, Susan M., et al. An endogenous
capsaicin-like substance with high potency at
recombinant and native vanilloid VR1
receptors. Proceedings of the National
Academy of Sciences, 2002, 99.12: 8400-8405.
[8] ZHANG, Haixia, et al. Methods for peptide and
protein quantitation by liquid chromatography-
multiple reaction monitoring mass
spectrometry. Molecular & Cellular
Proteomics, 2011, 10.6: M110. 006593.
[9] ZHANG, Pingbo, et al. Multiple reaction
monitoring to identify site-specific troponin I
phosphorylated residues in the failing human
heart. Circulation, 2012,
CIRCULATIONAHA. 112.096388.
[10] CHANG, Ching-Yun, et al. Protein
significance analysis in selected reaction
monitoring (SRM) measurements. Molecular &
Cellular Proteomics, 2012, 11.4: M111.
014662.
[11] PURSIHEIMO, Anna, et al. Optimization of
statistical methods impact on quantitative
proteomics data. Journal of proteome research,
2015, 14.10: 4118-4126.
[12] MCDONALD, John H. Handbook of biological
statistics. Baltimore, MD: Sparky House
Publishing, 2009.
[13] HEDGES, Larry V.; OLKIN,
Ingram. Statistical method for meta-analysis.
Academic press, 2014.
[14] CHUNG, Yeojin; RABEHESKETH, Sophia;
CHOI, InHee. Avoiding zero betweenstudy
variance estimates in randomeffects
metaanalysis. Statistics in medicine, 2013,
32.23: 4071-4089.
Creative Commons Attribution License 4.0
(Attribution 4.0 International, CC BY 4.0)
This article is published under the terms of the Creative
Commons Attribution License 4.0
https://creativecommons.org/licenses/by/4.0/deed.en_US
DESIGN, CONSTRUCTION, MAINTENANCE
DOI: 10.37394/232022.2022.2.1
Jongsoo Jun, Taesung Park
E-ISSN: 2732-9984
7
Volume 2, 2022