
4 Applications
For a given experimental data, usually the
distribution of data does not precisely give a clue
about the degree of the polynomial to be used.
Sometimes, a polynomial regression is not suitable
at all. With worked examples and the theorems
given, guidelines for selection of the degree of a
polynomial regression are established in this
section.
Table 1 is produced from an approximately
cubic polynomial data. The major indicators are the
determinant, the coefficients, the sum of the
coefficients, the regression error and the difference
of the regression errors between the n-1’th degree
and n’th degree. At n=4, the determinant becomes
very singular, the highest degree coefficient appears
to be small (a4=0.1626), the sum of the coefficients
start deviating from 1 (Theorem 3), the standard
regression error is higher than the previous case and
the difference turns out to be negative all indicating
that n=3 is the best choice.
In Table 2, a functional type of data is
considered. The data is an approximation of a
logarithmic relationship. Given the previous
criterion, the ideal representation of the data is a
cubic polynomial because the determinant is too
much singular for n=4, the sum of the coefficients
start deviating from 1 and the difference of the
standard errors become negative. Note that since the
original data is not of a polynomial form, the highest
order coefficient at n=4 is not small, but at this
stage, one has larger opposite sign coefficients
(Theorem 2) which is an indicator that one should
stop and take the n value of the previous stage.
5 Concluding Remarks
The basics of the algorithm can be summarized as
follows.
1) Try first a linear relationship and increase
the degree by one at each stage.
2) Form a similar table as given in Tables 1
and 2.
3) Check the singularity of the determinant,
the highest degree coefficient, the
magnitudes of the coefficients, the sum of
the coefficients, the standard regression
error and the differences of errors at each
step.
4) Stop at degree n when the highest degree
coefficient is small, and/or the difference of
the errors is negative and use n-1 as the
ideal degree.
5) Although not compulsory, may stop at n
when the determinant is too singular,
there are opposite sign large
coefficients, the sum of the coefficients
start deviating from the ideal value, the
standard error is small and may use n-1
as an ideal representation.
Acknowledgment: - The support of the Turkish
Academy of Sciences (TÜBA) for the expenses of
the conference is highly appreciated.
References:
[1] T. W. Anderson, The choice of the degree of a
polynomial regression as a multiple decision
problem, The Annals of Mathematical
Statistics, Vol.33, 1962, pp. 255-265.
[2] H. J. Motulsky and L. A. Ransnas, Fitting
Curves to Data Using Nonlinear regression: A
Practical and Nonmathematical Review,
FASEB Journal, Vol.1, 1987, pp. 365-374.
[3] H. Dette, Optimal designs for identifying the
degree of a polynomial regression, The Annals
of Statistics, Vol.23, 1995, 1248-1266.
[4] H. Dette and W. J. Studden, Optimal designs
for polynomial regression when the degree is
not known, Statistica Sinica, Vol.5, 1995, 459-
473.
[5] B. R. Jayasuriya, Testing for polynomial
regression using nonparametric regression
techniques, Journal of the American Statistical
Association, Vol.91, 1996, 1626-1631.
[6] S. C. Chapra and R. P. Canale, Numerical
Methods for Engineers, Mc Graw Hill, 2014.
Creative Commons Attribution License 4.0
(Attribution 4.0 International, CC BY 4.0)
This article is published under the terms of the Creative
Commons Attribution License 4.0
https://creativecommons.org/licenses/by/4.0/deed.en_US
PROOF
DOI: 10.37394/232020.2022.2.4