A NOVEL ENSEMBLE BASE MODELS SELECTION APPROACH FOR ESTIMATING CREDIBLE RUBBER CONVEYOR BELTING CURE TIMES FROM A SMALL SAMPLE SIZE MRSM DATASET

Authors

  • Domingo Pavolo Department of Mathematical Statistics and Actuarial Sciences, Faculty of Natural and Agricultural Sciences, University of the Free State, P.O. Box 339, Bloemfontein 9300, South Africa
  • Delson Chikobvu Department of Mathematical Statistics and Actuarial Sciences, Faculty of Natural and Agricultural Sciences, University of the Free State, P.O. Box 339, Bloemfontein 9300, South Africa
  • Louis Lagrange Department of Engineering Sciences, Faculty of Natural and Agricultural Sciences, University of the Free State, P.O. Box 339, Bloemfontein 9300, South Africa

Keywords:

Multiple response surface methodology, Ensembling, Small data analytics, Simultaneous optimisation, Model selection criteria, Nesting

Abstract

Multiple response surface methodology (MRSM) is small sample dataset analytics in nature. There are experiences associated with small sample size dataset problems in regression models, model selection and generalisability of models that affect the MRSM solution credibility. In this work, ensembling is used to account for the small sample size problems of an MRSM dataset and to avoid the tradition of simultaneously optimizing only single “best” models for each response. A novel ensemble base model(s) selection methodology is used to manage the number of simultaneous optimization computations. Fifteen model selection criteria are used to vote for the “best” fitting and parsimonious model and, with it, all response models nesting it are included as base models of the ensemble. Simultaneous optimizations and frequency analysis of solutions and model complexities are performed to arrive at a solution. The ensemble solution is estimated by weighted averaging of simultaneous optimization solutions using the solution frequencies. When the methodology is applied to the rubber-covered conveyor belt problem of Pavolo and Chikobvu (2022), the same estimated credible results are obtained, albeit with fewer simultaneous optimization computations. The results also show that the “best” fitting and parsimonious model is an under-fit, and its solution is different from the credible ensemble solution.  The approach is recommended for similar small sample size problems.

Downloads

Download data is not yet available.

References

a

Published

2023-08-20

How to Cite

Domingo Pavolo, Delson Chikobvu, & Louis Lagrange. (2023). A NOVEL ENSEMBLE BASE MODELS SELECTION APPROACH FOR ESTIMATING CREDIBLE RUBBER CONVEYOR BELTING CURE TIMES FROM A SMALL SAMPLE SIZE MRSM DATASET. Operational Research in Engineering Sciences: Theory and Applications, 6(2). Retrieved from https://oresta.org/menu-script/index.php/oresta/article/view/589