TY - JOUR
T1 - A column-oriented optimization approach for the generation of correlated random vectors
AU - Sefair, Jorge A.
AU - Guaje, Oscar
AU - Medaglia, Andrés L.
N1 - Funding Information: The authors would like to thank Professor Jim Wilson from NC State University for sharing his encouraging and valuable input at an earlier stage of this work. The authors sincerely thank Professor Douglas Montgomery at Arizona State University for his valuable comments to improve the manuscript. Also, authors thank Gurobi and FICO for providing access to their commercial optimization solvers under their academic licensing programs. The authors would like to thank the two anonymous reviewers, whose comments greatly improved the article. This material is based upon work supported by Dr. Sefair’s National Science Foundation Grant No. 1740042. Funding Information: The authors would like to thank Professor Jim Wilson from NC State University for sharing his encouraging and valuable input at an earlier stage of this work. The authors sincerely thank Professor Douglas Montgomery at Arizona State University for his valuable comments to improve the manuscript. Also, authors thank Gurobi and FICO for providing access to their commercial optimization solvers under their academic licensing programs. The authors would like to thank the two anonymous reviewers, whose comments greatly improved the article. This material is based upon work supported by Dr. Sefair’s National Science Foundation Grant No. 1740042. Publisher Copyright: © 2021, The Author(s), under exclusive licence to Springer-Verlag GmbH, DE part of Springer Nature.
PY - 2021/9
Y1 - 2021/9
N2 - To induce a desired correlation structure among random variables, widely popular simulation software relies upon the method of Iman and Conover (IC). The underlying premise is that the induced Spearman rank correlation is a meaningful way to approximate other correlation measures among the random variables (e.g., Pearson’s correlation). However, as expected, the desired a posteriori correlation structure often deviates from the Spearman correlation structure. Rooted in the same principle of IC, we propose an alternative distribution-free method based on mixed-integer programming to induce a Pearson correlation structure to bivariate or multivariate random vectors. We also extend our distribution-free method to other correlation measures such as Kendall’s coefficient of concordance, Phi correlation coefficient, and relative risk. We illustrate our method in four different contexts: (1) the simulation of a healthcare facility, (2) the analysis of a manufacturing tandem queue, (3) the imputation of correlated missing data in statistical analysis, and (4) the estimation of the budget overrun risk in a construction project. We also explore the limits of our algorithms by conducting extensive experiments using randomly generated data from multiple distributions.
AB - To induce a desired correlation structure among random variables, widely popular simulation software relies upon the method of Iman and Conover (IC). The underlying premise is that the induced Spearman rank correlation is a meaningful way to approximate other correlation measures among the random variables (e.g., Pearson’s correlation). However, as expected, the desired a posteriori correlation structure often deviates from the Spearman correlation structure. Rooted in the same principle of IC, we propose an alternative distribution-free method based on mixed-integer programming to induce a Pearson correlation structure to bivariate or multivariate random vectors. We also extend our distribution-free method to other correlation measures such as Kendall’s coefficient of concordance, Phi correlation coefficient, and relative risk. We illustrate our method in four different contexts: (1) the simulation of a healthcare facility, (2) the analysis of a manufacturing tandem queue, (3) the imputation of correlated missing data in statistical analysis, and (4) the estimation of the budget overrun risk in a construction project. We also explore the limits of our algorithms by conducting extensive experiments using randomly generated data from multiple distributions.
KW - Correlated random vectors
KW - Data imputation
KW - Iman–Conover method
KW - Kendall coefficient of concordance
KW - Pearson product-moment correlation
KW - Phi correlation coefficient
KW - Relative risk
KW - Simulation
KW - Spearman rank correlation
UR - http://www.scopus.com/inward/record.url?scp=85101571547&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85101571547&partnerID=8YFLogxK
U2 - 10.1007/s00291-021-00620-5
DO - 10.1007/s00291-021-00620-5
M3 - Article
SN - 0171-6468
VL - 43
SP - 777
EP - 808
JO - OR Spectrum
JF - OR Spectrum
IS - 3
ER -