Abstract
In the last couple of decades, data analytics-based pattern classification methods for disease detection have gained much traction in healthcare research and applications. The current study builds linear programming (LP) models for detecting disease incidence. We propose sequential steps of a convex programming algorithm to construct decision boundary functions to classify patterns in disease detection data. We compare the performance of our LP-based classifier with others (neural network, decision tree, k-nearest-neighbor, logistic regression, naïve-Bayes, and support-vector-machine) on four datasets: two different ones for breast cancer, and one each for diabetes and diabetic retinopathy. Statistical tests reveal that the LP classifier did significantly better than the other methods in five out of eight false-positive and false-negative test cases. There is not a statistically significant difference in performance in the remaining three tests between the LP classifier and the best alternative method. Most importantly, the LP classifier has significantly superior performance in both diabetes detection and diabetic retinopathy data. The success of the proposed LP classifier results from avoiding “modeling noise” and “memorization of training data.” We recommend that our proposed LP classifier be among the set of classifiers for use in disease detection analytics.
Original language | English (US) |
---|---|
Pages (from-to) | 661-698 |
Number of pages | 38 |
Journal | Decision Sciences |
Volume | 52 |
Issue number | 3 |
DOIs | |
State | Published - Jun 2021 |
Keywords
- Clinical Data
- Convex Programming
- Decision Tree
- Disease Detection Analytics
- Linear Programming
- Neural Network
- Pattern Classification
ASJC Scopus subject areas
- General Business, Management and Accounting
- Strategy and Management
- Information Systems and Management
- Management of Technology and Innovation