TY - GEN
T1 - Acclimatizing taxonomic semantics for hierarchical content classification
AU - Tang, Lei
AU - Zhang, Jianping
AU - Liu, Huan
PY - 2006
Y1 - 2006
N2 - Hierarchical models have been shown to be effective in content classification. However, we observe through empirical study that the performance of a hierarchical model varies with given taxonomies; even a semantically sound taxonomy has potential to change its structure for better classification. By scrutinizing typical cases, we elucidate why a given semantics-based hierarchy does not work well in content classification, and how it could be improved for accurate hierarchical classification. With these understandings, we propose effective localized solutions that modify the given taxonomy for accurate classification. We conduct extensive experiments on both toy and real-world data sets, report improved performance and interesting findings, and provide further analysis of algorithmic issues such as time complexity, robustness, and sensitivity to the number of features.
AB - Hierarchical models have been shown to be effective in content classification. However, we observe through empirical study that the performance of a hierarchical model varies with given taxonomies; even a semantically sound taxonomy has potential to change its structure for better classification. By scrutinizing typical cases, we elucidate why a given semantics-based hierarchy does not work well in content classification, and how it could be improved for accurate hierarchical classification. With these understandings, we propose effective localized solutions that modify the given taxonomy for accurate classification. We conduct extensive experiments on both toy and real-world data sets, report improved performance and interesting findings, and provide further analysis of algorithmic issues such as time complexity, robustness, and sensitivity to the number of features.
KW - Hierarchical Classification
KW - Hierarchical Modeling
KW - Taxonomy Adjustment
KW - Text Classification
UR - http://www.scopus.com/inward/record.url?scp=33749539119&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33749539119&partnerID=8YFLogxK
M3 - Conference contribution
SN - 1595933395
SN - 9781595933393
T3 - Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
SP - 384
EP - 393
BT - KDD 2006
T2 - KDD 2006: 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
Y2 - 20 August 2006 through 23 August 2006
ER -