TY - JOUR
T1 - Effect of LSU and ITS genetic markers and reference databases on analyses of fungal communities
AU - Xue, Chao
AU - Hao, Yuewen
AU - Pu, Xiaowei
AU - Penton, Christopher
AU - Wang, Qiong
AU - Zhao, Mengxin
AU - Zhang, Bangzhou
AU - Ran, Wei
AU - Huang, Qiwei
AU - Shen, Qirong
AU - Tiedje, James M.
N1 - Funding Information: Funding This work was funded by the DOE Great Lakes Bioenergy Research Center, DOE BER Office of Science (DE-FC02-07ER64494 and DE-FG02-99ER62848), the Jiangsu Science and Technology Department (BK20160730), the China Postdoctoral Science Foundation (2017M621761 and 2018T110510), and the Fundamental Research Funds for the Central Universities (KYZ201720 and KYZ201877). We also acknowledge the support of the GLBRC field staff who maintains the field experiment, and for the funding that supports that staff and field operations by the National Science Foundation Long-term Ecological Research Program (DEB 1637653), and by Michigan State University AgBioResearch. Publisher Copyright: © 2018, Springer-Verlag GmbH Germany, part of Springer Nature.
PY - 2019/1/1
Y1 - 2019/1/1
N2 - The effect of genetic markers and reference databases on analyses of fungal communities were estimated using fungal large subunit (LSU) and internal transcribed spacer (ITS) amplicon datasets in consecutive years of rhizosphere samples from three candidate biofuel crops, corn (Zea mays), switchgrass (Panicum virgatum), and miscanthus (Miscanthus × giganteus). These two marker genes were selected to contrast possible differences in biological conclusions. In addition, two ITS schemes based on two ITS reference databases were used to assess differences due to reference database composition. A taxonomy-supervised method was invoked using the Ribosomal Database Project naïve Bayesian classifier that accesses all three databases. The UNITE classification scheme had the highest number of classified taxa in the raw classification result; however, it also had the highest proportion of unknown taxa (sequences that were classified to “unclassified,” “unidentified,” incertae sedis or, in the case of Warcup, to matches containing two unique names). After removal of these unknown taxa, LSU had the highest classification rate followed by Warcup and UNITE. As expected, the communities resolved using the two ITS databases, based on the same sequences, were relatively more similar than those from the lower-coverage LSU classification scheme. The choice of marker gene or even the same reads with different classification databases revealed different community patterns due to database coverage, e.g., the relative abundance of the most abundant groups changed or were only detected in one or two of the classification schemes, such as for Mortierella, Fusarium, and Phoma. No marked difference in fungal beta-diversity was identified among the three methods. Differentiation between the three biofuel crops and between the drought and normal rainfall years was apparent, regardless of method. Though classification rates, taxonomic conflicts, and coverage differences within the high-abundance fungal groups varied according to classification scheme, there was no overall impact on beta diversity among the three methods.
AB - The effect of genetic markers and reference databases on analyses of fungal communities were estimated using fungal large subunit (LSU) and internal transcribed spacer (ITS) amplicon datasets in consecutive years of rhizosphere samples from three candidate biofuel crops, corn (Zea mays), switchgrass (Panicum virgatum), and miscanthus (Miscanthus × giganteus). These two marker genes were selected to contrast possible differences in biological conclusions. In addition, two ITS schemes based on two ITS reference databases were used to assess differences due to reference database composition. A taxonomy-supervised method was invoked using the Ribosomal Database Project naïve Bayesian classifier that accesses all three databases. The UNITE classification scheme had the highest number of classified taxa in the raw classification result; however, it also had the highest proportion of unknown taxa (sequences that were classified to “unclassified,” “unidentified,” incertae sedis or, in the case of Warcup, to matches containing two unique names). After removal of these unknown taxa, LSU had the highest classification rate followed by Warcup and UNITE. As expected, the communities resolved using the two ITS databases, based on the same sequences, were relatively more similar than those from the lower-coverage LSU classification scheme. The choice of marker gene or even the same reads with different classification databases revealed different community patterns due to database coverage, e.g., the relative abundance of the most abundant groups changed or were only detected in one or two of the classification schemes, such as for Mortierella, Fusarium, and Phoma. No marked difference in fungal beta-diversity was identified among the three methods. Differentiation between the three biofuel crops and between the drought and normal rainfall years was apparent, regardless of method. Though classification rates, taxonomic conflicts, and coverage differences within the high-abundance fungal groups varied according to classification scheme, there was no overall impact on beta diversity among the three methods.
KW - Fungal database
KW - ITS
KW - LSU
KW - UNITE
KW - Warcup
UR - http://www.scopus.com/inward/record.url?scp=85057007693&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85057007693&partnerID=8YFLogxK
U2 - 10.1007/s00374-018-1331-4
DO - 10.1007/s00374-018-1331-4
M3 - Article
SN - 0178-2762
VL - 55
SP - 79
EP - 88
JO - Biology and Fertility of Soils
JF - Biology and Fertility of Soils
IS - 1
ER -