Scalable method for exploring phylogenetic placement uncertainty with custom visualizations using treeio and ggtree

Meijun Chen , Xiao Luo , Shuangbin Xu , Lin Li , Junrui Li , Zijing Xie , Qianwen Wang , Yufan Liao , Bingdong Liu , Wenquan Liang , Ke Mo , Qiong Song , Xia Chen , Tommy Tsan-Yuk Lam , Guangchuang Yu

iMeta ›› 2025, Vol. 4 ›› Issue (1) : e269

PDF
iMeta ›› 2025, Vol. 4 ›› Issue (1) :e269 DOI: 10.1002/imt2.269
METHOD
Scalable method for exploring phylogenetic placement uncertainty with custom visualizations using treeio and ggtree
Author information +
History +
PDF

Abstract

In metabarcoding research, such as taxon identification, phylogenetic placement plays a critical role. However, many existing phylogenetic placement methods lack comprehensive features for downstream analysis and visualization. Visualization tools often ignore placement uncertainty, making it difficult to explore and interpret placement data effectively. To overcome these limitations, we introduce a scalable approach using treeio and ggtree for parsing and visualizing phylogenetic placement data. The treeio-ggtree method supports placement filtration, uncertainty exploration, and customized visualization. It enhances scalability for large analyses by enabling users to extract subtrees from the full reference tree, focusing on specific samples within a clade. Additionally, this approach provides a clearer representation of phylogenetic placement uncertainty by visualizing associated placement information on the final placement tree.

Keywords

ggtree / phylogenetic placement / placement uncertainty / treeio / visualization

Cite this article

Download citation ▾
Meijun Chen, Xiao Luo, Shuangbin Xu, Lin Li, Junrui Li, Zijing Xie, Qianwen Wang, Yufan Liao, Bingdong Liu, Wenquan Liang, Ke Mo, Qiong Song, Xia Chen, Tommy Tsan-Yuk Lam, Guangchuang Yu. Scalable method for exploring phylogenetic placement uncertainty with custom visualizations using treeio and ggtree. iMeta, 2025, 4(1): e269 DOI:10.1002/imt2.269

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Li, Tianbao, Tao Huang, Cheng Guo, Ailan Wang, Xiaoli Shi, Xiaofei Mo, Qingqing Lu, et al. 2021. “Genomic Variation, Origin Tracing, and Vaccine Development of SARS-COV-2: A Systematic Review.” The Innovation 2: 100116. https://doi.org/10.1016/j.xinn.2021.100116

[2]

Lu, Guoqing, and Etsuko N. Moriyama. 2021. “2019nCoVR—A Comprehensive Genomic Resource for SARS-COV-2 Variant Surveillance.” The Innovation 2: 100150. https://doi.org/10.1016/j.xinn.2021.100150

[3]

Ewers, Isabelle, Lubomír Rajter, Lucas Czech, Frédéric Mahé, Alexandros Stamatakis, and Micah Dunthorn. 2023. “Interpreting Phylogenetic Placements for Taxonomic Assignment of Environmental DNA.” Journal of Eukaryotic Microbiology 70: e12990. https://doi.org/10.1111/jeu.12990

[4]

Czech, Lucas, Pierre Barbera, and Alexandros Stamatakis. 2019. “Methods for Automatic Reference Trees and Multilevel Phylogenetic Placement.” Bioinformatics 35: 1151-1158. https://doi.org/10.1093/bioinformatics/bty767

[5]

Alamin, Md, and Kevin J. Liu. 2023. “Phylogenetic Placement of Aligned Genomes and Metagenomes With Non-Tree-Like Evolutionary Histories.” Proceedings of the 14th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics, 1-10. Houston, TX USA: ACM.

[6]

Czech, Lucas, Alexandros Stamatakis, Micah Dunthorn, and Pierre Barbera. 2022. “Metagenomic Analysis Using Phylogenetic Placement—A Review of the First Decade.” Frontiers in Bioinformatics 2: 871393. https://doi.org/10.3389/fbinf.2022.871393

[7]

Matsen, Frederick A., Noah G. Hoffman, Aaron Gallagher, and Alexandros Stamatakis. 2012. “A Format for Phylogenetic Placements.” PLoS One 7: e31009. https://doi.org/10.1371/journal.pone.0031009

[8]

Berger, Simon A., Denis Krompass, and Alexandros Stamatakis. 2011. “Performance, Accuracy, and Web Server for Evolutionary Placement of Short Sequence Reads under Maximum Likelihood.” Systematic Biology 60: 291-302. https://doi.org/10.1093/sysbio/syr010

[9]

Linard, Benjamin, Krister Swenson, and Fabio Pardi. 2019. “Rapid Alignment-Free Phylogenetic Identification of Metagenomic Sequences.” Bioinformatics 35: 3303-3312. https://doi.org/10.1093/bioinformatics/btz068

[10]

Matsen, Frederick A., Robin B. Kodner, and E. Virginia Armbrust. 2010. “Pplacer: Linear Time Maximum-Likelihood and Bayesian Phylogenetic Placement of Sequences Onto a Fixed Reference Tree.” BMC Bioinformatics 11: 538. https://doi.org/10.1186/1471-2105-11-538

[11]

Ye, Yongtao, Marcus H. Shum, Joseph L. Tsui, Guangchuang Yu, David K. Smith, Huachen Zhu, and Joseph T. Wu, et al. 2024. “Robust expansion of phylogeny for fast-growing genome sequence data.” PLOS Computational Biology 20: e1011871. https://doi.org/10.1371/journal.pcbi.1011871

[12]

Letunic, Ivica, and Peer Bork. 2019. “Interactive Tree of Life (iTOL) v4: Recent Updates and New Developments.” Nucleic Acids Research 47: W256-W259. https://doi.org/10.1093/nar/gkz239

[13]

Matsen, Frederick A., and Steven N. Evans. 2013. “Edge Principal Components and Squash Clustering: Using the Special Structure of Phylogenetic Placement Data for Sample Comparison.” PLoS One 8: e56859. https://doi.org/10.1371/journal.pone.0056859

[14]

Czech, Lucas, Pierre Barbera, and Alexandros Stamatakis. 2020. “Genesis and Gappa: Processing, Analyzing and Visualizing Phylogenetic (Placement) Data.” Bioinformatics 36: 3263-3265. https://doi.org/10.1093/bioinformatics/btaa070

[15]

Barbera, Pierre, Lucas Czech, Sarah Lutteropp, and Alexandros Stamatakis. 2021. “SCRAPP: A Tool to Assess the Diversity of Microbial Samples from Phylogenetic Placements.” Molecular Ecology Resources 21: 340-349. https://doi.org/10.1111/1755-0998.13255

[16]

Turakhia, Yatish, Bryan Thornlow, Angie S. Hinrichs, Nicola De Maio, Landen Gozashti, Robert Lanfear, David Haussler, and Russell Corbett-Detig. 2021. “Ultrafast Sample Placement on Existing Trees (UShER) Enables Real-Time Phylogenetics for the SARS-COV-2 Pandemic.” Nature Genetics 53: 809-816. https://doi.org/10.1038/s41588-021-00862-7

[17]

Roush, Daniel, Ana Giraldo-Silva, and Ferran Garcia-Pichel. 2021. “Cydrasil 3, A Curated 16S rRNA Gene Reference Package and Web App for Cyanobacterial Phylogenetic Placement.” Scientific Data 8: 230. https://doi.org/10.1038/s41597-021-01015-5

[18]

Theys, Kristof, Philippe Lemey, Anne-Mieke Vandamme, and Guy Baele. 2019. “Advances in Visualization Tools for Phylogenomic and Phylodynamic Studies of Viral Diseases.” Frontiers in Public Health 7: 208. https://doi.org/10.3389/fpubh.2019.00208

[19]

Yu, Guangchuang, David K. Smith, Huachen Zhu, Yi Guan, and Tommy Tsan-Yuk Lam. 2017. “Ggtree: An R Package for Visualization and Annotation of Phylogenetic Trees with Their Covariates and Other Associated Data.” Methods in Ecology and Evolution 8: 28-36. https://doi.org/10.1111/2041-210X.12628

[20]

Yu, Guangchuang, Tommy Tsan-Yuk Lam, Huachen Zhu, and Yi Guan. 2018. “Two Methods for Mapping and Visualizing Associated Data on Phylogeny Using Ggtree.” Molecular Biology and Evolution 35: 3041-3043. https://doi.org/10.1093/molbev/msy194

[21]

Wang, Li-Gen, Tommy Tsan-Yuk Lam, Shuangbin Xu, Zehan Dai, Lang Zhou, Tingze Feng, Pingfan Guo, et al. 2020. “Treeio: An R Package for Phylogenetic Tree Input and Output with Richly Annotated and Associated Data.” Molecular Biology and Evolution 37: 599-603. https://doi.org/10.1093/molbev/msz240

[22]

Xu, Shuangbin, Zehan Dai, Pingfan Guo, Xiaocong Fu, Shanshan Liu, Lang Zhou, Wenli Tang, et al. 2021. “GgtreeExtra: Compact Visualization of Richly Annotated Phylogenetic Data.” Molecular Biology and Evolution 38: 4039-4042. https://doi.org/10.1093/molbev/msab166

[23]

Yu, Guangchuang. 2022. Data Integration, Manipulation and Visualization of Phylogenetic Trees (1st Ed.). New York: Chapman and Hall/CRC.

[24]

Wickham, Hadley. 2016. Ggplot2: Elegant Graphics for Data Analysis. Cham: Springer.

[25]

Arroyo, Alicia S., David López-Escardó, Eunsoo Kim, Iñaki Ruiz-Trillo, and Sebastián R. Najle. 2018. “Novel Diversity of Deeply Branching Holomycota and Unicellular Holozoans Revealed By Metabarcoding in Middle Paraná River, Argentina.” Frontiers in Ecology and Evolution 6: 99. https://doi.org/10.3389/fevo.2018.00099

[26]

Monier, Adam, Aurélie Chambouvet, David S. Milner, Victoria Attah, Ramón Terrado, Connie Lovejoy, Hervé Moreau, et al. 2017. “Host-Derived Viral Transporter Protein for Nitrogen Uptake in Infected Marine Phytoplankton.” Proceedings of the National Academy of Sciences 114: E7489. https://doi.org/10.1073/pnas.1708097114

[27]

Mitsi, Konstantina, Alicia S. Arroyo, and Iñaki Ruiz-Trillo. 2019. “A Global Metabarcoding Analysis Expands Molecular Diversity of Platyhelminthes and Reveals Novel Early-Branching Clades.” Biology Letters 15: 20190182. https://doi.org/10.1098/rsbl.2019.0182

RIGHTS & PERMISSIONS

2025 The Author(s). iMeta published by John Wiley & Sons Australia, Ltd on behalf of iMeta Science.

PDF

0

Accesses

0

Citation

Detail

Sections
Recommended

/