First of all, thank you very much for GTDB! We are big fans here!
I couldn’t easily find this information so I thought I’d try the forums! I was wondering if GTDB included MAGs from large-scale metagenomic assembly projects that are focusing on characterizing unknown diversity, and that don’t seem to be depositing their sequences on RefSeq. I imagine that GTDB does not include anything not on RefSeq (but I’m not 100% sure, please correct if I am wrong). If they aren’t in GTDB, are there any plans to do so?
I am thinking of two studies in particular:
Pasolli et al. (2019) Cell 176:649–662 in which ~9500 metagenomes were assembled into ~150,000 MAGs corresponding to ~75% “unknown” species from repositories. The paper mentions that they are available on this website.
Almeida et al. (2019) Nature 568:499–504 Similarly, >92,000 MAGs from were deposited on (the ENA repository) but I’m not sure if they are on RefSeq.
Thank you very much for your answer!