Hi CX,
- Unfortunately, we do not provide 23S rRNA sequences at this time.
- The
ssu_all_r207.tar.gz
file contain 16S rRNA sequence identified across all genome in GTDB while thear53_ssu_reps.tar.gz
is restricted to just archaeal genomes selected as GTDB representatives of a species. The fileFILE_DESCRIPTIONS
gives more information about what is contained in each file provided on the GTDB FTP site. - We identify 16S rRNA genes de novo and thus our results may differ slightly from those at NCBI.
Thank you for pointing out the discrepancy with the GCA_019058055.1 16S fragment. I will need to dig into this to determine why our results differ from those at NCBI. It seems we start the 16S fragment 2 bases earlier and terminate it 1 base prior and thus are 1 bp shorter.
Cheers,
Donovan