16S, 23S and ssu_all_r207

Hi CX,

  1. Unfortunately, we do not provide 23S rRNA sequences at this time.
  2. The ssu_all_r207.tar.gz file contain 16S rRNA sequence identified across all genome in GTDB while the ar53_ssu_reps.tar.gz is restricted to just archaeal genomes selected as GTDB representatives of a species. The file FILE_DESCRIPTIONS gives more information about what is contained in each file provided on the GTDB FTP site.
  3. We identify 16S rRNA genes de novo and thus our results may differ slightly from those at NCBI.

Thank you for pointing out the discrepancy with the GCA_019058055.1 16S fragment. I will need to dig into this to determine why our results differ from those at NCBI. It seems we start the 16S fragment 2 bases earlier and terminate it 1 base prior and thus are 1 bp shorter.

Cheers,
Donovan