Same "order", different "phylum"

I am unable to understand how GTDB taxonomy lineage is prepared such that we have the same “order” taxa name despite being in different “phylum” taxa names?

Look at following example:

Bacteria;Bacillota;Clostridia;Lachnospirales;Lachnospiraceae;Blautia_A;Blautia_A wexlerae

Bacteria;Bacillota_A;Clostridia;Clostridiales;Clostridiaceae;Clostridium;Clostridium septicum

Shouldn’t the higher resolution taxa names (order, family, genus, species, strain) start to separate away from the lower resolution (kingdom, phylum) since the reference genomes are already differentiated at the lower level (phylum)? Why is the taxonomy lineage regrouping at the level of “order”?

PS there are many more species following said pattern

Hi,

You are certainly correct this shouldn’t occur. In GTDB R226 (current version), I have the following classification:

Where did you find a classification for s__Clostridium septicum with it being assigned to Bacillota_A?

Cheers,
Donovan

Hey Donovan, thanks for your reply!

I have been using R214 for my project since a while now so I never realized that R226 had been released (with the reclassification of Clostridium septicum)

I reckon the best way forward is to update my GTDB taxonomy to the latest release as well

Hi,

We update the GTDB every April and the website always reflects the latest release. Generally, there will be some conflicting classifications if one ends up mixing results from different GTDB releases.

Cheers,
Donovan

Note that the suffix on higher taxon names (family and above) only indicates polyphyly of a group in a given reference FastTree, so for all intents and purposes Bacillota = Bacillota_A, etc. With a perfect reference tree these suffixes wouldn’t exist.