I have noticed that some genomes excluded from RefSeq because they have “many frameshifted proteins” are sometimes selected as GTDB representatives.
Yet, genomes of higher quality are sometimes available.
Below are some examples:
GCA_000614735.1 (excluded from RefSeq) -> GCF_001434975.1
GCA_001311765.1 (excluded from RefSeq) -> GCF_001434815.1
Could you fix this in the next release of GTDB?