Converting ncbi to gtdb


I am using the mapping file bac120_metadata.tsv (from the gtdb website) to convert ncbi to gtdb taxonomies. How was this mapping achieved? As I understand it, it is not easy/possible to map ncbi to gtdb 1 to 1.

Kind regards,


I am also interested in this.

I guess you are then making use of the NCBI-information columns of that file? There are also some excel-files you may download from GTDB, with names like gtdb_vs_ncbi_domain.xlsx and the reverse mapping as well. But, these are also a little difficult to understand, and they do not contain the full taxonomy either, as far as I could see.

You are right in thinking there is no 1-to-1 mapping here, but it would be nice if some of the expertise at GTDB could comment on this. Converting between these taxonomies is bound to be a major topic for many uses.

1 Like