Hi, Please help me to resolve the following error :
[2024-06-25 15:03:32] INFO: GTDB-Tk v2.4.0
[2024-06-25 15:03:32] INFO: gtdbtk classify_wf --genome_dir /Volumes/Ext.HD-NRLC_Old/genomes --out_dir /Volumes/Ext.HD-NRLC_Old/gtdbtk_output_new --mash_db /Volumes/Ext.HD-NRLC_Old/gtdbtk_mash_sketch.msh --extension fasta --cpus 4 --tmpdir /Volumes/Ext.HD-NRLC_Old/tmp
[2024-06-25 15:03:32] INFO: Using GTDB-Tk reference data version r220: /Volumes/Ext.HD-NRLC_Old/gtdbtk_data/release220
[2024-06-25 15:03:32] INFO: Loading reference genomes.
[2024-06-25 15:03:32] INFO: Using Mash version 2.3
[2024-06-25 15:03:32] INFO: Loading data from existing Mash sketch file: /Volumes/Ext.HD-NRLC_Old/gtdbtk_output_new/classify/ani_screen/intermediate_results/mash/gtdbtk.user_query_sketch.msh
[2024-06-25 15:03:32] INFO: Creating Mash sketch file: /Volumes/Ext.HD-NRLC_Old/gtdbtk_mash_sketch.msh
[2024-06-25 15:53:29] INFO: Completed 113,104 genomes in 49.94 minutes (2,264.78 genomes/minute).
[2024-06-25 15:53:29] ERROR: Error generating Mash sketch:
Sketching /Volumes/Ext.HD-NRLC_Old/gtdbtk_data/release220/skani/database/GCA/000/008/085/GCA_000008085.1_genomic.fna.gz…
Sketching /Volumes/Ext.HD-NRLC_Old/gtdbtk_data/release220/skani/database/GCA/000/008/885/GCA_000008885.1_genomic.fna.gz…
Sketching /Volumes/Ext.HD-NRLC_Old/gtdbtk_data/release220/skani/database/GCA/000/009/845/GCA_000009845.1_genomic.fna.gz…
Sketching /Volumes/Ext.HD-NRLC_Old/gtdbtk_data/release220/skani/database/GCA/000/010/565/GCA_000010565.1_genomic.fna.gz…
Sketching /Volumes/Ext.HD-NRLC_Old/gtdbtk_data/release220/skani/database/GCA/000/011/445/GCA_000011445.1_genomic.fna.gz…
…
…
…
(omitting the processes)
…
…
…
Sketching /Volumes/Ext.HD-NRLC_Old/gtdbtk_data/release220/skani/database/GCF/963/378/075/GCF_963378075.1_genomic.fna.gz…
Sketching /Volumes/Ext.HD-NRLC_Old/gtdbtk_data/release220/skani/database/GCF/963/378/095/GCF_963378095.1_genomic.fna.gz…
Sketching /Volumes/Ext.HD-NRLC_Old/gtdbtk_data/release220/skani/database/GCF/963/378/105/GCF_963378105.1_genomic.fna.gz…
Writing to /Volumes/Ext.HD-NRLC_Old/gtdbtk_mash_sketch.msh…
libc++abi: terminating with uncaught exception of type kj::ExceptionImpl: kj/io.c++:405: failed: ::writev(fd, current, iov.end() - current): Invalid argument; fd = 3
stack: 104b2c679 104b2c97a 104b03884 104af375c 104aba5aa 104af4d14 104afa9c4 104ab07a3
[2024-06-25 15:53:29] ERROR: Controlled exit resulting from an unrecoverable error or warning.
An error would have been occurred when a mash sketch file (gtdbtk_mash_sketch.msh) was being generated (kj::ExceptionImpl). I am using an external HD (1.77TB space/2TB) to keep enough space to write. Is it better to use home directory (only 396.4 GB space/1TB)? Or are there any other solutions?
Hi. The error looks to be with mash itself, a 3rd party program we use internally in GTDB-Tk. I would try writing this to your local disk. Disk space shouldn’t be an issue here, but it might be an I/O issue if mash is producing results far quicker than can be written to an external disk.
Hi. The issue is obvious to me. I’d confirm that you aren’t running out of disk space, including that your tmp directory is not reaching 100% capacity.