Outputs
plassembler
will output a _plasmids.fasta
file, which will contain the assembled plasmid sequence(s) in FASTA format (including long and short read copy numbers in the header), and a _plasmids.gfa
file, which will contain the assembly graph from Unicycler that can be visualised in Bandage.
plassembler
also outputs a _summary.tsv
file, which gives the estimated copy number for each plasmid, for both short reads and long reads (see this paper for more details about plasmid copy numbers) and also gives each contig's top hit by mash distance in the PLSDB (if there is a hit), along with all its supporting information.
If plassembler
fails to assemble any plasmids at all in _plasmids.fasta
, all these files will still exist, but will be empty (to ensure plassembler can be easily integrated into workflow managers like Snakemake).
plassembler
will also output a log file, a flye_output
directory, which contains the output from Flye (it may be useful to decide whether you need more sequencing reads, or some strange assembly artifact occured) and a unicycler_output
directory containing the output from Unicycler. If --use_raven
is specified, a raven_output
director will exist instead of flye_output
.
Other Outputs
If you use the --keep_fastqs
flag, plassembler
will keep FASTQ files containing all reads that went into the Unicycler assembly. These will be kept in the plasmid_fastqs
directory. It will also keep all long reads that map to more than one contig as multimap_long.fastq
.
If you use --keep_chromosome
, Plassembler will keep the unpolished chromosome(s) as chromosome.fasta
.
What happens if plassembler
fails to find a plasmid?
-
There are two reasons why
plassembler
will fail to find a plasmid: -
Where Flye assembles a complete chromosome, and then Plassembler fails to find any plasmids using the short reads. Most of the time, this simply means that there are no plasmids in the your bacterial isolate. Plassembler will still make output files (to ensure you can easily use it with workflow managers like Snakemake), but these will be empty.
- Where there is insufficient coverage for Flye to assemble a complete circular chromosome. In these cases, it is recommended that you either do more long read sequencing so that you can assemble a complete circular chromosome, or check that your -c parameter is correcting set.