I am newbie in python programming, kindly help as I am having multi-DNA sequence file in FASTA format (test.fasta) as:
>AB1|sp.1 CTCTTCGTTCCATCTCATGCGGTTCTTTCTTCACGAGTATTTGAATTCGAACCGTCTTGTATCTAAAAGAACTCTA
>AB2|sp.1 CTCTTCGTTCCATTTATTACGGTTCTTTCTTCACGAGTATTTGAATTCGAACCGTCTTATTACTACAAAGAACTCTA
>AB3|sp.2 TTACTAAACATTATCCAAGATTGTTATTGTTTCTATATAATTCTCATGTATATGAATATGAATCCATCCTCTTTTTTCT
>AB4|sp.2 TTACTAAAAGTAATCCAAGATTGTTATTGTTTCTATATATTCGAACGTATATGAATATGAATCCATCCTCTTTTTTCT
and want to format it to text (with tab delimitation) as:
Sequence Class
CTCTTCGTTCCATCTCATGCGGTTCTTTCTTCACGAGTATTTGAATTCGAACCGTCTTGTATCTAAAAGAACTCTA AB1|sp.1
CTCTTCGTTCCATTTATTACGGTTCTTTCTTCACGAGTATTTGAATTCGAACCGTCTTATTACTACAAAGAACTCTA AB2|sp.1
TTACTAAACATTATCCAAGATTGTTATTGTTTCTATATAATTCTCATGTATATGAATATGAATCCATCCTCTTTTTTCT AB3|sp.2
TTACTAAAAGTAATCCAAGATTGTTATTGTTTCTATATATTCGAACGTATATGAATATGAATCCATCCTCTTTTTTCT AB4|sp.2
Kindly provide python code (3.x) for formatting as above.
I tried importing sequence file, I was able to display it but was not able to write further command to format it into tab-delimited … Kindly help
from Bio import SeqIO
for seq_record in SeqIO.parse("test.fasta", "fasta"):
print(seq_record.seq)
print(seq_record.id)