I need some actual DNA sequences for a computer program.

Discussion of all aspects of biological molecules, biochemical processes and laboratory procedures in the field.

Moderators: honeev, Leonid, amiradm, BioTeam

Post Reply
Awol
Garter
Garter
Posts: 13
Joined: Fri Dec 03, 2010 2:20 am

I need some actual DNA sequences for a computer program.

Post by Awol » Thu Mar 29, 2012 9:14 pm

I am currently writing a computer program to search DNA sequences. i need some DNA to test my code on. Any ideas where I could get a few thousand pb from?


User avatar
JackBean
Inland Taipan
Inland Taipan
Posts: 5694
Joined: Mon Sep 14, 2009 7:12 pm

Post by JackBean » Fri Mar 30, 2012 8:19 am

Why do you need actual sequences? Why cannot it be just random?
http://www.biolib.cz/en/main/

Cis or trans? That's what matters.

Awol
Garter
Garter
Posts: 13
Joined: Fri Dec 03, 2010 2:20 am

Post by Awol » Fri Mar 30, 2012 1:10 pm

I have been using random sequences up until now. Its easy enough to pick out a gene looking for start/stop codons (or Shine-Dalgarno sequences) but the 'gene' it identifies is not a multiple of 3 bases.
The program then assumes a deletion/insertion mutation (or equivalent) resulting an erroneous amino acid sequence.

User avatar
JackBean
Inland Taipan
Inland Taipan
Posts: 5694
Joined: Mon Sep 14, 2009 7:12 pm

Post by JackBean » Fri Mar 30, 2012 3:37 pm

Well, he program should read only in-frame start and stop codons.
http://www.biolib.cz/en/main/

Cis or trans? That's what matters.

Awol
Garter
Garter
Posts: 13
Joined: Fri Dec 03, 2010 2:20 am

Post by Awol » Fri Mar 30, 2012 3:58 pm

If not sure if I follow what you are trying to say....

If the random sequence is AUGAAAGGUAG (with AUG = Start and UAG = Stop) the 'gene' = AAAGG. Which is
5 bases = 2/3 codons. Where as an actually DNA sequence will tend to have genes of multiples of 3 unless mutated.

User avatar
JackBean
Inland Taipan
Inland Taipan
Posts: 5694
Joined: Mon Sep 14, 2009 7:12 pm

Post by JackBean » Tue May 15, 2012 6:12 am

No, the gene is
AUG AAA GGU AG...
because the STOP codon is not in-frame, it won't be read. It will be like if there was none.
http://www.biolib.cz/en/main/

Cis or trans? That's what matters.

User avatar
JackBean
Inland Taipan
Inland Taipan
Posts: 5694
Joined: Mon Sep 14, 2009 7:12 pm

Post by JackBean » Tue May 15, 2012 6:12 am

Ane I was trying to point out also, that the START codon AUG is part of the gene ;)
http://www.biolib.cz/en/main/

Cis or trans? That's what matters.

Post Reply

Who is online

Users browsing this forum: No registered users and 5 guests