Login

|
|
Want to identify common nucleotide motifsModerator: BioTeam
6 posts • Page 1 of 1
Want to identify common nucleotide motifsI'm looking for a program/website that can identify motifs common between a number of long nucleotide sequences. I'd like to compare sequences (the more the better, 5-10 would be the minimum) each up to 150kb. I've found programs for 2 long sequences, or lots of short ones, but nothing for several long ones. Anyone know of a program/website that can do this?
Thanks.
Hi biznatch,
Which motif-finders have you tried? MEME is my favorite. However, I'm not sure that it can be used to discover motifs in such a large sequence set - typically, the bigger your sequences are, the more difficult it is to establish the statistical significance of any particular discovered motif (because the more sequence you have, the more likely it is that a 20-mer, for example, occurs by chance). I can come up with a list of other motif-finders, but I think most of them will suffer from the statistical problem I just described. What is the biological context for this project? These aren't 150kb promoters, are they? Maybe we can come up with another solution if I know what it is you're trying to find. Vi veri veniversum vivus vici
Thanks, I've tried MEME but ya it has a maximum of 60kb for all the sequences. I'm not really sure what I'm looking for, just doing a screen for a group of genes I've found to be commonly regulated and want to find possible common motifs in their genomic sequences. I'd like to compare the gene itself plus about 10kb up and downstream. The problem is that some of the genes are very big. I'm focusing more on larger motifs like over 100bp so that should make it easier to establish significance.
Hi biznatch,
Do you have any reason to expect that the elements controlling these genes' regulation are 100bp in length? Do you have any reason to expect that the elements controlling these genes' regulation are in the exons/introns, as opposed to the 5'UTR/3'UTR and 10kb promoter? I can tell you now that the social conventions for promoter element detection are: (i) Promoter elements are usually short (6-30bp) (ii) They are usually found in the UTRs, first intron, and promoter Granted, there is an issue of ascertainment bias in these claims, but these are the assumptions that motif-detection algorithms (like the ones MEME and AlignACE use) are based on. If you are dead set on finding ~100bp motifs, I guess you could try local/multiple sequence alignments. A motif of that size would probably be detectable using standard local alignment statistics. Alternatively, if you expect these motifs to pop up a lot in each sequence, you could try breaking the sequences into subsequences and finding the motif inside those. Vi veri veniversum vivus vici
Go to;
http://www.ncbi.nlm.nih.gov Then go for BLAST --- Nucleotide compare Progamme or FAST ---- Proteom compare programme You will get results in 15 sec. I oftnely use this site. It is a large database site. One of my favorites. Nature Bangs On My Mind
How is that supposed to help find ~100mer motifs in many input oligonucleotide sequences? We aren't interested in BLASTing against a database in this problem. Vi veri veniversum vivus vici
6 posts • Page 1 of 1
Who is onlineUsers browsing this forum: No registered users and 0 guests |

© Biology-Online.org. All Rights Reserved. Register | Login | About Us | Contact Us | Link to Us | Disclaimer & Privacy
Science Network - Braintrack.com - University Directory | Chemicool.com - Chemistry | EquationSheet.com - Equations | Logo design by LogoBee