|
Table 1 Statistical information of the genome-wide dataset.
|
Organism
|
# of genes
|
# of introns
|
Avg. introns
|
%GC
|
Phase distribution (%)
|
|
|
|
|
|
|
|
|
|
|
|
P-0
|
P-1
|
P-2
|
|
N. crassa
|
8817
|
15856
|
1.80
|
56.0
|
38.2
|
34.1
|
27.7
|
D. melanogaster
|
8932
|
27135
|
3.04
|
53.7
|
42.5
|
31.8
|
25.7
|
H. sapiens
|
11058
|
89508
|
8.09
|
52.3
|
45.4
|
31.2
|
23.4
|
F. graminearum
|
8168
|
18695
|
2.29
|
51.8
|
38.6
|
34.4
|
27.0
|
C. neoformans
|
5603
|
26945
|
4.81
|
51.0
|
41.6
|
30.7
|
27.7
|
X. tropicalis
|
7793
|
61999
|
7.96
|
47.2
|
46.7
|
30.3
|
23.0
|
A. thaliana
|
9734
|
52856
|
5.43
|
44.1
|
57.6
|
20.6
|
21.8
|
C. elegans
|
11128
|
60110
|
5.40
|
43.1
|
47.8
|
26.5
|
25.7
|
S. pombe
|
3791
|
3924
|
1.04
|
39.4
|
42.2
|
31.2
|
26.6
|
P. falciparum
|
3828
|
6127
|
1.60
|
23.6
|
44.8
|
32.6
|
22.6
|
|
| The 10 eukaryotes are arranged in descending order of GC contents (%) from top to bottom. P-0, phase-0; P-1, phase-1; P-2, phase-2. |
..................................................
Table 2 Prediction of intron phase distributions using the all-pattern intron insertion model and mutation correction.
Organism
Best mrate
Best %GC
Phase distribution (%)
Error
σ
P-0
P-1
P-2
N. crassa
-0.004
55.9
38.2
34.2
27.6
0.08*
0.001
D. melanogaster
-0.011
53.5
42.8
32.6
24.6
19.58
0.055
H. sapiens
+0.178
55.0
45.4
31.1
23.5
0.27*
0.066
F. graminearum
+0.163
54.3
38.7
34.7
26.6
1.75*
0.024
C. neoformans
+0.503
59.9
41.7
31.1
27.2
4.09*
0.125
X. tropicalis
+0.279
52.3
46.7
30.1
23.2
1.41*
0.138
A. thaliana
+0.632
57.6
57.5
20.7
21.8
0.69*
0.147
C. elegans
+0.539
54.6
48.6
28.4
23.0
274.98
2.260
S. pombe
+0.406
49.2
42.3
33.3
24.4
13.39
0.099
P. falciparum
+0.390
34.7
42.3
30.9
26.8
56.88
0.347
"Best mrate" shows the best mutation rate. "Best %GC", "phase distribution (%)", and "error" are averages of 20 simulations for mutation correction at the best mutation rate. Error is measured as the χ2 value between the observed and predicted intron phase distributions. "σ" shows the standard deviation of error in 20 simulations. P-0, phase-0; P-1, phase-1; P-2, phase-2.* Not significant at the P
..................................................
Source: BMC Evolutionary Biology 2006, 6:69
© 2006 Nguyen et al; licensee BioMed Central Ltd.