CIS700 class notes by Junwen Wang
12Jan06
General advice for reading paper
Alternative splicing
Motif finding algorithm
Graph alignment
Answer for first exon conservation
based on human mouse conservation on all refSeq genes, (hg16 and mm5), I calculated the average conservation human-mouse genes. As shown in table below, first and last exon are more conserved than the exons in the middle.
| type | sample size | average conservation | conservation at upper 5 percentile |
| 5' utr | 19965 | 9.69% | 86.73% |
| 3' utr | 20337 | 9.13% | 81.84% |
| all intron | 199808 | 9.82% | 70.15% |
| all exon | 220902 | 16.43% | 94.78% |
| first exon | 19816 | 23.90% | 94.12% |
| middle exon | 180025 | 15.00% | 95.05% |
| last exon | 19783 | 22.26% | 89.53% |