Identifying proteins from cDNAs

You are mapping the gene for an inherited hypouracilemia (lack of uracil) in an inbred strain of mice. You have narrowed the disease gene's location to a 500kb interval. You identify and sequence four different transcripts (mRNAs) that come from this region. Which of these is the best candidate for the disease gene?

Suggestions: If you need help determining the likely function of the genes, try looking for articles in a library database such as PubMed (http://ncbi.nlm.nih.gov/PubMed), or try finding them in a metabolic pathway at KEGG (http://www.genome.ad.jp/kegg). {WARNING. There is a huge amount of useful information at KEGG, but the newer versions have become progressively more difficult to use}

Sequences:

>Transcript 1
ttgccgctgtcgccgcggtgagggaagtggacgcgatggccgggtccgcgtgggtgtccaaggtctctcggctgctgggt
gcattccacaacacaaaacaggttcagacagtaactttaattcctggagatggaattggcccagaaatttcagcctcagt
catgaagatttttgatgctgccaaagcacctattcagtgggaggagcgcaatgtcacagcaattcaaggaccaggaggaa
agtggatgatccctccagaagccaaggagtccatggataagaacaagatgggcttgaaaggcccactaaagaccccaata
gccgctggccatccatctatgaatctgttgcttcgtaagacatttgacctttatgccaatgtccggccatgtgtctcaat
tgaaggttataaaaccccttacacggatgtaaatatcgtcaccatccgagagaacacggaaggagaatacagtggaattg
a


>Transcript 2
caacagccccagctcctgtgctggcctcttcattgcttcacacatcgggtttgactggcccggggtctgggtccacctgg
acatcgctgctccagtgcatgctggcgagcgagccacaggctttggggtggctctcctactggctctttttggccgtgcc
tccgaggacccgctgctgaacctggtatccccgctggactgtgaggtggatgcccaggaaggcgacaacatggggcgtga
ctccaagagacggaggctcgtgtgagggctacttcccagctggtgacacagggttccttacctcattttgcactgactga
tttaagcaattgaaagattaactaactcttaagatgagtttggcttctccttccgtgcccagtggtgacaggagtgagcc

>Transcript 3
gcagaaatgagctctgctggctccttggccactggaaactacaccaaggcagcagtcgggatggctgaggagcactgtga
atttgtcattggcttcatttctggctcccgagtaagcatgaaaccagagtttcttcacttgaccccaggagttcagttag
aaacaggaggggatcaccttggccagcagtacaatagtccccaagaagtgatcgggaaacgcggttctgatgtcatcatt
gtaggccggggaatacttgcagcggctaaccgcctagaagcagctgagatgtacaggaaggctgcctgggaggcatatct
gagtaggcttgctgttcagtgagacagaagacactgaaaatgggtggtatcccaaaagccgctggcttcagagaccagtg
ctcagggccctgcacagggtctgtgaagagagactctgcagcattgtatcatggtctaacttgtatcactgcatggtacc
tgtatgagcagccctttgaaagcttctcaagactggtattggtttggcagctgccacctttcaga

>Transcript 4
aggttgataccattcctgacctagcaaacctgtgtggtcctgagacacttcagtactatgagtgggtgcccgtttgcagg
aaacagtgtaggatacactttgaaaaacgtatctatggaggacaatgaagaagacagagctcaaactggtgtaaacagag
ccagcaaaggaggacttatctatggcaattacttgcagttggaaaagattttgaatgcgcaagaacttcagagtgaagta
aaagggaacaaaatccatgacgagcacctattcgtcataactcatcaagcttatgaactttggtttaaacaaatcctctg
ggaactagattctgttcgtgaaatcttccaaaatggccatgtcagggatgagaggaacatgctcaaggtgatagctcgga
tgcatcgtgtggtg

Back to courses