Analyze the following sequence in the attached file. Make sure to highlight all the key features.
The sequence shall, from now on, be referred to as TR-5. Translation from DNA to protein was done using the computer program DNAMAN (ver 4.11, Lynnon BioSoft, 1994). TR-5 is a 1292 bp sequence with a base composition of 26%A, 33%C, 23%G, and 18%T (Appendix A). The sequence was translated by DNAMAN (1994), in three frames giving the largest ORF in the plus 1 frame (Fig. 1).
The potential ORF (+1 reading frame) encodes a protein that is 329 amino acids in length, a pI of 8.22, and with a calculated molecular weight of 35012 Daltons (Appendix B). Using ProtParam tool from the ExPASy web-site (expasy.nhri.org.tw), the pI was calculated as 8.56 and the MW was 35065.6 Da (Appendix C). The results from ExPASy were similar to DNAMAN. The slight differences seen are probably due to the number of significant figures each tool uses in their calculations.
Figure 2 shows the cDNA sequence of TR-5 and the deduced amino acid sequence given a plus 1 reading frame. There is a polyadenylation (AATAAA) signal at 1239bp (Takagaki, et al. 1988). There is also a CAAT box at 18bp that represents a DNA transcription factor CTF/NF1 binding site, but no TATAA box in the sequence (Muller, et al. 1988). ...
This solution describes the analysis of a DNA sequence. The DNA sequence is attached as a word document. The solution also provides peer-reviewed references that help in answering the question. This solution is a good example of what is typically required in the analysis of an unknown DNA sequence.