See attached for full problem description
Which frame is most likely coding?
Which is the most significant domain match?
Does this protein have anything to do with cancer?
How about cell differentiation?
How about viral nucleic acid binding?
What chromosome is this DNA located?
What kind of cells express this DNA and make its coded protein?
For the first step, you need to copy the DNA sequence on the bottom of the page and go to the website listed for expasy. You paste your DNA sequence into the box and click on TRANSLATE SEQUENCE (I like the Compact format). This gives you the six different reading frames translated for your DNA sequence with 3 frames coming from each strand. It's important to understand why there are these 6 options. Remember that each amino acid in a protein is coded by 3 nucleotides, but you can start at any spot in the sequence.
As an example, your sequence starts with TTCTTTTCTAAGCAAACTTT
You could make this TTC TTT TCT AAG CAA ACT TT
or T TCT TTT CTA AGC AAA CTT T
or TT CTT TTC TAA GCA AAC TTT
Even though these three options all have the exact same DNA sequence, you can see that they can be grouped differently into various combinations for amino acids. Every 3-letter combination will give you a specific amino acid.
The solution analyzes a given sequence of DNA and finds the implications of it.