1. Introduction

Transcription and translation in a cell. 1. Outer membrane. 2. Area inside the cell. 3. Double helix molecule. 4. Large circular membrane inside the cell. 5. Double helix molecule with a bubble separating the strands and new bases being added on one side. 6. Single stranded molecule. 7. Hole in 4. 8. Roughly T-shaped loop with 3 bases at one end. 9. Linked chain of small circles emerging from 8 when interacting with 10 and 11. 10. Two-part structure interacting with 8 and 11. 11. Single-stranded molecule in 2 interacting with 8 and 10.In the previous tutorial, we looked at transcription, the first step in gene expression. During transcription (shown at 5 in the diagram to your left) the template strand of DNA (3) is transcribed into a sequence of single-stranded RNA (6)

After genes are transcribed, they need to be translated into protein. That process, translation, is the subject of this tutorial.

Why is the conversion of information in DNA into RNA called transcription, while the conversion of RNA information into a protein is called translation? Transcription is when spoken words are written down. The form changes, but the language remains the same. When DNA is transcribed into RNA, the language — nucleic acids — stays the same.

Translation involves moving information from one language to another (from English to Mandarin, for example). When information in RNA is translated by a ribosome into protein, the cell is changing the information from one that’s encoded in a sequence of nucleotides to one that’s encoded in a sequence of amino acids. Because of the magnitude of the change, this is called translation.

2. A quick review of proteins

Proteins are polymers of amino acids. In the diagram on the right, the amino acids are represented by three-letter codes: Asn stands for asparagine. Gly stands for glycine.

Polypeptide chain with 19 amino acids starting with Asn and ending with His.

A note about terminology: A string of amino acids like the one shown on the right is called a polypeptide. When a polypeptide folds up into its three-dimensional shape, it’s considered to be a protein.

All proteins are composed of various combinations of the 20 amino acids found in nature. Because a protein’s shape depends on its sequence of amino acids, the cell must be able to translate the information in RNA into the sequence of amino acids that defines a protein’s form and function.

But that leads to a question: how can a message written in RNA, which consists of only four bases (A, U, C, G), specify all twenty amino acids?

3. A Coding Challenge

[qwiz qrecord_id=”sciencemusicvideosMeister1961-A Coding Challenge (v2.0)”]

[q labels = “top”]The table below is a code dictionary. Use it to decipher the coded message.

 

 

DECIPHER THIS CODED MESSAGE:

 ___  ___  ___  ___  ___  ___  ___
CAA AGU GAU AGU UAA CGU ACA
 ___  ___  ___ ___
ACA GCA AGG AGU

[l]A

[fx] No. Please try again.

[f*] Excellent!

[l]B

[fx] No, that’s not correct. Please try again.

[f*] Excellent!

[l]C

[fx] No. Please try again.

[f*] Great!

[l]D

[fx] No. Please try again.

[f*] Excellent!

[l]E

[fx] No, that’s not correct. Please try again.

[f*] Excellent!

[l]F

[fx] No, that’s not correct. Please try again.

[f*] Correct!

[l]G

[fx] No, that’s not correct. Please try again.

[f*] Excellent!

[l]H

[fx] No. Please try again.

[f*] Correct!

[l]I

[fx] No, that’s not correct. Please try again.

[f*] Correct!

[l]J

[fx] No. Please try again.

[f*] Correct!

[l]K

[fx] No, that’s not correct. Please try again.

[f*] Excellent!

[l]L

[fx] No, that’s not correct. Please try again.

[f*] Excellent!

[l]M

[fx] No. Please try again.

[f*] Good!

[l]N

[fx] No, that’s not correct. Please try again.

[f*] Great!

[l]O

[fx] No, that’s not correct. Please try again.

[f*] Great!

[l]P

[fx] No. Please try again.

[f*] Good!

[l]Q

[fx] No, that’s not correct. Please try again.

[f*] Good!

[l]R

[fx] No. Please try again.

[f*] Good!

[l]S

[fx] No. Please try again.

[f*] Good!

[l]T

[fx] No. Please try again.

[f*] Great!

[l]U

[fx] No, that’s not correct. Please try again.

[f*] Correct!

[l]V

[fx] No. Please try again.

[f*] Good!

[l]W

[fx] No. Please try again.

[f*] Great!

[l]X

[fx] No, that’s not correct. Please try again.

[f*] Good!

[l]Y

[fx] No, that’s not correct. Please try again.

[f*] Excellent!

[l]Z

[fx] No. Please try again.

[f*] Good!

[/qwiz]

Let’s think about what you just did.

The two-word answer is written in our 26-letter English alphabet. But the answer was encoded using just four letters: A, U, C, and G. The way that four letters were able to represent our 26-letter alphabet was to have 3-letter combinations of A, U, C, and G code for one letter of our alphabet. AAA coded for the letter “A.” AAC coded for the letter “B,” and so on.

I used the letters A, U, C, and G because each of these letters corresponds to one of the four RNA nucleotides (adenine, uracil, cytosine, and guanine). Cells use this same coding system to translate messenger RNA nucleotide sequences into the amino acid sequences of proteins. In the genetic code, groups of three mRNA nucleotides stand for one amino acid. You can see this in the image below:

mRNA strand separated by lines every three bases and amino acids above each group. From top to bottom and left to right: Met-AUG. Val-GUC. Lys-AAG. Leu-CUC. Asp-GAU. Ala-GCA. Val-GUC. Arg-CGU.

Three mRNA nucleotides are called a codon. Codon means “codes one.”  In the example above, the first codon is AUG, which codes for the amino acid methionine (Met). The second codon, GUC, codes for valine (Val), and so on.

4. Using a Genetic Code Dictionary

Genetic code dictionaries let you translate sequences of RNA into sequences of amino acids. In other words, if you’re given a sequence of mRNA, you can use a genetic code dictionary to figure out what amino acid sequence that mRNA is coding for.

Let’s say that you wanted to see what amino acids would result from the following mRNA sequence (it’s the one shown in the image above, with the corresponding amino acids removed).

AUGGUCAAGGUUCUCGAUGCAGUCGU

The first step is to find the start codon, AUG. In cells, AUG signals to ribosomes (the cell’s protein factories) where to start translating mRNA into protein. To make things easy, I’ve put the start codon at the start. Next, divide the mRNA into codons (groups of three RNA nucleotides). Do this by drawing a line after every third codon. You’ll wind up with something like what’s shown below.

mRNA codon AUG GUC AAG GUU CUC GAU GCA GUC CGU

The next step is to look up each codon in a Genetic Code dictionary. One form of the dictionary is the circular one shown below. I think it’s the easiest one to use, so we’ll start with that one.

Circular codon wheel. Innermost circle is yellow and divided into four quadrants, with black letters reading clockwise from the upper left: A, C, G, U. The second ring is orange and separated into 16 segments so that each yellow inner quadrant is lined up with 4 orange segments (A, C, G, U). The third ring is dark orange and aligns each orange segment from the second ring with 4 letters (U, C, A, G). The fourth ring is cyan and contains the names of all amino acids and two STOPs associated with different segments of the wheel. The fifth outermost ring is green and contains the three letter abbreviations for the amino acids and STOPs in the cyan ring.
A modified version of an image from http://www.clker.com/

Here’s how it works.

  1. The inner ring (yellow) represents the first nucleotide in a codon.
  2. The second ring (light orange) represents the second nucleotide
  3. The third ring (orange/red) represents the third nucleotide
  4. The fourth ring (light blue) tells you the amino acid
  5. The last ring (green) gives you the three-letter abbreviation for the amino acid.

Upper left quadrant of the colored circular codon wheel. From the center to the outside, the following are circled: A. U. G. Methionine. Met.To translate an mRNA codon to an amino acid, just start from the inside and work your way out. Here’s how you would do it for AUG:

The first “A” takes you to the upper left of the inside circle, as shown on the left. Then move out to “U,” then out again to “G.” The fourth ring shows you that the corresponding amino acid to AUG is “methionine,” and the outermost ring tells you that “methionine” can be abbreviated as Met.

The table below shows you how to translate the next two codons: GUC, and AAG.


CODON GUC AAG
DICTIONARY Lower right quadrant of the colored circular codon wheel. From the center to the outside, the following are circled: G. U. C. Valine. Val. Upper left quadrant of the colored circular codon wheel. From the center to the outside, the following are circled: A. A. G. Lysine. Lys.
AMINO ACID Valine Lysine
Abbreviation Val Lys

Now that you’ve worked with the genetic code dictionary, there are two things to note.

  1. Many codons are synonymous with one another. That means that they code for the same amino acid. Here’s why. With four bases in combinations of 3, there are 4possible codons. 43 = 64. With only 20 amino acids to code for, it follows that many codons would code for the same amino acid.
  2. The code includes some “punctuation.”
    1. The codon AUG codes for methionine, but it’s also the start codon. All genes for proteins begin with the codon AUG, and all resulting proteins begin with the amino acid methionine. Later, when the protein is processed into its final form, the methionine is often snipped off. AUG, in other words, is like the capital letter that begins a sentence.
    2. The codons UAA, UAG, and UGA are stop codons. Like the period at the end of the sentence, these codons tell the ribosome when to stop translating mRNA.
  3. Even though all proteins begin with methionine and the corresponding codon, AUG, many practice problems will leave this out. If that’s the case, just assume that the first three bases make up the first codon for you to translate.

Another widely used genetic code dictionary has the codons arrayed in a table, as shown below.

Codon table. Left side is labeled "First letter", top side is labeled "Second letter" and right side is labeled "Third letter". Left column contains one letter in each of the four rows, starting with U and descending to C, A, and G. Top row contains four columns each with one letter, from left to right: U, C, A, G. Rightmost column contains four letters (U, C, A, G) in each of its four rows. Inside the table, the codons are all listed, with their associated amino acids, abbreviated, to the right. For example, the upper left cell contains UUU and UUC for Phe and UUA and UUG for Leu.
Source: http://www.mun.ca/biology/scarr/MGA2_03-20.html

Codon table. Left side is labeled "First letter", top side is labeled "Second letter" and right side is labeled "Third letter". Left column contains one letter in each of the four rows, starting with U and descending to C, A, and G. Top row contains four columns each with one letter, from left to right: U, C, A, G. Rightmost column contains four letters (U, C, A, G) in each of its four rows. Inside the table, the codons are all listed, with their associated amino acids, abbreviated, to the right. The U on the left, C in the second column, and A on the top rightmost cell are boxed in red. UCA and Ser, located in the first row, second column, are boxed in red.Let’s say that you wanted to translate the codon UCA using this table. Here’s how you’d do it.

  1. Start with the column on the left. Find “U,” the first letter of the codon.
  2. Use the row on top to find “C,” the second letter of the codon.
  3. Find the square where this row and column intersect. Now, look over to the right to find “A,” the third letter.
  4. The specific row where all three intersect lists the codon that you’re looking for (which in this example is UCA). Right next to UCA, you can see Ser, the abbreviation for Serine, the amino acid that corresponds to UCA. Note that this dictionary doesn’t give you the full name for the amino acid, which you might need to look up (which is for me, another disadvantage of this dictionary).

When you’re working on your own, you can choose whichever dictionary is easiest for you. In class, especially on a test, you might have to work with whatever dictionary your teacher provides, so it’s probably a good idea to learn how to use both. For a printer-friendly version of the code, click here.

5. Genetic Code Practice Problems 1: mRNA to Protein

Translate the following mRNA sequences into an amino acid sequence. Dictionaries have been provided for each problem. To give you practice with each, some problems use a circular dictionary, and others use a tabular dictionary. Feel free to print out a copy of the genetic code dictionary, or, if you’re doing this in a classroom, ask your teacher or professor.

[qwiz qrecord_id=”sciencemusicvideosMeister1961-Genetic Code Practice Problems 1 (v2.0)”]

[h]Genetic Code Practice Problems: mRNA to Protein

[q labels = “top”]

 

amino acids _____ _____ _____ _____ _____ _____
mRNA codons UAC UGC AGU CCC GUG UGA
 

[l]Ala

[fx] No, that’s not correct. Please try again.

[f*] Excellent!

[l]Arg

[fx] No, that’s not correct. Please try again.

[f*] Correct!

[l]Asn

[fx] No. Please try again.

[f*] Excellent!

[l]Asp

[fx] No. Please try again.

[f*] Great!

[l]Cys

[fx] No. Please try again.

[f*] Excellent!

[l]Gln

[fx] No. Please try again.

[f*] Excellent!

[l]Glu

[fx] No, that’s not correct. Please try again.

[f*] Good!

[l]Gly

[fx] No, that’s not correct. Please try again.

[f*] Excellent!

[l]His

[fx] No. Please try again.

[f*] Correct!

[l]Ile

[fx] No, that’s not correct. Please try again.

[f*] Excellent!

[l]Leu

[fx] No, that’s not correct. Please try again.

[f*] Correct!

[l]Lys

[fx] No. Please try again.

[f*] Great!

[l]Met

[fx] No. Please try again.

[f*] Correct!

[l]Phe

[fx] No. Please try again.

[f*] Correct!

[l]Pro

[fx] No. Please try again.

[f*] Correct!

[l]Ser

[fx] No. Please try again.

[f*] Great!

[l]Thr

[fx] No. Please try again.

[f*] Correct!

[l]Trp

[fx] No. Please try again.

[f*] Excellent!

[l]Tyr

[fx] No. Please try again.

[f*] Excellent!

[l]Val

[fx] No, that’s not correct. Please try again.

[f*] Good!

[l]STOP

[fx] No. Please try again.

[f*] Excellent!

[q labels= “top”]

 

amino acids _____ _____ _____ _____ _____ _____ _____
mRNA codons GAG CAU AUA CUG UUU AGA UUC

[l]Ala

[fx] No. Please try again.

[f*] Good!

[l]Arg

[fx] No. Please try again.

[f*] Good!

[l]Asn

[fx] No. Please try again.

[f*] Excellent!

[l]Asp

[fx] No, that’s not correct. Please try again.

[f*] Great!

[l]Cys

[fx] No. Please try again.

[f*] Good!

[l]Gln

[fx] No. Please try again.

[f*] Good!

[l]Glu

[fx] No, that’s not correct. Please try again.

[f*] Correct!

[l]Gly

[fx] No. Please try again.

[f*] Excellent!

[l]His

[fx] No, that’s not correct. Please try again.

[f*] Good!

[l]Ile

[fx] No, that’s not correct. Please try again.

[f*] Great!

[l]Leu

[fx] No. Please try again.

[f*] Good!

[l]Lys

[fx] No, that’s not correct. Please try again.

[f*] Great!

[l]Met

[fx] No. Please try again.

[f*] Correct!

[l]Phe

[fx] No, that’s not correct. Please try again.

[f*] Excellent!

[l]Pro

[fx] No. Please try again.

[f*] Good!

[l]Ser

[fx] No. Please try again.

[f*] Excellent!

[l]Thr

[fx] No. Please try again.

[f*] Good!

[l]Trp

[fx] No. Please try again.

[f*] Great!

[l]Tyr

[fx] No. Please try again.

[f*] Excellent!

[l]Val

[fx] No. Please try again.

[f*] Correct!

[l]STOP

[fx] No, that’s not correct. Please try again.

[f*] Correct!

[x]

[restart]

[/qwiz]

6. Genetic Code Practice Problems Set 2: DNA to Protein

In the problems above, you translated sequences of mRNA into the sequences of amino acids that make up a protein.

To demonstrate that you understand molecular biology’s central dogma (DNA makes RNA makes protein), you’ll want to be able to start with sequences of DNA, transcribe them into RNA, and then use a genetic code dictionary to translate the mRNA into protein. In what follows, I’ll mark DNA, RNA, and protein with colors to clarify what’s what.

Let’s use this DNA sequence: CCGGCTAGCGTCTGA as an example. 

The first step is to transcribe the DNA into RNA. Remember that T(thymine) in DNA is replaced by U (Uracil) in RNA. That means that during transcription, A in DNA generates a U in RNA, as shown below. For exercises transcribing DNA into RNA, click here.

mRNA codons GGC CGA UCG CAG ACU
DNA (triplets) CCG GCT AGC GTC TGA 

Next, use the genetic code dictionary to translate the mRNA into an amino acid sequence.

Amino acids Gly Arg Ser Glu Thr
mRNA codons GGC CGA UCG CAG ACU
DNA (triplets) CCG GCT AGC GTC TGA 

Ready? Try a few practice problems to see if you’ve got it. To prompt you to remember to transcribe the DNA to RNA before translating, I’ve left an empty row in the table. A good technique would be to write down the mRNA codons on a piece of scrap paper, then translate them into amino acids.

[qwiz qrecord_id=”sciencemusicvideosMeister1961-Genetic Code Practice Problems 2 (v2.0)”]

[h]Genetic Code Practice 2: DNA to Protein

[q labels = “top”] Remember to transcribe before translating!

amino acids _____ _____ _____ _____ _____
mRNA codons
DNA GTA CTC TTA ACC ACT

[l]Ala

[fx] No. Please try again.

[f*] Excellent!

[l]Arg

[fx] No. Please try again.

[f*] Excellent!

[l]Asn

[fx] No, that’s not correct. Please try again.

[f*] Great!

[l]Asp

[fx] No. Please try again.

[f*] Correct!

[l]Cys

[fx] No. Please try again.

[f*] Excellent!

[l]Gln

[fx] No. Please try again.

[f*] Excellent!

[l]Glu

[fx] No, that’s not correct. Please try again.

[f*] Great!

[l]Gly

[fx] No. Please try again.

[f*] Great!

[l]His

[fx] No, that’s not correct. Please try again.

[f*] Good!

[l]Ile

[fx] No. Please try again.

[f*] Good!

[l]Leu

[fx] No, that’s not correct. Please try again.

[f*] Good!

[l]Lys

[fx] No. Please try again.

[f*] Excellent!

[l]Met

[fx] No, that’s not correct. Please try again.

[f*] Excellent!

[l]Phe

[fx] No. Please try again.

[f*] Excellent!

[l]Pro

[fx] No. Please try again.

[f*] Excellent!

[l]Ser

[fx] No, that’s not correct. Please try again.

[f*] Correct!

[l]Thr

[fx] No. Please try again.

[f*] Correct!

[l]Trp

[fx] No, that’s not correct. Please try again.

[f*] Great!

[l]Tyr

[fx] No, that’s not correct. Please try again.

[f*] Excellent!

[l]Val

[fx] No. Please try again.

[f*] Correct!

[l]STOP

[fx] No. Please try again.

[f*] Correct!

[q labels = “top”]

 

amino acids _____ _____ _____ _____ _____
mRNA codons
DNA CCA AAG AAC TCA AGT
 

[l]Ala

[fx] No. Please try again.

[f*] Correct!

[l]Arg

[fx] No. Please try again.

[f*] Correct!

[l]Asn

[fx] No, that’s not correct. Please try again.

[f*] Correct!

[l]Asp

[fx] No, that’s not correct. Please try again.

[f*] Good!

[l]Cys

[fx] No, that’s not correct. Please try again.

[f*] Correct!

[l]Gln

[fx] No, that’s not correct. Please try again.

[f*] Good!

[l]Glu

[fx] No, that’s not correct. Please try again.

[f*] Excellent!

[l]Gly

[fx] No. Please try again.

[f*] Correct!

[l]His

[fx] No. Please try again.

[f*] Correct!

[l]Ile

[fx] No, that’s not correct. Please try again.

[f*] Correct!

[l]Leu

[fx] No, that’s not correct. Please try again.

[f*] Excellent!

[l]Lys

[fx] No, that’s not correct. Please try again.

[f*] Correct!

[l]Met

[fx] No. Please try again.

[f*] Great!

[l]Phe

[fx] No. Please try again.

[f*] Excellent!

[l]Pro

[fx] No, that’s not correct. Please try again.

[f*] Excellent!

[l]Ser

[fx] No, that’s not correct. Please try again.

[f*] Correct!

[l]Thr

[fx] No, that’s not correct. Please try again.

[f*] Correct!

[l]Trp

[fx] No, that’s not correct. Please try again.

[f*] Great!

[l]Tyr

[fx] No. Please try again.

[f*] Excellent!

[l]Val

[fx] No, that’s not correct. Please try again.

[f*] Great!

[l]STOP

[fx] No. Please try again.

[f*] Correct!

[/qwiz]

7. What’s Next?