Glycine
Smallest amino acid. Highly flexible due to no side chain. Found in collagen (every 3rd residue).
These 20 molecular building blocks combine in endless ways to create every protein in every living organism on Earth.
For a protein of length n:
20ⁿ possible sequences
10 amino acids: 20¹⁰ = 10,240,000,000,000 (10 trillion)
100 amino acids: 20¹⁰⁰ = 10¹³⁰ combinations
300 amino acids (average protein): 20³⁰⁰ = 10³⁹⁰ combinations
For perspective: There are only about 10⁸⁰ atoms in the observable universe. The number of possible 300-amino-acid proteins (10³⁹⁰) is unimaginably larger!
~500,000+
Proteins in UniProt (reviewed)
~250 million+
Total protein sequences in databases
~200 million
Structures predicted by AlphaFold
Human proteome: ~20,000 protein-coding genes producing ~100,000+ distinct proteins through alternative splicing and modifications.
Only proteins that provide survival advantages are selected. Most random sequences would not fold properly or have useful functions.
Only ~1 in 10⁷⁷ random sequences can fold into a stable, functional structure. Most sequences are "junk" that would misfold or aggregate.
Proteins evolved from common ancestors. There are only ~2,000 distinct protein fold families, with variations built on these templates.
Smallest amino acid. Highly flexible due to no side chain. Found in collagen (every 3rd residue).
Simple, small hydrophobic residue. One of the most common amino acids. Important in energy metabolism.
Branched-chain amino acid (BCAA). Essential amino acid. Important for muscle metabolism.
Branched-chain amino acid. Most abundant essential amino acid. Key regulator of protein synthesis (mTOR).
Branched-chain amino acid. Essential. Important for hemoglobin synthesis and blood sugar regulation.
Unique cyclic structure. Introduces kinks in protein chains. Abundant in collagen. Disrupts secondary structures.
Essential aromatic amino acid. Precursor to tyrosine. People with PKU cannot metabolize it properly.
Precursor to dopamine, epinephrine, thyroid hormones. Can be phosphorylated for cell signaling.
Largest amino acid. Essential. Precursor to serotonin and melatonin. Rarest amino acid in proteins.
Polar, can be phosphorylated. Important in enzyme active sites. Precursor to glycine and cysteine.
Essential amino acid. Can be phosphorylated. Important for collagen, elastin, and tooth enamel.
Contains sulfur. Forms disulfide bonds critical for protein structure. Important antioxidant (glutathione).
Essential. Start codon (AUG) codes for Met. Important for methylation reactions. Contains sulfur.
Amide of aspartic acid. Common site for N-linked glycosylation. Important for protein folding.
Most abundant amino acid in blood. Important nitrogen carrier. Fuel for rapidly dividing cells.
Negatively charged at pH 7. Important in enzyme active sites. Precursor to other amino acids.
Negatively charged. Major excitatory neurotransmitter. Used as flavor enhancer (MSG).
Essential. Positively charged. Important for collagen crosslinking. Can be acetylated/methylated (histones).
Positively charged. Precursor to nitric oxide. Important in immune function and wound healing.
pKa near physiological pH - acts as buffer. Essential for infants. Found in hemoglobin active site.
Must be obtained from diet - body cannot synthesize them.
Remember: "PVT TIM HaLL" - Phe, Val, Thr, Trp, Ile, Met, His, Arg (conditionally), Leu, Lys
Body can synthesize these from other molecules.
Note: Arginine, Cysteine, Glutamine, Tyrosine, Glycine, and Proline are "conditionally essential" during illness or stress.
Each amino acid is encoded by one or more three-nucleotide sequences (codons) in mRNA. This redundancy helps protect against mutations.
| Amino Acid | 1-Letter | 3-Letter | Codons | # Codons |
|---|---|---|---|---|
| Alanine | A | Ala | GCU, GCC, GCA, GCG | 4 |
| Arginine | R | Arg | CGU, CGC, CGA, CGG, AGA, AGG | 6 |
| Asparagine | N | Asn | AAU, AAC | 2 |
| Aspartic Acid | D | Asp | GAU, GAC | 2 |
| Cysteine | C | Cys | UGU, UGC | 2 |
| Glutamic Acid | E | Glu | GAA, GAG | 2 |
| Glutamine | Q | Gln | CAA, CAG | 2 |
| Glycine | G | Gly | GGU, GGC, GGA, GGG | 4 |
| Histidine | H | His | CAU, CAC | 2 |
| Isoleucine | I | Ile | AUU, AUC, AUA | 3 |
| Leucine | L | Leu | UUA, UUG, CUU, CUC, CUA, CUG | 6 |
| Lysine | K | Lys | AAA, AAG | 2 |
| Methionine | M | Met | AUG (Start) | 1 |
| Phenylalanine | F | Phe | UUU, UUC | 2 |
| Proline | P | Pro | CCU, CCC, CCA, CCG | 4 |
| Serine | S | Ser | UCU, UCC, UCA, UCG, AGU, AGC | 6 |
| Threonine | T | Thr | ACU, ACC, ACA, ACG | 4 |
| Tryptophan | W | Trp | UGG | 1 |
| Tyrosine | Y | Tyr | UAU, UAC | 2 |
| Valine | V | Val | GUU, GUC, GUA, GUG | 4 |
| Stop Codons | * | Stop | UAA, UAG, UGA | 3 |
Total: 64 codons encoding 20 amino acids + 3 stop signals = genetic code redundancy
See how these 20 building blocks combine to create the proteins that power life.