Biochemistry Foundations — Chemistry Reference
Biochemistry is the chemistry of living systems: the molecules, reactions, energy flows, and information transfer that constitute life. It sits at the interface of organic chemistry, physical chemistry, structural biology, and molecular biology. This note covers the canonical foundations — water, amino acids and proteins, enzymes and kinetics, cofactors, carbohydrates, lipids, nucleic acids, metabolism, bioenergetics, signaling, structural methods, industrial enzymology, clinical biochem — and the 2024-26 wave of AI-driven biochemistry (AlphaFold-3, AlphaProteo, RFdiffusion, Boltz).
Companion references: [[Chemistry/organic-chemistry-foundations]] (functional groups, stereochemistry, reaction mechanisms), [[Chemistry/analytical-chemistry-methods]] (spectroscopy, chromatography, MS), [[Biology/cell-molecular-biology]] (membranes, organelles, signaling pathways), [[Biology/genetics-and-genomics]] (DNA replication, transcription, translation).
1. Water, pH, and Buffers
1.1 Water as the universal biological solvent
Water (H₂O) is polar (dipole moment 1.85 D), hydrogen-bonded (≈4 H-bonds per molecule in bulk liquid), and amphoteric. Its high heat capacity (4.184 J·g⁻¹·K⁻¹), high heat of vaporization (2257 J·g⁻¹ at 373 K), and high dielectric constant (ε_r ≈ 78.4 at 298 K) make it ideal for:
- Solvating ions and polar molecules (hydration shells, ΔH_solv exothermic for most salts).
- Driving the hydrophobic effect — entropic burial of non-polar surfaces is the dominant force in protein folding and membrane self-assembly (Kauzmann 1959).
- Mediating enzyme catalysis (proton shuttling, nucleophilic attack, leaving-group stabilization).
1.2 pH and the ion product
Self-ionization: 2 H₂O ⇌ H₃O⁺ + OH⁻, with K_w = [H⁺][OH⁻] = 1.0 × 10⁻¹⁴ M² at 298 K. Sørensen 1909 defined pH = −log₁₀[H⁺]. Physiological pH is tightly held at 7.35-7.45 in human blood; deviations of ±0.1 pH units cause acidosis or alkalosis.
1.3 Henderson-Hasselbalch and buffers
For a weak acid HA ⇌ H⁺ + A⁻ with K_a:
pH = pKa + log₁₀([A⁻]/[HA])
(Henderson 1908, Hasselbalch 1917). Buffer capacity peaks at pH = pKa and within ±1 pH unit. Key physiological buffers:
- Bicarbonate: H₂CO₃ ⇌ H⁺ + HCO₃⁻ (pKa 6.1; coupled to pCO₂ via carbonic anhydrase; dominant blood buffer).
- Phosphate: H₂PO₄⁻ ⇌ H⁺ + HPO₄²⁻ (pKa 6.86; dominant intracellular).
- Histidine imidazole (pKa ~6.0); explains hemoglobin’s Bohr effect.
- Protein side chains collectively buffer cytoplasm.
Common lab buffers: Tris (pKa 8.06, temperature-sensitive ΔpKa/ΔT = −0.028 K⁻¹), HEPES (pKa 7.55), MES, MOPS, PIPES — the Good buffers (Good et al. 1966).
2. Amino Acids and the Peptide Bond
2.1 The 20 proteinogenic amino acids + 2 non-canonical
All α-amino acids share NH₃⁺-CHR-COO⁻ at physiological pH (zwitterion; isoelectric point pI is the pH at which net charge = 0). Side chain R determines properties:
- Non-polar aliphatic: Gly (G), Ala (A), Val (V), Leu (L), Ile (I), Pro (P), Met (M).
- Aromatic: Phe (F), Tyr (Y), Trp (W).
- Polar uncharged: Ser (S), Thr (T), Cys (C), Asn (N), Gln (Q).
- Positively charged (basic): Lys (K, pKa 10.5), Arg (R, pKa 12.5), His (H, pKa 6.0).
- Negatively charged (acidic): Asp (D, pKa 3.9), Glu (E, pKa 4.1).
Non-canonical genetically-encoded:
- Selenocysteine (Sec, U) — 21st amino acid, encoded by recoded UGA stop with SECIS element (Böck 1986); in glutathione peroxidase, thioredoxin reductase.
- Pyrrolysine (Pyl, O) — 22nd, encoded by UAG in some archaea/bacteria (Krzycki 2002).
All except Gly have a chiral α-carbon; ribosomally-synthesized proteins use L-isomers (S configuration except L-Cys which is R due to Cahn-Ingold-Prelog priority of sulfur).
2.2 The peptide bond
Condensation of two amino acids yields a peptide bond (amide) with loss of water. Pauling and Corey (1951) showed it is planar due to partial double-bond character (resonance between C=O and C-N), with ω torsion ≈ 180° (trans, ~99.9% of non-Pro residues) or 0° (cis, predominantly before Pro). Bond length 1.32 Å (vs 1.47 Å typical C-N single).
The Cα-N (φ, phi) and Cα-C (ψ, psi) torsions are rotatable. Ramachandran plot (Ramachandran, Ramakrishnan, Sasisekharan 1963) maps allowed (φ, ψ) — clusters at α-helix (φ ≈ −60°, ψ ≈ −45°), β-sheet (φ ≈ −120°, ψ ≈ +120°), and left-handed α (rare, mostly Gly).
2.3 Protein structure hierarchy
- Primary (1°): amino acid sequence, N→C, encoded in DNA.
- Secondary (2°): local H-bond patterns. α-helix (Pauling, Corey, Branson 1951; 3.6 residues/turn, 5.4 Å pitch, H-bond i→i+4); β-sheet (parallel/antiparallel; H-bond between strands); 3₁₀-helix (i→i+3); π-helix (i→i+5); turns (β-turns: types I-VIII, 4 residues).
- Tertiary (3°): global 3D fold of single chain. Stabilized by hydrophobic core, disulfide bonds (Cys-Cys, oxidative), salt bridges, H-bonds.
- Quaternary (4°): assembly of multiple subunits (hemoglobin α₂β₂; ATP synthase ~30 subunits; ribosome ~80).
2.4 Folds, domains, and motifs
A domain is an independently folding unit (50-250 residues). Catalogs: SCOP (Murzin 1995), CATH (Orengo 1997), Pfam/InterPro, ECOD. Iconic folds:
- TIM barrel — (β/α)₈ (triose phosphate isomerase, ~10% of all enzymes).
- Rossmann fold — βαβαβ binds NAD(P)/FAD; dehydrogenases.
- Greek key, jelly roll (viral capsids, immunoglobulins).
- Immunoglobulin fold — β-sandwich; antibodies, cell-surface receptors.
- Helix-turn-helix, zinc finger, leucine zipper, helix-loop-helix — DNA-binding motifs.
- WD40 propeller, ankyrin repeat, armadillo, TPR — scaffolding.
2.5 Protein-protein interactions (PPI)
~650,000 PPIs in human proteome (Stumpf 2008 estimate, refined by AlphaFold-Multimer). Interfaces typically 1,200-2,000 Ų with “hot spot” residues contributing most ΔG (Clackson, Wells 1995). Methods: yeast two-hybrid (Fields, Song 1989), co-IP/MS, BioID, proximity labeling (TurboID), AlphaFold-Multimer screens (Humphreys 2021).
2.6 Intrinsically disordered proteins (IDPs)
~30% of eukaryotic proteome lacks stable 3D structure under native conditions (Dunker 2001, Tompa 2002). Enriched in P, E, S, K, A; depleted in W, C, F, I, Y, V. Function via:
- Coupled folding-upon-binding (e.g. p53 transactivation domain).
- Polyvalent low-affinity interactions driving liquid-liquid phase separation (LLPS, biomolecular condensates; Brangwynne, Hyman 2009; P-bodies, stress granules, nucleoli).
- Hub roles in signaling (BRCA1, p53, α-synuclein — also implicated in Parkinson’s aggregation).
Databases: DisProt, MobiDB. Predictors: PONDR, IUPred, AlphaFold pLDDT < 50 correlates with disorder.
3. Enzymes and Kinetics
3.1 Michaelis-Menten kinetics
Michaelis and Menten (1913), refined by Briggs and Haldane (1925) with the steady-state assumption:
E + S ⇌ ES → E + P v = V_max · [S] / (K_M + [S])
where V_max = k_cat · [E]₀ and K_M = (k₋₁ + k_cat)/k₁. k_cat (turnover number) is reactions per active site per second; K_M approximates substrate affinity (lower = tighter). The specificity constant k_cat/K_M measures catalytic efficiency; the diffusion limit is ~10⁸-10⁹ M⁻¹s⁻¹ (“catalytic perfection” — superoxide dismutase 7×10⁹, carbonic anhydrase 1.5×10⁹, triose phosphate isomerase 2.4×10⁸; Knowles, Albery 1976).
3.2 Linearizations (historical but still pedagogically used)
- Lineweaver-Burk (1934, double reciprocal): 1/v = (K_M/V_max)·(1/[S]) + 1/V_max. Distorts errors; superseded for fitting.
- Eadie-Hofstee: v = V_max − K_M·(v/[S]).
- Hanes-Woolf: [S]/v = (1/V_max)·[S] + K_M/V_max.
Modern practice: nonlinear least-squares fit to the rectangular hyperbola (e.g. GraphPad Prism, DynaFit, KinTek Explorer).
3.3 Enzyme Commission (EC) classification
IUBMB EC numbers (Enzyme Commission, est. 1956; 7 top-level classes after 2018 expansion):
- EC 1 Oxidoreductases — electron transfer (dehydrogenases, oxidases, peroxidases, reductases).
- EC 2 Transferases — group transfer (kinases, transaminases, methyltransferases).
- EC 3 Hydrolases — bond cleavage by water (proteases, lipases, glycosidases, phosphatases).
- EC 4 Lyases — non-hydrolytic, non-oxidative bond breakage (decarboxylases, aldolases).
- EC 5 Isomerases — intramolecular rearrangement (racemases, mutases, topoisomerases).
- EC 6 Ligases — bond formation coupled to ATP (synthetases, aminoacyl-tRNA synthetases).
- EC 7 Translocases — ion/molecule transport across membranes (added 2018; ATP synthase, Na⁺/K⁺-ATPase).
3.4 Catalytic strategies
- Proximity and orientation (effective concentration ~10⁵ M).
- Transition-state stabilization (Pauling 1948) — basis for transition-state-analog inhibitors and catalytic antibodies (Lerner, Schultz 1986).
- General acid/base catalysis (His, Glu, Asp, Lys, Cys).
- Covalent catalysis (Ser proteases, cysteine proteases, thiamine, PLP).
- Metal-ion catalysis (Zn in carbonic anhydrase + carboxypeptidase, Mg in kinases, Fe-S in oxidoreductases).
- Strain/distortion of substrate (lysozyme; Phillips 1965 mechanism).
3.5 Allosteric regulation
MWC (Monod-Wyman-Changeux 1965) — concerted/symmetry model: oligomer exists in T (tense) and R (relaxed) states; substrate binding shifts equilibrium. Predicts cooperativity (Hill coefficient n_H > 1; hemoglobin n_H ≈ 2.8-3.0).
KNF (Koshland-Némethy-Filmer 1966) — sequential/induced-fit model: each subunit changes conformation upon binding; intermediate states populated.
Real enzymes blend both. Examples: aspartate transcarbamoylase (ATCase, Lipscomb 1976 Nobel for structure), phosphofructokinase-1, glycogen phosphorylase.
3.6 Covalent post-translational modifications (PTMs)
Reversible and irreversible covalent regulation expands the proteome’s functional space ~100-fold beyond gene count:
- Phosphorylation — Ser/Thr/Tyr by ~520 human kinases; reversed by ~200 phosphatases (PP1, PP2A, PTPs). Krebs and Fischer (1992 Nobel) discovered reversible phosphorylation in glycogen phosphorylase.
- Acetylation — Lys-ε-N by HATs (p300/CBP, GCN5); reversed by HDACs/sirtuins. Histone code (Strahl, Allis 2000).
- Methylation — Lys, Arg by PRMTs and KMTs (using SAM); reversed by LSD1, JmjC.
- Ubiquitination — 76-residue Ub conjugated to Lys via E1/E2/E3 cascade (Hershko, Ciechanover, Rose 2004 Nobel); polyUb K48 → 26S proteasome; K63 → signaling/repair.
- SUMOylation — Small Ub-like Modifier; alters localization and PPIs.
- Glycosylation — N-linked (Asn-X-Ser/Thr sequon, ER/Golgi) and O-linked (Ser/Thr); also O-GlcNAc nutrient sensor (Hart 1984).
- Lipidation — myristoylation (N-Gly), palmitoylation (Cys), prenylation (farnesyl/geranylgeranyl on CAAX motif).
- ADP-ribosylation — PARPs in DNA damage response.
3.7 Enzyme inhibition
- Competitive — binds active site, competes with S; ↑K_M, V_max unchanged. K_i = [E][I]/[EI].
- Non-competitive — binds elsewhere, lowers k_cat; ↓V_max, K_M unchanged (true non-comp rare).
- Uncompetitive — binds ES only; ↓V_max and ↓K_M proportionally; lithium on inositol phosphatases.
- Mixed — both E and ES; ↓V_max and Δ K_M.
- Irreversible — covalent. Suicide/mechanism-based inactivators (Abeles 1976): aspirin (acetylates COX-1/2 Ser530), penicillin (acylates transpeptidase Ser), DFP (Ser proteases), allopurinol (xanthine oxidase).
4. Cofactors and Coenzymes
Cofactors are non-protein chemical components required for activity. Coenzymes are organic cofactors (often vitamin-derived). Prosthetic groups are tightly/covalently bound.
| Cofactor | Carries | Vitamin source | Key reactions |
|---|---|---|---|
| NAD⁺/NADH | 2 e⁻ + H⁺ (hydride) | Niacin (B3) | Catabolic oxidations (glycolysis GAPDH, TCA, β-oxidation) |
| NADP⁺/NADPH | hydride | Niacin (B3) | Anabolic reductions, pentose phosphate pathway, antioxidant defense |
| FAD/FADH₂, FMN | 2 e⁻ ± H⁺ | Riboflavin (B2) | Succinate DH (Complex II), MAO, glutathione reductase |
| Coenzyme A (CoA-SH) | acyl groups | Pantothenate (B5) | Acetyl-CoA, fatty acid metabolism |
| ATP/ADP/AMP | phosphoryl, energy currency | adenine + ribose + Pi | Kinases, ATPases, biosynthesis |
| GTP | phosphoryl | guanine + ribose + Pi | G proteins, protein synthesis, gluconeogenesis (PEPCK) |
| SAM (S-adenosylmethionine) | methyl | Met + ATP | Methylation of DNA, histones, small molecules; “biological methyl donor” |
| PLP (pyridoxal-5’-phosphate) | amino groups (Schiff base) | Pyridoxine (B6) | Transaminases, decarboxylases, racemases, glycogen phosphorylase |
| Biotin | CO₂ | Biotin (B7) | Carboxylases (pyruvate carboxylase, acetyl-CoA carboxylase) |
| TPP (thiamine pyrophosphate) | aldehyde/ketol | Thiamine (B1) | PDH, α-KG-DH, transketolase, pyruvate decarboxylase |
| Lipoic acid | acyl + reducing equivalents | de novo or diet | PDH, α-KGDH, BCKDH, glycine cleavage |
| Tetrahydrofolate (THF, FH₄) | 1-C units (methyl, methylene, formyl) | Folate (B9) | Nucleotide synthesis, methionine cycle |
| Cobalamin (B12) | methyl, alkyl radicals | Cobalamin (B12) | Methionine synthase, methylmalonyl-CoA mutase |
| Ascorbate (vitamin C) | reducing agent | Diet (humans lack GULO) | Collagen prolyl/lysyl hydroxylases, antioxidant |
| Heme | O₂, e⁻ (Fe²⁺/Fe³⁺) | porphyrin synthesis | Hemoglobin, cytochromes, catalase, cyt P450 |
| Fe-S clusters | e⁻ | Fe + S²⁻ | ETC Complexes I, II, III; aconitase |
| Coenzyme Q (ubiquinone) | 2 e⁻ + 2 H⁺ | mevalonate pathway | ETC mobile carrier |
| Cytochrome c | 1 e⁻ | heme c | ETC Complex III → IV; apoptosis trigger |
5. Carbohydrates
5.1 Monosaccharides
CₙH₂ₙOₙ. Trioses (glyceraldehyde, dihydroxyacetone), tetroses (erythrose), pentoses (ribose, deoxyribose, xylose, arabinose, ribulose), hexoses (glucose, fructose, galactose, mannose), heptoses (sedoheptulose). Aldose vs ketose (CHO vs C=O).
Fischer projection (Emil Fischer 1891 Nobel ‘02): straight-chain stereochemistry; D/L by C farthest from carbonyl (D-glucose dominates in nature). Haworth projection (Walter Haworth 1937 Nobel): cyclic pyranose (6-ring) or furanose (5-ring) after intramolecular hemiacetal formation.
Anomers: α (OH at C1 trans to C6 in pyranose, axial in D-glucose) vs β (cis, equatorial). Mutarotation — interconversion through open-chain form via the anomeric carbon; equilibrium α-D-glucose 36%, β-D-glucose 64%; rate catalyzed by mutarotase.
5.2 Disaccharides and polysaccharides
Glycosidic bonds form by condensation. Examples:
- Sucrose (Glc-α1,2-β-Fru), lactose (Gal-β1,4-Glc), maltose (Glc-α1,4-Glc).
- Starch — amylose (linear α1,4) + amylopectin (α1,4 + α1,6 branches every ~25 residues).
- Glycogen — like amylopectin but branched every 8-12 residues; mammalian glucose store.
- Cellulose — β1,4-glucose, planar sheets, H-bonded microfibrils; humans lack cellulase.
- Chitin — β1,4-GlcNAc; arthropod exoskeleton, fungal cell walls.
5.3 Glycoproteins and glycosaminoglycans (GAGs)
N-linked (Asn) glycans (high-mannose, complex, hybrid) attached in ER via dolichol-P-P-oligosaccharide. O-linked (Ser/Thr) initiated in Golgi. Functions: folding chaperoning (calnexin/calreticulin cycle), cell-cell recognition (selectins, sialic acid), serum half-life (asialoglycoprotein receptor), blood groups ABO.
GAGs: long repeating disaccharides of amino sugar + uronic acid. Heparin/heparan sulfate (anticoagulant, antithrombin III activation), hyaluronan (extracellular space, joints), chondroitin sulfate (cartilage), keratan sulfate, dermatan sulfate. Cross-ref [[Biology/cell-molecular-biology]].
6. Lipids
6.1 Fatty acids
Long-chain carboxylic acids. Saturated (palmitic 16:0, stearic 18:0) vs unsaturated (oleic 18:1 ω-9, linoleic 18:2 ω-6, α-linolenic 18:3 ω-3, arachidonic 20:4 ω-6, EPA 20:5 ω-3, DHA 22:6 ω-3). ω-3 and ω-6 are essential. cis double bonds are normal; trans-fats (industrial hydrogenation, Procter & Gamble’s Crisco 1911) are atherogenic and largely banned post-2018 in US/EU.
6.2 Lipid classes
- Triacylglycerols (TAG) — glycerol + 3 fatty acids; energy storage in adipocytes (~9 kcal/g vs 4 kcal/g for carbs).
- Phospholipids — glycerophospholipids (PC, PE, PS, PI, PG, cardiolipin) and sphingophospholipids (sphingomyelin). Amphipathic; form lipid bilayers (Singer-Nicolson fluid mosaic 1972).
- Glycolipids — cerebrosides, gangliosides (GM1, GM2, GM3; Tay-Sachs lacks hexosaminidase A → GM2 accumulation).
- Cholesterol — sterol nucleus (3 cyclohexane + 1 cyclopentane); membrane fluidity modulator; precursor to bile acids, steroid hormones, vitamin D. Synthesized via mevalonate/HMG-CoA reductase (statin target; Brown & Goldstein 1985 Nobel for LDL receptor).
- Steroid hormones — glucocorticoids (cortisol), mineralocorticoids (aldosterone), androgens (testosterone), estrogens (estradiol), progestins.
- Eicosanoids — 20-C signaling lipids from arachidonate: prostaglandins, thromboxanes (COX-1/2 → aspirin target; Vane 1982 Nobel), leukotrienes (LOX).
- Sphingolipids — sphingosine backbone; ceramide is central. Ceramide and sphingosine-1-phosphate are bioactive (apoptosis vs survival rheostat; Hannun, Obeid 2008).
- Lipoproteins — chylomicrons, VLDL, IDL, LDL, HDL; transport hydrophobic lipids in plasma.
6.3 Membrane biophysics
Bilayer thickness ~4-5 nm. Asymmetry maintained by flippases (ATP-dependent), floppases, scramblases. Lipid rafts — cholesterol + sphingolipid microdomains, ordered (Lo) phase (Simons, Ikonen 1997). Membrane proteins: integral (single-pass, multi-pass, β-barrel), peripheral, lipid-anchored (GPI, prenyl).
7. Nucleic Acids
7.1 Building blocks
Purines — adenine (A), guanine (G); fused 5+6 ring. Pyrimidines — cytosine (C), thymine (T, in DNA), uracil (U, in RNA); 6-ring.
Nucleoside = base + sugar (N-glycosidic, β configuration; ribose in RNA, 2’-deoxyribose in DNA). Nucleotide = nucleoside + 1-3 phosphates (5’-mono/di/triphosphate). Energy carriers (ATP, GTP), regulators (cAMP, cGMP), signaling (NAD⁺, FAD, CoA, SAM all built on AMP).
7.2 Phosphodiester backbone
5’→3’ polymer of nucleotides; backbone 5’-phosphate to 3’-OH via phosphodiester. Negatively charged. Watson-Crick base pairing (1953): A=T (2 H-bonds), G≡C (3 H-bonds). Double helix B-form: 10.5 bp/turn, 3.4 Å rise per bp, 23.7 Å diameter, major + minor grooves.
A-form (RNA + DNA-RNA hybrids), Z-form (left-handed, alternating purine-pyrimidine, GC-rich). Tertiary structures: tRNA cloverleaf → L-shape, ribozymes, riboswitches, G-quadruplexes (telomeres, oncogene promoters).
Cross-ref: [[Biology/genetics-and-genomics]] (replication, transcription, translation, central dogma Crick 1958).
8. Metabolism
Metabolism = catabolism (degradative, energy-yielding, oxidative) + anabolism (biosynthetic, energy-consuming, reductive). Pathways are organized around shared intermediates (pyruvate, acetyl-CoA, α-KG, OAA) and currencies (ATP, NADH, NADPH, FADH₂).
8.1 Glycolysis (Embden-Meyerhof-Parnas, EMP)
Cytosolic; one glucose → two pyruvate; net 2 ATP + 2 NADH + 2 H₂O. Embden, Meyerhof, Parnas elucidated 1930s. Ten steps:
- Hexokinase (HK1-4; HK4 = glucokinase, liver/β-cell) — Glc + ATP → Glc-6-P + ADP. Irreversible.
- Phosphoglucose isomerase — G6P → F6P.
- Phosphofructokinase-1 (PFK-1) — F6P + ATP → F1,6BP. Rate-limiting; allosterically inhibited by ATP, citrate; activated by AMP, F2,6BP.
- Aldolase — F1,6BP → DHAP + GAP.
- Triose phosphate isomerase (TPI) — DHAP ⇌ GAP.
- GAPDH — GAP + NAD⁺ + Pi → 1,3-BPG + NADH. Substrate-level couples redox to phosphorylation.
- Phosphoglycerate kinase — 1,3-BPG + ADP → 3-PG + ATP. Substrate-level phosphorylation.
- Phosphoglycerate mutase — 3-PG → 2-PG.
- Enolase — 2-PG → PEP + H₂O.
- Pyruvate kinase (PK; PKM2 in cancer/proliferating cells — Warburg effect) — PEP + ADP → Pyruvate + ATP. Irreversible.
Fate of pyruvate: aerobic → PDH → acetyl-CoA → TCA; anaerobic → lactate (lactate DH; muscle) or ethanol + CO₂ (yeast, pyruvate decarboxylase + ADH).
8.2 Gluconeogenesis
Reverse net flux from pyruvate/lactate/glycerol/amino acids → glucose. Liver + kidney cortex. Bypasses three irreversible glycolysis steps:
- Pyruvate carboxylase (mitochondrial, biotin) + PEPCK (cytosolic, GTP).
- F1,6BPase.
- G6Pase (ER lumen).
Cori cycle — muscle lactate → liver glucose; Carl & Gerty Cori 1947 Nobel.
8.3 Pentose phosphate pathway (PPP)
Cytosolic; parallel to glycolysis. Oxidative arm: G6P → 6-PG → ribulose-5-P + CO₂ + 2 NADPH (G6PDH = rate-limiting; G6PDH deficiency causes hemolytic anemia, favism). Non-oxidative arm: transketolase (TPP) + transaldolase interconvert C3-C7 sugars; provides ribose-5-P for nucleotides.
8.4 TCA cycle (Krebs / citric acid cycle)
Hans Krebs 1937, Nobel 1953. Mitochondrial matrix. Per acetyl-CoA: 3 NADH + 1 FADH₂ + 1 GTP + 2 CO₂. Per glucose: 2 turns.
Steps: Acetyl-CoA + OAA → citrate (citrate synthase) → isocitrate (aconitase) → α-KG + CO₂ (isocitrate DH, NAD⁺) → succinyl-CoA + CO₂ (α-KGDH complex, like PDH) → succinate + GTP (succinyl-CoA synthetase) → fumarate (succinate DH = Complex II) → malate (fumarase) → OAA (malate DH).
Anaplerotic reactions replenish intermediates: pyruvate carboxylase (Pyr → OAA), glutamate DH (Glu → α-KG). Cataplerotic: gluconeogenesis, biosynthesis pull intermediates out.
8.5 Electron transport chain (ETC)
Inner mitochondrial membrane. Electrons flow from NADH/FADH₂ to O₂ through 4 complexes, pumping H⁺ across the inner membrane to build the proton motive force (Δp = ΔpH·2.303RT/F + Δψ).
- Complex I (NADH:ubiquinone oxidoreductase) — 45 subunits, 7 mtDNA-encoded; FMN + 8 Fe-S; pumps 4 H⁺ per 2 e⁻.
- Complex II (succinate DH) — TCA enzyme; FAD + 3 Fe-S + heme b; no H⁺ pumping.
- Complex III (cyt bc₁) — Q cycle (Mitchell); pumps 4 H⁺ per 2 e⁻ (2 from QH₂ + 2 net).
- Cytochrome c — mobile peripheral carrier.
- Complex IV (cytochrome c oxidase) — heme a, a₃, Cu_A, Cu_B; reduces O₂ → 2 H₂O; pumps 2 H⁺ per 2 e⁻.
Inhibitors: rotenone (I), antimycin A (III), cyanide/azide/CO (IV), oligomycin (ATP synthase F₀), DNP (uncoupler).
Chemiosmotic theory — Peter Mitchell 1961, Nobel 1978. Initially rejected, validated by 1970s liposome reconstitution.
8.6 Oxidative phosphorylation and ATP synthase
ATP synthase (Complex V, F₁F₀-ATPase) couples 8-10 H⁺ flow back through F₀ rotor (c-ring) to rotation that drives ADP + Pi → ATP in F₁ catalytic head (α₃β₃). Binding change mechanism — Paul Boyer 1979, structural basis from John Walker’s crystal structure (Boyer + Walker 1997 Nobel).
P/O ratio: ~2.5 ATP per NADH, ~1.5 ATP per FADH₂ (revised down from older 3 + 2 figures). Per glucose aerobically: ~30-32 ATP.
8.7 Photosynthesis
Plants, algae, cyanobacteria. Two stages:
Light reactions (thylakoid membrane). Z-scheme (Hill, Bendall 1960): PSII (P680, water-splitting Mn₄CaO₅ cluster, Joliot-Kok S-state cycle) → plastoquinone → cyt b₆f → plastocyanin → PSI (P700) → ferredoxin → FNR → NADPH. ATP synthase uses thylakoid pmf to make ATP.
Calvin-Benson cycle (stroma; Calvin, Benson, Bassham 1950s; Calvin Nobel 1961). 3 phases:
- Carboxylation — RuBisCO (ribulose-1,5-bisphosphate carboxylase/oxygenase; most abundant enzyme on Earth, ~50 Gt globally) fixes CO₂ to RuBP → 2× 3-PGA.
- Reduction — 3-PGA → G3P using ATP + NADPH.
- Regeneration — 5 G3P → 3 RuBP (uses ATP).
Photorespiration — RuBisCO oxygenase activity: RuBP + O₂ → 3-PGA + 2-phosphoglycolate; wasteful in hot dry conditions.
C4 plants (maize, sugarcane; Hatch-Slack pathway 1966) — PEP carboxylase fixes CO₂ in mesophyll → malate/aspartate shuttled to bundle sheath, releases CO₂ near RuBisCO. CAM (cacti, pineapple) — temporal separation: night CO₂ fixation via PEPC, day decarboxylation behind closed stomata.
8.8 Fatty acid β-oxidation
Mitochondrial (long-chain via carnitine shuttle / CPT1-CPT2). Per cycle (4 steps): acyl-CoA → trans-Δ²-enoyl-CoA (acyl-CoA DH, FAD) → L-3-hydroxyacyl-CoA (hydratase) → 3-ketoacyl-CoA (HAD, NAD⁺) → acetyl-CoA + acyl-CoA shortened by 2 C (thiolase). Palmitate (C16) → 8 acetyl-CoA + 7 FADH₂ + 7 NADH → ~106 ATP net.
Variants: peroxisomal (very long chain, branched), α-oxidation (phytanic acid; Refsum disease), ω-oxidation.
8.9 Ketogenesis
Liver mitochondria during fasting/diabetes/keto diet. Acetyl-CoA → acetoacetyl-CoA → HMG-CoA → acetoacetate → β-hydroxybutyrate (+ minor acetone). Exported to brain, heart, muscle as alternative fuel.
8.10 Amino acid metabolism
Transamination — α-amino + α-keto interconversion (PLP-dependent; ALT, AST clinical markers). Oxidative deamination — glutamate DH releases NH₄⁺. Urea cycle (liver; Krebs-Henseleit 1932) — 5 enzymes (CPS-I, OTC, ASS, ASL, arginase) detoxify NH₄⁺ → urea; urea cycle defects → hyperammonemia.
Glucogenic vs ketogenic amino acids. Essential (in humans, mnemonic PVT TIM HALL): Phe, Val, Thr, Trp, Ile, Met, His, Arg (conditionally), Leu, Lys. Branched-chain (BCAAs: Val, Leu, Ile) catabolized by BCKDH (defect → maple syrup urine disease).
8.11 Nucleotide metabolism
De novo purine synthesis — 11 steps building purine ring on PRPP; produces IMP → AMP or GMP. Glutamine, glycine, aspartate, CO₂, formate (from THF) are precursors. Methotrexate, 6-MP, allopurinol target purine biosynthesis/salvage.
De novo pyrimidine synthesis — pyrimidine ring built first then attached to PRPP; UMP → UTP, CTP, dTMP (via thymidylate synthase + dihydrofolate reductase — methotrexate, 5-FU targets).
Salvage — HGPRT (purines; Lesch-Nyhan from deficiency), APRT, TK1/TK2. Critical because de novo is energetically expensive.
9. Bioenergetics
9.1 Gibbs free energy
ΔG = ΔG°’ + RT ln Q
where Q = product/reactant ratio at non-standard state; ΔG°’ at 298 K, pH 7, 1 M, 1 atm. ΔG < 0 spontaneous; ΔG = 0 equilibrium; ΔG > 0 non-spontaneous.
9.2 ATP and the high-energy phosphate
ATP hydrolysis ΔG°’ = −30.5 kJ/mol (ATP + H₂O → ADP + Pi); in cells with [ATP]/[ADP][Pi] ≈ 500, actual ΔG ≈ −50 to −60 kJ/mol. Other “high-energy” compounds: PEP (−61.9), 1,3-BPG (−49.4), creatine-P (−43.1), acetyl-CoA (−31.5).
Energy coupling — endergonic reactions are driven by ATP hydrolysis (or NTPs) via shared phosphorylated/activated intermediates. Hexokinase: Glc + ATP → Glc-6-P + ADP; overall ΔG°’ = −16.7 kJ/mol couples (Glc + Pi → G6P, +13.8) with (ATP → ADP + Pi, −30.5).
9.3 Reduction potentials
Standard ΔE°’ values (V at pH 7): NAD⁺/NADH −0.32, FAD/FADH₂ (free) −0.22, UQ/UQH₂ +0.045, cyt c (Fe³⁺/²⁺) +0.254, ½O₂/H₂O +0.816. Spontaneous flow toward higher E°’; ΔG°’ = −nFΔE°‘.
9.4 Membrane transport
- Passive diffusion — down gradient, no protein (O₂, CO₂, small uncharged).
- Facilitated diffusion — down gradient, via channel or carrier (GLUTs for glucose, aquaporins for water — Agre 2003 Nobel).
- Primary active transport — ATP-driven against gradient (Na⁺/K⁺-ATPase exports 3 Na⁺ / imports 2 K⁺ per ATP, Skou 1997 Nobel; Ca²⁺-ATPase / SERCA; H⁺/K⁺-ATPase gastric; ABC transporters: P-gp, CFTR, MRP).
- Secondary active transport — gradient-driven cotransport. Symport same direction (SGLT1 Na⁺/glucose), antiport opposite (Na⁺/Ca²⁺ exchanger, NHE Na⁺/H⁺).
10. Signaling
Cell signaling propagates information from extracellular cues to gene expression and behavior. Major systems (overlap with [[Biology/cell-molecular-biology]]):
- GPCRs (G-protein coupled receptors) — 7-TM helical bundle; ~800 in humans, ~30% of drug targets. Ligand binding → GDP/GTP exchange on Gα → effectors (AC, PLC, ion channels). Kobilka, Lefkowitz 2012 Nobel.
- Kinase cascades — RTKs (EGFR, insulin receptor) → Ras → Raf → MEK → ERK (MAPK); PI3K → Akt → mTOR. Stress-activated p38, JNK.
- Second messengers:
- cAMP (Sutherland 1971 Nobel) — AC from ATP; activates PKA.
- cGMP — NO/sGC → cGMP → PKG (vasodilation; sildenafil inhibits PDE5).
- Ca²⁺ — ER store + extracellular; sensed by calmodulin, troponin C; activates CaMK, PKC, calcineurin.
- IP₃ + DAG — PLCβ/γ cleaves PIP₂; IP₃ opens ER Ca²⁺ channels; DAG activates PKC. Berridge 1984.
- cADPR, NAADP — Ca²⁺ release.
- JAK-STAT — cytokine receptors → JAK transphosphorylates → STAT dimerizes → nucleus → gene transcription.
- Wnt — Frizzled + LRP5/6 → β-catenin stabilization → TCF/LEF transcription.
- TGF-β / SMAD, Notch (RIP cleavage), Hedgehog (Patched/Smoothened/Gli).
- Nuclear receptors — ligand-gated TFs (steroid, thyroid, retinoid, vitamin D, PPARγ).
11. Structural Biology Methods
- X-ray crystallography — Bragg 1915 Nobel; first protein structure: myoglobin (Kendrew 1958) and hemoglobin (Perutz 1959), both Nobel 1962. Modern XFEL (LCLS, European XFEL) enables femtosecond serial crystallography of dynamics + radiation-sensitive metalloenzymes.
- NMR spectroscopy — Wüthrich 2002 Nobel for protein NMR. Solution structures of < 35 kDa proteins; dynamics on ps-µs-ms timescales (relaxation dispersion, CPMG). Solid-state ssNMR for membrane proteins, amyloids.
- Cryo-EM — Henderson, Frank, Dubochet 2017 Nobel (“resolution revolution”; direct electron detectors + Bayesian software RELION/cryoSPARC pushed cryo-EM from ~10 Å to 2-3 Å routinely, sub-2 Å for select targets). Single-particle, cryo-electron tomography (CET), MicroED for crystals.
- Mass spectrometry — native MS (mass + assembly), HDX-MS (dynamics), XL-MS (crosslinking), top-down MS for proteoforms.
- SAXS/SANS — low-res envelopes, multi-state ensembles for IDPs.
- Single-molecule — FRET, optical tweezers (Ashkin 2018 Nobel), magnetic tweezers, AFM.
- AlphaFold-2 (Jumper et al. DeepMind 2021; Hassabis + Jumper 2024 Nobel Chemistry, shared with David Baker) — transformer-based structure prediction at near-experimental accuracy for single chains; ~200M+ predicted structures in AlphaFold DB.
- AlphaFold-3 (May 2024 Nature) — multi-modal joint prediction of protein-protein, protein-nucleic acid, protein-ligand, protein-ion, with covalent modifications. Diffusion-based architecture. Initially restricted (web-only); weights released for academic use Nov 2024.
12. Enzymology in Industry
- Detergents — proteases (subtilisin, Novozymes Savinase / Alcalase; Genencor Purafect), amylases (Termamyl), lipases (Lipolase), cellulases for color care. Engineered for high pH + temperature + surfactant tolerance.
- Starch processing — α-amylase + glucoamylase + glucose isomerase (xylose isomerase) → high-fructose corn syrup (HFCS). Multi-billion-dollar industrial process.
- Brewing/baking — α-amylase, β-glucanase, protease, xylanase; yeast pyruvate decarboxylase + ADH for ethanol.
- Biofuels — cellulases (Trichoderma reesei; Novozymes Cellic CTec series), hemicellulases for lignocellulosic ethanol. Iogen, POET-DSM commercial plants.
- Pharmaceutical chiral synthesis — engineered transaminases (Codexis sitagliptin process; replaced Rh catalysis 2010), ketoreductases, halohydrin dehalogenases, P450s. Codexis evolved transaminase via ~12 rounds of directed evolution.
- Directed evolution — Frances Arnold 2018 Nobel Chemistry (with Smith + Winter for phage display). Iterative mutagenesis + screening/selection; underpins industrial enzyme + AAV + antibody engineering.
- Food/dairy — chymosin (rennet; first GMO food enzyme, Pfizer 1990), lactase (lactose-free dairy), invertase (confectionery), pectinase (juice clarification).
- Diagnostics — Taq polymerase (Cetus 1988, basis of PCR; Mullis 1993 Nobel), reverse transcriptase, glucose oxidase (glucometer strips), uricase, urease, HRP/AP for ELISA.
13. Clinical Biochemistry
13.1 Routine panels
- CMP (comprehensive metabolic panel) — glucose, BUN, creatinine, eGFR, Na⁺, K⁺, Cl⁻, HCO₃⁻, Ca²⁺, albumin, total protein, ALT, AST, ALP, total bilirubin.
- Lipid panel — total cholesterol, LDL-C (Friedewald or direct), HDL-C, triglycerides, non-HDL-C, ApoB increasingly preferred.
- LFT (liver function) — ALT, AST, ALP, GGT, bilirubin (direct/indirect), albumin, PT/INR. AST/ALT > 2 suggests alcohol; ALP+GGT cholestatic.
- RFT (renal function) — creatinine, BUN, eGFR (CKD-EPI 2021 race-free), cystatin C, urine ACR.
- CBC with diff (hematology adjacent).
- HbA1c — glycated hemoglobin; integrates ~3-month glycemia; diagnosis at ≥6.5%, prediabetes 5.7-6.4%.
13.2 Cardiac + endocrine + tumor markers
- Troponin I/T — cardiac myonecrosis; hs-cTn assays (Roche, Abbott) detect ng/L; 4th universal definition of MI (Thygesen 2018).
- BNP/NT-proBNP — heart failure.
- CK-MB (older), myoglobin (early but non-specific).
- TSH, free T4, free T3 — thyroid.
- PSA (prostate), CA 19-9 (pancreatic), CA 125 (ovarian), CEA (colorectal), AFP (HCC, germ-cell), β-hCG (pregnancy, trophoblastic).
- CRP / hs-CRP — inflammation/CV risk.
- D-dimer — VTE rule-out.
13.3 Inborn errors of metabolism (IEM)
Newborn screening (Guthrie 1963 PKU; expanded MS/MS panels post-1990s).
- PKU (PAH deficiency) — phenylalanine accumulation.
- Galactosemia (GALT).
- MCAD deficiency — fatty acid oxidation.
- Tay-Sachs (HexA), Gaucher (GBA), Niemann-Pick (SMPD1).
- MSUD (BCKDH), homocystinuria (CBS), tyrosinemia.
- Urea cycle: OTC deficiency (X-linked, most common).
Enzyme replacement therapy (Cerezyme/imiglucerase 1994), substrate reduction therapy, gene therapy (Luxturna 2017, Zolgensma 2019, Casgevy 2023 — first CRISPR therapy approved).
14. 2024-26 AI in Biochemistry
The 2024-26 period saw AI move from prediction to design and from single-protein to multi-modal complex prediction. Key tools:
- AlphaFold-3 (Abramson et al., DeepMind + Isomorphic Labs, Nature 2024-05) — diffusion-based joint prediction of proteins + nucleic acids + small-molecule ligands + ions + covalent modifications. Outperforms physics-based docking (AutoDock Vina, GOLD) on PoseBusters. Released as AlphaFold Server (free academic), code+weights Nov 2024 with non-commercial license. Cross-ref
[[Compute/transformer-architecture]]. - AlphaProteo (DeepMind, Aug 2024) — de novo protein binder design from target structure; 3-300× higher experimental hit rate than baselines (RFdiffusion + ProteinMPNN); validated against SARS-CoV-2 spike RBD, VEGF-A, BHRF1, TrkA, IL-7Rα, PD-L1.
- AlphaMissense (Cheng et al. 2023 Science) — pathogenicity prediction for all ~71M possible human missense variants; classified 32% likely-pathogenic, 57% likely-benign; integrated into ClinVar workflows.
- AlphaFold-Multimer (Evans 2021, Nature Methods 2022) — protein complex prediction; AF-3 supersedes for most use cases.
- ESM-2 and ESM-3 (Meta FAIR / EvolutionaryScale 2023-24) — protein language models. ESM-2 up to 15B params; ESMFold for fast inference (~6× AF-2 speed, lower accuracy). ESM-3 (June 2024) — generative multi-modal foundation model trained on sequence + structure + function tokens; “ESM3 designed esmGFP” — fluorescent protein 58% identity to closest natural GFP.
- RFdiffusion (Watson, Juergens, Bennett et al., Baker lab, Nature 2023; refined 2024) — generative protein design via diffusion on backbone coordinates; binder + symmetric oligomer + scaffolding design. RFdiffusion-AA (all-atom, 2024) — designs binders to small molecules.
- ProteinMPNN (Dauparas et al., Baker 2022) — inverse folding, sequence design from backbone. Standard companion to RFdiffusion.
- Chroma (Generate Biomedicines, 2023) — diffusion-based protein generative model with programmable conditioning.
- Boltz-1 (MIT, Wohlwend, Corso et al., Oct 2024) — first open-source AlphaFold-3-equivalent; MIT license, code + weights free. Boltz-2 (2025) — adds affinity prediction and improved ligand handling; community-driven alternative to AF-3 for commercial use.
- Chai-1 (Chai Discovery 2024) — open multi-modal model (free for non-commercial, API for commercial).
- Profluent’s OpenCRISPR-1 (Ruffolo et al. 2024) — first AI-designed CRISPR-Cas9 family nuclease; functional in mammalian cells; protein language model trained on CRISPR-Cas operon database; released open-source as a precedent for AI-generated gene editors.
- Generate Biomedicines — Chroma + machine-learning drug discovery platform; clinical candidates in progress.
- Cradle.bio (Delft, 2021-) — protein engineering SaaS; ML-guided directed evolution for enzymes.
- Isomorphic Labs (Alphabet spin-out 2021) — drug discovery pipeline using AlphaFold-3; partnerships with Eli Lilly + Novartis announced 2024; first clinical candidates expected 2026.
- Cradle, Cradle-NeuralPLexer, DiffDock-PP, NeuralPLexer-3 — docking + co-folding tools growing rapidly.
- Evo, Evo-2 (Arc Institute + Stanford 2024-25) — DNA/genomic language model; predicts mutational effects across coding + non-coding; designs functional DNA sequences.
- Lab automation + ML — Insitro, Recursion, Schrödinger combine high-throughput biology with ML; Recursion + NVIDIA BioNeMo partnerships.
Practical impact by 2026: AI-designed binders enter clinical trials (multiple in I/II); AlphaFold-3 the default starting point for structure-based drug design; lab effort shifts from “what is the structure” to “how do I design or modulate this complex.”
15. Pitfalls and Common Errors
- Treating K_M as K_d — K_M only equals K_d when k_cat « k₋₁. Always check the kinetic regime.
- Reporting V_max without [E] — without enzyme concentration k_cat is unobtainable; insist on specific activity or [E]₀.
- Lineweaver-Burk fitting — unweighted least squares on a double-reciprocal plot weights low [S] points heavily; use nonlinear regression on raw v vs [S].
- Ignoring substrate depletion — initial-rate assumption fails once > 10% S consumed.
- Buffer-enzyme interactions — Tris inhibits many enzymes; phosphate inhibits kinases; pick buffer carefully.
- DMSO/co-solvent inhibition — > 1% DMSO inhibits ~10-30% of enzymes; titrate.
- Confusing standard ΔG°’ (pH 7) with ΔG° — biochem convention is ΔG°’ at pH 7, 25 °C, 1 M except H₂O and H⁺.
- Misreading membrane permeability — ions, ATP do not cross membranes passively; check transporter availability.
- Trusting AlphaFold pLDDT blindly — low-confidence (< 70) regions and inter-domain orientations are unreliable; PAE matrix matters more for assemblies; AF rarely captures induced-fit or active-state conformations.
- AF-3 ligand pose interpretation — even with high pLDDT, predicted ligand poses can be wrong; benchmark on PoseBusters; use experimental validation (X-ray, cryo-EM, ITC, MST).
- Ignoring proteoform diversity — sequence alone misses isoforms, PTMs, processing; top-down MS or proteoform-aware bottom-up MS for completeness.
- Over-interpreting in vitro k_cat/K_M — cellular conditions (crowding, partners, gradients) shift kinetics; verify in cellulo when possible.
- Linear Lineweaver-Burk extrapolations for allosteric enzymes — fit Hill equation v = V_max·[S]^n / (K^n + [S]^n) instead.
- Forgetting metals — many “ATP-dependent” assays need Mg²⁺ (1-5 mM); check stoichiometry.
16. Cross-References
[[Chemistry/organic-chemistry-foundations]]— functional groups, stereochemistry, mechanisms underpinning biochem (carbonyl, acyl substitution, Schiff bases, redox).[[Chemistry/analytical-chemistry-methods]]— UV-Vis, fluorescence, HPLC, LC-MS/MS, NMR, CD, SPR, ITC, MST.[[Biology/cell-molecular-biology]]— membranes, organelles, signaling pathways, the central dogma in cellular context.[[Biology/genetics-and-genomics]]— DNA replication, transcription, translation, regulation, sequencing.[[Engineering/pharma-process-engineering]]— large-scale fermentation, downstream purification, formulation.[[Engineering/bioinstrumentation]]— biosensors, point-of-care diagnostics, lab automation.[[Compute/transformer-architecture]]— the ML substrate of ESM, AlphaFold, ESM-3, RFdiffusion attention layers.
17. Citations and Foundational References
- Pauling, L., Corey, R. B., & Branson, H. R. (1951). The structure of proteins: two hydrogen-bonded helical configurations of the polypeptide chain. PNAS 37: 205-211.
- Ramachandran, G. N., Ramakrishnan, C., & Sasisekharan, V. (1963). Stereochemistry of polypeptide chain configurations. J Mol Biol 7: 95-99.
- Michaelis, L., & Menten, M. L. (1913). Die Kinetik der Invertinwirkung. Biochem Z 49: 333-369.
- Briggs, G. E., & Haldane, J. B. S. (1925). A note on the kinetics of enzyme action. Biochem J 19: 338-339.
- Lineweaver, H., & Burk, D. (1934). The determination of enzyme dissociation constants. JACS 56: 658-666.
- Monod, J., Wyman, J., & Changeux, J.-P. (1965). On the nature of allosteric transitions. J Mol Biol 12: 88-118.
- Koshland, D. E., Némethy, G., & Filmer, D. (1966). Comparison of experimental binding data and theoretical models in proteins containing subunits. Biochemistry 5: 365-385.
- Krebs, H. A., & Johnson, W. A. (1937). The role of citric acid in intermediate metabolism in animal tissues. Enzymologia 4: 148-156. [Nobel 1953]
- Mitchell, P. (1961). Coupling of phosphorylation to electron and hydrogen transfer by a chemi-osmotic type of mechanism. Nature 191: 144-148. [Nobel 1978]
- Boyer, P. D. (1993). The binding change mechanism for ATP synthase — some probabilities and possibilities. BBA 1140: 215-250. [Boyer + Walker Nobel 1997]
- Watson, J. D., & Crick, F. H. C. (1953). Molecular structure of nucleic acids. Nature 171: 737-738. [Nobel 1962 with Wilkins]
- Calvin, M., & Benson, A. A. (1948). The path of carbon in photosynthesis. Science 107: 476-480. [Calvin Nobel 1961]
- Hill, R., & Bendall, F. (1960). Function of the two cytochrome components in chloroplasts: a working hypothesis. Nature 186: 136-137.
- Kendrew, J. C., et al. (1958). A three-dimensional model of the myoglobin molecule. Nature 181: 662-666. [Kendrew + Perutz Nobel 1962]
- Sumner, J. B. (1926). The isolation and crystallization of the enzyme urease. JBC 69: 435-441. [Nobel 1946 — first protein crystallized]
- Cori, C. F., & Cori, G. T. (1929-47). lactate-glucose cycle; Nobel 1947.
- Krebs, E. G., & Fischer, E. H. (1955). The phosphorylase b to a converting enzyme of rabbit skeletal muscle. JBC 216: 121-132. [Nobel 1992]
- Hershko, A., Ciechanover, A., & Rose, I. (1980s). Ubiquitin discovery. [Nobel 2004]
- Skou, J. C. (1957). The influence of some cations on an adenosine triphosphatase from peripheral nerves. BBA 23: 394-401. [Nobel 1997]
- Sutherland, E. W. (1971 Nobel). cAMP as second messenger.
- Brown, M. S., & Goldstein, J. L. (1985 Nobel). LDL receptor.
- Berridge, M. J., & Irvine, R. F. (1984). Inositol trisphosphate, a novel second messenger in cellular signal transduction. Nature 312: 315-321.
- Arnold, F. H. (1993-). Directed evolution. [Nobel 2018]
- Kobilka, B. K., & Lefkowitz, R. J. (2007 Nature β2-AR structure). [Nobel 2012]
- Henderson, R., Frank, J., & Dubochet, J. — cryo-EM. [Nobel 2017]
- Jumper, J., et al. (2021). Highly accurate protein structure prediction with AlphaFold. Nature 596: 583-589. [Hassabis + Jumper Nobel Chemistry 2024]
- Baker, D. (Rosetta + de novo design). [Nobel Chemistry 2024, shared]
- Abramson, J., et al. (2024). Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630: 493-500.
- Watson, J. L., et al. (2023). De novo design of protein structure and function with RFdiffusion. Nature 620: 1089-1100.
- Dauparas, J., et al. (2022). Robust deep learning-based protein sequence design using ProteinMPNN. Science 378: 49-56.
- Wohlwend, J., Corso, G., et al. (2024). Boltz-1: Democratizing biomolecular interaction modeling. MIT preprint, Oct 2024.
- Hayes, T., et al. (2024). Simulating 500 million years of evolution with a language model. EvolutionaryScale (ESM3) preprint.
- Ruffolo, J. A., et al. (2024). Design of highly functional genome editors by modeling the universe of CRISPR-Cas sequences. Profluent / bioRxiv.
- Cheng, J., et al. (2023). Accurate proteome-wide missense variant effect prediction with AlphaMissense. Science 381: eadg7492.
- Berg, J. M., Tymoczko, J. L., Gatto, G. J., & Stryer, L. Biochemistry 9th ed. (2019) — canonical textbook.
- Lehninger, A. L., Nelson, D. L., & Cox, M. M. Principles of Biochemistry 8th ed. (2021).
- Voet, D., & Voet, J. G. Biochemistry 5th ed. (2019).
- Fersht, A. Structure and Mechanism in Protein Science (1999) — enzymology gold standard.
Last updated 2026-05-17. Maintain alongside [[Chemistry/_index]]. When AlphaFold-4 / Boltz-3 / ESM-4 land, update Section 14 and citations.