Document 29557

CY60004 Biophysical Chemistry
Structures of biological macromolecules (proteins and polynucleic acids);
Molecular Mechanics: simulating macromolecular structure;
Spectroscopic (NMR, Fluorescence and Circular Dichroism) methods to study
structure of proteins and DNA; solving macromolecular structures by X-ray
diffraction; structural transitions in polypeptides, proteins and polynucleic acids.
Interactions
between
macromolecules:
thermodynamics
of
folding/stability by fluorescence and circular dichroism techniques.
protein
Binding of small ligands by biological macromolecules: kinetics and energetics
of protein-drug, protein-surfactant and DNA-drug interactions by fluorescence,
CD and calorimetric methods.
Books :
Biophysical Chemistry, Parts I, II and III, Cantor and Schimmel
Principles of Physical Biochemistry, van Holde, Johnson and Ho
Protein Structure
Amino acids
Sidechain (R)
O
20 common amino acids
R side chain
Three letter code
One letter code
NH2
C
CH2
CH2
HO
C
O
carboxyl
group
Cα
H
• Peptide = a short chain of amino acids
• Polypeptide = a longer chain of amino acids
• Protein = a polypeptide that occurs in nature and folds
into a defined three-dimensional structure
N H
H
amino
group
Building blocks
H
+
H N
H
H
Cα
O
C
H
N
H
Cα
H
+
H N
H
O
C
Cα
O
C
R
O-
H
N
H
Cα
R
C
O
H
N
H
Cα
O
C
O-
R
The beginning of the protein is known as the
amino-terminus and the end of the protein is
known as the carboxyl-terminus.
R
R
H
Peptide bond
Amino Acid Characteristics
•
•
•
•
•
Hydrophobicity
Size
Charge
Secondary structure preference
Aromaticity
Special characteristics:
•
•
•
•
bridge forming by cysteines,
rigidity of prolines,
titrating at physiological pH of histidine,
flexibility of glycines
Nomenclature
• Glx can be Glu or Gln
• Asx can be Asp or Asn
• Polypeptide chains are always described from the N-terminus to the Cterminus
Nomenclature
• Nonhydrogen atoms of the amino acid side chain are named in sequence
with the Greek alphabet
ISOELECTRIC POINT (pI)
Definition: the pH at which a molecule carries
no net electric charge
pH
12
: using the Henderson-Hasselbalch pK2 10
equation
H3+NCH2COO- ⇌
H2NCH2COO- + H+
8
pI = ½(pK1 + pK2)
: for amino acids
pK1 = pK α-COOH
pK2 = pK α-NH3
Isoelectric Focusing
: separation of proteins according to charge
: pH gradient set up by ampholytes
: electric field applied such that one pole is
positively charged and one pole negatively
charged
: proteins migrate in the pH gradient until
their net charge = 0 (isoelectric point)
+
pI 6
4
pK1 2
H3+NCH2COOH ⇌
H3+NCH2COO- + H+
0
0.5
0
1.0
1.5
Titration of glycine
pI
-
Ionization of Amino Acids
Equilibrium dissociation constant
Acid-base properties of amino acids
The dissociation of first proton
from the α-carboxyl group is
Gly+ + H2O
Gly0 + H3O+
The dissociation of the second
proton from the α-amino group
Gly0 + H2O
Gly- + H3O+
Henderson-Hasselbach Equation
K1=
[Gly0][H3O+]
K2=
[Gly+]
[Gly-][H3O+]
[Gly0]
The pKa’s of these two groups are far enough apart that they can be
approximated by Henderson-Hasselbalch
pH = pK1 + log
[Gly0]
[Gly+]
2.0
H+ ions dissociated/molecule
pH = pK2 + log
[Gly-]
[Gly0]
Isoelectric Point of Amino Acids
Titration curve of glycine
COO-
+
H3N
C
H
H
Neutral
form
Titration of Gly
COOH
+
H3N
C
H
Gly+
H
pK1
+
H3N
C
H
pH 2.3
COO-
COO-
Gly0
H
pK2
H2N
pH 9.6
C
H
H
Gly-
From the pK values we can calculate the pI (isoelectric point) where the
amino acid is neutral.
pI ≈ average of (pK below neutral+ pK above neutral)
So, for Gly,
pI = (pK1 + pK2)/2
= (2.3 + 9.6)/2 ≈ 6
The Acid-Base Chemistry of the Amino Acids
pI = 2.35 + 9.78 = 6.1
2
Lysine
Lys
Glycine
Gly
Glutamic acid
Glu
pI = 2.10 + 4.07 = ~3.1
2
pI = 9.18 + 10.79 = ~10.0
2
Charged polar (acidic) side chains
Henderson-Hasselbach Equation
9.5
COOH 2.1
+
H3N
C
COOH 2.0
9.8
+
H3N
H
CH2
Buffering zone = pK +/- 1 pH unit
At pH = pK, [HA] = [A-]
CH2
At pH below pK, [HA] > [A-]
C
O
At pH above pK, [HA] < [A-]
C
H
CH2
C
O
OH 4.1
OH 3.9
Aspartic acid
Glutamic acid
Asp
Glu
D
E
Side chain
The 5 Complex Amino Acids are:
R
Glutamic acid (Glu)
H
Aspartic acid (Asp)
H
Lysine (Lys)
H
Arginine (Arg)
Amino
group
Histidine (His).
+
N
O
C
Cα
OH
H
Carboxyl
group
Each of these 5 amino acids has 3 ionizable groups ⇒ 3 pKs.
• His has 3 ionizing groups, alpha-carboxylic acid (pK 1.8),
• side-chain amino (pK 6.0) and alpha-amino (pK 9.2):
• His+2 goes to His+1 via pK 1.8;
• His+1 goes to His0 via pK 6.0;
• His0 goes to His-1 via pK 9.2.
• Therefore, pI = (6.0 + 9.2)/2 = 7.6.
pI = average of (pK-below) + (pK-above)
where pK-below is the pK between AA+1 and AA0
and pK-above is the pK between AA0 and AA-1.
Histidine Titration
Histidine Titration
Free Amino acid
COO
Amino acid on
surface of enzyme
-
N
H
3
CH 2
C
N
H
-1
-
COO
N
2
H
O
C
H
C
CH 2
C
N
N
NH 3 +
C
H2
N
NH
H
0
pKa=6.0
-
COO
H
N+
1
H
C
NH 3
CH
2
N
+
H
+1
pKa=1.8
COOH
H
2
4
6
8
pH
10
12
14
C
CH
0
0
N
H
+2
O
C
H
C
H
+
N
NH
C
H2
N
H
+
2
NH 3 +
0
H
pKa=6.0
H
N
0
0
Mols OH- added
Mols OH- added per mol histidine
NH 2
pKa=9.3
2
4
6
8
pH
10 12 14
+1
Histidine Titration
Amino acid in a
pocket (active site)
1. Carboxylic acids ionize at acidic pH; ie. Carboxylic acids
give up their protons at acid pHs.
O
C
H
C
N
pKa<6.0
+
+
N
0
H
Mols OH- added
+
C
H2
NH
pKa=6.0
3. Carboxylic acids near an amino group have a more acidic
pK than isolated carboxylic acids.
+
pKa>6.0
O
C
H
C
+
H
N+
C
H2
N
NH
H
-
+1
2. Amino groups ionize at basic pH; ie. Amines give up their
protons at basic or alkaline pHs.
+
-
4. Amino groups near a carboxylic acid have a more acidic pK
than isolated amines.
0
0
2
4
6
8
-
10 12 14
-
pH
-
For Ala-Lys, there are 3 ionizable groups:
For the tetrapeptide: Glu-Ala-Lys-Tyr
1) alpha-amino group contributed by Ala - assign pK 9.9.
Write out the structure.
2) alpha-carboxylate group from Lys - assign pK 2.2.
With the assigned pK values, determine the net charge at pH 1, 3, 5, 7, 10, 11.
3) side chain amino group from Lys - assign pK 10.8.
pH 1
pH 5
pH 7
pH 10
Calculate the pI of this tetrapeptide.
pH 12
The pK values of the amino acids are:
Ala- 2.4, 9.9
α−amino
Glu - 2.1, 4.1, 9.5
α-carboxylate
Lys - 2.2, 9.2, 10.8
Tyr - 2.2, 9.7
Side chain amino
Net Charge
Peptide Bond - Structure
1) DOUBLE BOND CHARACTER
O
..
C N
O+
C N
: fractional charges
: no rotation between C and N
: HIGH DEGREE OF ROTATION RESTRICTION
2) COPLANAR NATURE
: αC-N can rotate through angle φ
α R1
: αC-C can rotate through angle ψ
C
ψ
: R groupings can rotate through a
H
characteristic angle
rotation
C N rotation
: as it rotates, gives 3D structure to
O φ
the protein
C
: degree of rotation will depend upon R group
R2 α
1) BULK
held in
eg. free rotation with glycine (R = H)
position
2) CHARGED STATE
because
of double
two positives repel one another
bond
3) WATER REACTION
polarity of R groupings
polar - rotate out to H bond with water
nonpolar - on inside of molecule
phi (ϕ) torsion angle around N-CA bond
psi (ψ) torsion angle around CA-C bond
omega (ω) peptide bond; trans ~180 degrees, cis ~0
Torsion angles
O
R2
C
H2N
φ
CH
ψ
CH
ω
R1
N
OH
ψ
φ
C
O
H
Residue
Globular protein
Membrane protein
Non-polar
VLIMFYW
In interior
Hydrophobic core
Surface – lipid
anchor
Polar charged
RKDEH
Surface
Catalytic sites
Hydrophilic core
Polar neutral
STNQYW
H bond network
Inside surface –
part of channel
Steric hindrance:
• Most – Pro
• Least - Gly
Importance of Protein Structure
Amino Acid Partitioning Into Membrane Regions
Region
Amino Acids
Bulk water
Arg, Asn, Asp, Gln, Glu, His, Lys, Pro
Bulk water +
interfacial
Ala, Cys, Gly, Ser, Thr
Interfacial
Tyr
Hydrophobic
Ile, Leu, Met, Phe, Trp, Val
Sequestering hydrophobic residues in the protein core protects proteins
proteins
from hydrophobic agglutination.
Hemoglobin A: Val-His-Leu-Thr-Pro-Glu-Glu-LysHemoglobin S: Val-His-Leu-Thr-Pro-Val-Glu-Lys-
http://chemed.chem.purdue.edu/genchem/topicreview/bp/1biochem/amino2.html
“sticky patch” causes hemoglobin S to agglutinate (stick together) and form
fibers which deform the red blood cell
Classes of Proteins
Fibrous Proteins
Based on structure and solubility,
proteins can be grouped into three large classes:
Fibrous
Globular
Membrane
• Fibrous proteins contain
polypeptide chains
organized parallel along a
single axis, producing
long fibers or large sheets.
• They are mechanically
strong, play structural
roles in nature;
• Difficult to dissolve in
water;
α-Keratins and Collagen
are examples of fibrous
proteins
α-keratins are found in hair, fingernails, claws, horns and beaks;
•Sequence consists of long alpha helical rod segments capped with non-helical
N- and C-termini
β-keratins are found in silk and consist of gly-ala repeat sequences;
•Ala is small and can be packed within the sheets
Globular Proteins
• Globular proteins are classified according to the type and arrangement
of secondary structure
– Antiparallel alpha helix proteins
– Parallel or mixed beta sheet proteins
– Antiparallel beta sheet proteins
Metal binding sites examples
Protein Architecture
Primary structure (1°) : the amino acid sequence.
Secondary structure (2°) : helices, sheets and turns.
Tertiary structure (3°) : side chain packing in the 3-D structure.
Quaternary structure (4°) : association of subunits.
1) M is Fe (rubredoxin) or Zn (aspartate transcarbamoylase)
2) Carboxypeptidase A
3) Catalytic ion in liver alcohol dehydrogenase
4) Azurin and plastocyanin
Protein Structure
• Physical properties of protein that influence stability &
therefore, determine its fold:
Secondary Structure
Primary Structure
Two major types:
Alpha Helical Regions
Beta Sheet Regions
– Rigidity of backbone
– Amino acid interaction with water
• Hydropathy index for side chains
– Interactions among amino acids
• Electrostatic interactions
• Hydrogen bonds
• S-S bonds
• Volume constraints
•
Other classification schemes:
Turns
Transmembrane regions
Internal regions
External regions
Antigenic regions
Features
α-helix is one of two secondary structures
(the other being the β -sheet) predicted and
discovered by Linus Pauling in 1951.
Secondary Structure Conformations
φ
alpha helix
ψ
-57
alpha-L
-47
57
47
3-10 helix
-49
-26
π helix
-57
-80
type II turn
-79
150
β-sheet parallel
-119
113
β-sheet antiparallel
-139
135
It is a right-handed helix with the following
spatial parameters:
φ= -57°
ψ = -47°
n = 3.6 (number of residues per turn)
pitch 0.54nm (or 5.4Å)
β-sheet: parallel (φ =-119 and ψ =113) or
βanti-parallel pleated sheet structures
Residue conformational preferences:
Glu, Ala, Leu, Met, Gln, Lys, Arg - helix
Val, Ile, Tyr, Cys, Trp, Phe, Thr - strand
Gly, Asn, Pro, Ser, Asp - turn
other regular structures
a) 310 helix - i, i+3 hydrogen bonding pattern
b) π-helix - i, i+5 hydrogen bonding pattern
c) turns and loops
antiparallel beta-sheet parallel beta-sheet
Properties of the alpha helix
• φ ≈ ψ ≈ −60°
• Hydrogen bonds
between C=O of
residue n, and
NH of residue
n+4
• 3.6 residues/turn
• 1.5 Å/residue rise
• 100°/residue turn
All the common
secondary
structure
conformations fit
in the sterically
allowed regions of
phi-psi space
alpha-helix
Properties of α-helices
• 4 – 40+ residues in length
• Often amphipathic or “dual-natured”
– Half hydrophobic and half hydrophilic
– Mostly when surface-exposed
• If we examine many α-helices,
we find trends…
– Helix formers: Ala, Glu, Leu, Met
– Helix breakers: Pro, Gly, Tyr, Ser
the alpha-helix: repeating i, i+4 Hbonds
11
Helix nomenclature: α-helix
example
1 Hydrogen bond between
C=O
(residue i)
10
12
right-handed helical
region of phi-psi space
9
8
– 5 turns
– 18 residues per repeat
5
4
185 helix
3 Loop formed between
hydrogen bond
1
3
1←5 helix
2 Repeating unit:
7
6
H-N
(residue i+4)
Which of residues 1-12 are in the
helix? Does this coincide with the
residues in the helical region of phipsi space?
2
C=O H-N
1 ←
– 13 atoms
– 3.6 residues per turn
3.613 helix
5
nomenclature used for
310 helix
α helices extend with approximately 1.5 Angstrom per
residue, 5.4 Angstrom per turn.
Helical Wheel
310 helix i, i+3
α-helix i, i+4
π-helix i, i+5
Helical Wheel
Beta Strands
For each residue the
rotation is 1000
lys his gly val thr val leu thr ala leu gly ala ile leu lys lys
phi(deg) psi(deg) omega (deg)
-----------------------------------------------------------------β strand
-120
120
180
-----------------------------------------------------------------
beta strands/sheets
beta-strand region of
phi-psi space
57
56
54
53
52
51
50
parallel or anti-parallel sheet?
49
Secondary Structure: Beta Sheets
Turns and Loops
BETA PLEATED SHEET: a result of H-bonding between polypeptide chains
Four-stranded beta
sheet that contains
three antiparallel
and one parallel
strand.
Hydrogen bonds are
indicated with red
lines (antiparallel
strands) and blue
lines (parallel
strands)
β-turns in proteins:
reversing the chain direction
turn
residues
Α β-turn consists of two
residues, where there is a
hydrogen bond between the
carbonyl of the residue
preceding the turn and the
amide nitrogen following the
turn. There are a number of
ways to configure the backbone
to achieve this
direction of
polypeptide chain
• Secondary structure elements are connected by regions of
turns and loops
• Turns – short regions
of non-α, non-β
conformation
• Loops – larger stretches with no secondary structure. Often
disordered.
– “Random coil”
– Sequences vary much more than secondary structure
regions
Secondary Structure: Turns
Gamma-turn
It involves three amino acid residues and the intraturn hydrogen bond
for a gamma-turn is formed between the backbone CO(i) and the
backbone NH(i+2).
Beta-turn
A beta-turn involves four amino acid residues and may or may not be
stabilized by the intraturn hydrogen bond between the backbone CO(i)
and the backbone NH(i+3).
Alpha-turn
An alpha-turn involves five amino acid residues where the distance
between the Calpha(i) and the Calpha(i+4) is less than 7Å and the
pentapeptide chain is not in a helical conformation.
Pi-turn
It is the largest tight turn which involves six amino acid residues.
Tertiary Structure of Protein
Tertiary Structure: Aggregation of individual protein.
1.
Hydrophobic attraction: the close association
attraction of hydrocarbon side-chains.
2.
Ionic bond: between positively charged groups and
negatively charged groups.
3.
Hydrogen bonds
4.
Disulfide bonds
A protein has size and shape as well as unique arrangement
through hydrogen, ionic, hydrophobic and disulfide bonds.
Side chain conformations--canonical staggered forms
Side chain
conformation
Newman projections for χ1 of glutamate:
O
C δ
O
• side chains differ in their number
of degrees of conformational
freedom
•but side chains of very different
size can have the same number of
χ
angles.
C
γ
CH2
β
CH2
CH
α
CO
CO
χ3
Hβ
χ2
N
χ1
NH
Cγ
Cγ
Hα
N
CO
Hβ
Hβ
Hα
N
Hβ
Hα
Hβ
Cγ
χ1 = 180°
χ1 = +60°
χ1 = -60°
t
g+
Hβ
g–
O
glutamate
t=trans, g=gauche
name of conformation
Side chain angles are defined moving outward from the backbone, starting
with the N atom: so the χ1 angle is N–Cα–Cβ–Cγ, the χ2 angle is Cα–Cβ–Cγ –Cδ ...
IUPAC nomenclature:
http://www.chem.qmw.ac.uk/iupac/misc/biop.html
Rotamers
•
•
•
•
Anfinsen’s experiments, late 1950’s through 1960’s
a particular combination of side chain torsional angles χ1, χ2, etc. for a
particular residue is known as a rotamer.
for example, for aspartate, if one considers only the canonical staggered
forms, there are nine (32) possible rotamers: g+g-, g+g+, g-g-, g-g+, tg+, g+t, tg-,
g-t, tt
not all rotamers are equally likely.
for example, valine prefers
distribution of
its t rotamer
valine rotamers
in protein structures
CO
Hβ
Cγ1
N
Ribonuclease, an enzyme involved in cleavage
of nucleic acids. Structure has a combination of
α and β segments and four disulfide bridges
Hα
Cγ2
χ1=180°, trans or t
χ1=0
180
360
What are Disulfide Bridges?
Active,
native
structure
Oxidation
Cys-SH +
Cys-SH
Cys-S
S-Cys
Reduction
add β-mercaptoethanol (BME)
HS-CH2-CH2-OH
add urea, H2N-C-NH2
BME is reducing agent
Urea unfolds proteins
O
SH
Cys110
SH
Cys110
Cys58
SH
SH
SH
Cys58
SH
SH
SH
Denatured, inactive, “random coil”, many conformations
SH
SH
SH
SH
SH
SH
SH
Many Conformations
SH
SH
SH
SH
SH
SH
SH
Many Conformations
Remove BME
SH
SH
Remove BME
Remove urea
S S
S S
- Native structure
- fully active
- 4 disulfide bond
correct
One
Conformation
S S
Remove
urea
S
S
Mixture of 105
different conformations, 1% active
One
Conformation
Sequence specifies structure
• Anfinsen’s experiments
– Denatured ribonuclease
– Spontaneously regained enzymatic
activity
– Evidence that it re-folded to native
conformation
The essential
structure
information is stored
in the primary
sequence of amino
acids
The “native” structure of a protein is the form we
find when we isolate that protein in an active state
from a natural source.
If the protein loses that structure, by unfolding, or
unwinding, it loses activity.
Therefore, native = folded
denatured = unfolded
The “native” structure is necessary to create the
binding pockets that make up the active site of
an enzyme.
Structure Stabilizing Interactions
Structure Stabilizing Interactions
• Noncovalent
– Van der Waals forces (transient, weak electrical
attraction of one atom for another)
– Hydrophobic (clustering of nonpolar groups)
– Hydrogen bonding
δ- δ+
δ-
Higher interaction energy than a
simple van der Waals interaction
H-bond
D–H
A
• Covalent
– Disulfide bonds
β sheet
α helix
The hydrophobic effect
H
H
H
H
H
H
H
H
H
H
H
H
H
H
H
+
H
Partitioning of the molecules between water and an organic solvent such as octanol
H
-
H
H
H
H
H
H
H
H
-
H
H
+
H
H
H
H
Quantification of contribution to the hydrophobic effect
(Hydropathy)
H
H
H
H
H
H
H
H
Amino acid placed in two-solvent system
Hydropathy represented by partition coefficient P,
Hydropathy represented by
P=
− log( P )
χ aq
χ org
H
Non-polar solute
Polar solute
The dissolution of a non-polar solute in water is unfavourable –
for polar solutes there is favourable hydrogen bonding
Structure Stabilizing Interactions
Solving Protein Structures
Asn
H H H H H +
Lys C C C C N H
H H H H H
O H
O C C Asp
H
H C H
H
N C
H
δ O
δ+
H
O
H C H
Ser
Only 2 kinds of techniques allow one to get atomic resolution
pictures of macromolecules
• X-ray Crystallography (first applied in 1961 - Kendrew &
Perutz)
• NMR Spectroscopy (first applied in 1983 - Ernst & Wuthrich)
• Structure
Function
• Structure
Mechanism
• Structure
Origins/Evolution
• Structure-based Drug Design
• Solving the Protein Folding Problem
QHTAWCLTSEQHTAAVIWDCETPGKQNGAYQEDC
HHHHHHCCEEEEEEEEEEECCHHHHHHHCCCCCC
Hydrophobicity scales
Hydropathy plots
•An hydropathy plot is a graphical display of the local hydrophobicity of
amino acid side chains in a protein.
A positive value indicates a
hydrophobic residue and a
negative value a hydrophilic residue
•A positive value indicates local hydrophobicity and a negative value
suggests a water-exposed region on the face of a protein.
•Hydropathy plots are generally most useful in predicting transmembrane
segments and N-terminal secretion signal sequences.
Hydropathy index
Hydropathy plots
Sliding Window Approach
Calculate property for first
sub-sequence
•A window of 9 or 11 is generally
optimal for recognizing the long
hydrophobic stretches that typify
transmembrane stretches.
I L I K E I R
4.50+3.80+4.50-3.90
-3.50+4.50-4.50 = 5.40
= 5.4/7=0.77
•In an α-helix the rotation is 100 degrees per amino acid
•The rise per amino acid is 1.5 Å
•To span a membrane of 30 Å approx.
30/1.5 = 20 amino acids are needed
Move to the next position
Transmembrane Helix Predictions
•Not many structures known
of transmembrane helix
proteins
•Hydropathy analysis can be
used to locate possible
transmembrane segments
•The main signal is a stretch
of hydrophobic and helixloving amino acids
Hydropathy plots
•The window size can be changed.
A small window produces
"noisier" plots that more
accurately reflect highly local
hydrophobicity.
Hydropathy plot for rhodopsin