Genome Structure, Chromatin and The Nucleosome
I. Introduction
A. Introduction: The Human Genome Project
1. Since the first gene was sequenced in 1972, Molecular Biologists have wished to take a systematic approach to understanding the human genome
2. From the idea of the systematic approach, the Human Genome Project was inaugurated in 1990
a. Billed as Biology’s first “big project”
b. Projected time frame to finish the project was 15 years
3. A Molecular Biologists defines genome as the total set of DNA molecules present in either an organism, a cell or an organelle
4. When the Human Genome and Human Genome project is being discussed, the total set of DNA molecules in the cell is being looked at
a. Chromosomes (24 types)
b. Mitochondrial DNA
5.The goal of the Human Genome project was to acquire fundamental information about our genetic makeup
a. How many genes are present in the human genome
b. How many genes are actually similar to each other in DNA sequence
c. How many base pairs are present in the Human Genome
d. How many base pairs are present in a chromosome
e. How many base pairs are not part of genes (intervening sequences)
f. Understand how genes interact with each other
6. Although the basic goals of the project are listed above, the major justification for a project lasting 13 years and costing $2.7 Billion were the possible medical benefits
a. Allow for more accurate diagnosis of inherited disorders
b. Provide a framework for the development of new therapies
c. Personalized medicine: Wide-scale application of mutation screening which better allows a move from treating advanced symptoms of disease to preventative disease
7. With personalized medicine comes significant ethical, legal and societal implications-~20% of the $2.7 Billion spent went into this category
B. Introduction: Human Genome Project Findings
1. The Human genome consists of roughly 3 Billion base pairs spread across 46 chromosomes (23 pairs)
2. Human chromosomes can range in size from 50 million to 300 million base pairs
a. The smallest chromosome is chromosome 21
b. The largest chromosome is chromosome 1
3. The Human Genome has roughly 20,000 genes (protein coding only)
a. Chromosome 1 has the most genes at about 3000
b. The Y chromosome has the least number of genes having only 70-200 genes
c. Chromosome 5 has lowest gene density
d. Chromosome 19 has the highest gene density
4. The 20,000 genes is only a rough estimate and was determined through using computer based methods
a. Genes that encode proteins
b.Evidence of evolutionarily conserved gene sequences
5. The human genome does not have many more genes than other less complex organisms
a. C. eleganshas 20,000 genes and a genome size of 100 million base pairs
b. Drosophila has 14,000 genes and a genome size of 165 million base pairs
6. The reason why humans can get away with so few genes is that on average one human gene encodes not just one protein but actually three
7. Introduction: Human Genome Project Findings: Supplemental Figure
II. Fitting the Human Genome Into The Cell
A. Fitting the Human Genome Into The Cell: Introduction
1. Fitting the 3.0 billion base pair genome into a human cell nucleus is a difficult proposition for the cell
2. If you were to stretch out the 3.0 billion bases, one would find that the DNA molecule would be about 2m long
a. A human cell nucleus is going to be approximately 10 μm in diameter
b. Represents a packing ratio of 1000 – 10,000 fold
3. Another way of looking at this is if we were to take a string the length of the Willis Tower, and fit it into a tennis ball, how easy would it be to do that?
4. Somehow, the cell must have a way to actually package all of the DNA into the nucleus
5. The way in which eukaryotic DNA is packaged in the cell nucleus is one of the wonders of macromolecular structure”
–G. Michael Blackburn, Nucleic Acids in Chemistry and Biology (1990) p. 65
6. In order to fit the large human genome into a relatively small nucleus human cells have an intricate mechanisms for packaging the DNA
a. Dividing the genome into linear chromosomes
b. Winding the DNA around proteins called histones
B. Fitting the Human Genome Into The Cell: Dividing the Genome Into Chromosomes
1. By definition, a chromosome is a structure of a long DNA molecule and its associated proteins, which carries all or part of the hereditary information for an organism
2. All organisms and viruses carry their genomes in the form of chromosomes
3. The number of chromosomes an organism has varies, and does not correlate with the complexity of the organism (see table 7-1)
4. In general, most prokaryotes have one circular chromosome
5. Most eukaryotes have multiple linear chromosomes of varying size and copy number
a. Most eukaryotes have 2 copies of each chromosome (homologs)
b. Some eukaryotes may have only one copy of each chromosome
c. In rare cases, some eukaryotes may have more than two copies of each chromosome (Tetrahymena)
C. Fitting the Human Genome Into The Cell: The Purpose of Dividing The Genome Into Chromosomes
1. Dividing the genome into chromosomes is actually very important for all organism
a. Allows for easier compaction to fit the DNA into the nucleus (or nucleoid)
b. Protects the DNA from damage
c. Provides an easy way to transmit the DNA to daughter cells when a cell divides
2. For sexually reproducing organisms, having chromosomes allows for easy transmission of half the genome to each gamete in a process called meiosis
3. By diving the genome into chromosomes gives the cell the ability to control DNA structure, and by proxy control gene expression
a. Allows some genes to be “expressed”
b. Allows some genes to not be “expressed”
D. Fitting the Human Genome Into The Cell: The Human Chromosomal Content
1. Humans have a total of 46 chromosomes, with two copies of each chromosome
2. Humans have 23 pairs of chromosomes
3. Each member of a chromosome pair (homolog) will have the exact same genes
4. However, each member will not necessarily have the same sequence (variant or allele) for each gene
a. In cases where both chromosomes have the same sequence of a specific gene, the individual is a homozygote for that gene
b. In cases where both chromosomes have different sequence for a specific gene, the individual is a heterozygote for that gene
D. Fitting the Human Genome Into The Cell: The Human Chromosomal Content and Sickle Cell Anemia (Supplemental Figure)
E. Fitting the Human Genome Into The Cell: The Human Chromosomal Content and Sickle Cell Anemia (Supplemental Figure)
F. Fitting The Genome Into The Cell: Chromosome Structure
1. Eukaryotic chromosomes have several important features
a. Allow for greater DNA stability
b. Allow for efficient gene expression
c. Allow for control of gene expression
2. Each end of a linear chromosome is capped with a telomere for maintenance of chromosome stability
3. Throughout each chromosome reside origins of replication, which direct the start of DNA synthesis
4. Each eukaryotic chromosome will have a more or less centrally located centromere, which is important for chromosome segregation in mitosis/meiosis
a. An elaborate protein complex known as the kinetochore binds the centromere
b. Microtubules will in turn interact with the kinetochore to pull the chromosomes to opposite poles of the cell during mitosis
5. The centromere divides the chromosome into arms
a. p-arm is the short arm (petit)
b. q-arm is the long arm
6. Fitting The Genome Into The Cell: Chromosome Structure (Supplemental Figure)
7. It is not the norm to have chromosomes with multiple centromeres
8. However, this does occasionally happen in humans
9. Having a chromosome with two centromeres creates a problem during mitosis as each centromere may be getting pulled in opposite directions leading to chromosomal breakage
10. Having no centromeres is also not the norm in human populations, and results in completely random segregation during mitosis
III. Chromatin Formation
A. Winding The DNA Into Chromatin: Introduction
1. Although dividing the genomic DNA into chromosomes helps with packaging, it clearly is not enough
2. In addition to having chromosomes, the chromosomal DNA is wound around proteins to form chromatin
3. Chromatin is the genomic DNA and its associated proteins
4. By winding the chromosomal DNA around proteins, it is very easily fit into the nucleus of the cell
5. To form chromatin, the genomic DNA must be wound around a specific structure called a nucleosome
6. Each chromosome will have many nucleosomes associated with it
7. Each nucleosome contains the following
a. Chromosomal DNA
b. Histone proteins around which the DNA is wound
8. The nucleosome allows for compaction of the DNA about 6-fold
9. This is only the first stage in compaction, as the genome needs to be compacted 100-10,000 fold to efficiently fit into the nucleus
B. Winding The DNA Into Chromatin: The Histone Proteins
1. Each nucleosome consists of a core of eight histone proteins and the chromosomal DNA wound around them
2. The histone proteins were discovered back in 1928 by Albrecht Kossel, who isolated from goose erythrocytes
a. Discovered that they were basic (have a positive charge)
b. Since then histones have been discovered in many other organisms
3. Eukaryotic nucleosomes generally have a total of 5 abundant histones divided into two sub-categories
a. Core histones, which show strong conservation
b. Linker histones
4. The Core histones, form the core of the nucleosome, and will form an octamer (complex of 8 protiens) and are found in the cell in roughly equal amounts
a. Histone H2A
b. Histone H2B
c. Histone H3
d. Histone H4
5. The core histones to be basic,
a. Rich in arginine
b. Rich in lysine
6. The protein core of a nucleosome is a disk-shaped structure that assembles in an order fashion only in the presence of DNA (discovered through in vitro experiments)
7. Without DNA, the core histones will create intermediate assemblies, which are mediated by the conserved histone fold domain (discovered through in vitro experiments
B. Winding The DNA Into Chromatin: The Core Histone Proteins Come Together To Form an Octomer
1. In order to form the core, the Histone H3 and Histone H4 proteins form a central tetramer
a. H3 and H4 proteins form heterodimers
b. Two H3/H4 heterodimers interact to form the tetramer
2. Once the tetramer H3/H4 tetramer forms, it binds to DNA to start the winding process (~147 bp of DNA)
3. Next, Histones H2A and H2B form heterodimers which then bind onto each end of the H3/H4 tetramer
4. The DNA that is wound around the core histones is considered the core DNA and is wound ~1.65 times around the octomer like thread around a spool
5. The fifth histone present in a nucleosome is the linker histone
6. There are two possible types of linker histones
a. Histone H1
b. Histone H5 (only found in avian erythrocytes)
7. Compared to the core histones, the linker histones are larger, with a molecular weight of greater than 20 kD
8. Histone H1 is half as abundant in the cell as the core histones
9. Histone H1 is not part of the core nucleosomeparticle, instead it binds to the linker DNA
10. If one counts the DNA wound around the core nucleosomes and the linker DNA which is associated with Histone H1 there are about 180 bp (in humans)
11. The role of the linker Histone H1 is to induce tigher wrapping of the DNA around the nucleosome
12. Histone H1 binding relative to the nucleosome is different than the core histones
a. Only one Histone H1 protein binds
b. Histone H1 binding sites are located asymetrically with respect to the nucleosome
13. The two Histone H1 binding sites are as follows:
a. The linker DNA at one end of the nucleosome (only linker DNA on one side of the nucleosome is protected from mnase digestion)
b. At the mid-point of the 147 bp associated with the core histone complex
14. In the end, the whole nucleosome includes the following
a. Core histones and the DNA that is wrapped around them
b. Linker histone, and the DNA that is covered by it
C. Winding The DNA Into Chromatin: Determining How Many Base Pairs of DNA Are Associated With A Nucleosome
1. In order to figure out the amount of DNA wound around a nucleosome, the enzyme Micrococcal nuclease was used
a. Micrococcal nuclease will digest free double stranded DNA
b. Micrococcal nuclease will not digest double stranded DNA associated with proteins
2. To perform the Mnase experiment chromatin was isolated and subjected to treatment with a low concentration of micrococcal nuclease
3. Once treatment was finished, protein components were removed, leaving only the DNA that was associated with the nucleosomes
4. The DNA was then run on an agarose gel
5. On the agarose gel, they saw DNA bands at multiples of 180 bp
6. Winding The DNA Into Chromatin: Determining How Many Base Pairs of DNA Are Associated With A Nucleosome
7. As we saw before, the 180 bp refers to the amount of DNA wound around the core nucleosome and the linker DNA associated with Histone H1
8. In order to figure out how much DNA was wound around the core nucleosome, the chromatin was exposed to more extensive micrococcal nuclease treatment
a. The linker Histone (Histone H1) is not tightly bound to the DNA
b. Extensive treatment with Mnase results in the release of Histone H1
c. The release of Histone H1 allows for the Mnase to in turn degrade the rest of the DNA Mnase was bound to
9. After extensive treatment only a 147 base pair band was seen-indicative of the amount of DNA wound around the core nucleosome
D. Winding The DNA Into Chromatin: The Winding of the DNA Around The Core Histones
1. Although the nucleosome is not perfectly symmetryical, it does have a twofold axis of symmetry called the dyad axis
a. Imagine the nucleosome as a clock, with the midpoint of the 147 bp of DNA at the 12 o’clock position
b. This places the ends of the DNA at the 11 o’clock and 1 o’clock positions
2. Throughout the nucleosome, there are 14 distinct sites of contact between the core histones and the DNA
3. An interaction occur each time the minor groove of the DNA faces the histoneoctamer
4. Winding The DNA Into Chromatin: The Winding of the DNA Around the Core Histones
5. The core histone proteins bind the DNA through H-bonds in a sequence non-specific manner (minor groove binding)
a. Majority of H-bonds are between the Histones and the oxygens in the phospho-diester backbone
b. Few H-bonds are formed between the histones and the nitrogenous bases
6. In order to get the DNA wound around a nucleosome, it must be bent
7. In general DNA is a rigid molecule due to the fact the fact the negatively charged phosphates in the backbone (resists bending)
8. The total number of Hydrogen bonds between the core histones and the DNA is twice as many as a typical sequence specific DNA binding protein
a. Core histones interact with the DNA through 40 Hydrogen bonds
b. Typical Sequence specific DNA binding proteins interact with the DNA through 20 hydrogen bonds
9. Two aspects of the core histone-DNA interaction result in allowing the rigid DNA to bend
a. Increased hydrogen bonding
b. Basic nature of the histones masks the negative charge of the phosphate groups
IV. Packaging The Genome
A. Packaging The Genome: Introduction
1. Winding the DNA around nucleosomes only compacts the DNA 6 fold-well short of the needed 1000-10,000 fold compaction that is necessary
2. To further look at DNA compaction, chromatin was carefully isolated from nuclei and viewed by electron microscopy using a low salt extraction
3. In 1975, Chambon’s lab published electron micrographs of the eukaryotic genome revealed the existence of uniformly sized particles with a repeating pattern
4. The genome looked like “Beads-on-a-string” by electron microscopy
a. The beads represent DNA wrapped around the histone core octamer
b. The string represents the free DNA helix between nucleosomes
c. Note: these methods of extraction are non-physiological and therefore, it is unknown whether this conformation exists in vivo-however it is likely this conformation is not present in vivo
B. Packaging The Genome: The 30 nm Fiber
1. The binding of Histone H1 not only increases how tight the DNA is wound around the nucleosome, but also stabilizes higher order chromatin structures
2. If the chromatin is studied In the test tube, as salt concentrations are increased, the addition of Histone H1 results in the nucleosomal DNA forming a 30 nm fiber
3. Chromatin in situ experiments using electron microscopy also further revealed thatthe arrays of nucleosomes formed a more compact fiber
a. Revealed the 30 nm fiber (30 nm in diameter)
b. Difficult to maintain the 30 nm structure upon purification and so the structure remains not well understood, as it is present in vivo
4. Two possible models have been proposed for the structure of the 30 nm fiber-based on high salt extractions