Control of Gene Expression
Control of Gene Expression
Control of Gene Expression
Why the pancreatic cells produces Insulin but not Melanin and the
skin cell produces melanin but not insulin?
Human pancreas Human Skin
Insulin Melanin
Overview of Eukaryotic gene expression
Overview of Prokaryotic gene expression
Different cell types in a multicellular organism differ dramatically in
both structure and function. If we compare a nerve cell with a liver
cell they are completely different in terms of functions.
All the cells have identical genetic makeup; the only difference lies
in their pattern of gene expression
When the nucleus of a fully differentiated frog cell is injected into
frog egg, whose nucleus has been removed is capable of producing
a normal tadpole. The tadpole contains full range of differentiated
cells, that derived from the nucleus of donor cell. Thus it can be
concluded that the donor nuclei contained all the important DNA
sequence in it.
Different cell types synthesize different sets of RNAs and protein
• Many process of cell types are common like Histones for DNA packaging, DNA &
RNA polymerase, ribosomal proteins, proteins for cytoskeleton(Actin/myosin).
Hence these genes are transcribed and translated all types of cells.
• Some RNA and proteins are abundant in the specialized cells in which they
function. e.g Hemoglobin protein is only expressed in RBC where it carries oxygen.
The enzyme tyrosin amino transferase which breaks down tyrosin in foods is
expressed in liver but not in other tissues.
How external signals are responsible to change the expression of the gene
3’UTR
TSS
ATG 3’UTR
PROMOTER of Gene B
PROMOTER of Gene B TSS
ATG 3’UTR
Cis elements
TSS
ATG 3’UTR
RNA
pol
mRNA
ATCAATG TATA
box
ATG
Promoter TSS
RNA
pol
mRNA
How sequence specific DNA binding proteins/ transcription factors
control transcription process
TF
• Each TF makes a series of contacts with
the DNA, involving hydrogen bonds, ionic
bonds, hydrophobic interactions.
ATTAACAGTTTGTTGCCAAAAATATTTCTTTTTTTTAAAATAAAAAATCCTTTATTTTTTCCATCATCTTCTTCCTT
ATCTAAGTTTTTTATTTATAATTATAATATTGCAGTGACCTTAATGTGTACAAGTATATTCGTTACATTATTTGG
GTCTCTGTAAAAACAAGTGCAGAAACACTAAAAACTATGAAGGCCTTTAATGGAACCAATGTCATCAAGAAAC
AATTTTCTTTAATTTTTGTAAAATTTACAGTTTTTGTTCTGTTGTTGGGTCTTGCCTATCGACTCATCTTGTCCAAT
TTTGAACAGTTCTCTCAGATTGAAGTGAAAAATACTGCTCTGTTTTCCGACAACACATTGCCACAGGCGCCGGT
GACCGTGAGTGAACCTCCGGTGACTGTCAATGAACCTCTGGTGACCGTTGATGAACCTACGGTGACTGTGAGT
GAATCTCTGGTGACTATTGATCTTCCAGAAAACCAAACTTTGCGAAATGGTTAGTTTATATTTGCATGAATGCT
CAGTTCTTGTAAAAATTGAATCTTGAAAGATTAATATTTTCTGTTTTTTTTTAGCTTATTGTAATTTGTTTGCGTT
GGATTTTGTGTAATCAATGACGTGGGTAATTTTCCTTCACTTCTCGTCTCTGTGTTTTGTGATTAATTTTGTATAT
TTGGTATTTGAGGATCAAGTCTTTCATCCATATTTGTAGTGAAAATCTCTTTTCTACGGAATCAAACTCTCACCA
ATCAACTGAGCTATTAAGATTTTCTAGTGGAATATATAAAATATGGATGGTGGATTAATTTTTTATGATATTATC
TTTGGCTGGTAGTCATGAATTTTTTTAGTAATAGAAAAAGGAACAACTTTGAGTACAAATAATAAGATTTTGCA
CTTTGAGCTTAGGTCACTAATGTATGAGTCCTTACAATTAACTGACTCTTTTTTTTTTCTTTTTCAGGAACATGTG
ATTTATTTATAGGAAATTGGGTTTATGATCCAAGTGGTCCAGTATACACGAACGCGACTTGCTATTCAATTGAA
TCTCATCAGAACTGTATGAAAAATGGGAGACCTGATACTGAATATCTTAACTGGAGATGGAACCCAACAGACT
GTGAGCTACCGAGATTTAGCCGTAAGAAATTCCTCAATTTGATGCGAAACAAATCATTCACCTTCATAGGCGAT
TCAATCATGCGTAATCATGTGCAGTCATTGCTTTGTATTCTCTCACAGGTACATTTCAAGCTCAAAAAGAGTTAT
ACTTGAACCAGGCAATTAGCTGCTCTACATTTCATTTGTTTTGTTTTGTTTCAGTTTGTTGTGTTAGTGAAATTG
GTAATTGCAAGCGTCATGATTTAGTTGTTTAGTCGTAACAATAATTGCTCTTAAATGGTAATGTTGATATCAAA
TCAAGTAAATACAAGCTTGTTGTGGCTGTTTGTGTATGGCATTACAATTTGTGGGGGCACTCTTGTGTTGTATA
AGCTGGTGCCTTATCTAGTTGTATCTCACTAACCATAATTAGGATCTTTTGTGTATGAGGTGGCATAGCTACAT
ATTGGATTTATCGGTAATGTCACTCTCAGTTACATGGATGTTGGTCCCGCGAGCCAGCTTGTAAAGGCTCTTTG
TAGTTATAACATTTGTTTCGAAAGTTGTTAAGAAGTTGATTTAAAGTTAAGTGTCAAACCTAACTAAGTGTACA
TTTAAGACTATTACTAAAGTTTTGCTGTTGTTCCATCTCCTTTGACCAAGAAAAATTAGCTGGCCTCATTATTAT
CTCCTTGAGGTTTCAGCAAAAATTGATGTGTAATGATATGACACTGGGGTCCAACTCAACCCCGGATATCGTA
AGTCCATATAAGCAGACGATCAGTCTAGTCTAACACCCGTCTTCACACCCAGGCCCATAATTCTTTGTGTTGTTT
AATTTTGCAAGAGATGACAGGCAGTTTTTCTAAGGACAAAACCTTAATGGTAATTGTCATAGAGTAATAAGGA
TCATTATGGTAGAAATCTTACTTACCTGTTGTGTCTTATATAGAACCCTCTTATACAAATGTCGTTTTAGATGGT
TGGCTGATCGTGCTGATTCTTTCTTTAGTACAGTACTTCTTTTACTTGGAATTGACTGACTGAATATGGAATTTT
CTTTACTATATAAAATCGAGGCTGCTAATTCGAGAATCTATATCCTAAATTTAGAATTCGATATTGTTGAATAAC
CCTTTGCTCTTTTGTACACAAGCTTCATTCGGGACATCATTGGCACGTTAAATTCATGTTGCTTCATTTCCTTGG
GAATTTTACTATACTTTTTGAGATTTTTTATTTGTAGTTGATTACGTTTCATCGTGTTGCAATGTGTTGGGTTGTC
CTATGCTGCCGAAACTTATAAGGGAAAAGATAGTTTTTGTTTCTAATCACGTTTACATTGTGAAACAGGAGGA
AGAAGCTGATGATGTGTACCATGACGAGCAATACAAAAGCAGAAGATGGTACTTTCCGCTCCACGACTTCAAT
CTTTCTGTGATTTGGTCACCTTTTCTCGCAAAATCAACTGTTTTTGAAGATGACAATGGAGCTTCAACGGACATT
ACTCAGCTTCATCTTGACAAACTGGATGATGTTTGGACTCAACAATTCGACAATTTTGATTATGTATTTGTAGCT
GGTGGCAAATGGTACCTGAAATCAACAATTTACCTTGAGAATGATACGATTGTTGGGTGTCACAACTGTCCTG
GTAAGAACATTACACAGGTGGGATTTGAATACGCCTATCGCAAAGCATTGAATACAACTTTTAAATTCATCATG
AACTCGAAGAACAAGGCGTATACATTTTTCAGAACCACTACCCCTGATCACTTTGAGAATGGTGAATGGAATA
GTGGAGGTTATTGTAACAGAACAGGACCTTTCAAAGAAGGTGAGATTGACATTGGTTATGTAGACGAGGTAA
TGCGCAAAGTTGAGCTGGAAGAATTCAAAAGAGCATCTGGATTGGGTTTGAAGGTGAAATTGTTTGACACAA
CCTTACTTTCACTGCTGAGACCAGACGGGCATCCCGGAGTCTACAGGCAATATCAGCCATTTGCTGTAGAAAA
TAAGAAGAAAAAAATTCAGAATGACTGTCTACATTGGTGTTTGCCTGGTCCAATAGACTCGTGGAACGATATA
Identification of promoter region and Promoter analysis
3’UTR
Sequence retrieval from browser
https://phytozome.jgi.doe.gov/pz/portal.html#!info?alias=Org_Csativus
Bioinformatic analysis
http://plantpan2.itps.ncku.edu.tw/
TSS
ATG
Identification of promoter region and Promoter analysis
3’UTR
Sequence retrieval from browser https://phytozome.jgi.doe.gov/pz/port
al.html#!info?alias=Org_Csativus
This mechanism of control was discovered first in Trp operon transcription in E. Coli.
The transcription of Trp operon is repressed by trp repressor when the conc. Of
Tryptophan is high in the media. But in that scenario still very low level of expression still
occurs , which is further controlled by the process attenuation.
The first 140nt of the trp operon does not encode proteins required for tryptophan
biosynthesis. But this 140nt region consist of short peptide “Leader sequence”
5’ 1 2 3 4 UUUU 3’
Transcription
start codon
• Region one of the leader sequence consist of two successive trp codons.
• Region 3 can base pair with either 2 or 4 .
• The ribosome follows closely behind the RNA polymerase. When the 5’ of the mRNA
emerges ribosome starts translating the mRNA.
Condition 1: When there is sufficient Tryptophan amino acids in the
media.
RNA
pol
UU 3’
1 2
4
3
tRNA trp
Condition 2: When there is insufficient Tryptophan amino
acids in the media.
3
RNA
pol 1
No phosphorylation
Gln
Gln of Aspertic acid in
Gln
regulatory domain
Ntr-c protein
Ntr-B protein
Sensory domain
Regulatory
D
domain
DNA binding
domain
His-kinase transmitter domain
Conformational change occurs
due to binding of Glutamine
(becomes inactive)
Can’t bind to the enhancer region of
the gln A gene
High metabolite
conc.
RNA RNA
pol. pol.
RNA No transcription
pol.
How Riboswitches regulates Transcription?
When metabolite and amino acid concentration Is lower
When the concentration of metabolite is lower it can’t bind to aptamers allowing the
transcription of the gene.
RNA RNA
pol. pol.
Helix- Discove Consist of Two helices connected by short They binds with DNA as dimer in the
turn- red in chain of amino acids. The Cterminal helix major groove.
Helix bacteria is known as recognition helix which binds
l TFs with major groove of DNA
protein
2
1
1
2 2
1
Homeo First Consist of three alpha helices which They bind as monomer. The
domain discover are packed tightly by hydrophobic Asparagine of helix 3 contacts with
proteins ed in interactions. The third helix is called adenine residue of DNA in major
drosoph
ila recognition helix which binds with groove.
major groove of DNA.
2
2 3 1
A sn
3
A sn
1
Types Discovery Structure Binding with DNA Structure
Zn
ATCAATG TATA
box
ATG
Promoter TSS
RNA
pol
mRNA
Types of DNA-protein interaction
DNA-protein
interaction
In vivo In vitro
1. Chromatin Immuno 1. Electrophoretic mobility shift
Assay (EMSA)
precipitation Assay (ChIP) 2. DNase I Foot printing assay
2. Trans activation assay
3. Yeast one hybrid assay
Basic Principle of Gel Mobility Shift Assay / EMSA
Incubation of labelled
Electrophoretic
probe with TF in room
analysis Autoradiography
temperature for 30 min
5’
TF 3’
3’ Shifted
5’
5’ 5’ 3’ band
Labelling 3’
TF
3’ 5’
3’
of Probe TF 3’ 5’
5’
5’ TF
3’ 3’
5’ 5’
3’ 5’ 3’ Free
3’ 5’ probe
TF
How to label the duplex DNA ? Why labelling is necessary?
TF
TTGACC TATA
box
ATG
Promoter TSS
5’ 20-30 bp 3’
3’ 5’
Design of DNA duplex
γ P32 ATP
5’ end Labeling
T4 PNK
5’ 3’
3’ 5’
TF
5’ 3’
3’ 5’ 5’ 3’
Hot probe 3’ 5’
Cold probe
Design of EMSA Experiment
+ + Hot probe
Protein purified
- +
Shifted band
DNA –protein comple
Autoradiography
Design of competition EMSA Experiment and super shift EMSA
It’s a competition between the hot and cold probe to bind to the protein.
- - - Cold probe
+ + + + + Hot probe
+ Protein purified
- + + +
+ Antibody
- - - -
1 2 3 4 5 6
1 2 3 4 5 6
DNA protein
complex
Free
probe
Autoradiography
Original Gel picture of EMSA and competition EMSA
+ Protein purified
- +
1 2 3
TF can’t bind in
the mutant site
Autoradiography
EMSA reveals the binding of OsDREB Tf in the promoter of rd29A (a dehydration
responsive gene)
Shifted band
Free probe
Chromatin immunoprecipitation
ChIP
(Formaldehyde)
input
ChIP PCR
sonicating
antibody
F R
input
F R
F R
F R
ChIP PCR
ChIP-on-chip / ChIP-chip
Agroinfiltration Assay
Among the 7 different cis-elements which one is most crucial for
transcriptional regulation?
TSS
ATG
Construction of 5’ deletion promoter fragments
TSS
5’ GUS
TSS
5’ GUS
TSS
5’ GUS
TSS
5’ GUS 1
TSS
5’ GUS 2
TSS
5’ GUS 3
Promoter
construct
Agrobacterium
1
3
2 1 3
agroinfiltration Histochemical gus assay
Site directed mutagenesis assist in identification of specific transcription
factors that are involved in transcriptional regulation
TSS
GUS Wt
TSS
GUS Mut 1
TSS
GUS
Mut 2
Wt
Mut1
Wt Mut 1
Mut2
Mut 2
Post transcriptional modifications
The schematic representation of mRNA processing and transport
Eukaryotes
Structure of prokaryotic and Eukaryotic mRNA
Initiation of RNA processing
Mechanism of 5’ CAPPING
After the mRNA transcript is about 25-30 nucleotides long a 7-methylguanosine is
added to the 5’ end. Protects the 5’ from enzymatic degradation in the nucleus and
assists in export to the cytosol.
5’ 3’ RNA
5’ ppp N p N RNA 3’
phosphatase
pi phosphatase
cleaves the end phosphate
from the nascent RNA
pp N p N RNA
Guanyl transferase adds the GMP
residue along with a phosphate in a
GTP Guanyl transferase
inverted orientation 5’ to 5’ instead of ppi
5’to 3’
Gppp N p N RNA
Methyl transferase adds
the methyl group to the 7th Methyl transferase
position of Guanine
residue m7Gppp N p N RNA
All the three enzymes bind to the RNA pol tail phosphorylated at the ser 5 residue. As
soon as the RNA emerges from the polymerase these three protein starts adding 5’cap.
Requirement of 5’ CAPPING
The presence of 5’ methyl cap helps the cell to distinguish mRNA from other type of
RNA molecules.
RNA pol I and III produces uncapped RNA molecules during transcription.
..
OH A
exon1 exon2
Eukaryotic introns are longer, hence the lone pair electrons can’t attack the intron-exon junction. So for a
successful nucleophilic attack two exons should come closer by folding of introns. The ribonucleoproteins helps in
folding phenomenon.
U1
..
OH A Spliceosome mediated splicing occurs in
exon1
U2
exon2 step1 higher Eukaryotes that involves U1,U2, U4,
U5, and U6 ribonucleo proteins.
U5 Branch point
U6
U4
U1 and U4 would be released
U4
U1 U5 will hold the exon1 and exon2 , now
U1
exon1 lone pair of electron can do nucleophilic
U4 exon1 attack to the intron-exon junction
step2 U5
..
U5 leading to cleavage of phosphodiester
bond
U6 exon2
A
U6 exon2 OH A
U2
U2
Nuclear exonucleases cut the lariet RNA from U1 and U4 would be released
both the 3’ and 5’ ends, thus recycling the ..
nucleotides.
OH exon1
exon1 exon2 U5
U6 exon2
The lone pair of electron of OH group of exon1 can do nucleophilic attack to the
A
U2
intron-exon junction of second exon leading to cleavage of phosphodiester bond
and ultimately leads to joining of exon1 and exon2 step3
Effect of Random mutation on splicing
Transcription termination signal
3’POLYADENYLATION
• Primary transcript is cleaved and a poly A tails is added to the free 3’ OH.
• Up to 250 A residues may be added.
• Carried out by poly(A) polymerase
• Protects the mRNA from degradation from the 5’ end.
5’ 3’ 5’ 3’
3’ 3’
3’ 5’ 3’ 5’
CPSF cstF
AAUAAA AAUAAA
Poly Adenylation 5’
Signal 5’
A Set of protein (CPSF)recognizes the poly adenylation signal
and forms the RNA cleavage complex With cstF and other
protein. This complex having endonuclease activity cleaves
the RNA at a certain point.
3’ AAAAAAAAA Poly A
PAB PAB poymerase
AAUAAA