Single Mammalian Cells Compensate for Differences in

Article
Single Mammalian Cells Compensate for Differences
in Cellular Volume and DNA Copy Number through
Independent Global Transcriptional Mechanisms
Graphical Abstract
Authors
Olivia Padovan-Merhar,
Gautham P. Nair, ..., Abhyudai Singh,
Arjun Raj
Correspondence
rajlaboratory@gmail.com
In Brief
Padovan-Merhar et al. combine singlemolecule transcript counting with
computational measurement of cellular
volume, showing that single cells
maintain transcript abundance despite
variability in cell size. Large cells have
greater transcriptional burst size,
whereas burst frequency halves upon
DNA replication.
Highlights
d
Transcription scales with cell volume to maintain transcript
concentration
d
Cell fusion shows that increasing cellular content can
increase transcription
d
Transcriptional burst size changes with cell volume and burst
frequency with cell cycle
d
The burst frequency mechanism allows for proper
transcription during early S phase
Padovan-Merhar et al., 2015, Molecular Cell 58, 1–14
April 16, 2015 ª2015 Elsevier Inc.
http://dx.doi.org/10.1016/j.molcel.2015.03.005
Accession Numbers
GSE66053
Please cite this article in press as: Padovan-Merhar et al., Single Mammalian Cells Compensate for Differences in Cellular Volume and DNA Copy
Number through Independent Global Transcriptional Mechanisms, Molecular Cell (2015), http://dx.doi.org/10.1016/j.molcel.2015.03.005
Molecular Cell
Article
Single Mammalian Cells Compensate for Differences
in Cellular Volume and DNA Copy Number through
Independent Global Transcriptional Mechanisms
Olivia Padovan-Merhar,1 Gautham P. Nair,2 Andrew G. Biaesch,2 Andreas Mayer,3 Steven Scarfone,1 Shawn W. Foley,4
Angela R. Wu,5 L. Stirling Churchman,3 Abhyudai Singh,6 and Arjun Raj2,*
1Department
of Physics and Astronomy, University of Pennsylvania, Philadelphia, PA 19104, USA
of Bioengineering, University of Pennsylvania, Philadelphia, PA 19104, USA
3Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
4Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
5Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
6Department of Electrical and Computer Engineering, University of Delaware, Newark, DE 19716, USA
*Correspondence: rajlaboratory@gmail.com
http://dx.doi.org/10.1016/j.molcel.2015.03.005
2Department
SUMMARY
Individual mammalian cells exhibit large variability in
cellular volume, even with the same absolute DNA
content, and so must compensate for differences
in DNA concentration in order to maintain constant
concentration of gene expression products. Using
single-molecule counting and computational image
analysis, we show that transcript abundance correlates with cellular volume at the single-cell level due
to increased global transcription in larger cells. Cell
fusion experiments establish that increased cellular
content itself can directly increase transcription.
Quantitative analysis shows that this mechanism
measures the ratio of cellular volume to DNA content,
most likely through sequestration of a transcriptional
factor to DNA. Analysis of transcriptional bursts reveals a separate mechanism for gene dosage compensation after DNA replication that enables proper
transcriptional output during early and late S phase.
Our results provide a framework for quantitatively
understanding the relationships among DNA content, cell size, and gene expression variability in single cells.
INTRODUCTION
Within a population, individual mammalian cells can vary greatly
in their volume, often independently of their position in the cell
cycle (Bryan et al., 2014; Crissman and Steinkamp, 1973; Tzur
et al., 2009). Biochemical reaction rates, however, depend on
the concentration of reactants and enzymes. Thus, to maintain
proper cellular function, most molecules must be present in the
same concentration despite these volume variations, meaning
that the absolute numbers of molecules would have to scale
roughly linearly with cellular volume (see Marguerat and Ba¨hler,
2012 for an excellent review).
One critical molecule whose concentration need not scale with
cellular volume, however, is DNA. Most mammalian cells have
two or four copies of the genome per cell, and even cells with
the same number of genomes can differ widely in size; thus,
DNA concentration can vary dramatically from cell to cell. This
poses a problem: if two otherwise identical cells with the same
DNA content had different volumes, then the larger cell must
somehow maintain a higher absolute number of biomolecules
despite them being expressed from the same amount of DNA.
Previous efforts to resolve this puzzle have largely focused on
analyzing bulk population measurements of size-altering mutants. A number of such studies have shown that the amount
of both RNA and protein generally scales with cellular volume
(Marguerat and Ba¨hler, 2012; Marguerat et al., 2012; Schmidt
and Schibler, 1995; Watanabe et al., 2007; Zhurinsky et al.,
2010) and ploidy (Wu et al., 2010), with some further finding
that transcription changes in mutants with larger or smaller cell
volumes (Fraser and Nurse, 1979; Schmidt and Schibler, 1995;
Zhurinsky et al., 2010). Most of these studies utilized yeast,
with a few notable exceptions (Miettinen et al., 2014; Schmidt
and Schibler, 1995; Watanabe et al., 2007).
These experiments do not, however, establish a causal relationship between cellular volume changes and transcript abundance. Causality could change the interpretation of gene expression measurements because, if cellular volume changes can in
and of themselves change global expression levels, observations of changes in global expression levels in response to
various perturbations may actually be the indirect consequence
of changes to cellular volume rather than resulting from direct
global transcriptional responses to the perturbations, per se.
Also unclear is how or even whether these mechanisms manifest in individual cells. Much recent evidence has shown that
individual transcripts levels can vary from cell to cell due to stochastic effects in gene expression (Raj and van Oudenaarden,
2008, 2009; Sanchez and Golding, 2013) such as transcriptional
bursts (Chubb et al., 2006; Golding et al., 2005; Raj et al., 2006;
Suter et al., 2011; Zenklusen et al., 2008). Yet it remains unclear
how differences in cellular volume affect the interpretation of
such measurements or whether transcriptional measurements
can reveal further characteristics of homeostatic mechanisms.
Molecular Cell 58, 1–14, April 16, 2015 ª2015 Elsevier Inc. 1
Please cite this article in press as: Padovan-Merhar et al., Single Mammalian Cells Compensate for Differences in Cellular Volume and DNA Copy
Number through Independent Global Transcriptional Mechanisms, Molecular Cell (2015), http://dx.doi.org/10.1016/j.molcel.2015.03.005
We used single-molecule RNA imaging and computational
image analysis to measure transcript abundance and cellular
volume simultaneously in individual human cells. Cell fusion
experiments showed that cellular size can directly and globally
affect gene expression by modulating transcription. Quantitative
analysis of these experiments revealed that the mechanism underlying this global regulation does not merely sense cellular volume but rather integrates both DNA content and cellular volume
to produce the appropriate amount of RNA for a cell of a given
size, consistent with a model in which a factor limiting for transcription is sequestered to the DNA, either through direct titration by DNA or restriction to the nuclear compartment. We also
provide a quantitative framework for interpreting gene expression variability in single cells and extended this framework
to genome-wide single-cell RNA-sequencing analysis, showing
that cell-type-specific genes are more variable than ubiquitously
expressed genes.
RESULTS
mRNA Counts Scale with Cellular Volume in Single
Mammalian Cells
We first looked at the number of mRNA molecules in individual
primary fibroblast cells (human primary foreskin fibroblasts,
CRL2097) within a population to see whether mRNA counts
scale with cellular volume at the single-cell level. We measured
both mRNA abundance and volume simultaneously using single-molecule multicolor mRNA fluorescence in situ hybridization
(RNA FISH; Femino et al., 1998; Raj et al., 2008), which allowed
us to detect the positions of individual mRNAs in three dimensions as fluorescent spots in the microscope (Figure 1A). We
measured the abundance of a particular mRNA (e.g., TBCB)
labeled with one color and then calculated the volume of the
cell using the 3D positions of mRNA from a ‘‘volume guide’’
gene labeled with another color to define the cellular boundary
(Figure 1B; Experimental Procedures). The cell volumes we
measured varied over a 6-fold range, agreeing with other estimates (Bryan et al., 2014; Tzur et al., 2009), and are robust to
choice of guide gene and the fixation procedure itself (Figure S1).
We ultimately measured the abundance of 25 to 30 different
mRNA species in both this primary fibroblast line and a lung cancer line (A549).
For most genes, mRNA counts and volumes in single cells exhibited a strongly positive, linear correlation (e.g., Figure 1C; see
Figure S2 for all genes examined). Because larger cells had proportionally more transcripts than did smaller cells, the mRNA
concentration remained relatively constant from cell to cell
despite considerable variation in absolute mRNA numbers. This
scaling property was not confined to high-abundance mRNAs
such as GAPDH and EEF2—genes expressing as few as 10 to
20 mRNA per cell such as ZNF444 and KDM5A scaled similarly,
as did rRNA (Figure S2). We also observed the same behavior
for short-lived mRNA such as UBC and IER2 mRNA, whose
half-lives are 2.9 and 2.2 hr, respectively (Tani et al., 2012).
We checked whether the scaling of mRNA count with volume
depended on cell-cycle progression or cell growth. We costained cells with cell-cycle markers (Eward et al., 2004; Levesque and Raj, 2013; Robertson et al., 2000; Whitfield et al.,
2 Molecular Cell 58, 1–14, April 16, 2015 ª2015 Elsevier Inc.
2002) to classify them as being in the G1, S, or G2 phases of
the cell cycle (Figure S3). Cell volume varied as much for cells
in individual phases of the cell cycle as the population overall,
with a shift in the distribution toward G2 cells being larger, and
the linear relationship between mRNA count and volume did
not depend on cell-cycle phase (Figure 1D), showing that
mRNA count did not depend on DNA content of the cell. We
also note that the primary fibroblast cells exhibit normal ploidy
(Levesque and Raj, 2013), so our results are not simply explained
by differences in ploidy. We also found that nuclear size
increased somewhat with cellular volume and that nuclear size
increased in later stages of the cell cycle (Figure S3). To check
whether progression through the cell cycle or cell growth is
responsible for maintaining scaling, we grew the primary fibroblasts for 7 days in medium lacking serum, making them quiescent. Despite growth and cell cycle arrest, we found that mRNA
count and volume still scaled strongly for all genes examined,
showing that neither progression through cell cycle nor continual
cell growth is required for mRNA count to scale with cellular volume (Figure 1F). Interestingly, we found that although the mean
count of mRNA decreased in quiescent cells as compared with
proliferating cells, the cells maintained a similar concentration
of GAPDH and other mRNA between the two conditions (Figures
1G–1H and S1).
We also checked whether we could observe similar behavior in
intact organisms. We looked at both RNA and DNA density in the
heads and gonads of adult nematodes, comparing measurements from both wild-type worms and worms with mutations
leading to decreased organismal size but with roughly the
same number of cells (Figures 2A–2B) (Watanabe et al., 2007).
We found that the RNA concentrations were roughly the same
between the two strains and that the number of RNA per molecule of DNA decreased by a factor similar to that of the volume
differences between the strains (Figures 2C–2D), verifying that
our observations can hold in vivo.
It is important to note that although the mRNA abundance is
strongly correlated with cellular volume, the y intercept (a) of a
line fit to the data (mRNA = a + bV) was nonzero, indicating
that mRNA count in individual cells has a volume-independent
component in addition to the volume-correlated component.
Quantifying the relative fractions of mRNA that are volume correlated versus volume independent in a cell of average volume
(Figure 1E) [i.e., a / (a + bVavg) versus bVavg / (a + bVavg)] revealed
a range of values for different genes (Figure S1), although the volume-dependent fraction was dominant for most genes examined. Thus, the mRNA concentration is actually somewhat
greater in smaller cells than in larger ones; for most genes, the
smallest cells have an mRNA concentration 1.2 to 3 times greater
than that of the largest cells (Figure S2). We later describe a
mathematical model providing a potential explanation for this
increased concentration based on nuclear volume measurements (see Supplemental Information).
Transcriptional Activity, Not mRNA Degradation, Scales
Globally with Cellular Volume
These data show that larger cells have a proportionally higher
number of mRNA than smaller cells, even if they have the
same absolute number of DNA molecules. To maintain this
Please cite this article in press as: Padovan-Merhar et al., Single Mammalian Cells Compensate for Differences in Cellular Volume and DNA Copy
Number through Independent Global Transcriptional Mechanisms, Molecular Cell (2015), http://dx.doi.org/10.1016/j.molcel.2015.03.005
A
C
B
D
E
F
G
H
Figure 1. mRNA from Many Genes Scales with Cellular Volume
(A) Single-molecule RNA FISH. The DAPI stain is in blue, and the TBCB mRNA FISH probe is in white.
(B) Representative outline of a primary fibroblast cell found using our volume calculation algorithm.
(C) mRNA versus volume for EEF2, LMNA, and TBCB. Each point represents one single-cell measurement. Each dataset is a combination of at least two biological
replicates, with at least 30 cells per replicate.
(D) GAPDH mRNA and volume in primary fibroblast cells. Marginal histograms show volume and mRNA distributions. Colors indicate cell-cycle stage determined
by Cyclin A2 (CCNA2) mRNA count. Dashed diagonal line is the best linear fit of RNA versus volume. Vavg indicates the average primary fibroblast volume. We
determined volume-independent and -dependent transcript levels using the linear fit and Vavg. Data are a 15% subset of 1,868 cells spanning >30 biological
replicates.
(E) Fraction of volume-independent and -dependent RNA expression from the linear fit of RNA versus volume for 21 genes in primary fibroblast cells (we omitted
highly variable genes whose volume-independent fractions were less than zero). Data for each gene are a combination of at least two biological replicates, with at
least 30 cells per replicate.
(F) GAPDH mRNA versus volume in cycling and quiescent primary fibroblast cells. Dashed lines are best fit line for GAPDH in cycling cells. Data are an 8% subset
of 1,868 cells spanning >30 biological replicates for cycling cells and 10% subset of 1,105 cells for quiescent. We only analyzed quiescent cells that had less than
20 CCNA2 mRNA.
(G and H) Mean GAPDH mRNA count (G) and concentration (H) in different growth conditions for data from (F).
All error bars represent SEM. See also Figures S1–S3.
Molecular Cell 58, 1–14, April 16, 2015 ª2015 Elsevier Inc. 3
Please cite this article in press as: Padovan-Merhar et al., Single Mammalian Cells Compensate for Differences in Cellular Volume and DNA Copy
Number through Independent Global Transcriptional Mechanisms, Molecular Cell (2015), http://dx.doi.org/10.1016/j.molcel.2015.03.005
A
C
B
D
Figure 2. mRNA Scales with Volume In Vivo
(A) Images of the two C. elegans strains.
(B) Quantification of the relative sizes of the two strains (n = 24 for N2, n = 20 for
CB502).
(C) Number of mRNA molecules per cell in the gonad region for each type of
worm for genes ama-1 and arf-3. We estimated the number of cells in each
segment by counting nuclei stained with DAPI. Each bar is a compilation of
three biological replicates, with >3 worms per replicate.
(D) Concentration of mRNA in the gonad region. All scale bars represent 10 mm.
All error bars represent SEM.
proportionality, larger cells must either transcribe more mRNA
from the same number of DNA molecules or degrade those
mRNA more slowly. To distinguish these possibilities, we determined the rate of mRNA degradation in cells of different sizes by
measuring volumes and mRNA counts for UBC and IER2 after a
period of 4 hr of transcriptional inhibition (transcriptional inhibition did not affect cell volume; Figure 3D). By measuring mRNA
counts before and after transcription inhibition (Figures 3A–3B,
inset; see Experimental Procedures for details), we calculated
the effective decay constant for each measured cell (Figures
3A–3B). We found that the degradation rate was the same in cells
of all volumes, showing that slower degradation is not responsible for the increased number of mRNA in larger cells.
We next checked whether larger cells transcribe more than
smaller cells (as observed in bulk populations; Fraser and Nurse,
1979; Schmidt and Schibler, 1995; Zhurinsky et al., 2010). We inferred global transcription rate by incorporating a labeled uridine
into all newly synthesized RNA produced during a 60-min time
window (Figure 3C), which we then rendered fluorescent via click
chemistry (Jao and Salic, 2008). The intensity of fluorescence is
equivalent to the global transcription rate. We found that transcription rate is linearly proportional to volume, thus showing
that individual cells vary in their overall transcription (das Neves
et al., 2010), and these variations correlate strongly with volume.
We conclude that larger cells maintain proportionally higher
levels of RNA by increased transcription rather than decreased
degradation as compared to smaller cells. Also, quantification
of fluorescence from probes targeting the internal transcribed
spacer of the rRNA (the ‘‘intronic’’ sequence of rRNA) showed
4 Molecular Cell 58, 1–14, April 16, 2015 ª2015 Elsevier Inc.
that transcription of rRNA also scales linearly with volume (Figure S2), indicating that RNA polymerase I transcription is also
volume dependent.
The scaling of transcription with cellular volume could be due to
global factors regulating transcription of all genes in a volumecorrelated manner, or it could be that gene regulatory networks
sense deviations in each particular gene’s protein concentration
and modulate transcription to restore concentration. In the latter
case, reducing protein concentration of any one gene would
result in increased transcription to compensate, whereas in the
global scenario, reducing the concentration of any one gene
would not appreciably affect the cellular volume, thus leaving
the gene’s transcription unchanged. We tested this by reducing
the level of lamin A/C mRNA and protein in the cell via small interfering RNA (siRNA) (Figures 3E and 3F); we chose lamin A/C
because its expression scales strongly with volume (Figure 1C)
and is thought to be tightly regulated (Swift et al., 2013). To measure the transcriptional response, we took advantage of the fact
that transcription occurs in bursts (Chubb et al., 2006; Dar et al.,
2012; Golding et al., 2005; Suter et al., 2011; Vargas et al., 2005;
Zenklusen et al., 2008), and genes that are actively undergoing a
transcriptional burst have bright accumulations of nascent RNA
at the site of transcription itself (Levesque and Raj, 2013; Levsky
et al., 2002; Raj et al., 2006; Zenklusen et al., 2008) (note that
siRNA does not affect nuclear RNA; Maamar et al., 2013). We
measured both the average number of active lamin A/C transcription sites per cell and the intensity of those transcription sites,
finding both metrics unchanged upon reduction of lamin A/C protein levels (Figure 3G). We conclude that increased mRNA counts
in larger cells result from a global difference in transcription rather
than the activity of a particular gene network that regulates the
concentration of lamin A/C. There may be other situations in
which mRNA levels are regulated by specific networks.
A Diffusible trans Factor Sensing DNA Content and
Volume Links Cellular Volume and Transcription
What links volume and transcription? One possibility is that the
total cellular content itself exerts a global influence on transcription, thus making transcription scale with cellular volume; alternatively, transcription may affect cellular volume. To distinguish
these possibilities, we fused a small human melanoma cell that
expressed GFP mRNA (WM983b-GFP-NLS) at constant density
to the larger fibroblasts that do not express GFP (Figure 4A) to
form heterokaryons (Pomerantz et al., 2009). We found that absolute GFP mRNA counts increased in fused cells as compared
to the original small cells (Figure 4B), showing that increasing total cellular content is by itself sufficient to increase absolute
mRNA abundance. Moreover, the GFP mRNA counts scaled
with heterokaryon volume (Figure 4C), suggesting that the rate
of GFP transcription scaled with the ultimate volume of the fused
cell. The fact that the nucleus from the WM983b-GFP-NLS cell
could change its overall transcriptional activity showed that the
modulation occurs via the activity of a diffusible trans factor.
How might such a factor transmit volume information to the
GFP gene in order to increase its transcription concordantly
with the increase in cellular volume? There are two broad categories of mechanism: (1) the factor acts as a ‘‘volume sensor’’
and does not know about the amount of DNA in the cell. An
Please cite this article in press as: Padovan-Merhar et al., Single Mammalian Cells Compensate for Differences in Cellular Volume and DNA Copy
Number through Independent Global Transcriptional Mechanisms, Molecular Cell (2015), http://dx.doi.org/10.1016/j.molcel.2015.03.005
A
C
B
D
E
F
G
Figure 3. Cells Exhibit Global Volume-Dependent Transcriptional Control over mRNA Abundance
(A and B) We inhibited transcription in primary fibroblast cells using actinomycin D for 4 hr and allowed UBC (A) and IER2 (B) mRNA to degrade. Inset shows mRNA
before and after inhibition. Each point represents a single-cell measurement. We calculated the decay constant for each cell using the best-fit line before inhibition
(see Experimental Procedures). Blue line shows fit if degradation were volume-dependent; red line shows fit if transcription were volume-dependent. Data
represent one of two biological replicates.
(C) We fluorescently labeled nascent RNA produced in 1 hr using the Click-iT eU assay in primary fibroblast cells and quantified the total fluorescence intensity by
imaging the nuclei of single cells. Inset shows raw micrograph data. Blue line shows fit for volume-dependent degradation; red line shows fit for volumedependent transcription. Data shown are from quiescent cells and are one of three biological replicates.
(D) Distribution of cell volumes before and after transcription inhibition.
(E) We performed siRNA treatment for 72 hr in primary fibroblast cells using either a control siRNA (left) or an siRNA targeting LMNA mRNA (right). DAPI stain is
shown in purple, and LMNA mRNA FISH probe is shown in white. White arrows indicate active transcription sites.
(F) Quantification of cytoplasmic LMNA mRNA knockdown by RNA FISH. Inset shows protein knockdown.
(G) Comparison of the number of LMNA transcription sites and transcription site intensity in siRNA control and LMNA knockdown conditions. We detected
transcription sites through intron/exon colocalization using RNA FISH. All error bars represent SEM. Data in (D) and (E) are a combination of two biological
replicates, n = 323 cells for control siRNA and 284 cells for LMNA siRNA.
example could be a modifiable global transcription factor protein
whose degree of modification/activity is proportional to cellular
size (Figure 4D, left). Or (2) the factor acts as a ‘‘volume/DNA
sensor’’ whose activity depends on both cellular volume and
DNA content. One such mechanism is the existence of a general
transcription factor of limiting abundance relative to the number
of binding sites in the DNA (limiting factor). Here, the DNA
‘‘counts’’ how big the cell is by binding all available factor molecules (Figure 4D, right), thus increasing transcription in bigger
cells as more factor binds to DNA. Another possibility is
Molecular Cell 58, 1–14, April 16, 2015 ª2015 Elsevier Inc. 5
Please cite this article in press as: Padovan-Merhar et al., Single Mammalian Cells Compensate for Differences in Cellular Volume and DNA Copy
Number through Independent Global Transcriptional Mechanisms, Molecular Cell (2015), http://dx.doi.org/10.1016/j.molcel.2015.03.005
A
B
C
Prediction for volume
sensor hypothesis
D
Figure 4. A trans-Acting Limiting Factor Links Gene Expression to Volume
(A) Representative image of fused cells (heterokaryon, left) and unfused cells (WM983b, primary fibroblast, right). DAPI stain is in orange, GFP mRNA is in green,
and GAS6 mRNA is in white. White arrows indicate transcription sites.
(B) Quantification of GFP mRNA in unfused and fused cells. Box extends to first and third quartiles, and whiskers extend to the maximum-distance points within
1.5 interquartile ranges of the box. Data are a combination of two biological replicates.
(C) GFP versus volume for fused and unfused cells. Upper dashed line represents fit for unfused cells. Lower dashed line has a slope that is half of the upper fit line.
(D) Schematic of transcriptional output of fused cells if the scaling of expression with volume were mediated by a volume sensor or a volume/DNA sensor. All error
bars represent SEM.
sequestration of the factor to the nucleus, which can also
achieve the same volume/DNA-sensing behavior if nuclear volume is only weakly dependent on cellular volume (see Supplemental Information).
We distinguished these two alternatives by comparing the
concentration of GFP mRNA in the fused and unfused cells (Figure 4C). In the volume sensor scenario, the fusion cell will have
the same concentration of GFP mRNA as the original small cells
because the factor transmits the volume information to the GFP
gene independent of the number of nuclei in the cell. In the volume/DNA sensor scenario, the fusion cell will have half the concentration of GFP mRNA because the factor senses both the
increased volume and the two nuclei (for example, a limiting
factor would be diluted between the two nuclei in the fused
cell). We found that the concentration of GFP mRNA in the fused
cells is strictly less than and very close to half the concentration
in unfused cells. We conclude that the factor responsible for
increased transcription in smaller cells is not a volume sensor
but responds to both the size and DNA content. These results
also suggest that perturbations that change cell size will indirectly change global transcript counts per cell through this
generic mechanism.
6 Molecular Cell 58, 1–14, April 16, 2015 ª2015 Elsevier Inc.
Transcriptional Burst Size Increases in Larger Cells
We next sought to characterize this mechanism further by
examining the relationship between volume and transcription
of individual genes. mRNA is produced in bursts, marked by
bright accumulations of nascent mRNA at the site of transcription. We characterize this bursting behavior through burst size
(how much RNA is produced during a single burst) and burst
fraction (how often a gene is actively transcribing, which is
related to burst frequency; Levesque and Raj, 2013). To quantify
burst size, we measured the intensity of transcription sites for
four genes in our fibroblasts and found higher intensity transcription sites in larger cells (Figure 5A). We verified that the intensity of transcription sites reflected the degree of transcriptional activity by treating primary fibroblast cells with 100 nM
triptolide, which targets RNA polymerase II for degradation
(Bensaude, 2011), reducing its levels (Figure 5D). After 1 hr,
we saw a reduction of bright transcription sites for two different
genes, showing that transcription site intensity depends directly
on the amount of transcriptional machinery available. This intensity is often proportional to burst size (Levesque and Raj, 2013;
Senecal et al., 2014). Interestingly, transcription site intensity
did not depend on cell-cycle stage (Figure 5B). We concluded
Please cite this article in press as: Padovan-Merhar et al., Single Mammalian Cells Compensate for Differences in Cellular Volume and DNA Copy
Number through Independent Global Transcriptional Mechanisms, Molecular Cell (2015), http://dx.doi.org/10.1016/j.molcel.2015.03.005
A
B
C
Figure 5. Transcriptional Burst Size Increases in Larger Cells
(A) Transcription site intensity and volume in primary fibroblast cells for genes UBC, MYC, EEF2,
and TUSC3. Each data point represents the mean
transcription site intensity per cell for a quartile of
cells classified by volume or GAPDH. We detected
transcription sites through intron/exon colocalization using RNA FISH. We calculated volume for
EEF2 data using EEF2 as a guide and volume for
MYC data using GAPDH. We use GAPDH as a
proxy for volume for UBC and TUSC3.
(B) Transcription site intensity and cell-cycle stage
in primary fibroblast cells. We determined cell-cycle
stage by Cyclin A2 and the histone 1H4E mRNA
counts (see Experimental Procedures and Figure S3). For intensity measurements, data for UBC,
MYC, and EEF2 are from one of two biological
replicates (EEF2: n = 190, UBC: n = 202, and MYC:
n = 103 transcription sites). Data for TUSC3 are
combined from two biological replicates (n = 255
transcription sites).
(C) Western blot analysis reveals that >99% of the
C-terminal domain hyperphosphorylated form of
RNA polymerase II (IIO) is present in the chromatin
fraction. The hypophosphorylated form of Pol II (IIA)
is captured in all cellular fractions. We generated
subcellular lysates from the same batch of primary
fibroblast cells and probed with the F-12 antibody
(Santa Cruz Biotechnology) that is directed against
the N-terminal region of RPB1, the largest subunit
of RNA polymerase II. We adjusted sample volumes
so that western blot signals of the subcellular
fractions are comparable.
(D) Quantification of transcription site intensity
before and after treatment with 100 nM triptolide for
1 hr. p value represents the probability of randomly
finding the distributions of bright transcription
sites (values to the right of the black line) in each
condition.
All error bars represent SEM. See also Figures S3
and S4.
D
that the factor connecting gene expression and cellular volume
affected mRNA abundance through modulation of transcriptional burst size.
What might this factor be? In order to
formalize the possibilities, we developed
a mathematical model for how this factor
may act (see Supplemental Information),
assuming that the factor is almost purely
nuclear and that the total amount of factor
in the cell is proportional to cellular volume.
Our model of perfect linear scaling of transcription with volume requires that either
(1) the volume of the nucleus remains fixed
between cells irrespective of cytoplasmic
volume or (2) the factor has such a high affinity for DNA that it is essentially titrated by
DNA (or both). In both scenarios, either the
nucleus or the DNA ‘‘counts’’ the amount
of factor in the cell to modify transcription,
thus making transcription proportional to the total amount, rather
than concentration, of the factor. Further, in scenario 1, it predicts
that if nuclear volume changes with cell size, it will manifest as a
Molecular Cell 58, 1–14, April 16, 2015 ª2015 Elsevier Inc. 7
Please cite this article in press as: Padovan-Merhar et al., Single Mammalian Cells Compensate for Differences in Cellular Volume and DNA Copy
Number through Independent Global Transcriptional Mechanisms, Molecular Cell (2015), http://dx.doi.org/10.1016/j.molcel.2015.03.005
deviation from pure linear scaling of transcription with cellular volume. Our data cannot distinguish these two possibilities, but our
analysis did predict that in scenario 1, the slight increase in nuclear volume we saw in larger cells (Figure S3) could lead to a
slight decrease in mRNA concentration, manifesting as a nonzero
intercept when fitting RNA abundance to volume, as observed in
Figure 1D. Indeed, the quantitative magnitude of the increased
concentration at low cell volumes matches what our model
predicts based on our measured relationship between nuclear
volume and cellular volume (see Supplemental Information),
perhaps favoring the model in which transcriptional scaling
results from sequestration of this factor to the nucleus over a
pure titration mechanism. It is possible that the factor is some
component of the general transcriptional machinery, which is
94% nuclear based on fractionation (Figure 5C) (A.M. and
L.S.C., unpublished data).
A DNA-Linked cis-Acting Factor Reduces Transcription
Frequency, Not Burst Size, Immediately after DNA
Replication
We have shown that cells in the G1 and G2 phases of the cell cycle can have the same volume and same mRNA density, although
cells in G2 have twice the DNA content as those in G1. How, then,
do two cells of the same size produce the same amount of mRNA,
despite having different amounts of DNA?
We already found that transcriptional burst size does not
change dramatically between phases of the cell cycle (Figure 5B),
so we now measured burst fraction. To measure transcriptional
burst fraction, we counted active transcription sites in each cell
and divided by the total number of gene copies for each stage
of the cell cycle (two copies in G1, four copies in G2). For all of
the genes we measured, the number of active sites per gene
copy in G2 was approximately half of that in G1 (Figure 6A).
This is not due to repression of replicated copies of DNA, as
we observed G2 cells with four transcription sites (Figure S4).
This showed that the cell has a mechanism to precisely reduce
transcription fraction in G2 to keep overall transcription constant
across the G1 and G2 phases of the cell cycle. Burst fraction did
not depend on cell volume (Figure 6B), showing that the mechanism is distinct from the volume-compensating mechanism
described above.
We were surprised that the mechanism for cell cycle was
different from the one compensating for volume. In principle,
limiting factor models that may be responsible for volume conservation would also compensate for increased gene copy number due to DNA replication. However, these models predict an
inappropriate boost in transcription for genes that replicate early
in S phase because a limiting factor would distribute itself over all
the DNA, essentially ‘‘double counting’’ the small percentage of
genes that replicate considerably earlier than the majority of DNA
(Figure 6C). To see whether this was the case, we measured
transcriptional burst fraction for early-replicating genes in S
phase (EEF2, MYC, UBC; see Figure S4 for replication timing).
These genes all showed the same number of active transcription
sites per gene copy in S and G2, ruling out the possibility that a
factor simply gets diluted between copies of replicated DNA.
This leaves two alternatives for burst fraction reduction
between G1 and G2. Transcription fraction could universally
8 Molecular Cell 58, 1–14, April 16, 2015 ª2015 Elsevier Inc.
decrease by a factor of two upon the initiation of S phase,
but this would potentially cause the opposite problem—genes
that replicate late in S phase would be under-transcribed for
the majority of S phase. Alternatively, transcription fraction could
decrease on a gene-by-gene basis (i.e., in cis) immediately upon
DNA replication. To test between these alternatives, we imaged
transcription of a gene that replicates very late in S phase
(TUSC3; Figure S4). If transcription fraction was universally
reduced by a factor of two at the beginning of S phase, this
gene would have the same transcription fraction in S and G2.
However, we found that this late-replicating gene maintains G1
levels of transcription through S phase and does not reduce transcription fraction until G2. Therefore, we conclude that there exists a mechanism whereby transcription fraction is reduced by a
factor of two immediately upon replication of that gene, with
different timing for different genes. Candidates for a mechanism
include the partitioning or modification of DNA-linked factors,
such as histones with particular modifications, upon DNA replication, resulting in half the transcriptional burst fraction as before
replication.
Together, our data demonstrate the existence of two separate
transcriptional mechanisms that allow cells to maintain RNA
concentration homeostasis despite changes in DNA content
and cellular volume. Cells modulate transcriptional burst size
through a trans mechanism to allow larger cells to produce
more mRNA from the same amount of DNA and modulate burst
frequency over the cell cycle in cis to maintain RNA concentration despite changes in DNA content.
Single-Cell RNA Sequencing Reveals that Cell-TypeSpecific Genes Are ‘‘Noisier’’ Than Ubiquitously
Expressed Genes
Our RNA FISH data revealed that, although the expression of
most genes was consistent with a volume-dependent transcription rate, many of the genes also showed strong variability in
transcript concentration from cell to cell (Li and Xie, 2011; Raj
and van Oudenaarden, 2009; Raj et al., 2008; Sanchez and Golding, 2013).
To accurately quantify gene expression noise while accounting for cell volume, we developed a metric we call volume-corrected noise measure (Nm; Figure 7A), defined as (Bowsher
and Swain, 2012; Johnston et al., 2012; Swain et al., 2002)
follows:
bhVi
Covðm; VÞ
;
Nm = CVm2 a + bhVi
hmihVi
where
CVm =
sm
;
hmi
and a and b indicate the intercept and slope of a best-fit line for
mRNA and volume,
hmi = a + bhVi:
Volume-corrected noise measure is in principle similar to the
squared coefficient of variation of mRNA concentration but accounts for volume-independent transcription (Figure S5).
Please cite this article in press as: Padovan-Merhar et al., Single Mammalian Cells Compensate for Differences in Cellular Volume and DNA Copy
Number through Independent Global Transcriptional Mechanisms, Molecular Cell (2015), http://dx.doi.org/10.1016/j.molcel.2015.03.005
A
B
Figure 6. A cis-Acting Factor Decreases
Transcription Frequency Immediately upon
DNA Replication
(A) Number of transcription sites by cell-cycle stage
in primary fibroblast cells. We determined cell-cycle
stage by Cyclin A2 and histone 1H4E mRNA counts.
Dashed lines represent half the number of transcription sites in G1. We normalize all G1 data to two
gene copies and all G2 data to four gene copies. For
EEF2, MYC, and UBC (early replicators), we
normalize S-phase data to four gene copies. For
TUSC3 (late replicator), we normalize S-phase data
to two gene copies.
(B) Number of transcription sites per gene copy
classified by volume in primary fibroblast cells. Each
data point represents the mean number of transcription sites for a quartile of cells classified by
volume. We calculated volume for EEF2 data using
EEF2 as a guide and volume for MYC data using
GAPDH. We use GAPDH as a proxy for volume for
UBC and TUSC3. For frequency measurements,
data for EEF2, UBC, and TUSC3 are a combination
of two biological replicates (EEF2: n = 516, UBC: n =
332, and TUSC3: n = 255 transcription sites). Data
for MYC is from one of two biological replicates
(MYC: n = 103 transcription sites).
(C) Schematic depicting different potential mechanisms for changing gene expression with cell cycle.
All error bars represent SEM. See also Figures S3
and S4.
C
We evaluated Nm for all of the genes we measured by RNA
FISH. A standard measure of gene noise is the coefficient of variation (CV; standard deviation divided by mean), which takes into
account only the spread and the mean of the expression data.
With such a measurement, most of the genes we measured by
RNA FISH would be deemed ‘‘noisy,’’ or far from Poisson noise
levels (Figure S5). However, when we instead measured Nm
for each of our RNA FISH genes, we saw that many of them
displayed levels of variability near to or
indistinguishable from Poisson (Figure 7B)
once we accounted for measurement error
(upper bound of approximately 15%) (Raj
et al., 2006).
To quantify this variability genome-wide,
we performed single-cell RNA sequencing
(Brennecke et al., 2013; Gru¨n et al., 2014;
Shalek et al., 2013) on human foreskin
fibroblast cells, calibrating the sequencing
data using our RNA FISH results (Figures
7C and S6) and adding synthetic RNA at
known concentrations to estimate cell
volume. Briefly, we used the ratio of
RNA-sequencing reads mapping to the
transcriptome to the reads mapping to
spiked-in RNA from the External RNA
Controls Consortium (ERCCs) (Devonshire
et al., 2010) to estimate the total relative
amount of RNA in the cell (Marinov et al.,
2014; Wu et al., 2014), dropping a small
number of cells that were qualitatively inconsistent with the
rest (Figure S7). We used the measured relationship between
GAPDH mRNA count and cellular volume from RNA FISH to
convert total RNA to relative cellular volume, matching this to
our measured volume distribution to obtain cellular volumes.
We then used the correlation between FPKM (fragments per kilobase per million fragments mapped) and RNA FISH counts for
our gene panel to provide estimates of absolute RNA counts
Molecular Cell 58, 1–14, April 16, 2015 ª2015 Elsevier Inc. 9
Please cite this article in press as: Padovan-Merhar et al., Single Mammalian Cells Compensate for Differences in Cellular Volume and DNA Copy
Number through Independent Global Transcriptional Mechanisms, Molecular Cell (2015), http://dx.doi.org/10.1016/j.molcel.2015.03.005
A
B
C
D
F
E
Figure 7. Connection between Single-Cell RNA Sequencing and RNA FISH Reveals that Cell-Type-Specific Genes Exhibit Higher Noise
Levels
(A) RNA FISH data. TBCB mRNA abundance and volume in primary fibroblast cells. Each point represents a single-cell measurement. Histogram indicates mRNA
distribution. Arrow indicates volume-corrected noise measure. Gray line is best linear fit.
(B) Volume-corrected noise measure values for different genes in primary fibroblast cells. Each data point represents a collection of single-cell measurements for
one gene. The straight gray line represents the Poisson limit. The curved gray line is the Poisson limit plus our experimental noise limit, a combination of the
Poisson limit and a 15% measurement error. Data for each gene is a combination of at least two biological replicates, with at least 30 cells per replicate.
(C) Pipeline for converting FPKM from single-cell sequencing to RNA-FISH-equivalent counts and cellular volume in picoliters.
(D) Qualitative comparison of count versus volume from RNA FISH and single-cell RNA sequencing. Example low-Nm (GAPDH) and high-Nm genes (MYC).
(E) Comparison between Nm calculated from RNA FISH data and single-cell RNA-seq data. Each point represents a single gene. Nm is calculated by bootstrapping.
(F) Breakdown of high- and low-noise genes into ubiquitously expressing genes and genes that express in a cell-type-dependent manner.
All error bars represent 95% confidence intervals as calculated by bootstrapping. See also Figures S5–S7.
for all genes in individual cells. (It is important to note, however,
that there is substantial variance in this relationship and that the
relationship is nonlinear.) We found that both RNA FISH and single-cell mRNA sequencing yielded similar results for noisy and
non-noisy genes (Figures 7D and 7E).
Upon quantifying Nm for the genes in our RNA FISH experiments in both human foreskin fibroblast and lung cancer cells,
we noticed that three of the four genes (ICAM1, LUM, and
ACTA2) with the strongest degree of cell-type specificity were
also the three genes with the highest noise measure of all the
genes in our study (Figure S7). To see whether this trend held
more generally, we used our single-cell RNA-sequencing data
to explore noise measure across all genes. We selected genes
with high or low noise measures and looked for enrichment in
genes exhibiting cell-type-specific expression between human
lung cancer cell and foreskin fibroblast cell data. We found that
10 Molecular Cell 58, 1–14, April 16, 2015 ª2015 Elsevier Inc.
the set of high noise measure genes contained a significantly
higher proportion of cell-type-specific genes than did the set of
low noise genes (Figure 7F; see Figure S7 for classification details). Such findings mirror those showing that more ubiquitously
expressed ‘‘housekeeping’’ genes typically exhibit lower levels
of noise than other types of genes, although the notion of celltype specificity is more difficult to relate to studies performed
in single-celled organisms (Bar-Even et al., 2006; Newman
et al., 2006; Taniguchi et al., 2010).
DISCUSSION
We have shown that individual cells globally control transcription
to compensate for variability in the ratio of DNA to cellular
content. Our results point to two independent mechanisms:
one that compensates for cell size fluctuations and one that
Please cite this article in press as: Padovan-Merhar et al., Single Mammalian Cells Compensate for Differences in Cellular Volume and DNA Copy
Number through Independent Global Transcriptional Mechanisms, Molecular Cell (2015), http://dx.doi.org/10.1016/j.molcel.2015.03.005
compensates for DNA content changes during the cell cycle.
These compensatory mechanisms help to maintain the concentration of mRNA in the cell, which is presumably useful from the
perspective of the cell because the rates of most chemical reactions in the cell depend on concentration rather than absolute
number. Importantly, the fact that these mechanisms seem to
be global in nature, and not gene-specific, means that other
specific forms of transcriptional regulation, for instance, during
development or in response to particular cues, can still function
properly in both large and small cells without the need for a
complex interplay between the specific regulation and the
mechanism governing concentrational homeostasis; indeed,
the expression of most genes is likely not specifically compensated to maintain concentration (Springer et al., 2010). This is
not to say that the concentration of most gene expression products is arbitrary and unregulated. Rather, these global mechanisms provide a means by which any such regulation may operate
to achieve said concentration without having to take into account
differences in DNA content due to cellular volume or DNA content
differences. This is important in a number of biological contexts
such as development and embryogenesis, in which rapid cell divisions lead to an exponential decrease in individual cell volume,
but the organism must maintain the concentration of most proteins while still enabling dynamic transcriptional programs to
occur (Nair et al., 2013).
Our work also highlights the importance of taking cellular volume into account when interpreting gene expression data and
points to the significance of global factors in studying single
cell expression in general (Elowitz et al., 2002; das Neves
et al., 2010; Volfson et al., 2006). In particular, our cell fusion experiments show that changing the amount of cellular content in
and of itself can lead to changes in total RNA abundance,
whereas previous experiments largely relied on cell-size mutants
that make inferences of cause and effect more difficult (Fraser
and Nurse, 1979; Miettinen et al., 2014; Schmidt and Schibler,
1995; Zhurinsky et al., 2010). These cell fusion experiments
directly establish that any perturbation that changes overall
cellular volume may result in global changes in overall transcript
abundance as a secondary rather than primary effect per the
generic mechanism of a diffusible trans factor that senses an
increased ratio of volume to DNA. Thus, we believe one must
take care in interpreting experiments showing global changes
in transcript abundance (Lin et al., 2012; Nie et al., 2012), both
from the perspective of establishing causal relationships, given
that cellular volume/content can by itself change transcription
rates, and in the interpretation of the functional significance,
given that the concentration of many transcripts will remain
roughly the same despite these overall changes. Our study
does not, however, address the question of why cells have
different volumes and how expression plays a role in that heterogeneity—such questions necessarily involve the examination of
mechanisms regulating cell growth and proliferation. Rather,
our results show how cells may globally cope with such changes
to maintain biomolecule concentration.
We do not yet know the factor (or factors) linking the ratio of
cellular volume to DNA content to the amount of transcription.
One potential candidate is some element of the general transcription machinery, such as the RNA polymerase II holoen-
zyme. We have shown that RNA polymerase II is almost completely nuclear, and it is required for transcription; indeed, its
reduction changes burst size, much as reduced volume does.
Its transcription also scales with volume (Figure S4). Other
studies (Marguerat and Ba¨hler, 2012; Zhurinsky et al., 2010)
have speculated that RNA polymerase II holoenzyme may act
as a limiting factor titrated by DNA, but various studies (Kimura
et al., 1999, 2002) show that RNA polymerase II is directly associated with DNA only for short periods of time. Thus, we believe
that the scenario in which the factor remains in the nucleus and
nuclear volume scales only weakly with cell size is the most plausible given our current evidence (see Supplemental Information).
Indeed, our model shows that the weak scaling of nuclear volume with cell size we observe can also explain why we observed
higher mRNA concentrations in smaller cells, sometimes by a
factor of two or more, further supporting this view, although
more experiments will be required to establish this model.
Regardless of the origin of the effect, it is clear that mRNA concentration is typically higher in smaller cells. We do not know the
consequences of this effect, in particular on cell growth, or how it
may vary in different cell types and contexts. One practical
consequence of this finding is that the time-honored practice
of normalizing transcript levels to GAPDH mRNA abundance,
although largely sound, does not fully account for differences
in mRNA concentration between small and large cells. This
suggests that new strategies may be required for measuring
cellular volume when interpreting single-cell RNA-sequencing
experiments.
We found it striking that the volume compensation mechanism
is distinct from the one that compensates for changes in DNA
content as the cell cycle progresses. We found that the burst frequency appears to decrease upon DNA replication for each gene
rather than at a particular time in the cell cycle for all genes. One
plausible explanation for this feature is to ensure proper transcriptional output regardless of whether a gene is replicated
early or late in S phase, which can proceed for many hours.
The molecular underpinnings of this mechanism remain unclear,
although our results demonstrate that it must be a factor that remains bound to DNA and changes in character during DNA replication. A likely candidate may be a DNA or histone modification
that completely coats the DNA during G1 but that is diluted by a
factor of two upon DNA replication in S phase.
Together, these findings provide a deeper quantitative understanding of single-cell gene expression and its role in maintaining cellular homeostasis. Further work may elucidate how
these homeostatic mechanisms for maintaining biomolecule
concentration manifest themselves in biological contexts and
whether they are an important point of dysregulation in disease
processes.
EXPERIMENTAL PROCEDURES
Cell Culture
We grew primary human foreskin fibroblast cells (CCD-1079Sk, ATCC CRL2097) and A549 cells (human lung carcinoma, A549, ATCC CCL-185)
in Dulbecco’s Modified Eagle Medium supplemented with 10% FBS and
50 U/mL penicillin and streptomycin (Pen/Strep). To create quiescent cells,
we grew primary fibroblast cells in DMEM with Pen/Strep, without FBS for
7 days. We cultured WM983b-GFP-NLS cells (melanoma cell line from the
Molecular Cell 58, 1–14, April 16, 2015 ª2015 Elsevier Inc. 11
Please cite this article in press as: Padovan-Merhar et al., Single Mammalian Cells Compensate for Differences in Cellular Volume and DNA Copy
Number through Independent Global Transcriptional Mechanisms, Molecular Cell (2015), http://dx.doi.org/10.1016/j.molcel.2015.03.005
lab of Meenhard Herlyn) in Tu 2% media. The WM983b-GFP-NLS contains
EGFP fused to a nuclear localization signal driven by a cytomegalovirus promoter that we stably transfected into the parental cell line.
RNA Fluorescence In Situ Hybridization and Imaging
We performed single-molecule RNA FISH on the samples as described previously (Femino et al., 1998; Raj and Tyagi, 2010; Raj et al., 2008). We co-stained
the actin cytoskeleton with Phalloidin-Alexa 488 (Life Technologies) to detect
cell boundaries.
We used Cyclin A2 mRNA to specifically label cells in S, G2, and M phases
(Eward et al., 2004). To distinguish cells in S phase from those in G2 (Robertson
et al., 2000; Whitfield et al., 2002), we labeled histone H4 mRNA (see Figure S3).
Table S1 lists all sequences of oligonucleotide probes.
We imaged the cells with a Nikon Ti-E equipped with appropriate filter sets.
We took a series of optical z sections, each 0.2–0.35 microns high, that
spanned the vertical extent of the cell.
Image Analysis and Quantification
We manually identified cell boundaries and counted and localized RNA spots
using custom software written in MATLAB (Raj and Tyagi, 2010; Raj et al.,
2008). We estimate the technical error in our RNA count determination to be
at most 15%.
To compute the volume of a cell, we used the 3D positions of a highly abundant mRNA found by RNA FISH to define the outline of the cell, applying corrections for bias in volume estimation. The volume computation did not
depend on the number of spots identified nor on the choice of mRNA (Figure S1). We limited ourselves to the cytoplasmic volume by removing a vertical
cylinder corresponding to the nuclear outline.
We identified transcription sites through intron/exon probe colocalization.
We manually annotated transcription sites by visually inspecting images of
intron and exon probes to determine instances of colocalized signal and
computationally determined their intensity.
RNA Degradation
We measured UBC and IER2 mRNA degradation by inhibiting transcription for
4 hr by applying actinomycin D at 1 ug/ml. We interpreted the data using
models of volume-dependent or -independent degradation.
LMNA siRNA Knockdown
We incubated primary fibroblast cells with an siRNA targeting LMNA for 72 hr,
verifying protein knockdown via western blot.
Heterokaryon Formation
We created heterokaryons by pelleting cultures of primary fibroblast cells
and WM983b-GFP-NLS cells and resuspending in PEG for 2 min. We
added media over the course of 5 min to allow cells to fuse, and then we
plated the cells onto two-well chambered coverglasses (Lab-Tek) and fixed
the cells after 12 hr.
Fractionation and RNA Polymerase II Western Blot
We performed cell fractionation and blotting as described in (Bhatt et al., 2012)
and based on (Wuarin and Schibler, 1994) with modifications.
Triptolide
We degraded RNA polymerase II in primary fibroblast cells by incubating cells
in 100 nM triptolide for 1 hr and then fixed cells in methanol.
Repli-seq Analysis
We accessed Repli-seq data from Hansen et al. (2010) using the UW Repli-seq
track on the UCSC Genome Browser.
Bulk RNA Sequencing
We sequenced total RNA from primary fibroblast cells. We used the NEB Next
Ultimate Library Preparation Kit for Illumina and the Ribo-Zero Magnetic Gold
Kit. We used 50 b single-end reads and sequenced each of two replicates at a
depth of 10–15 M reads. We aligned reads to hg19 using STAR’s included
annotation. We quantified reads per gene using HTSeq and a RefSeq hg19
12 Molecular Cell 58, 1–14, April 16, 2015 ª2015 Elsevier Inc.
annotation. We calculated FPKM for each gene using R. All sequencing data
are available at GEO accession number GSE66053.
Single-Cell RNA Sequencing
We prepared 96 cells for RNA sequencing on a Fluidigm C1 Single-Cell Auto
Prep System using a large-size chip. We added ERCC (External RNA Controls
Consortium) RNA controls, Mix 1 (Ambion 4456740) at a concentration of
1:10,000 before loading and prepared cDNA libraries as per the Fluidigm instructions. We obtained 75b paired-end reads to a depth of 1–2 M reads per sample.
We quantified reads per gene using HTSeq and a RefSeq hg19 annotation and
transformed the data to obtain an estimate of molecule count per cell. All
sequencing data are available at GEO accession number GSE66053.
C. elegans Growth and Imaging
We grew N2 (wild-type) and CB502 (sma-2 mutant) C. elegans at 20 C under
standard conditions. We performed RNA FISH and analyzed the data as previously described (Raj et al., 2008), determining the volume of each analyzed
region computationally.
ACCESSION NUMBERS
The GEO accession number for the single-cell RNA sequencing (primary fibroblast cell line) and the bulk RNA sequencing (primary fibroblast and lung cancer cell lines) data reported in this paper is GSE66053.
SUPPLEMENTAL INFORMATION
Supplemental Information includes Supplemental Experimental Procedures,
seven figures, and one table and can be found with this article online at
http://dx.doi.org/10.1016/j.molcel.2015.03.005.
ACKNOWLEDGMENTS
We thank members of the Raj lab for many helpful suggestions. We thank
Jeff Carey for suggesting the LMNA knockdown experiment and Sydney
Shaffer for the WM983b-GFP-NLS cell line. We thank Jan Skotheim for useful
discussions about mechanism and John I. Murray for pointing out that the
simple trans factor model would lead to overtranscription in early S phase.
We thank Hyun Youk and Uschi Symmons for a careful reading of the manuscript. We thank the Herlyn lab for providing the WM983b cell line. A.R. acknowledges support from an NSF CAREER award, NIH New Innovator Award
1DP2OD008514, and a Burroughs Wellcome Fund Career Award at the Scientific Interface. G.P.N. is a Howard Hughes Medical Institute Fellow of the Life
Sciences Research Foundation (http://www.lsrf.org). A.M. was supported by
Long-Term Postdoctoral Fellowships of the Human Frontier Science Program
(LT000314/2013-L) and EMBO (ALTF858-2012). L.S.C. acknowledges support
from an NIH grant (NHGRI R01HG007173), a Damon Runyon Cancer Research
Foundation Frey Award, and a Burroughs Wellcome Fund Career Award at the
Scientific Interface.
Received: September 22, 2014
Revised: January 16, 2015
Accepted: March 4, 2015
Published: April 9, 2015
REFERENCES
Bar-Even, A., Paulsson, J., Maheshri, N., Carmi, M., O’Shea, E., Pilpel, Y., and
Barkai, N. (2006). Noise in protein expression scales with natural protein abundance. Nat. Genet. 38, 636–643.
Bensaude, O. (2011). Inhibiting eukaryotic transcription: Which compound to
choose? How to evaluate its activity? Transcription 2, 103–108.
Bhatt, D.M., Pandya-Jones, A., Tong, A.-J., Barozzi, I., Lissner, M.M., Natoli,
G., Black, D.L., and Smale, S.T. (2012). Transcript dynamics of proinflammatory genes revealed by sequence analysis of subcellular RNA fractions. Cell
150, 279–290.
Please cite this article in press as: Padovan-Merhar et al., Single Mammalian Cells Compensate for Differences in Cellular Volume and DNA Copy
Number through Independent Global Transcriptional Mechanisms, Molecular Cell (2015), http://dx.doi.org/10.1016/j.molcel.2015.03.005
Bowsher, C.G., and Swain, P.S. (2012). Identifying sources of variation and the
flow of information in biochemical networks. Proc. Natl. Acad. Sci. USA 109,
E1320–E1328.
Lin, C.Y., Love´n, J., Rahl, P.B., Paranal, R.M., Burge, C.B., Bradner, J.E., Lee,
T.I., and Young, R.A. (2012). Transcriptional amplification in tumor cells with
elevated c-Myc. Cell 151, 56–67.
Brennecke, P., Anders, S., Kim, J.K., Ko1odziejczyk, A.A., Zhang, X.,
Proserpio, V., Baying, B., Benes, V., Teichmann, S.A., Marioni, J.C., and
Heisler, M.G. (2013). Accounting for technical noise in single-cell RNA-seq experiments. Nat. Methods 10, 1093–1095.
Maamar, H., Cabili, M.N., Rinn, J., and Raj, A. (2013). linc-HOXA1 is a noncoding RNA that represses Hoxa1 transcription in cis. Genes Dev. 27, 1260–1271.
Marguerat, S., and Ba¨hler, J. (2012). Coordinating genome expression with cell
size. Trends Genet. 28, 560–565.
Bryan, A.K., Hecht, V.C., Shen, W., Payer, K., Grover, W.H., and Manalis, S.R.
(2014). Measuring single cell mass, volume, and density with dual suspended
microchannel resonators. Lab Chip 14, 569–576.
Marguerat, S., Schmidt, A., Codlin, S., Chen, W., Aebersold, R., and Ba¨hler, J.
(2012). Quantitative analysis of fission yeast transcriptomes and proteomes in
proliferating and quiescent cells. Cell 151, 671–683.
Chubb, J.R., Trcek, T., Shenoy, S.M., and Singer, R.H. (2006). Transcriptional
pulsing of a developmental gene. Curr. Biol. 16, 1018–1025.
Marinov, G.K., Williams, B.A., McCue, K., Schroth, G.P., Gertz, J., Myers,
R.M., and Wold, B.J. (2014). From single-cell to cell-pool transcriptomes: stochasticity in gene expression and RNA splicing. Genome Res. 24, 496–510.
Crissman, H.A., and Steinkamp, J.A. (1973). Rapid, simultaneous measurement of DNA, protein, and cell volume in single cells from large mammalian
cell populations. J. Cell Biol. 59, 766–771.
Miettinen, T.P., Pessa, H.K.J., Caldez, M.J., Fuhrer, T., Diril, M.K., Sauer, U.,
Kaldis, P., and Bjo¨rklund, M. (2014). Identification of transcriptional and metabolic programs related to mammalian cell size. Curr. Biol. 24, 598–608.
Dar, R.D., Razooky, B.S., Singh, A., Trimeloni, T.V., McCollum, J.M., Cox,
C.D., Simpson, M.L., and Weinberger, L.S. (2012). Transcriptional burst frequency and burst size are equally modulated across the human genome.
Proc. Natl. Acad. Sci. USA 109, 17454–17459.
Nair, G., Walton, T., Murray, J.I., and Raj, A. (2013). Gene transcription is coordinated with, but not dependent on, cell divisions during C. elegans embryonic fate specification. Development 140, 3385–3394.
das Neves, R.P., Jones, N.S., Andreu, L., Gupta, R., Enver, T., and Iborra, F.J.
(2010). Connecting variability in global transcription rate to mitochondrial variability. PLoS Biol. 8, e1000560.
Newman, J.R.S., Ghaemmaghami, S., Ihmels, J., Breslow, D.K., Noble, M.,
DeRisi, J.L., and Weissman, J.S. (2006). Single-cell proteomic analysis of S.
cerevisiae reveals the architecture of biological noise. Nature 441, 840–846.
Devonshire, A.S., Elaswarapu, R., and Foy, C.A. (2010). Evaluation of external
RNA controls for the standardisation of gene expression biomarker measurements. BMC Genomics 11, 662.
Elowitz, M.B., Levine, A.J., Siggia, E.D., and Swain, P.S. (2002). Stochastic
gene expression in a single cell. Science 297, 1183–1186.
Eward, K.L., Van Ert, M.N., Thornton, M., and Helmstetter, C.E. (2004). Cyclin
mRNA stability does not vary during the cell cycle. Cell Cycle 3, 1057–1061.
Nie, Z., Hu, G., Wei, G., Cui, K., Yamane, A., Resch, W., Wang, R., Green, D.R.,
Tessarollo, L., Casellas, R., et al. (2012). c-Myc is a universal amplifier of expressed genes in lymphocytes and embryonic stem cells. Cell 151, 68–79.
Pomerantz, J.H., Mukherjee, S., Palermo, A.T., and Blau, H.M. (2009).
Reprogramming to a muscle fate by fusion recapitulates differentiation.
J. Cell Sci. 122, 1045–1053.
Raj, A., and van Oudenaarden, A. (2008). Nature, nurture, or chance: stochastic gene expression and its consequences. Cell 135, 216–226.
Femino, A.M., Fay, F.S., Fogarty, K., and Singer, R.H. (1998). Visualization of
single RNA transcripts in situ. Science 280, 585–590.
Raj, A., and van Oudenaarden, A. (2009). Single-molecule approaches to stochastic gene expression. Annu. Rev. Biophys. 38, 255–270.
Fraser, R.S., and Nurse, P. (1979). Altered patterns of ribonucleic acid synthesis during the cell cycle: a mechanism compensating for variation in gene concentration. J. Cell Sci. 35, 25–40.
Raj, A., and Tyagi, S. (2010). Detection of individual endogenous RNA transcripts in situ using multiple singly labeled probes. Methods Enzymol. 472,
365–386.
Golding, I., Paulsson, J., Zawilski, S.M., and Cox, E.C. (2005). Real-time kinetics of gene activity in individual bacteria. Cell 123, 1025–1036.
Raj, A., Peskin, C.S., Tranchina, D., Vargas, D.Y., and Tyagi, S. (2006).
Stochastic mRNA synthesis in mammalian cells. PLoS Biol. 4, e309.
Gru¨n, D., Kester, L., and van Oudenaarden, A. (2014). Validation of noise
models for single-cell transcriptomics. Nat. Methods 11, 637–640.
Raj, A., van den Bogaard, P., Rifkin, S.A., van Oudenaarden, A., and Tyagi, S.
(2008). Imaging individual mRNA molecules using multiple singly labeled
probes. Nat. Methods 5, 877–879.
Hansen, R.S., Thomas, S., Sandstrom, R., Canfield, T.K., Thurman, R.E.,
Weaver, M., Dorschner, M.O., Gartler, S.M., and Stamatoyannopoulos, J.A.
(2010). Sequencing newly replicated DNA reveals widespread plasticity in human replication timing. Proc. Natl. Acad. Sci. USA 107, 139–144.
Jao, C.Y., and Salic, A. (2008). Exploring RNA transcription and turnover in vivo
by using click chemistry. Proc. Natl. Acad. Sci. USA 105, 15779–15784.
Johnston, I.G., Gaal, B., Neves, R.P., Enver, T., Iborra, F.J., and Jones, N.S.
(2012). Mitochondrial variability as a source of extrinsic cellular noise. PLoS
Comput. Biol. 8, e1002416.
Kimura, H., Tao, Y., Roeder, R.G., and Cook, P.R. (1999). Quantitation of RNA
polymerase II and its transcription factors in an HeLa cell: little soluble holoenzyme but significant amounts of polymerases attached to the nuclear substructure. Mol. Cell. Biol. 19, 5383–5392.
Kimura, H., Sugaya, K., and Cook, P.R. (2002). The transcription cycle of RNA
polymerase II in living cells. J. Cell Biol. 159, 777–782.
Levesque, M.J., and Raj, A. (2013). Single-chromosome transcriptional
profiling reveals chromosomal gene expression regulation. Nat. Methods 10,
246–248.
Levsky, J.M., Shenoy, S.M., Pezo, R.C., and Singer, R.H. (2002). Single-cell
gene expression profiling. Science 297, 836–840.
Li, G.-W., and Xie, X.S. (2011). Central dogma at the single-molecule level in
living cells. Nature 475, 308–315.
Robertson, K.D., Keyomarsi, K., Gonzales, F.A., Velicescu, M., and Jones, P.A.
(2000). Differential mRNA expression of the human DNA methyltransferases
(DNMTs) 1, 3a and 3b during the G(0)/G(1) to S phase transition in normal
and tumor cells. Nucleic Acids Res. 28, 2108–2113.
Sanchez, A., and Golding, I. (2013). Genetic determinants and cellular constraints in noisy gene expression. Science 342, 1188–1193.
Schmidt, E.E., and Schibler, U. (1995). Cell size regulation, a mechanism that
controls cellular RNA accumulation: consequences on regulation of the ubiquitous transcription factors Oct1 and NF-Y and the liver-enriched transcription
factor DBP. J. Cell Biol. 128, 467–483.
Senecal, A., Munsky, B., Proux, F., Ly, N., Braye, F.E., Zimmer, C., Mueller, F.,
and Darzacq, X. (2014). Transcription factors modulate c-Fos transcriptional
bursts. Cell Rep. 8, 75–83.
Shalek, A.K., Satija, R., Adiconis, X., Gertner, R.S., Gaublomme, J.T.,
Raychowdhury, R., Schwartz, S., Yosef, N., Malboeuf, C., Lu, D., et al.
(2013). Single-cell transcriptomics reveals bimodality in expression and
splicing in immune cells. Nature 498, 236–240.
Springer, M., Weissman, J.S., and Kirschner, M.W. (2010). A general lack of
compensation for gene dosage in yeast. Mol. Syst. Biol. 6, 368.
Suter, D.M., Molina, N., Gatfield, D., Schneider, K., Schibler, U., and Naef, F.
(2011). Mammalian genes are transcribed with widely different bursting kinetics. Science 332, 472–474.
Molecular Cell 58, 1–14, April 16, 2015 ª2015 Elsevier Inc. 13
Please cite this article in press as: Padovan-Merhar et al., Single Mammalian Cells Compensate for Differences in Cellular Volume and DNA Copy
Number through Independent Global Transcriptional Mechanisms, Molecular Cell (2015), http://dx.doi.org/10.1016/j.molcel.2015.03.005
Swain, P.S., Elowitz, M.B., and Siggia, E.D. (2002). Intrinsic and extrinsic contributions to stochasticity in gene expression. Proc. Natl. Acad. Sci. USA 99,
12795–12800.
Watanabe, N., Ishihara, T., and Ohshima, Y. (2007). Mutants carrying two sma
mutations are super small in the nematode C. elegans. Genes Cells 12,
603–609.
Swift, J., Ivanovska, I.L., Buxboim, A., Harada, T., Dingal, P.C.D.P., Pinter, J.,
Pajerowski, J.D., Spinler, K.R., Shin, J.-W., Tewari, M., et al. (2013). Nuclear
lamin-A scales with tissue stiffness and enhances matrix-directed differentiation. Science 341, 1240104.
Whitfield, M.L., Sherlock, G., Saldanha, A.J., Murray, J.I., Ball, C.A., Alexander,
K.E., Matese, J.C., Perou, C.M., Hurt, M.M., Brown, P.O., and Botstein, D.
(2002). Identification of genes periodically expressed in the human cell cycle
and their expression in tumors. Mol. Biol. Cell 13, 1977–2000.
Tani, H., Mizutani, R., Salam, K.A., Tano, K., Ijiri, K., Wakamatsu, A., Isogai, T.,
Suzuki, Y., and Akimitsu, N. (2012). Genome-wide determination of RNA stability reveals hundreds of short-lived noncoding transcripts in mammals.
Genome Res. 22, 947–956.
Wu, C.-Y., Rolfe, P.A., Gifford, D.K., and Fink, G.R. (2010). Control of transcription by cell size. PLoS Biol. 8, e1000523.
Taniguchi, Y., Choi, P.J., Li, G.-W., Chen, H., Babu, M., Hearn, J., Emili, A., and
Xie, X.S. (2010). Quantifying E. coli proteome and transcriptome with singlemolecule sensitivity in single cells. Science 329, 533–538.
Wu, A.R., Neff, N.F., Kalisky, T., Dalerba, P., Treutlein, B., Rothenberg, M.E.,
Mburu, F.M., Mantalas, G.L., Sim, S., Clarke, M.F., and Quake, S.R. (2014).
Quantitative assessment of single-cell RNA-sequencing methods. Nat.
Methods 11, 41–46.
Tzur, A., Kafri, R., LeBleu, V.S., Lahav, G., and Kirschner, M.W. (2009). Cell
growth and size homeostasis in proliferating animal cells. Science 325,
167–171.
Wuarin, J., and Schibler, U. (1994). Physical isolation of nascent RNA chains
transcribed by RNA polymerase II: evidence for cotranscriptional splicing.
Mol. Cell. Biol. 14, 7219–7225.
Vargas, D.Y., Raj, A., Marras, S.A.E., Kramer, F.R., and Tyagi, S. (2005).
Mechanism of mRNA transport in the nucleus. Proc. Natl. Acad. Sci. USA
102, 17008–17013.
Zenklusen, D., Larson, D.R., and Singer, R.H. (2008). Single-RNA counting reveals alternative modes of gene expression in yeast. Nat. Struct. Mol. Biol. 15,
1263–1271.
Volfson, D., Marciniak, J., Blake, W.J., Ostroff, N., Tsimring, L.S., and Hasty, J.
(2006). Origins of extrinsic variability in eukaryotic gene expression. Nature
439, 861–864.
Zhurinsky, J., Leonhard, K., Watt, S., Marguerat, S., Ba¨hler, J., and Nurse, P.
(2010). A coordinated global control over cellular transcription. Curr. Biol. 20,
2010–2015.
14 Molecular Cell 58, 1–14, April 16, 2015 ª2015 Elsevier Inc.
Molecular Cell
Supplemental Information
Single Mammalian Cells Compensate for Differences
in Cellular Volume and DNA Copy Number through
Independent Global Transcriptional Mechanisms
Olivia Padovan-Merhar, Gautham P. Nair, Andrew G. Biaesch, Andreas Mayer, Steven
Scarfone, Shawn W. Foley, Angela R. Wu, L. Stirling Churchman, Abhyudai Singh, and
Arjun Raj
D
5
●
●
●
4
●
●●
●
●
●
●
● ●●
●
2
1
●
●
●
●
●
●
●●
●
●●
●
●
●●
●
●●
●
●
●
●
●
●
0
6
4
2
0
Single cells
5
●
●
●
4
●
●
●
●
3
●
●
2
●
1
●●
●
0
G
●
0.10
●
●
●
●
●
●
0.01
●
●
●
●
V
expression (Quiescent)
●
●
●
1000
●
●
●
●
100
●
●
●
●
●
10
●
●
●
●
●
●
●
0.01
0.10
1.00
Cytoplasmic RNA concentration
(cycling) (count/pL)
10
●
●
●●
●●●
●● ●
●●
●
●
●●
F
●
●
1
2
3
4
5
Volume calculated from
GAPDH (pL)
●
●
●
●
●
0
1
2
3
4
5
Volume calculated from
GAPDH (pL)
1.00
●●●
●● ●
●
●
100
1000
V
expression (Cycling)
V
expression (Quiescent)
●
●
3
0
Cytoplasmic RNA concentration
(quiescent) (count/pL)
●
Live
Fixed
Permeabilized
Volume calculated from
EEF2 (pL)
Single cells
C
E
Cell height (µm)
B
4000
3000
2000
1000
0
Volume calulated from half
GAPDH molecules (pL)
A
Cell area (µm2)
Supplemental Figure 1
1000
●
●
●
●
●
●
10
●
●
●
●
●
●
●
●
●
10
1000
V
expression (Cycling)
Figure S1, related to Figure 1.
A and B. We monitored primary fibroblast cells on the microscope throughout the process of fixation. We took measurements of the same cells live, after
fixing in 4% formaldehyde for 10 minutes, and after permeabilizing in ethanol for 30 minutes.
A. We measured the areas of the cells through brightfield images.
B. We measured the height of the cells by coating the cells with fluorescent beads. These measurements indicate that the cells remain roughly the same
size throughout the fixation and permeabilization process.
C. To demonstrate the robustness of the volume calculation algorithm, we calculated volume for the same cells using all the GAPDH mRNA spot coordinates as detected by RNA FISH, or using only half of the points, chosen randomly. Both methods result in approximately the same volume, suggesting
that the number of points we use is sufficient to calculate the volume accurately.
D. We calculated volume using a different gene, EEF2. On average, EEF2 has an abundance that is less than half that of GAPDH (mean EEF2 = 1079
mRNA/cell, mean GAPDH = 2673 mRNA/cell). Volume calculated using EEF2 is systematically lower than that calculated using GAPDH, but the values
are similar.Black lines in C, D indicate a fit with intercept = 0 and slope = 1.
E. mRNA concentration is similar in cycling and quiescent cells. We calculated the average mRNA concentration for 17 genes in both the cycling and
quiescent state in human foreskin fibroblast cells. Each data point represents one gene. Each gene had a minimum of 2 biological replicates, with at least
thirty cells per replicate. Line has intercept 0 and slope 1. Error bars represent standard error.
F,G. We compared volume-dependent and -independent abundance for cycling and quiescent cells. Both volume-independent and volume-dependent
expression are lower in quiescent cells. All error bars represent confidence intervals of the slope or intercept of the fit, normalized to the scale of the plot.
In C and D, we omitted error bars that extended below zero. Each gene had a minimum of two biological replicates, with at least 30 cells per replicate.
We omitted highly variable genes with intercept terms less than zero.
Supplemental Figure 2
ACTA2
●
●
●
●●●
●●
●
●
●
●● ● ●
●
●●●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●●●●
●
●●
●
mRNA Count
FUCA1
20
10
0
●
KDM5A
●
●
● ●
●
● ●●
●
● ●● ●
●
●
●●●
●
● ●●● ●
●●●
●
●
● ●
●●
●
●●
●● ●●
●
●
●●
●
●
●
●●●
●●
●
●
●
●●
●
●●●
●●
●
●●●●
●
●
●
●
●
●
●●●
●●
●
●●
●
●
●
●
●
●● ●
●
GAA
8
6
4
2
0
FUCA1
●
●●
●
●
●
● ●
●●
●●●
●
● ●●
●● ● ●
●●
●●●●
● ●● ●●
●
●●●●
●
●●
●
●
●●●
●
●
●
●
●
● ●
●
●
● ●●
●
●●●●
●●●●● ●
●
●●
●●
●
●
● ●●
●
●
●
●
●●
●
●
●
●●●
● ● ●
●
●
●
●
●
●
●●
●
●●●
● ●
●●●●
●
●●
●
●
●
●
●
●
●●● ●
●●●
●
●●●
●
●
●●
●
● ●
●●●
●●
● ●●
● ●
●
C
GAPDH
●●
●
●
●●
●
● ●●
●●● ●
●
●●
●
● ●●
●●
●●●
●
●
● ● ●●
●
●● ●
●●●
● ●
●
●
● ●
●
●●
●●
●●
●●
●●●
●●
●●
●
●
1500
1000
500
0
● ●
●
●
●
●●
●●
● ● ●●
● ●
●●●
●● ● ●● ●● ●
●● ● ●● ● ●
● ●●
●
●
● ●●●
●
●
●
●
●
●
●●
●●
KDM5B
●●
●●
● ●
●● ●● ●
●
●●
●●● ●
●● ● ●
● ●
● ● ●
●
●●●
●
●
●●●
●
●
● ●●●
●●
●
●
●
●
●
●
●●●●●
●
●
● ●●
●
●●
●
●
●
PABPC1
●
●
●
●
750
500
250
0
●
● ●
●
●
●
●
●
●●● ● ●
●
●●
●●
● ●● ●●
●●
● ●
●●
●● ●
●●
●
●●
●
●● ●
●
●●
●● ●
●
●
●● ●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●●
●
●
●
●●
● ●●
●
●●
●
●
● ●
●●
●●
●
TBCB
200
150
100
50
0
●
DNMT1
UBC
●
●
●
●
●
●●●●●●
●●
●●
●
●
●●
●
●
●●
●
●●●
●●
● ●
●
●
●
●
●
●
●●
●
●
●●●● ●
●●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●●●●
●●
●●
●
●
●● ●
1500
1000
500
0
●
●
● ●
●
●● ● ●●
●●
●
●●
●
● ●
●
●
● ●●
●
●
●●
●
●●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●●●
●●●● ●
●●
●●●
●●
● ●●●
●● ● ●
●●
●
●
●
●● ●
●●
●
●●
●●
●
●●●
●
● ●●
●●
●●●●●
●
●
●●
●
●●
● ●
●
● ●● ●●
●●
●●
●
●●
●●● ●
● ●
●
●● ●
●
●●●●
●
●
●● ●
●
●
●●●
●●
●● ●●
●●
● ●●
●●
● ● ●●
●
●
●●
●
●●
●
●●
●
●●
●
●
●
●
● ●
●
●
●
●
●●●
●●●
●●
●●
● ●
●●
●
● ●●
●●
●●
●
●
●
●●
● ●●●●
●
● ●
●● ●●
●●
●
GAS6
300
200
100
0
●
●
●●
●
●●
●
●
●●
●
●● ●
● ●●
●
●●●●
●
●
●
●●
●
●
●●
●
● ●●●
●
●
●
●
●
●
●
●
●●●
●
MYC
●●
●
●
●
●●
●●
●
●●
●● ● ●
● ●●●
● ●
●
●●
● ●● ●●
●
●
● ●
●●
● ●● ●
●
●● ●●● ●
●
●
●
●
●
●●
●
●
SUPT5H
●●
●●
●●
●
●
●
●
● ●
●
●●●
●●●●
●
●
●
●
●
●
●
●●
●
●
●●
●●
●
●
●
●●●
●●
●●
●
●
●● ●
●
●
●
●
●
●●
●●
●
●
●
●●
●
●
●
●
●●●
●
●
●
●
●●
●
●
●● ●
● ● ●●
●●
●
●
TBCB
60
40
20
0
●
●
●
●
●
●●● ●
●
●
● ●
●
●
●
●
●
●
● ●●●
●
●●
●●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
● ●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●●●
●●
●
●
●
●
●●●
●●
●
●
●
●
●
●
●
●●●
●
●
●●
●●● ● ●
●
●
●
●
●
●●
●
●●●●
●
●● ●●
●●●●● ●
●
PABPC1
300
200
100
0
●
75
50
25
0
●
● ●
●
●
●
●
●
●
●
● ●●
●●
●
●●
●
●
●
●
●
●●
●
●
●
●●
● ●●●●
●●
●●●
●
●●
●
●●
●●●
●
● ●●
●
40
20
0
●●●
●
●●
●
●
● ●
● ●
●
●
●● ●●
●●
●●● ● ● ●
●●
●●●
●
●
●
● ●
●
●
●
●●
●
●
●
● ● ●●● ● ●
● ● ●●
IER2
150
100
50
0
●
30
20
10
0
LUM
●
FTL
1500
1000
500
0
ICAM1
●
● ● ●
●●
●
●● ● ●
●
●●
●
● ●
●
●
●●
●
●●
●●
●
●
●
●
●● ●
● ●●
●●
●
●
●
●
● ●
● ●
● ●●●
●
●●●●
● ●
●●
●●
●●● ●
●
●
●●
●
● ●●
●
●
● ●● ●
●
●
SLC1A5
200
150
100
50
0
●
EEF2
800
600
400
200
0
●
●●
●
●●
●
●● ●
● ●●
●
●
●●● ●
●
● ●
●●
●
●●
●
400
300
200
100
0
●
RND3
●●
●● ●
●
●
●● ●
●●
●●●
●●
● ●
●
●●
●
●
●
●●●●
●
●
●
●
●●
●
●●●●
●
●
●
●
●
●●
●
●
●
●●
●
●●
●●●●
●
●
●
●● ●●
●● ● ●
600
400
200
0
●
●●
●
●
●
●
●●
●●
●●
●
●●
●
●
●
●●
● ●
●●●●●●
●●
● ●● ●●
●
●● ●
●
●
● ●
●●● ●
● ●
● ●●
●
●
●
●●
●
●● ●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●●
●●
●●●
●
●●●
●
●●
●
● ●●
●
●
●
●
●●
UBC
●
● ●
●
●
● ●●
●
●●
●
●
● ●●
●
●
●
●
●
●
●
● ●●
●
●●●●
●
●
●
●
●
●
●●●
●●
●●
●
●
●●●●
●●
●●
●
●●●●
●●
●
●
●
●
●
●
●●●●
●
● ●●
●
●
●
● ●
●
●
● ● ● ●● ●
●● ●●●
●● ●●
● ●●
ZNF444
●
●
●●●●
● ●
●●
●
●● ●
●
●●
● ●●●
●●●●
●●
●
●● ●
●●
●
●●
●
●
●●●
●
●●●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●
●
●●
●
●
●
● ● ●●
●
●
●
●
●
●
●●
●●
●●
●
●
●
●
●●
●●
●
●●●
●
●●
● ●●
● ●
●●● ●
●
●
Volume (picoliters)
●
rRNA Intensity (a.u.)
● ●●● ●
●
●●
● ● ●
●
●●
●
●
●
● ●
●●
●
●
●
●●
●●
●
●
●●●
●
●
●
●
●
●●●
●
●
●
●●●
●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●●
●
LMNA
400
300
200
100
0
●
30
20
10
0
80
60
40
20
0
●
●
●
●●
● ●● ●
●
●
● ● ●
● ●
● ●●●● ●●
●●
● ● ● ●● ●
● ●●
●
●●●
● ● ●
●
GAA
15
10
5
0
20
15
10
5
0
●
●
200
150
100
50
0
●
100
50
0
BABAM1
40
30
20
10
0
●
USF2
●
●
●●
●●
●●
● ●● ●
●
●●
●
●
●●
●
●●●●●●
●●● ● ●
● ●●
●●
●
●
●
●
●●●
●
●
●
●
●● ● ●
●●●● ● ● ●
0.0
2.5
5.0
7.5
60
40
20
0
MYC
●
SUPT5H
●●
●
●
Volume (picoliters)
ACTN4
600
400
200
0
●
●●
●●●
●
●
●● ●
●●●
●
●
●
●●●● ●●
●●
●●
●
●
● ●●
●
●
●
● ●●
●
●●
●
●
● ●
●
●
●
●
●
●
●
● ●
●
● ●
●
●
●
●
●
●●
●●●
●
●
● ●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
● ●
● ●●
●●●
●
●●
●● ●● ●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
● ●●
●●●●
●
●
●
●
●
●● ●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
● ●
● ●●●
●
●●
●● ● ●
●
●
●●
●●
●
●
●●
●
200
100
0
●
●
●
●●
●
●●
●●●
●
●
●
●●
● ●
●●
●
● ●●
●●
●●●
●● ●
●●●
● ● ●
●●
●●●●
●
●
●
●●●●
●● ●
●
●●
● ● ●●
●
●
RBM3
●
150
100
50
0
●
● ●
●
●●● ●●
●
●
●●●● ●
●
●
●
●●
●
●●
●● ●
●●
●
●
●
●
●
●
●● ●
●●●
●●
●
●
●●
●
●●
●
●
●●
●
●●
IER2
●
●
●●
LUM
●
●
●
● ●●● ●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●● ●
●●
●
●
●
●●
●●
●
●
●
●●
●●
●●●●
●
●
●
●●●
400
300
200
100
0
●
●
●
●
●
●● ●● ●●●
●
●
●
●●
●
●●●●●
●●●●●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
ICAM1
80
60
40
20
0
● ●●
●
●
●
●
●
● ●
●
●●
●
●
●
●
●●
●●
●
●●
●● ●
●●●
●● ●●●
●
●
●●
●● ● ●●
●
●
●●
●
●
●
●
●
● ●
●●
●
●
●
●
●
●
●
●●
●
●
● ●
●●
●
●
●
900
600
300
0
1200
800
400
0
●
●
●● ●● ●
● ●●●
●●●
●●
●
●
●
●●
●●●●
●
●
●
●●
●●
●
●
●
●
●●●●
●
●
●●●●
●
●
●
●
●●●
●
0.0
2.5
5.0
7.5
● ●●●
● ●
● ●
● ●●●●
●●
●● ●
● ●
●●
● ●●
●●
●
●
●
●
●
●
●●
●●
●
●
●
●●
●
●●●
●
●
●●
●●
●
●
●
●
●●
●●
●●
●
●
●
●
●
●●
●
●
●● ●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
● ●●
●
●
● ●●
●
●
●
●
●
●
●
●
●●
KDM5A
15
10
5
0
GAS6
●
FTL
10000
7500
5000
2500
0
●
●
● ●
● ●
●●
●
●●
●
●
●
●
●●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
● ●
●●
●
●
●
●
ZNF444
400
300
200
100
0
●
● ●●
● ●●
●
● ●
●
●
● ●
●●
●
●
●
● ●●
● ●●●
●
●
●
●
●●
●
●
●●●●
●
●
●
●●●●
● ●
SLC1A5
●
●
40
30
20
10
0
●
●
●
●●
LMNA
1000
500
0
●
●
● ●●
●
●
● ● ●
●
●●●●●
●
●
●●
●●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●●
●
●●
●
●
●●●●
●
●●
●
●
●●
●
●●
●●
ACTA2
●
●●
● ●
●●
●●
●●
●●
●
●●
●
●●
●
●
●
●●●
●●●
●●●
●
●
●
●
●
●● ●
●
●
●
●●
● ●
●
●
●
●
●
●●
●
●
●●●
0.0
2.5
5.0
7.5
mRNA Concentration (count/pL)
800
600
400
200
0
200
100
0
EEF2
3000
2000
1000
0
●
●
● ●
●
●
●
●●●
●
●
● ●
●●
●●
●●
●●
●
● ●●
GAPDH
RND3
0.0
2.5
5.0
7.5
B
DNMT1
●
●●
● ●
●●●
●
●● ●●● ● ● ●
●
●
●
●● ●
●●
●
●
●
●●
●●● ●●
●●
●
●
●●
●
●
● ●●● ●
●
●
●● ● ●
●●
●● ●
● ●
●
● ●
●
●
●●●●
●●
●●
●
●
●● ●●
●●●
●
●
●
●
●
●
●
●●●
●
●
● ●
●
●● ●●
●
●
●
USF2
●
● ● ●
●
●
●●● ●●
●
●●
●
●●
●
●
●
●
●
●
● ● ●●
●
●
●
●
●
●
●●
●●●
●
●
●●
●
●
●●●●●
●
●
● ●●
BABAM1
KDM5B
80
60
40
20
0
1500
1000
500
0
●●
●●●
●
●
●
●●
●
●
●●●●● ●
●
●
●
●
●
●
●●
●●
●
●
●●●
●
●●●
●●●
● ●
●
150
100
50
0
8000
6000
4000
2000
0
●
● ●●
●●
●
●
●
●
●●●
●
●● ● ● ●
● ●● ●
●●
●
●
●●
●
●
●
●●
●
●
● ●●
●●
●
●
●
●
● ●
●
●
●
●
●
RBM3
400
200
0
125
100
75
50
25
0
●
●
●
●
●
●
●
●
●
● ● ●
●
●
● ●
●
●●●
●●
●
●●
●●
●
●●●●
●●
●
●●
●
●●
●●● ●
●
●●●
●
●
●
●●
●●
●
●
●
50
40
30
20
10
0
●
●● ● ●
●
●
●
●
●●
●●●
● ●●
● ● ● ●
●
●
●
●●● ●
●●
●
●●
●●
●
●●
●
●
●
●
●
●●
●
●
50
40
30
20
10
0
ACTN4
1500
1000
500
0
●
2000
1500
1000
500
0
2.0e+09
1.5e+09
●
●
●
●
1.0e+09
●
●
● ●
●
● ●●
●
●●
● ●
●
●
●
●
●●
●
●
●
5.0e+08
●
0.0e+00
0
1
2
3
4
Volume (picoliters)
D
rRNA ITS Intensity (a.u.)
A
●
●
●
7.5e+07
●
●
●
5.0e+07
●
●
●
●
●
●●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
2.5e+07
●
0.0e+00
0
1
2
3
4
Volume (picoliters)
Figure S2, related to Figure 1.
A, B. mRNA count (A) and concentration (B) measurements for all genes in cycling fibroblast cells. Each data point is an individual single cell measurement. In count plots, red line indicates best linear fit to the data. In concentration plots, red line indicates mean mRNA concentration. Each data set is a
combination of at least two biological replicates.
C. We measured ribosomal RNA by quantifying total fluorescence intensity in the cytoplasm from an rRNA FISH probe in cycling fibroblast cells.
D. We measured the rRNA ITS (the rRNA “intron”) by quantifying total fluorescence intensity in the nucleus from an ITS RNA FISH probe.
rRNA and the rRNA ITS both scale with volume to some degree, suggesting that the production of ribosomal RNA scales with volume. We have shown
that mRNA scales with volume, so a similar scaling of rRNA is not inconsistent with the production of protein to scale with volume as well. The data shown
for rRNA is one of three biological replicates.
Supplemental Figure 3
A
B
G1
G2
●
0.4
S
Fraction of cells
G1
G2
1000
●
●
500
● ●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
0
●
●
●
● ●
●
●
●
●
●●
●
●
● ●● ● ●
●●
● ●●
●
●
●
● ●
0
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
● ●
●●●
● ●●
●●● ●
●
●
100
●
●
●
●
●
200
300
Cyclin mRNA
●
0.0
2.5
5.0
7.5
Volume (picoliters)
●
●
●
●
●
●
●
●
●
●
●
● ●
● ●
● ● ●●
●
●
●
●
●
●●
●
●
●
● ●●●
●
●
●
●
●
●●
●
●
●
●
●
●
●● ●
●
●
● ●
●
●
●● ● ●
●
●
●●
●●
●
●
● ●
●●
●
● ●●
●
●
●● ●
●● ●
● ●● ● ●
●
●●
● ●● ● ● ●●● ● ●●
● ●●
●
●
● ● ●
●
●
●
●
●● ●
●● ●
●
●●
● ●
● ●●● ●● ● ●
●
●
●
●
●
●
●
●●
●
●
● ●
●
●
●
●●
● ●
●
●
●
● ●●
●
●
● ●●●●●
●● ●
●
● ●●●
● ●●
●●● ● ●●
●
●
●
●● ●●●●
●
●
●
●●
●●
● ● ●●
● ● ●●●
● ●●
●
● ●
● ●● ●●●
● ● ●●
●
●● ●
●●
●●● ●
●
●
●●●● ● ●●●
●
●●●●●●●●
●●
●●
●
● ●
●●●● ●●
●●
●●
● ●
●●
●●●●●
●●●●●
●
●
●
●●
●●
●● ● ● ●●●
●
●
●● ●● ●● ● ● ●
●●
●● ●
●
●
●
●
●●
●
●●●
●
●●
●●●
●●
●●
● ●
●●●
●●
●
●
●●
● ●
●●●
●
●
●
● ●●
●
●
●● ●
● ●●
●
●
●● ●
●●●
●●
●●●●●
●
●●
●●
●
● ● ●●
●
●●
●
●
●
● ● ●
●
●
●●
●
●●
●
●
●
●
●● ●
●●
●
● ●
●
●
● ●●● ●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●●
●
●●
●
●
● ●●●●
●
● ●
●
●
●
●●
●
●
●
● ●●
●
●
●●
●
●●
●
●
●●
●
●
●
●●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
● ●●●●●
●
●
●●
●
●
●
●
● ●●●
●
●●
●
●●
●●
●●
●●
●
●
●●
●
●●
●
●
●●●●●●
● ●
●
●
●
●●
●●
●●●
●
●
●
●
●
●
●
●●
●●
●●
●●
●
●
● ● ●●
●
●
●
●●
●●
●
●
●
●●●
●
●
●
●●
●
●
●●
●
●● ●
●
●●●● ●
●
●●
●
●●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●●
●
●
●●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●●● ●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●●
● ●● ● ●●
●
●
●●●
●●
●
●
●● ●●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
● ●●●
●
●
●●
●
● ●●
●
●
●
● ●●● ● ●●● ●● ●
●
●
●● ●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●● ● ●●
●
● ●● ●
●
●
●
●
●●
●●
●
● ●●● ●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●●
●
●
●●
●
●
●
●● ●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●● ●●●
●
●
●
●
●
●
● ● ● ● ●●● ● ●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●●
●●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●
●
●
●
●
●●●
●
●●●●●
●
●
●
●
●
●
●
●●
●●
●
●●
●●
●
●
●
● ●
●● ●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
● ●● ● ●
●
●●
●
●
●●●●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●●●
●
●●
●
●
●
●
●
●
●●●
●
●
●
●●
●●
●
●
●
●●
● ●●●
●
●
●●
●
●
●
●
●
●●●
●
●
●
●
● ●●
●
●
●
●●
●
●
●●
●
●●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●●
●
●●
●
●●●●
●
●
●●
●●●
●●
●●
●
● ●●●●●
●
●
●●
●●●
●
●●
●
●
●
●
●●
●
●
●
●●
● ●●● ●
●●
●
●
●●
● ●●
●●●●●
●
●
●●●● ●● ●
● ● ● ●●
●
●
●●
●●
●
●
G1
S
G2
●
●
Nuclear Area (µm2)
0.1
D
600
200
0.2
400
C
400
0.3
●
●
●
●
●
●
●
●
● ●
● ●
●
●
●
●
●
●
●
0.0075
Fraction of cells
Histone mRNA
1500
G1
S
G2
0.0050
0.0025
0.0000
0
0.0
2.5
5.0
7.5
Cellular volume (picoliters)
100 200 300 400 500 600
Nuclear Area (µm2)
Figure S3, related to Figures 1, 5, 6.
A. We simultaneously measured CCNA2 and HIST1H4E mRNA by RNA FISH to precisely determine cell cycle position. Each data point is a single cell
measurement. CCNA2 is highly expressed in S and G2, but not G1. HIST1H4E is highly expressed only in S phase. Cells with low CCNA2 and
HIST1H4E are in G1 (cutoff = 20 CCNA2 mRNA), cells with mid-range CCNA2 and high HIST1H4E are in S, and cells with high CCNA2 and low
HIST1H4E are in G2 (cutoff = 230 CCNA2 mRNA). We determined thresholds for all samples using this method. Data shown are from one of four biological replicates.
B. Volume distributions in G1 and G2. We determine cell cycle position using CCNA2. We note that G2 cells are larger than G1 cells, but only 10% larger
on average, possibly due to non-linearities in growth over the course of the cell cycle. n = 841 cells in G1, 191 cells in G2.
C. Nuclear area vs. cytoplasmic volume. We measure cytoplasmic volume using our standard method. We measure nuclear area using the DAPI stain.
We note that we only measure nuclear area and not volume.
D. Density plot of nuclear area across cell cycle stages. While nuclear area generally scales with cytoplasmic volume, there is considerable spread in
the data (R2 = 0.358). n = 1866 cells.
Supplemental Figure 4
B
●
POLR2A RNA
●
●
100
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●● ● ●
50
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
0
0
1000
1.5
1.0
0.5
0.0
Control 100nM
2000
Volume (picoliters)
D
2.0
UBC mRNA
Mean number of transcription
sites per cell
●
C
UBC
Mean number of transcription
sites per cell
A
Triptolide Concentration
E
EEF2
MYC
MYC
2.0
1.5
1.0
0.5
0.0
Control 100nM
Triptolide Concentration
UBC
TUSC3
G1
GM
12878
S
G2
WS
G1
K562
S
G2
WS
G1
HeLa
S
G2
WS
G1
HepG2 S
G2
WS
G1
HUVEC S
G2
WS
Figure S4, related to Figures 5 and 6.
A. Quantification of RNA Polymerase II mRNA (quantified by RNA FISH) vs. cytoplasmic volume in A549 cells. Data shown is from a single biological
replicate.
B,C.We treated primary fibroblast cells with 100nM triptolide (100nM) or simply replaced the media on cells (Control) and fixed in methanol after one
hour. For genes UBC and MYC, we identified active transcription sites through intron/exon colocalization and quantified the number of active transcription
sites per cell with and without triptolide. Bars represent mean of cells, error bars are SEM. The data shown here is from two replicates, for a total of 282
cells for UBC and 257 cells for MYC. For both genes, the change in frequency between conditions is small compared to the change in intensity (Fig. 5).
D. UBC mRNA in a CRL2097 cell. RNA FISH probe in white, DAPI stain in purple. White arrows indicate transcription sites. We detect transcription sites
through intron/exon colocalization by RNA FISH. This cell is in G2 and has four transcription sites, demonstrating that all gene copies are transcriptionally
competent after replication.
E. Tracks from UCSC genome browser displaying UW Repli-Seq data in GM12878 (lymphoblastoid), K562 (chronic myelogenous leukemia), HeLa
(cervical cancer), HepG2 (liver carcinoma), and HUVEC (human umbilical vein endothelial) cell lines. The track displays data for different points in the
cell cycle: G1, S1 (early S phase), S2 (middle-early S phase), S3 (middle-late S phase), S4 (late S phase), and G2. WS represents a wavelet-smoothed
transform of the six other tracks. This data was generated by sequencing newly-replicated DNA in each point in the cell cycle. Darkness of track
corresponds to read density. Each track shown corresponds to entire length of each gene. Data is shown for early replicating genes EEF2, MYC, UBC,
and a late replicating gene, TUSC3.
Supplemental Figure 5
B
●
●
●
Noise Measure
60
●
●
●
●
●
●
● ● ●● ● ●
●●
● ●
●
●●
●
●
●
● ● ●●
●
●●
● ●●
●● ●
●
●●
●
●●
40
20
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
2.0
●
1.5
●
1.0
0.5
●
0
2
4
6
Volume (picoliters)
●
●
●
●
●
●
●●
●
● ●
●
●
●
●
●
●
●
5
10 15 20 25
mRNA Halflife (hours)
Volume-corrected
Noise Measure
0
0.0
●
●
●
CV2 RNA Count
CV2 RNA Concentration
Nm
C
0.75
2
1
0
0.50
ACTA2
FUCA1
ICAM1
LUM
MYC
RND3
Volume-corrected
Noise Measure
2.5
●
V
TBCB mRNA Concentration
A
0.25
ZNF444
USF2
UBC
TBCB
SUPT5H
SLC1A5
RBM3
PABPC1
LMNA
KDM5B
KDM5A
IER2
GAS6
GAPDH
GAA
FTL
EEF2
DNMT1
BABAM1
ACTN4
0.00
Figure S5, related to Figure 7.
A. TBCB mRNA concentration vs. volume. These data are the same as in (Fig. 7A), but each is normalized by volume. Histogram indicates distribution
of mRNA concentration. Gray line indicates average concentration. Data are from a combination of two biological replicates.
B. We compared volume-corrected noise measure and mRNA half-life. We obtained half-life values from Tani et al., Genome Res. (2012). We find that
volume-corrected noise measure does not depend strongly on half-life. Each data point represents one gene. For each gene, we have at least two biological replicates with at least 30 cells per replicate. Error bars represent 95% confidence intervals, calculated by bootstrapping.
C. mRNA CV, concentration CV, and volume-adjusted CV for cycling primary fibroblast cells. Inset shows genes that exhibit higher cell-to-cell variability
in RNA, and had values too high for main axes. Generally, mRNA CV is highest, followed by concentration CV and volume-adjusted CV. Error bars
represent 95% confidence intervals by bootstrapping.
Supplemental Figure 6
A
B
1e+06
●
10000
●
●
●
●
●
●
●
●
●
●
●
●
Mean FPKM
●
●
●
●
●
1e+02
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
100
●
●
●
●
●
●
●
●
●
●
●
●
●
●
FPKM cutoff
●
●
●
●
1
●
●
●
●
●
●
1e+00
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
Mean FPKM
1e+04
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
10
1e+01
1e+03
ERCC reference concentration (attomoles/microliter)
C
100
1000
Mean molecule count by RNA FISH
D
1.00
y = 1.07x-532
●
●
7.5
●
●
●
●
●
0.75
Total GAPDH by FISH
0.50
Volume (picoliters)
Normalized Genomic/ERCC
FPKM ratio
5
0.00
●● ●
●
●
●
●
● ●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
● ●
● ●
● ●
●
●
●
● ●
●
●
● ●
●
●
●
●
●
● ●
●● ●
● ●
●
●
●
●
●
●
●●● ● ●
●
●
●●
●●●●
●
●
●
●●
●
● ●● ●
●
●
● ●●●
●
●●
●
●
● ● ●●
●
●●●●
● ●
●
●
● ●
●
● ●●
●
●
●
●
● ●
●
●
●
●
●
●●
●●
● ●●
● ●
●●●
●
●
●
●
●●
●●
● ●
●●
●●
●
● ● ● ●
●
● ● ● ● ●●
● ● ● ●● ● ● ●●
● ●
●
●
●
●● ●
●
●●
●
●● ● ●●
●
●
● ●● ●
● ●
● ●
●
●
●
●
●●
●
●●
●
●
●●● ●
●●
●
●●
●
●●
● ● ●●● ●● ● ● ● ●
● ● ●●
●
●
●● ●
●
● ● ● ●●●
●
●
●
●
●
●●
●●
● ●
●
● ●
●●
● ●
●
●●●●●● ● ●●
● ●●
● ●●
●
●●
●● ●●
●●
●●
● ●●
●
●● ● ● ●●●
●
●● ● ●●
●
●
●
●
●
●
●
●
●
●
●
●
● ● ● ●●
●
●●
● ●
●●
● ● ●
●
● ●
●
● ● ● ●●
●●
●●
● ●●
● ● ●●●
● ●●●
● ●●
● ●●
●●● ●● ●
●●
●
● ●● ● ●
●
●
●
● ●
●
●
● ● ●●●
●
●●●●●●
●
●
● ● ●
●
●●●
●
●●
●
●●● ●●● ●
● ●●
●
●
●
● ●●
●●
●●
●
● ●●
●●●
● ●●
●
●
●● ● ● ●●●
●
●
●
●●
● ●● ●● ●
●●
●
● ●●●
●● ●
●●●
●
●
●
●● ●●
● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●● ●
● ● ●● ●
●●
●
●
●
●
●●
●
●●
●
●●
●●●
●● ●
●●
●
●●
● ●
●●
●●
● ●● ●
●● ●
●
●●●
●
●●
●●
●
●
● ●●
●
● ●
●
●● ●
●● ●
●●●
●
● ●
●
●
●●●●
●
●
●
●
●●
●●
●
● ●
●●●●
●● ●
●
●
●●●
● ●● ●
●
●●●
● ●
●
●●
●
●
● ●● ●●
●
●
●● ●● ●● ●
●
●
●●
●
●
●● ●
●●●
●●
● ●
●
●● ●
●
●●
●
●●
●● ●●●
●● ●●●
●●
●●●
●
● ● ●●
●●●●
●
●
●●
●
●●
●
● ●●
●
●●
●●
● ●
●●●●● ●
● ●●
●●●
●
● ● ● ●● ●●●
●
● ●
●
●● ● ●
●
●
●
●
●●
●
●
●
● ●
●
●● ●●
● ●● ●
● ●
●●●● ●
●
●●●
●
●●
●●
●
● ●● ●●●●
●
●
●
● ● ● ●●●
●●●
●
●●●●●
●● ●●
● ●●
●
●●
●● ●
●
●● ● ●
●
●
● ● ● ● ●●
●● ●
●●●
●
●●
●●
●
●
●● ●
●
●
●●
● ●
●●
●
●●●●●
● ●
●●●●
●●
●●●
●
● ●● ● ●●
●
●
●●●●
●●● ●
●●
●
●
●●
●●
●●
●●
● ●
●
●●●
●●
●
●●
●●
●●
● ●●●
●●
●
●
●
●
●●●
●
●
●
●
●●●
●●●
●
●
●●
●● ●●
●
●●●●
●●
●●
●
● ●
● ●
●●●
●●
●●●● ●●●●
●
●
●
●
●
●
●●●
●●
●●●
●●
●
●
●●
●●
●
●
●● ●●●
●
●
●
●
●● ●
●
●
●
●
●●●
●●
●
●
●●
●●● ●●●
●
●●
●
●● ●
●
●●
●●
●●
●
●
● ●●
●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●● ●
●●
●● ● ●
●●●●
●
●●
●●● ●●
●
● ● ●●
●●
●
● ●●●●●
●
●
●
●●
●●
●
●●●
●
●●●
●
●
● ●
●
● ●●●
●●●●
●
●●●
●
● ●●●●●●●
●●
● ●●
●●
●●
●●
●
●●●
●
●●●● ●
●●
●
●
●
●
●
●
●●
●
●●●● ●
●●
●
●● ●
●
● ●●
●
●
●●
●●●
●●
●
●
●
● ● ●
●
●
●
●
●● ● ●
●●
●● ●●●●
●
●●●
●
●
●●●●
●
●
●●
●●
●
●●
●●
● ●●●
● ●
●
●●
●
●
●●
●
●●
●●●●● ●
●
●
●●
●●
●
●●
●
●●●
● ● ● ●●
●
●
●
●●● ●●
●●
●
●
●
●●
●●
●●●●
●
●●
●●
●●
●
● ●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●●●●●
●●
●●
●● ●
●●●● ●
●● ●
●●
●
●
●
●
●
●
●
●● ●
●● ● ●●
●●
●
●●
●
●
●
●
●
●
●
●
●
●● ● ●
●
●●
● ● ●●
●●●
●●● ●●
●
●● ●
●
●●●●●●
●
●
●
●●
● ●● ●
●
●●
●●
●
●● ●
●
● ●● ●●● ●
●●● ●
●
●
●●●
●
●●
● ●●●
●● ●●●
●
●●●
●● ●
●
●● ● ●
● ● ●●
●●
●●●
● ●
●●
●●● ●
●●
●
●●●
●
● ●●●
●
●
●
●● ●
●●
● ●● ●
●
● ●
●● ●
●
●
●
●
●●
●●
●
●
●
●
● ●
●
●
● ●● ●●
●●●
●
●
●●
● ●
●
●●
●
●
●
●
●●●
● ● ●
●●
●●● ● ● ●
● ● ●●●
●
●
● ●●●●●●● ●
●
●● ● ●
●
●
●
● ● ●●
●
●
●●
●
●● ●
●
●
●
●● ●
●● ●
●
● ● ●●● ●
●
● ●● ●
●
●
●
●
●● ●
● ●
●
●
● ● ●
●
●
●
●
●
2.5
0.25
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
Cumulative density
●
●
●
●
●
●
●
●
●
●
0
0
2500
5000
Total RNA per cell equivalent
7500
0
2000
4000
6000
GAPDH mRNA count (FISH)
8000
Figure S6, related to Figure 7.
A. Mean FPKM and known concentrations for each of the ERCC reference transcripts. Each point represents a single ERCC transcript and is an average
over all 96 samples. Below 10 FPKM, we begin to see significant dropouts, and therefore choose and FPKM of 10 as our cutoff for “reliable” measurements. All other data in the manuscript is taken from genes having greater than 10 FPKM.
B. Mean count as measured by RNA FISH vs. mean FPKM from single-cell RNA sequencing. Each point represents a single gene and is an average
over 44 single cells for single-cell sequencing, and an average over at least two biological replicates with at least 30 cells apiece for RNA FISH. These
data suggest that an FPKM of 1 corresponds to approximately 23.2 transcripts per cell, as measured by RNA FISH in our cells, although the relationship
between RNA FISH counts and FPKM scales nonlinearly (FPKM ~ (FISH)1.7, see Methods). We used this fitted relationship between RNA FISH count
and FPKM to transform FPKM into transcript counts. Error bars represent SEM.
C. Comparison of “total RNA” distributions from single-cell sequencing and RNA FISH. Data represent a collection of total RNA measurements from
single cells. We assume that total GAPDH mRNA counts by RNA FISH are proportional to total RNA. For sequencing data, we use the ratio of reads
mapped to genomic loci to reads mapped to ERCCs as a proxy for total RNA. We scaled this ratio to have the same mean as the distribution of total RNA
by RNA FISH. After scaling, the distributions are similar, suggesting that our method for measuring total RNA via the ratio of genomic to ERCC transcripts
is sound.
D. Mapping between total RNA count (here, total GAPDH mRNA in single cells) and volume, as measured by RNA FISH. Each point represents a single
cell. We use this mapping to convert total RNA from sequencing experiments to actual volume. The red line is the best fit, as computed by principle
components analysis.
3e+05
●
●
●
●
●
●
●
●
●
●
● ●
●●
●
●
●●●
● ● ● ●
●
●
●
●
●
●●
● ●●●
●
●
●
2e+05
1e+05
0e+00
●
200000
●
●
C
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
"Good” cells
Eliminated cells
150000
●
●
●
●
100000
●
●
● ●
●
● ●●
●
●
●●
●
●●
●
●
●● ● ● ●
50000
●●
●
●
0
0
30
60
90
120
Genomic/ERCC count ratio ("volume")
E
●
●
●
●● ●●
● ● ●●
● ●
● ●● ●
●
●●
●●
● ● ●
●●● ● ● ●●
● ● ● ● ●●●● ●●
●
D
10.0
1.0
●
●
●
●
● ●
●
●●
●
0.1
FPKM CV2
(single-cell RNA sequencing)
●
Count CV2
(single-cell RNA sequencing)
B
4e+05
Summed ERCC counts
A
GAPDH FPKM * count ratio ("count")
Supplemental Figure 7
●
●
●
●
● ●
●
r = 0.5898
●
●
F
●
●
●
1.0
●
●●
●
● ●
●
0.1
●
●
●
●
●
●
●
●
r = 0.6449
●
0.1
0.1
1.0
10.0
Count CV2 (RNA FISH)
0e+00 1e+06 2e+06 3e+06
Summed genomic counts
10.0
1.0
10.0
Count CV2 (RNA FISH)
G
●
●
●
●
●
●
●
●
●
●
●
●
●
●
ACTA2
●
●
●
LUM
●
ICAM1 ●
GAS6
LUM
●
●
●
0.1
10
1000
Mean mRNA (CRL2097)
●
Ubiquitously expressed genes
Fibroblast-specific genes
ICAM1
●
●
●
●
● ●● ●
●
●
●
●
●
●
10.0
0.1
Volume-corrected noise measure
(CRL2097)
H
I
1
Log 10 (Nm)
●
●
0
●
●
●
●
●
●
●
● ●
● ●
●
● ●
●●
●
● ●● ●
●
●
●
●
●
●
● ● ●● ● ● ●
●
●
●
●
● ●
●●
●● ●
●
●
●
●
●
●
●
●
●
●●
● ●
●
● ●● ●
●● ●
●
●
● ●
● ●●●
●
●●●
●●
●
●●
●●
● ●●
● ● ●
●
●
●
●
●
●
●
● ●●
● ●●●
●
●●
●
●
●●
●●
●
●●
●
●
●
●
●
● ●●
●●●
●
●
●
●
●
●
●● ● ●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
● ●
●
●●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●● ●●●
●● ●
●●●
● ●
●
●
●
●
●
●
●
●
●
●
●● ●●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●
●
●● ●
●
●●
●
●
●●
●● ●●●●
●●
●
●●
●
●
●●
●
●●
●
●
●
●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●●● ● ● ● ●
●●●
●
●●
●
●●
●
●
●
●
●
●
●
●● ●
●
●
●
●●
●
●
●
●
●●
●
●
●●
●●●
●
●
●
●
●
●●●● ●
●
●
●
●
● ●● ●●●●
●
●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●● ●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
● ● ●●
●●
●
●
●
●●●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●● ●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●● ●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
● ● ●●
●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●●
●
●
●
● ●
●●●●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
● ● ●
●
●
●
●
●
●
●
● ●
●●
●
●
●
●
●●●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●
● ●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●●
●
●
●
●
● ●●●
●
●●
●●
●
●●
●
●
●
●●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
● ●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
● ●●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
● ●●
●
●
●
●●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●●
●
●
●
●
●●●●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●● ●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●
●● ● ●● ● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
● ●●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●● ●
●●
●
●●
●●
●
●●
●
●●●
●
●
●
●●●●●
●●
●
●
●
●
●
● ●●●
●
●
●
●
●
●
●●
●
●●
●●
●
●
● ●●●●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●●●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●●
●● ●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●●
●●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●
●
●
●
●
●●
●
●
●
●
●●●
●
●
●
●
●●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●●
●
●
●
●
●
●●●
●
●
●●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●●
●
●
●
●● ● ●
●
●
●●
●●
●
●
●
●
●●
●
●●
●
●●
●●●
●
●
●
●●
●●●
● ●
●
●
●
●
●
●
●●●
●
●
●●
●
●●●
●
●
●
●
●
●
●●●
●
●● ●
●●●
●●
●
●
●
●
●●
●●●
●
●
●
●● ●
●
●
●●
●●
●
●
●
●
●●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●
●●
●
●●●●
●
●
●
●●
●●
● ●
●
●
●
●
●
●
●
●
●
●●
●●●
●●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●●●
●
●
●
●
●●● ●●●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●●●●
●
●
●
●
●●
●
●
●
●● ●●●
●
● ●
●
●●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●●
●
●
●
●
●●
●●
●
●●●
●●●●●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●●●
●
●
● ●●●
●
●●●
●●
●
●
●
●●
●●
●●● ●●●●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●●●
●
●
●
●
●
●
●
●
●●
●
●
●●●●
●
●●
●●●●●
●
●
●
●
●
●
●
●
●
●
●● ● ●
●
●●
●
●
●
●●
●
●
●
●
●●
●●●
●
●
●
●
●
●
●
●
●●
●●●
●● ●
●
●
●
●
●●
● ●●●
●
●
●●
●
●
● ● ●●
●●●
●
●●●●●
●
●
●
●●
●●
●●
●
●●
●
●
●
●
●●
●●
●
●●
●
●
●
●
●●
●●●
●●
●
●
●
●●
● ●●
●●● ●●●
●
●
●
●
●
●●●
●●●●●
●●● ●
●
● ●
●
●
●
●
●●
● ●● ● ●
●
●
●
●
●
●
●
●●
●●●
●
●
●
●
●
●
●
●
●
●●
●● ●
●
●●●
●●●●
●
●●●●
●
●●●
●● ●
●
●●
●
●
●●
●●
●
●
●●●
●
●
●●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●●●
●●
●●
●
●●
● ●●●●
●
●
●●
●●●●
●● ●
●●
● ●
●
●
●
●●●
●
●
●● ●●
●●
●
●
●●●
●●●
●●
●
●●
●
●
●
●
●
●
●
●● ● ●●
●
●
●●
●● ●●
●
●
●●
●
●●
●●●●
● ●
●
●●●
●●●
● ●
●
●
●●
●
●●
●
●
● ●●
●
●
●●
●●
●
●●●●●
●
●
●●●
●
●●
●
●●
●●●●
●
●●●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●●
● ●●●●● ●
●●
●●
●●●● ● ●
●
●●
●
●●
● ●
●
●
● ●
●
●
●●●
●
●
●
● ●
● ●
●
●●
● ● ●●
●●
●●●
●●●●●
●
●
●
●●
●
● ●●●
●●
●
●
●
● ●● ● ●
●●
●
●
● ●
●
● ●●●●
●●
●
●
●●●●●● ●
●
●
●●
●
●●
●●
● ●●●
● ● ●● ● ● ●
●
●
●●●●
●
●
● ●
●●●●
● ●
●
●●●●
●
●●
●
●●
● ●●
● ●
●
●
●
●
●●
●
●●●●
●● ● ●
●● ● ●
●●
● ●
●
●
●
● ●
● ● ●●
●●
●
●●
●
●
●●●●●
●●●
● ● ●●
● ● ● ●●●●● ●
● ●●●
●
● ●
●
●
●
● ●●
●● ●
●
●●
●
●
● ● ●
●●●●
●
●
●
●
●● ●
●●
●
●●
● ●●
●
●●●
● ●●
●●
●●
● ● ● ●●
●
● ●●
●●
●
●
●
● ●●
●●
●
● ●● ●
●● ●●●●●●●
●
●
●● ●
●
● ●
●
●
●●● ●●●● ●●●
●
●
● ● ●● ●
●
●●●
●●
●
●●
● ●
● ● ● ●●
●
● ●●
● ●●
●
●●
●
●
●
●
●
●
●
●
●
● ●● ● ● ●
●
●●●
●
●
●●
●
●●●
● ● ●
●
●● ● ● ● ●
●
● ● ●●
●
●●● ●●●●
●●
●
● ●●●●● ●
●
●
●
●
●
●
●
●
●
●
● ●
●
● ●
●
●
●●
●●
●
● ● ●
●
●
●
●
●●
●
●●
● ●● ● ●
●
● ● ●●
●
●● ● ●
●●
●
●
●●
●
●
●
●
●
●
●
● ●●●
●
●
●
●
●
●● ●
1
2
3
Log 10 (mean FPKM)
●
●
●
●
●
Log 10 (Transformed Nm)
Mean mRNA (A549)
●
10.0
1000
10
●
Volume-corrected noise measure
(A549)
ACTA2
●
●
1
0
●
●
●
●
●
●
●
●
●
●
● ● ●
●
●
●
●
●
● ●●
●
●
● ●
●
●
●
●
●
● ●
● ● ● ●
●
●
● ●
●
●
●●
●
●
● ● ● ● ●
●● ●
● ●●
●● ●
● ●
●
●●
●● ● ●
● ●
●●
●
● ● ●●
●
●●
●
● ●
● ●●
● ●
●●●
●
●
● ● ● ●
●
●
●
●●●● ●● ● ● ● ●
● ●
●
●● ●
●
●● ●●
●
● ●
●
●
●
●
●
●
●
●
●
●
●●●
●
● ● ●● ●
●●
●
●● ● ● ●
● ●●
● ● ●
●
●
●●
●●
●
●
●
● ●●
●
●● ●●
●
●
●●
● ● ●
●●
●● ●●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●●
●
●
●● ●● ●●●● ● ●●
●
●
●
●
●●
●
●
●
●● ● ●●
●
●
●
● ●●●● ●● ●
●●●
●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●
●● ●● ● ● ● ● ●
●
● ●●● ●
●
●●
●
●
●
●
●●
●
●
●
●●●●●
●
●
●● ●
●
●●●
● ● ●●
●
●
●
●
●●
●
●●●●
● ●● ●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●●●
● ●●
●●● ●
●
●●● ●
●
●●
●●
●
●● ●● ●
●●
●
●●
●●●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●●
●
● ●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●●
●●
●●
●
●
●
●
●
●
●●
●●
●
●
● ● ●
●●●
●●
●●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●●
●
●
●
●
●●
●● ●
●●
●
●●
●
●
●
●
●
●
●● ●●
●
●
●●
●
●●
●●
●
●
●
●●●● ●●●
●
●
●●
●
●
●
●
●●
●
●
●●
●
●●●●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●●●●
●
●● ●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●● ●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●●
●●●
●●
●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●●●● ●
●
●
●●●
●
●
●
●
●
●
●●●
●
●
●
●
●
●●●●●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
● ●
●
●
●
●
●
●
●
●
●●●●●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●●●● ●
●
●
●
●
●
●
●
●
●
●
● ●
●●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●●●
●●
●●
●
●
●
●
●
●
● ●
●
●
●
●
●●●
●
●
●
●
●
●●●
●●
●
●
●
●
●
● ●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●●
●
●●
●
●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
● ●●● ●●●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
● ●● ● ● ●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●
●
●
●
●● ●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●● ●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●●
●
●●
●●
●
●
●●● ●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●● ● ● ●
●
●
●
●
●●
●
●
●
●●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●●
●●●●
●
●
●●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
● ●
●
●●
●
●
●
●
●
●
●
●
●
●●● ●● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●● ●●●●●●● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●● ●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●●
●
●
●●
●
●
●
●
●●
●●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●● ●● ● ● ●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
● ●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●●● ●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
● ●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●
●
●●
●●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
● ● ● ●● ●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●
●●
●●
●
●
●
●
●
●
●
● ●● ●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●●
●
●●● ●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●●● ●●●
●
●
●
●
●
●
●
●
●
● ●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
●●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●●
●
●
●
●●
●
● ●●●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●●●
●
●●
●
●
●●
●●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●● ●
●
●● ●
●
●
●
●
● ●●●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●●
●
●
●
●●●
●
●●●●
●●
●●
●●
●
●●
●
●●
●●
●●
●
●●●
●
●●
●
●●
●
●●
●
●
●
●
●
●
●●
●
●● ● ● ●●●
●
●
●●●●●● ●●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●● ●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●●
●
●
●● ●●●
●
●
●●●●
●
●●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
● ●●
●
●●
●
●
●
●●
●
●
●
● ●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
● ● ● ●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
● ●● ●● ● ●●
●●
●
●
● ●●●● ●
●
●●●
●
●●●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●●●● ●
●
●●●
●● ●
●●
●
●●●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
● ● ● ●● ●
●●●
●
●
●
●●
●●●
●●
●
●●
●
●
●●●●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●●
●
●
●●
●
●
●●● ●●
●
●●
●●●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●●●●●●● ●
●
●●
●
●●●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●●
●
●●●●
●
●
●
●
●●
●●● ●●●
●●
●
●●
●
●●
●●●
●
●●
●●
●●
●
● ●● ●
●
●
●
●
●●
●●
●●
●
●●
● ●●
●
●
●●
●● ●●
● ●●● ● ●
●●
●●
●●
●
●
●●
●●●
●●
●
●
●●●
●●
●●
●
●
●
●
●● ●●
●●
● ● ● ●●
●●
●●
●●●
●●●
●
● ●
●
●
●
●
●●
●
●●
●
●●
● ●
●●
●●●
●
●
●●●●●
●●
● ●●
●
●●
●● ● ●
●●
●
● ●
●●
●●
●
●
●●
●
●●●
●
●●● ●
●●
●
●
●
●
●●
● ● ●● ●
●
●
●
● ●●
●●
● ●
●●
●
●●
●
●
● ●
●●● ●
● ● ●● ●
●
●
● ●●
●
●●
●● ●●●
●
●
●
● ●
●
●●
●
●● ●●
●
●●●
●●●
● ● ●●●
●● ● ●
●●●
● ●
●
● ●●
●●
●
●
●● ●● ●●● ●●●
● ●
●
● ● ● ● ● ●
●
●
●●
●
●
●
●● ●
●
●
●
●
●
●
●
●
● ●
●
● ●●
●
●●● ●
● ●● ●
●●
● ●
●●
●●
● ●
●●
●●
●
●
●
● ●
●
●●
●●
●
●
●
●
●● ●
●
●
●
●
●
●
●
●
●
●● ●
●●●
●
●●
●
●
●
●
●
●●
●
Cutoff for high Nm genes
Cutoff for low Nm genes
●
●
●
●
●
●
4
1
2
3
Log 10 (mean FPKM)
4
Figure S7, related to Figure 7.
A. “Count” vs. “volume” for GAPDH from single-cell sequencing data. We define “volume” as the ratio between genomic reads and ERCC reads for each
cell. This quantity is more representative of total RNA, which we know to be roughly proportional to volume, although the relationship is not exactly
proportional due to volume-independent transcription (see main text Figure 1). We observed two clearly distinct classes of cells, those with a volume
range that matches what we see by imaging and RNA FISH and those that have very low volumes. For unknown reasons, these cells ended up with a
considerably higher ratio of ERCC reads than genomic reads, and we eliminated them from our subsequent analyses.
B. ERCC counts vs. genomic counts for the cells that we kept and those we eliminated.
C. Correspondence between CV2 of RNA FISH counts and CV2 of inferred counts from single-cell sequencing (transforming FPKM as in Supplemental
Figure 6 and Methods). Each data point represents a single gene.
D. Same as C, except using CV2 of FPKM from single-cell sequencing instead. Correlation is slightly higher than in C.
For C and D, error bars represent 95% confidence intervals, calculated by bootstrapping.
E. Average mRNA counts in cycling primary fibroblasts and A549 cells, calculated using RNA FISH. Gray line indicates a 1:1 correspondence. Error bars
represent standard error of the mean.
F. Volume-corrected noise measure in cycling primary fibroblast and A549 cells, calculated using RNA FISH. Gray line indicates a 1:1 correspondence.
Nm calculated by bootstrapping; error bars represent 95% confidence interval. Data in E and F for each gene is a combination of at least two biological
replicates, with at least 30 cells per replicate.
G. FPKM measurements from bulk RNA-sequencing in primary fibroblast and A549 cells. Each point represents one gene. We classified genes as
“ubiquitously expressed” if they had >5 FPKM in both cell types and differed by less than a factor of 2 in FPKM across the two cell types. We considered
genes “fibroblast specific” if they had >5 FPKM in fibroblasts and their FPKM was greater than five times higher in fibroblasts that A549 cells.
H. Single-cell RNA-sequencing data in primary fibroblast cells. Each point represents one gene. We used the method described in Supplemental Figure
6 and Methods to calculate Nm for each gene. We observe that higher abundance genes typically have lower Nm values. Red line indicates best fit line.
I. The same data as in H, but transformed to remove the abundance dependence from Nm. Red line here is transformed fit line from H. We use this
transformed data to select abundance-matched “low” and “high Nm” genes using a cutoff of Nm=0.5 and Nm=-0.5, respectively. We selected 307 high
Nm genes and 257 low Nm genes. Note that these high Nm genes actually have a higher mean abundance (FPKM=196.5) than the low Nm genes
(FPKM=55.4), thus showing that the observed differences in noise levels are not due to the overall increase in noise in genes of low abundance.
Supplemental Experimental Procedures
Cell culture:
We grew primary human foreskin fibroblast cells (CCD-1079Sk, ATCC CRL-2097™) and A549
cells (human lung carcinoma, A549, ATCC CCL-185™) in Dulbecco’s Modified Eagle Medium
(DMEM) supplemented with 10% FBS and 50U/mL penicillin and streptomycin (Pen/Strep). To
create quiescent cells, we grew primary fibroblast cells in DMEM with Pen/Strep, without FBS
for seven days. We cultured WM983b-GFP-NLS cells (WM983b is a human melanoma from the
lab of Meenhard Herlyn) in Tu2% media (78% MCDB media, 20% Leibovitz’s L-15 media, 2%
FBS, and 1.68 mM CaCl2). The WM983b-GFP-NLS contains EGFP fused to a NLS driven by a
cytomegalovirus promoter that we stably transfected into the parental cell line. Before imaging,
we plated cells on two-well Lab-Tek chambered coverglasses.
RNA fluorescence in situ hybridization and imaging:
We performed single molecule RNA FISH on the samples as described previously (Femino et
al., 1998; Raj and Tyagi, 2010; Raj et al., 2008). Briefly, we fixed the cells in formaldehyde or
methanol, performed RNA FISH using the specified pools of oligonucleotides, then washed and
stained nuclei with DAPI. We fixed most cells in this study using formaldehyde, but used
methanol for the experiments involving transcription site quantification because it resulted in
more accurate transcription site detection. We stained the actin cytoskeleton with PhalloidinAlexa 488 (Life Technologies) to detect cell boundaries.
We typically co-stained with sets of probes targeting many different mRNA. Typically, we used
exon probes labeled in Alexa 594, introns with Cy3, Cyclin A2 with Atto 647N (which labels cells
in S, G2 and M phase (Eward et al., 2004)) and GAPDH with Atto 700. To distinguish cells in S
phase from G2 (Robertson et al., 2000; Whitfield et al., 2002), we labeled Histone H4 mRNA
with Atto 647N and Cyclin A2 mRNA with Atto 700 (see Supplemental Fig. 3). Supplemental
data file 1 lists all sequences of oligonucleotide probes.
We imaged the cells with a Nikon Ti-E equipped with appropriate filter sets. We took a series of
optical z-sections, each 0.2-0.35 microns high, that spanned the vertical extent of the cell.
Image analysis and quantification:
We manually identified cell boundaries and counted and localized RNA spots using custom
software written in MATLAB (Raj and Tyagi, 2010; Raj et al., 2008). We estimate the technical
error in our RNA count determination to be at most 15%.
To compute the volume of a cell, we detected the 3D positions of a highly abundant mRNA by
RNA FISH. We selected only the points that defined the outer boundary of the cell by examining
each point and its neighbors within a 4µm radius. We kept only the points that had a higher zposition than their neighbors (signifying the top of the cell) or points that had no neighbors within
180 degrees (signifying the side of the cell). Once we had the points, we interpolated the points
to identify a smooth representation of the cell surface. We repeated this in both an upward and
downward direction to identify the top and bottom of the cell. We calculated the volume of the
cell by summing the heights between the top and bottom.
Calculating the volume in this manner will always result in an underestimation of the actual
volume. To correct for this bias, we first computed the outline of the cell as described above. We
then dilated this hull, filled it with the same number of randomly distributed points, and then
repeated the algorithm on this new set of points. If this volume matched that computed with the
actual spots in the cell, we then computed the volume by integrating between the top and
bottom boundaries of the dilated hull.
We used GAPDH mRNA as the primary mRNA for our volume determinations, but the volume
computation did not depend on the number of spots identified nor on the choice of volume-filling
gene (Supplemental Fig. 1). We limited ourselves to the cytoplasmic volume by removing a
vertical cylinder corresponding to the nuclear outline. This procedure does exclude the
cytoplasmic volume above and below the nucleus, but that region only comprised a very small
proportion of the total cytoplasmic volume.
We identified transcription sites through intron/exon probe colocalization. We manually
annotated transcription sites by visually inspecting images of intron and exon probes to
determine instances of colocalized signal. To determine spot intensity, we identified the z-plane
of maximum intensity in a 0.375µm-square region around the manually selected spot. We
defined the intensity as the difference between this maximum value and a background value.
For the background value, we used the median intensity in a 3.75µm-square annular region
around the maximum intensity point. Note that transcription site intensity need not necessarily
linearly relate to transcriptional burst size (Senecal et al., 2014).
RNA degradation:
We measured RNA degradation by inhibiting transcription for four hours by applying
actinomycin D at 1ug/ml. We measured degradation of UBC and IER2 mRNA because they
exhibited a strong correlation with volume while having a half-life short enough to enable us to
observe substantial degradation within four hours of actinomycin D treatment while avoiding
non-specific effects at longer times.
We used a model to determine whether degradation was volume-dependent (degradation ~ 1/V)
or volume-independent (degradation ~ constant). We first fit the untreated mRNA vs. volume
data with a line having zero intercept. If degradation is volume-independent, we expect the
treated cells to also be well-fit by a line having zero intercept, where the slope is determined by
the untreated fit and an exponential decay term:
!!" (!) = !! !! !!" ,
where !!" is the mRNA count after 4 hours of treatment, !! is the slope of the untreated data
(! = 0), ! is the decay constant (degradation rate), and ! is the treatment time. Note that here !
is the only fit parameter and is independent of volume.
If degradation is volume-dependent, the equation becomes:
!!" (!) = !! !! !!"/! .
Here, !/! is the decay constant (degradation rate), but ! itself is independent of volume and is
the fit parameter.
The line and curve described by these equations are the fits to the raw data, and the decay
constants ! and !/! are the fits we show to the calculated decay constants that we show in Fig.
3A-B.
We calculated the actual decay constant (Fig. 3A-B) for each cell measured at the 4 hour
timepoint assuming exponential decay:
!!" (!) = !!" (!)! !!" ,
where we approximate !!" = !! !, and ! could in principle be either volume-dependent or independent.
LMNA siRNA knockdown:
We used an siRNA targeting LMNA (Cat. #: AM16708, ID: 40502) at 30nM and a “scramble”
control siRNA (Cat. #: AM4611) at 30nM. We incubated primary fibroblast cells with the siRNA
for 72 hours. We verified protein knockdown via Western blot, using the SC-20680 (rabbit)
antibody and a goat-anti-rabbit 680 RD secondary (Licor 926-68071).
Heterokaryon formation:
We created heterokaryons by separately culturing primary fibroblast cells and WM983b-GFPNLS cells. Once the plates were 70-90% confluent, we trypsinized the cells, resuspended them
in DMEM Complete media, and combined half of each plate of cells in a 15ml tube. We pelleted
the cells and resuspended in PEG for 2 minutes. We added media over the course of five
minutes to allow cells to fuse, then plated the cells onto two-well chambered coverglasses (LabTek) and fixed the cells after 12 hours.
We identified heterokaryons as cells with two nuclei that expressed both GFP (WM983b-GFPNLS only) and GAS6 mRNA (primary fibroblast) by RNA FISH. We eliminated all homokaryons
from our analyses.
Fractionation and RNA polymerase II Western blot:
We performed cell fractionation as described in (Bhatt et al., 2012) and based on (Wuarin and
Schibler, 1994) with modifications. We conducted all subsequent steps on ice or at 4°C and in
the presence of 25 µM α-amanitin (Sigma, A2263) and Protease inhibitors cOmplete (Roche,
11873580001) according to manufacturer’s instructions. We pre-chilled all buffers on ice before
use. We grew primary fibroblast cells to a confluency of 90%. We removed media and washed
plates twice with 1x PBS before scraping cells into 1x PBS. We collected cells by centrifuging at
500 g for 10 min. We gently resuspended the cell pellet corresponding to 1x107 cells in 200 µl
cytoplasmic lysis buffer (0.15% NP-40, 10 mM Tris-HCl pH 7.0, 150 mM NaCl). We incubated
the cell lysate for 5 min on ice, layered it onto 500 µl sucrose buffer (10 mM Tris-HCl pH 7.0,
150 mM NaCl, 25% sucrose) and centrifuged at 16,000 g for 10 min. We carefully removed the
supernatant (600 µl) corresponding to the cytoplasmic fraction. We gently resuspended the
nuclei pellet in 400 µl nuclei wash buffer (0.1% Triton-X-100, 1 mM EDTA, in 1x PBS) and
centrifuged it at 1,500 g for 1 min. We removed the supernatant and gently resuspended the
pellet in 200 µl glycerol buffer (20 mM Tris-HCl pH 8.0, 75 mM NaCl, 0.5 mM EDTA, 50%
glycerol, 0.85 mM DTT). Next, we added 200 µl nuclei lysis buffer (1% NP-40, 20 mM Hepes pH
7.5, 300 mM NaCl, 1M Urea, 0.2 mM EDTA, 1 mM DTT), vortexed, incubated on ice for 2 min
and centrifuged at 18,500 g for 2 min. We carefully removed the supernatant corresponding to
the nucleoplasmic fraction (350 µl) and added 250 µl 1x PBS/Protease inhibitors cOmplete to
adjust the volume for Western blot experiments (described below). We resuspended the
chromatin pellet in 600 µl chromatin resuspension solution (25 µM α-amanitin, Protease
inhibitors cOmplete, in 1x PBS).
We monitored the success of cell fractionation by Western blot analyses. For Western blot
analyses, we probed membranes with the following primary antibodies: Pol II (F-12, Santa Cruz
Biotechnology; directed against the N-terminal region of Rpb1), Pol II Ser2-P (3E10, Active
Motif), Pol II Ser5-P (3E8, Active Motif), Histone 2B (FL-126, Santa Cruz Biotechnology), U1
snRNP70 (C-18, Santa Cruz Biotechnology) and GAPDH (6C5, Applied Biosystems). Next, we
probed membranes with Cy5- and Alexa Fluor 647-conjugated secondary antibodies (Cy5 goat
anti-mouse, A10524; Cy5 goat anti-rabbit, A10523; Cy5 goat anti-rat, A10525; Alexa Fluor 647
rabbit anti-goat, A21446; Life Technologies), and scanned using a Typhoon 9400 scanner (GE
Healthcare). We quantified fluorescent signals with ImageJ 1.47v software.
Triptolide:
We degraded RNA polymerase II in primary fibroblast cells by incubating cells in 100nM
triptolide for one hour, then fixed cells in methanol (control cells remained untreated).
Cell size verification:
To check that fixation did not alter cell size, we monitored the size of cells through the fixation
and permeabilization process by fixing cells while on the microscope stage. We monitored cell
area by taking images in brightfield, and we monitored cell height by coating the cells with
fluorescent beads and imaging them in a series of optical z-sections. We took images of the
same cells after 10 minutes of fixing in 4% formaldehyde and after 30 minutes of
permeabilization in ethanol. We calculated cell area by segmenting the cells as usual, and we
determined height by identifying the plane of the bottom of the cell and the plane of the top of
the cell (the last plane where beads remain motionless) and subtracting the two values.
Quantification of cell-to-cell variability:
We developed a phenomenological metric for cell-to-cell variability that takes into account both
volume-correlated and volume-independent contributions to mRNA numbers per cell (see
supplemental note for derivation and further information). We also used a model of
transcriptional bursting with volume-dependent transcription that enabled us to quantify
transcriptional parameters from population distributions of mRNA counts and volumes.
Repli-seq analysis:
We accessed Repli-seq data from Hansen et al. 2010 (Hansen et al., 2010) using the UW Repliseq track on the UCSC Genome Browser.
Bulk RNA Sequencing:
We sequenced total RNA from primary fibroblast cells. We used the NEB Next Ultimate Library
Preparation Kit for Illumina and the Ribo-Zero Magnetic Gold Kit. We used 50b single-end reads
and sequenced each of two replicates at a depth of 10-15M reads. We aligned reads to hg19
using STAR’s included annotation. We quantified reads per gene using HTSeq and a RefSeq
hg19 annotation. We calculated FPKM for each gene using R. All sequencing data is available
at GEO accession number GSE66053.
Single-cell RNA Sequencing:
We isolated 96 single cells, lysed, and performed first- and second-strand synthesis on a
Fluidigm C1 Single-Cell Auto Prep System using a large size chip. We spiked in ERCC
(External RNA Controls Consortium) RNA controls, Mix 1 (Ambion 4456740) at a concentration
of 1:10,000 before adding the cells to the C1. We prepared cDNA libraries using the Nextera XT
DNA Sample Preparation Kit (Illumina, PN FC-131-1096) and used 96 paired barcodes from the
Nextera XT DNA Sample Preparation Index Kit (96 Indices, 385 Samples) (Illumina, PN FC131-1002) following the abbreviated Fluidigm protocol for the Nextera XT kit. We sequenced the
samples on a NextSeq 500 using 75b paired-end reads to a depth of ~1-2M reads per sample.
To quantify sequencing data, we aligned reads to hg19 (using STAR’s included annotation) and
the ERCC reference transcripts. We quantified reads per gene using HTSeq and a RefSeq hg19
annotation. All sequencing data is available at GEO accession number GSE66053.
Single-cell RNA Sequencing Calibration and Analysis:
We independently calculated ERCC and genomic FPKM for each sample, normalizing to the
total number of reads mapped to ERCC loci or genomic loci, respectively. All FPKM data shown
for endogenous genes is this genomic FPKM. For each cell, we considered the ratio of total
genomic reads to total ERCC reads to be proportional to the total starting amount of mRNA in
that cell.
We sequenced 96 wells total, of which 5 were “control” wells that contained no cells and 14
were wells containing fixed cells. We excluded these 19 cells from the analysis. Further, we
excluded 12 cells that had fewer than 1 million total reads, and 21 cells that had a
genomic/ERCC read ratio of less than 30. We performed all further analyses on the 44
remaining cells.
Transform read ratio to volume: We assumed that the ratio of genomic/ERCC reads for each
sample was proportional to the total mRNA in each cell. We also assumed that, for our RNA
FISH measurements, total GAPDH mRNA counts were proportional to the total amount of
mRNA in each cell. The distributions for total mRNA obtained in this manner were similar
between RNA FISH and single-cell RNA sequencing, but had different means. We therefore
normalized the sequencing data to have the same mean as the RNA FISH distribution. From
our RNA FISH dataset, we have many co-measurements of GAPDH mRNA (total mRNA) and
volume from which we establish a transformation equation between total mRNA and volume.
We obtained this transformation equation using PCA, or orthogonal regression. Using this
equation, we transformed total mRNA obtained through sequencing into actual volume in
picoliters.
Transform FPKM to molecule count: FPKM is more a measure of mRNA concentration than
mRNA count, as it is normalized to total reads. To get a measure more similar to mRNA count,
for each cell, we multiplied each gene’s FPKM by the genomic/ERCC count ratio (“volume”) of
the cell. For each gene in our RNA FISH dataset, we fit the log of the seq “counts” and the log of
the actual counts from RNA FISH by orthogonal regression. We then used this transform to
convert the FPKM of all genes to their RNA FISH count equivalent. Note that, because we fit in
log space, the transform between FPKM and count is nonlinear, and actually scales as
approximately FPKM ~ (RNA FISH)1.7.
Once we had our single-cell sequencing data in terms of RNA FISH count and volume in
picoliters, we calculated Nm as described for RNA FISH. We performed all of our sequencing
analysis in R.
C. elegans growth and imaging:
We grew N2 (wild type) and CB502 (sma-2 mutant) C. elegans on NGM agar plates with OP50
lawns, kept at 20º C. Every 2-3 days, we transferred a small portion of each strain to new plates
to prevent overgrowth.
We released the worms off of the plates using phosphate buffered saline (PBS) solution, then
fixed with 4% formaldehyde for 45 minutes. We permeabilized and stored the worms in 70%
ethanol. We performed the RNA FISH protocol, then mounted the sample between a slide and
coverglass before imaging.
We manually identified head and gonad boundaries and counted and localized RNA spots using
custom software written in MATLAB.
To compute the volume of each worm segment, we multiplied the area of the segment by the
height of the segment (thus approximating the segment as a prism). We determined the height
by taking the vertical difference between the highest and lowest RNA spots’ positions, as
determined by our software. We determined the number of cells in each segment through
manually counting the DAPI-stained nuclei.
We obtained data from multiple segments. When combining the data (number of mRNA spots
per volume or per nucleus), we weighted each segment by its volume.
Model of di↵usible trans factor for volume:DNA ratio sensing
Here, we outline a fairly generic model for how a di↵usible trans factor may transmit information
on the ratio of volume to DNA to lead to increased transcription in larger cells irrespective of DNA
content. The primary assumptions are that the factor is predominantly localized to the nucleus, the
factor is required for mRNA transcription, and the cellular concentration of the factor is roughly
constant irrespective of cellular volume (i.e., the total amount of factor is proportional to cellular
volume). RNA polymerase II holoenzyme satisfies these conditions, although we do not claim that
RNA polymerase II is the factor.
The model assumes binding of the factor to the DNA, and that only bound factor can result in
productive transcription. The goal of the model is to provide a basis for the empirical finding that
larger cells have increased transcription from the same absolute number of DNA molecules. Our
model encompasses two broad categories of mechanism that would lead to a perfectly linear scaling
of transcription with cytoplasmic volume: (1) The factor is sequestered entirely in the nucleus, and
so if the nucleus doesn’t change with cellular volume, the concentration of the factor in the nucleus
will be proportional to the total amount of factor. Thus, the factor will be proportionally more
bound to the DNA in a larger cell than a smaller cell, producing more transcription. (2) The factor
is a purely “limiting” factor in the sense that it has a very high affinity for DNA and the number of
binding sites exceeds the amount of factor. In this situation, essentially all available factor will be
bound to the DNA, and so for each gene, there would be proportionally more transcription in larger
cells because more factor would be bound to DNA. These mechanisms are not necessarily mutually
exclusive. The model incorporates affinity and nuclear volume as parameters, and so encompasses
both of these potential mechanisms.
Briefly, the conclusion we derive from our model is that both scenarios pose viable mechanisms
for scaling transcription with cellular volume. That said, we overall mildly favor scenario 1. Our
data show that nuclear volume increases somewhat with nuclear size, which the model predicts
should lead to a slight decrease in transcription in larger cells, and thus a higher concentration of
mRNA in smaller cells, which is precisely what we observe. Moreover, there is a rough quantitative
agreement between the degree of increased nuclear volume and the higher concentration of mRNA
in smaller cells. Definitively proving that the cell follows scenario 1 of our model will require further
experiments.
We begin with a few definitions. We use quantities within brackets to denote concentration
(molecules per volume) and quantities without brackets to denote number of molecules per cell.
For instance, factorfree is the number of free molecules of the factor, while [factorfree ] is the number of
free molecules per unit volume. factorDNA denotes the number of factor molecules instantaneously
bound to DNA, factortotal is the total amount of factor in the cell/nucleus, and DNA is the number of
binding sites on the DNA for the factor in the nucleus. KDNA is the binding affinity of the factor for
a particular gene. The cellular volume is given by V , and the nuclear volume by Vnuclear . Thus, given
our assumption of proportionality, we define pfactor to be a constant such that factortotal = pfactor V .
The total factor is given by
factortotal = factorfree + factorDNA ,
(1)
Dividing by the nuclear volume, we arrive at a relationship between concentrations:
[factortotal ] = [factorfree ] + [factorDNA ] .
(2)
The binding affinity is defined via mass action as
KDNA =
[factorfree ] [DNA]
.
[factorDNA ]
(3)
and may be di↵erent for di↵erent genes owing to promoters having di↵erent numbers of binding
sites for the factor or di↵erent binding affinities.
Thus, the total concentration of the machinery bound to DNA is
[factorDNA ] =
([factorDNA ]
[factortotal ]) [DNA]
.
KDNA
(4)
Solving for [factorDNA ], we find:
[factorDNA ] =
[factortotal ] [DNA]
.
KDNA + [DNA]
(5)
In the limiting case where KDNA = 0, we expect all of the factor to be bound to DNA, and in
that case, we find [factorDNA ] = [factortotal ], as expected.
Relating concentrations to volumes yields
[factortotal ] =
pfactor V
,
Vnucleus
(6)
where pfactor is the proportionality constant defined earlier. Similarly,
[DNA] =
DNA
.
Vnucleus
(7)
Hence,
[factorDNA ] =
V
DNA
pfactor Vnucleus
Vnucleus
KDNA +
DNA
Vnucleus
.
(8)
Simplifying,
[factorDNA ] =
✓
1
Vnucleus
◆
pfactor · V · DNA
.
KDNA · Vnucleus + DNA
(9)
Because [factorDNA ] = factorDNA /Vnucleus , we can solve for the total amount of transcriptional
machinery bound to DNA:
factorDNA =
pfactor · V · DNA
.
KDNA · Vnucleus + DNA
(10)
In the limiting case KDNA = 0 here, we find that factorDNA is directly proportional to volume,
and equal to the total amount of factor in the nucleus irrespective of nuclear volume. However,
in the case where KDNA is not zero, then the volume of the nucleus will result in deviations from
perfect scaling of transcription with cellular volume. Intuitively, if the volume of the nucleus
increases somewhat in larger cells, then the concentration of the factor and the DNA will decrease
and hence the amount of factor bound to DNA will be somewhat less than it would be otherwise.
In that case, larger cells would have somewhat less transcription than would be expected in the case
of perfect scaling of transcription with cellular volume, which fits with our experimentally observed
volume-independent transcript abundance (i.e., decreased mRNA concentration in larger cells).
We also observed that nuclear volume is somewhat greater in larger cells. Thus, it was possible,
at least qualitatively, that the increase in nuclear volume could explain the apparent decrease in
mRNA concentration in larger cells. We thus wanted to check whether there is a quantitative
agreement between our observed relationship between nuclear volume and cytoplasmic volume and
the increased mRNA concentration in smaller cells, which would establish the plausibility of such
a model.
As mentioned, our measurements show that nuclear area and cellular volume positively correlate.
Approximating nuclear volume by raising nuclear area to the 3/2 power, we find a linear relationship
between nuclear “volume” and cellular volume (Vnucleus / a + bV ), with y-intercept a = 2169
femtoliters (95% C.I. = (1923, 2381)), and slope b = 0.9354 (95% C.I. = (0.8313, 1.042)). It is
important to note that the while the relationship is well-fit by a line, the line does not pass through
zero, and so nuclear volume is not directly proportional to total cellular volume. Using this linear
relationship, we can express the total amount of factor bound to DNA as a function of cellular
volume:
pfactor · V
⌘
factorDNA (V ) = ⇣
,
(11)
a
˜ + ˜bV + 1
DNA
DNA
where a
˜ = KDNA
· a and ˜b = KDNA
· b. The ratio a/b (= a
˜/˜b) has units of volume, and is geometrically
equivalent to the x-intercept of the line of best fit between nuclear volume and cellular volume.
We now wanted to check whether the volume-independent transcription we observed in our
mRNA-volume plots would quantitatively agree with this model. Because the factor is required
for transcription and only transcribes when bound to DNA, then each gene essentially grabs a
fixed proportion of the amount of factor bound to DNA. (This fraction will depend on the specific
regulation of the gene.) So the total transcription of a gene will be proportional to factorDNA . Thus,
lumping together this proportionality constant along with mRNA production and degradation and
other associated constants into a constant c, the relationship between RNA and volume is given
by:
RNA(V ) = c · factorDNA (V ) .
(12)
We should be able to fit our RNA vs. volume data using the above equation to obtain estimates
for a
˜ and ˜b, in particular their ratio, which is directly comparable to the ratio a/b.
We did so for three genes and found fitting parameters:
a
˜ (fL)
95% C.I. (˜
a) (fL)
˜b
95% C.I. (˜b)
a
˜/˜b (fL)
95% C.I. (˜
a/˜b) (fL)
U BC
1.578
(0.8524, 2.461)
1.936⇥10 4
(-7.617⇥10 5 , 4.595⇥10
7044
(-62743, 111200)
4)
ZN F 444
95.68
(70.23, 127.9)
0.01495
(0.004163, 0.02394)
6446
(2988, 28110)
EEF 2
0.2747
(-0.1518, 0.5630)
0.0003658
(0.0002680, 0.0005879)
744.4
(-253.5, 2064)
For the fit of nuclear area to volume, we find the ratio a/b = 2329 femtoliters, with a 95%
confidence interval of (1844, 2851). We note that the ratios a
˜/˜b for all of our genes are of that
same order of magnitude, albeit with large error. This result suggests that the above equation
for factorDNA (V ) may be the equation governing the production of mRNA in cells. This model
provides an explanation that is quantitatively consistent with our data for why smaller cells exhibit
proportionally slightly more transcription than larger cells—nuclei in small cells are slightly smaller
than those in large cells, increasing the concentration of factorDNA (V ), and therefore increasing
transcription.
Our results are consistent with RNA polymerase II holoenzyme being the factor. RNA polymerase II is required for transcription, transcribes when bound to DNA, and is almost exclusively
localized to the nucleus. Also, most reports indicate that most RNA polymerase II in the nucleus
is not specifically bound to DNA. Based on that fact, one would expect that increased nuclear size
should lead to slight under-transcription, as we observed. Our analysis shows that this relationship is quantitatively plausible. Further studies will be required to rigorously establish that RNA
polymerase II holoenzyme is the factor that connects volume/DNA ratio to transcription.
Note, however, that in many situations such as in early embryogenesis, nuclear size does change
dramatically as a function of cellular volume. In these situations, the mechanism we describe would
face a challenge because the concentration of RNA polymerase II holoenzyme in the cell’s nucleus
would remain the same after division (and the associated decrease in cellular volume), leading to
over-transcription. The limiting factor model (scenario 2, with KDNA very small) would not su↵er
from these issues. It is possible that some intermediate scenario is at play in early embryogenesis.
Another problem with these models is the potential for runaway positive feedback, in which a
random increase in the factor would lead to more production of the factor, thus leading to runaway
transcription. For this reason, we expect that the cell maintains strong control of factor levels
to avoid these issues. Ultimately, a complete understanding of factor dynamics will likely require
adding growth to models of transcriptional homeostasis.
Computing volume-corrected noise measure from single-cell mRNA
and volume measurements
We define volume-corrected noise measure as the cell-to-cell expression variability in mRNA levels
that cannot be accounted for by cell-to-cell di↵erences in volume. Throughout the section, we
2 .
denote the variance of a random variable X by X
Let m and V be random variables denoting single-cell mRNA level and volume, respectively. The
expected number of mRNA transcripts in a cell given its volume V is assumed to increase linearly
with V , i.e.,
hm|V i = a + bV =) hmi = a + bhV i,
(13)
where h.i represents the expected value, and a, b are gene-specific constants (volume-independent
and volume-correlated transcript abundance, respectively). From (13), the covariance between m
and V is given by
Cov(m, V ) = hmV i
hmihV i = h(a + bV )V i
(a + bhV i)hV i = b
2
V.
(14)
The extent of cell-to-cell variability in mRNA counts that can be accounted for by volume is
2
hm|V i
=
2
a+bV
= b2
2
V,
(15)
which using (14) can be written
2
hm|V i
= bCov(m, V ).
(16)
Volume-corrected noise measure N m defined as
N m :=
2
m
2
hm|V i
hmi2
(17)
is obtained as follows using (16)
N m = CVm2
bCov(m, V )
= CVm2
hmi2
S
Cov(m, V )
,
hmihV i
(18)
where CVm2 represents the total variability in mRNA levels measured by its Coefficient of Variation
(CV ) squared and
S=
bhV i
bhV i
=
.
hmi
a + bhV i
(19)
Noise measure in a two-state promoter model
Consider two alleles, where each allele transitions independently between active and inactive states
with rates kon and kof f . We assume that the transcription rate from the active state increases
linearly with cell volume V . We first compute CVm2 (mRNA coefficient of variation squared) for
the case where transcription is independent of volume, and then extend it to the volume dependent
case.
Transcription rate independent of volume
Let the transcription rate from active state be km . Then, the steady-state first and second-order
moment of the mRNA level m is given by
hmi =
2Gon km
m
, hm2 i = hmi +
m (1
2(Gon
Gon )hmi2
+ hmi2 ,
m + kon )
(20)
where
Gon =
kon
kon + kof f
(21)
is the fraction of time an allele is in the active state, and m is the mRNA degradation rate. Note
that the factor two in (20) arises due to the presence of two alleles. This results in
CVm2 :=
hm2 i hmi2
1
Gon )
m (1
=
+
.
hmi2
hmi 2(Gon m + kon )
(22)
Transcription rate dependent on volume
We assume km = a + bV , where volume V is a random variable with mean hV i and variance
Based on (20),
hm|V i =
2Gon (a + bV )
m
, hm2 |V i = hm|V i +
Gon )hm|V i2
+ hm|V i2 .
2(Gon m + kon )
m (1
2
V.
(23)
Unconditioning on the volume we obtain
hmi =
2Gon (a + bhV i)
(24a)
m
hm2 i = hmi +
⌦
Gon ) hm|V i
2(Gon m + kon )
m (1
Using (23)
⌦
hm|V i
2
↵
=
*✓
2Gon (a + bV )
m
◆2 +
↵
2
⌦
↵
+ hm|V i2 .
= hmi2 (1 + S 2 CVV2 ),
(24b)
(25)
where the mean mRNA count hmi is given by (24a), S is given by (19) and CVV2 is the volume
CV 2 . Substituting (25) in (24b)
hm2 i =
m (1
Gon )hmi2 (1 + S 2 CVV2 )
+ hmi2 (1 + S 2 CVV2 ) + hmi.
2(Gon m + kon )
(26)
Above equation yields
CVm2 :=
hm2 i hmi2
1
=
+
2
hmi
hmi
Gon )(1 + S 2 CVV2 )
+ S 2 CVV2 .
2(Gon m + kon )
m (1
(27)
As expected, (27) reduces to (22) when CVV2 = 0. In (27), the first term represent Poissonian noise
in mRNA level due to random-birth death of individual mRNA molecules. The second term is the
noise contribution from stochastic promoter switching. Variation in mRNA levels due to cell-to-cell
di↵erences in cell volume is represented by the last term. Removing the last term, we obtain the
noise measure as
Nm =
Gon )(1 + S 2 CVV2 )
.
2(Gon m + kon )
m (1
1
+
hmi
(28)
Estimating promoter transition rates between active and inactive
states
Noise measures obtained from single-cell mRNA count and volume measurements are used to
estimate promoter transition rates kon and kof f using (28). To correct for measurement noise,
we take into account a 15% error in mRNA counting. From (21) and (28)
Gon =
kon
kon + kof f
Nm =
1
+
hmi
(29a)
Gon )(1 + S 2 CVV2 )
2
+ CVcount
,
2(Gon m + kon )
m (1
(29b)
Average promoter dwell-time (30) obtained from the noise measure (Eq. (29)) or the total mRNA
expression variability (Eq. (31)). mRNA half-lives and dwell times are reported in hours.
Gene
Gon
CVm2
N m/CVm2
ACTN4
GAPDH
EEF2
FTL
ICAM1
ACTA2
LUM
SUPTH5
0.65
0.65
0.75
0.3
0.05
0.2
0.15
0.3
0.14
0.23
0.18
0.58
0.8
1.18
0.7
0.12
0.4
0.17
0.17
0.32
0.8
0.9
0.75
0.58
mRNA
half-life
13
24
16
24
6.5
3
24
10
Ton , Tof f
from N m
6.1, 3.3
4.9, 2.6
3.2, 1.1
6.3, 14.6
0.5, 9.7
5.1, 20.3
7.8, 44.3
0.45, 1.1
Ton , Tof f
from CVm2
40.5, 21.8
331, 178
1443.6, 481.2
45.4, 105.9
0.8, 15.4
7.9, 31.6
12.7, 72.5
1.4, 3.3
2
where CVcount
= 0.152 = 0.0225 represents the mRNA counting error. In the above equations,
quantities N m, S (defined in (19)), CVV2 (volume CV 2 ), hmi, Gon are computed from data for a
given gene. Using mRNA half-life information from literature, rates kon and kof f can be estimated
by solving (29). The average promoter dwell time in the active and inactive state is given by
Ton =
1
kof f
, Tof f =
1
,
kon
(30)
respectively, and reported in Table I under the column “Ton , Tof f from N m”. Since there may
be other unaccounted sources of noise in gene expression, these dwell time estimates should be
considered an upper bound on their actual values.
We contrast the above estimates to the scenario where all the mRNA expression variability is
assumed to arises from transcriptional bursting. From (22), kon and kof f in that case would be
estimated by solving
kon
kon + kof f
1
Gon )
m (1
2
=
+
+ CVcount
.
hmi 2(Gon m + kon )
Gon =
CVm2
(31a)
(31b)
Since CVm2 > N m, dwell times obtained from (31) (see “Ton , Tof f from CVm2 ” in Table I) are
significantly larger than those obtained from (29). For example, using the GAPDH noise measure,
we estimate Ton = 4.9 hours. However, if one ignores the contribution of cell volume in driving
intercellular variation in GAPDH mRNA, the mean dwell time in the active state is obtained to be
13 14 days (331 hours) from (31).