Characterizing and predicting cell population behavior is critical to the efficiency and flexibility of manufacturing processes of cell-based products. Characterization is challenging because individual cells within populations demonstrate heterogeneous and dynamic characteristics, and it is often unclear what measurements provide meaningful and predictive information about important population indicators of critical activities such as proliferative capacity or differentiation potential.
Heterogeneity of cell populations indicates the many ways cells can process information.
We study heterogeneity in gene expression by inserting a gene for a fluorescent protein downstream of the gene we are interested in, and we examine large numbers of cells by imaging or flow cytometry. We think of the heterogeneous distributions of responses in a cell population (Panel A) as analogous to a potential energy landscape (Panel B).
This theoretical construct allows us to model biological networks as nonequilibrium thermodynamic systems. This approach provides a window into which are the most important characteristic measurands of the population. Quantitative imaging of individual cells over time allows us to determine the rate with which cells fluctuate in their expression of the probe.
We then use that dynamic information in a stochastic model to predict the rates of response of the population to perturbations (1).
Measuring the temporal expression of more than one fluorescent reporter simultaneously in individual cells can provide data that indicate strong correlations – causative relationships – between network components (2).
The relationships between transcription factors that control pluripotency and differentiation will likely produce a 2-dimensional landscape like this:
Quantifying temporal fluctuations of reporter molecules in Induced pluripotent stem cells requires segmenting cells that exist very close to one another in colonies and tracking them over long times as they divide. This requires imaging each cell at a rate of about every 2 minutes, making it necessary to minimize the use of fluorescence and to train AI models to recognize and track cells in bright field imaging. And of course, one needs to be able to sample thousands of cells within that time frame, resulting in very large volumes of data Advances in imaging and image analysis capabilities are under development in support of this data-driven theoretical approach.