To create comprehensive reference maps of all human cells—the fundamental units of life—as a basis for both understanding human health and diagnosing, monitoring, and treating disease.
Below is the list of papers that are part of the Human Cell Atlas.
These have been approved by the HCA Publication Committee (composed of HCA Organising Committee members). The HCA Publication Committee reviews submitted publications to check if they fit within HCA technical scope. This review by the HCA Publication Committee does not serve as peer review.
We encourage you to submit your paper for inclusion within the Human Cell Atlas publications, which can help your paper gain more visibility. Please see here for the Publication Review process.
Nature Biotechnology 2023;1-12
Assessment of single-cell gene expression (single-cell RNA sequencing) and adaptive immune receptor (AIR) sequencing (scVDJ-seq) has been invaluable in studying lymphocyte biology. Here we introduce Dandelion, a computational pipeline for scVDJ-seq analysis. It enables the application of standard V(D)J analysis workflows to single-cell datasets, delivering improved V(D)J contig annotation and the identification of nonproductive and partially spliced contigs. We devised a strategy to create an AIR feature space that can be used for both differential V(D)J usage analysis and pseudotime trajectory inference. The application of Dandelion improved the alignment of human thymic development trajectories of double-positive T cells to mature single-positive CD4/CD8 T cells, generating predictions of factors regulating lineage commitment. Dandelion analysis of other cell compartments provided insights into the origins of human B1 cells and ILC/NK cell development, illustrating the power of our approach. Dandelion is available at https://www.github.com/zktuong/dandelion . A computational pipeline enables differential V(D)J usage analysis and pseudotime trajectory inference from single-cell AIR sequencing.
Nature 2023;1-9
The relationship between the human placenta—the extraembryonic organ made by the fetus, and the decidua—the mucosal layer of the uterus, is essential to nurture and protect the fetus during pregnancy. Extravillous trophoblast cells (EVTs) derived from placental villi infiltrate the decidua, transforming the maternal arteries into high-conductance vessels1. Defects in trophoblast invasion and arterial transformation established during early pregnancy underlie common pregnancy disorders such as pre-eclampsia2. Here we have generated a spatially resolved multiomics single-cell atlas of the entire human maternal–fetal interface including the myometrium, which enables us to resolve the full trajectory of trophoblast differentiation. We have used this cellular map to infer the possible transcription factors mediating EVT invasion and show that they are preserved in in vitro models of EVT differentiation from primary trophoblast organoids3,4 and trophoblast stem cells5. We define the transcriptomes of the final cell states of trophoblast invasion: placental bed giant cells (fused multinucleated EVTs) and endovascular EVTs (which form plugs inside the maternal arteries). We predict the cell–cell communication events contributing to trophoblast invasion and placental bed giant cell formation, and model the dual role of interstitial EVTs and endovascular EVTs in mediating arterial transformation during early pregnancy. Together, our data provide a comprehensive analysis of postimplantation trophoblast differentiation that can be used to inform the design of experimental models of the human placenta in early pregnancy. A multiomics single-cell atlas of the human maternal–fetal interface including the myometrium, combining spatial transcriptomics data with chromatin accessibility, provides a comprehensive analysis of cell states as placental cells infiltrate the uterus during early pregnancy.
DOI: 10.1038/s41586-023-05869-0 arXiv: https://www.nature.com/articles/s41586-023-05869-0#citeas
bioRxiv 2023;2023.03.08.531713
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
Nature Biotechnology 2023;1-9
Transferring annotations of single-cell-, spatial- and multi-omics data is often challenging owing both to technical limitations, such as low spatial resolution or high dropout fraction, and to biological variations, such as continuous spectra of cell states. Based on the concept that these data are often best described as continuous mixtures of cells or molecules, we present a computational framework for the transfer of annotations to cells and their combinations (TACCO), which consists of an optimal transport model extended with different wrappers to annotate a wide variety of data. We apply TACCO to identify cell types and states, decipher spatiomolecular tissue structure at the cell and molecular level and resolve differentiation trajectories using synthetic and biological datasets. While matching or exceeding the accuracy of specialized tools for the individual tasks, TACCO reduces the computational requirements by up to an order of magnitude and scales to larger datasets (for example, considering the runtime of annotation transfer for 1 M simulated dropout observations). Annotation transfer from reference to new datasets is improved with a probabilistic approach.
bioRxiv 2023;2023.01.30.526202
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
bioRxiv 2023;2023.01.16.524211
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
Genome biology 2023;24;1;70
Single-cell RNA sequencing methods focusing on the 5'-end of transcripts can reveal promoter and enhancer activity and efficiently profile immune receptor repertoire. However, ultra-high-throughput 5'-end single-cell RNA sequencing methods have not been described. We introduce FIPRESCI, 5'-end single-cell combinatorial indexing RNA-Seq, enabling massive sample multiplexing and increasing the throughput of the droplet microfluidics system by over tenfold. We demonstrate FIPRESCI enables the generation of approximately 100,000 single-cell transcriptomes from E10.5 whole mouse embryos in a single-channel experiment, and simultaneous identification of subpopulation differences and T cell receptor signatures of peripheral blood T cells from 12 cancer patients.
PUBMED: 37024957 PMC: PMC10078054 DOI: 10.1186/s13059-023-02893-1
Nature Medicine 2022;1-11
Single-cell atlases promise to provide a ‘missing link’ between genes, diseases and therapies. By identifying the specific cell types, states, programs and contexts where disease-implicated genes act, we will understand the mechanisms of disease at the cellular and tissue levels and can use this understanding to develop powerful disease diagnostics; identify promising new drug targets; predict their efficacy, toxicity and resistance mechanisms; and empower new kinds of therapies, from cancer therapies to regenerative medicine. Here, we lay out a vision for the potential of cell atlases to impact the future of medicine, and describe how advances over the past decade have begun to realize this potential in common complex diseases, infectious diseases (including COVID-19), rare diseases and cancer. This Perspective outlines how cell atlases can provide the missing links between genes, diseases and therapies, with advances already being made in several fields, including COVID-19 and cancer.
bioRxiv 2022;2022.10.31.514182
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
bioRxiv 2022;2022.10.24.513487
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
bioRxiv 2022;2022.10.12.511898
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
bioRxiv 2022
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
bioRxiv 2022;2021.11.30.470655
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
bioRxiv 2022
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
Nature Communications 2022;13;1
Common variable immunodeficiency (CVID), the most prevalent symptomatic primary immunodeficiency, displays impaired terminal B-cell differentiation and defective antibody responses. Incomplete genetic penetrance and ample phenotypic expressivity in CVID suggest the participation of additional pathogenic mechanisms. Monozygotic (MZ) twins discordant for CVID are uniquely valuable for studying the contribution of epigenetics to the disease. Here, we generate a single-cell epigenomics and transcriptomics census of naïve-to-memory B cell differentiation in a CVID-discordant MZ twin pair. Our analysis identifies DNA methylation, chromatin accessibility and transcriptional defects in memory B-cells mirroring defective cell-cell communication upon activation. These findings are validated in a cohort of CVID patients and healthy donors. Our findings provide a comprehensive multi-omics map of alterations in naïve-to-memory B-cell transition in CVID and indicate links between the epigenome and immune cell cross-talk. Our resource, publicly available at the Human Cell Atlas, gives insight into future diagnosis and treatments of CVID patients. Common variable immunodeficiency (CVID) is the most prevalent primary immunodeficiency. Here the authors perform single-cell omics analyses in CVID-discordant monozygotic twins and show epigenetic and transcriptional alterations associated with activation in memory B cells.
bioRxiv 2022;2022.03.10.483747
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
DOI: 10.1101/2022.03.10.483747 arXiv: https://www.biorxiv.org/content/10.1101/2022.03.10.483747v1
Cell reports 2022;38;10;110467
Despite their importance in tissue homeostasis and renewal, human pituitary stem cells (PSCs) are incompletely characterized. We describe a human single nucleus RNA-seq and ATAC-seq resource from pediatric, adult, and aged postmortem pituitaries (snpituitaryatlas.princeton.edu) and characterize cell-type-specific gene expression and chromatin accessibility programs for all major pituitary cell lineages. We identify uncommitted PSCs, committing progenitor cells, and sex differences. Pseudotime trajectory analysis indicates that early-life PSCs are distinct from the other age groups. Linear modeling of same-cell multiome data identifies regulatory domain accessibility sites and transcription factors that are significantly associated with gene expression in PSCs compared with other cell types and within PSCs. We identify distinct deterministic mechanisms that contribute to heterogeneous marker expression within PSCs. These findings characterize human stem cell lineages and reveal diverse mechanisms regulating key PSC genes and cell type identity.
PUBMED: 35263594 PMC: PMC8957708 DOI: 10.1016/j.celrep.2022.110467
bioRxiv 2022
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
Nature Communications 2022;13;1;562
Under normal conditions, the most significant expansion and differentiation of the adult mammary gland occurs in response to systemic reproductive hormones during pregnancy and lactation to enable milk synthesis and secretion to sustain the offspring. However, human mammary tissue remodelling that takes place during pregnancy and lactation remains poorly understood due to the challenge of acquiring samples. We report here single-cell transcriptomic analysis of 110,744 viable breast cells isolated from human milk or non-lactating breast tissue, isolated from nine and seven donors, respectively. We found that human milk largely contains epithelial cells belonging to the luminal lineage and a repertoire of immune cells. Further transcriptomic analysis of the milk cells identified two distinct secretory cell types that shared similarities with luminal progenitors, but no populations comparable to hormone-responsive cells. Taken together, our data offers a reference map and a window into the cellular dynamics that occur during human lactation and may provide further insights on the interplay between pregnancy, lactation and breast cancer.
PUBMED: 35091553 PMC: PMC8799659 DOI: 10.1038/s41467-021-27895-0
Nature Genetics 2022
Single-cell transcriptomics has allowed unprecedented resolution of cell types/states in the human lung, but their spatial context is less well defined. To (re)define tissue architecture of lung and airways, we profiled five proximal-to-distal locations of healthy human lungs in depth using multi-omic single cell/nuclei and spatial transcriptomics (queryable at lungcellatlas.org ). Using computational data integration and analysis, we extend beyond the suspension cell paradigm and discover macro and micro-anatomical tissue compartments including previously unannotated cell types in the epithelial, vascular, stromal and nerve bundle micro-environments. We identify and implicate peribronchial fibroblasts in lung disease. Importantly, we discover and validate a survival niche for IgA plasma cells in the airway submucosal glands (SMG). We show that gland epithelial cells recruit B cells and IgA plasma cells, and promote longevity and antibody secretion locally through expression of CCL28, APRIL and IL-6. This new 'gland-associated immune niche' has implications for respiratory health.
Cancer cell 2022
Tumor behavior is intricately dependent on the oncogenic properties of cancer cells and their multi-cellular interactions. To understand these dependencies within the wider microenvironment, we studied over 270,000 single-cell transcriptomes and 100 microdissected whole exomes from 12 patients with kidney tumors, prior to validation using spatial transcriptomics. Tissues were sampled from multiple regions of the tumor core, the tumor-normal interface, normal surrounding tissues, and peripheral blood. We find that the tissue-type location of CD8 T cell clonotypes largely defines their exhaustion state with intra-tumoral spatial heterogeneity that is not well explained by somatic heterogeneity. De novo mutation calling from single-cell RNA-sequencing data allows us to broadly infer the clonality of stromal cells and lineage-trace myeloid cell development. We report six conserved meta-programs that distinguish tumor cell function, and find an epithelial-mesenchymal transition meta-program highly enriched at the tumor-normal interface that co-localizes with IL1B-expressing macrophages, offering a potential therapeutic target.
2022;376;6594
Understanding gene function and regulation in homeostasis and disease requires knowledge of the cellular and tissue contexts in which genes are expressed. Here, we applied four single-nucleus RNA sequencing methods to eight diverse, archived, frozen tissue types from 16 donors and 25 samples, generating a cross-tissue atlas of 209,126 nuclei profiles, which we integrated across tissues, donors, and laboratory methods with a conditional variational autoencoder. Using the resulting cross-tissue atlas, we highlight shared and tissue-specific features of tissue-resident cell populations; identify cell types that might contribute to neuromuscular, metabolic, and immune components of monogenic diseases and the biological processes involved in their pathology; and determine cell types and gene modules that might underlie disease mechanisms for complex traits analyzed by genome-wide association studies.
2022;376;6594
Molecular characterization of cell types using single-cell transcriptome sequencing is revolutionizing cell biology and enabling new insights into the physiology of human organs. We created a human reference atlas comprising nearly 500,000 cells from 24 different tissues and organs, many from the same donor. This atlas enabled molecular characterization of more than 400 cell types, their distribution across tissues, and tissue-specific variation in gene expression. Using multiple tissues from a single donor enabled identification of the clonal distribution of T cells between tissues, identification of the tissue-specific mutation rate in B cells, and analysis of the cell cycle state and proliferative potential of shared cell types across tissues. Cell type-specific RNA splicing was discovered and analyzed across tissues within an individual.
2022;376;6594
Despite their crucial role in health and disease, our knowledge of immune cells within human tissues remains limited. We surveyed the immune compartment of 16 tissues from 12 adult donors by single-cell RNA sequencing and VDJ sequencing generating a dataset of ~360,000 cells. To systematically resolve immune cell heterogeneity across tissues, we developed CellTypist, a machine learning tool for rapid and precise cell type annotation. Using this approach, combined with detailed curation, we determined the tissue distribution of finely phenotyped immune cell types, revealing hitherto unappreciated tissue-specific features and clonal architecture of T and B cells. Our multitissue approach lays the foundation for identifying highly resolved immune cell types by leveraging a common reference dataset, tissue-integrated expression analysis, and antigen receptor sequencing.
2022
Single-cell genomics studies have decoded the immune-cell composition of several human prenatal organs but were limited in understanding the developing immune system as a distributed network across tissues. We profiled nine prenatal tissues combining single-cell RNA sequencing, antigen-receptor sequencing, and spatial transcriptomics to reconstruct the developing human immune system. This revealed the late acquisition of immune effector functions by myeloid and lymphoid cell subsets and the maturation of monocytes and T cells prior to peripheral tissue seeding. Moreover, we uncovered system-wide blood and immune cell development beyond primary hematopoietic organs, characterized human prenatal B1 cells, and shed light on the origin of unconventional T cells. Our atlas provides both valuable data resources and biological insights that will facilitate cell engineering, regenerative medicine, and disease understanding.
Cell 2022;185;25;4841-4860.e25
We present a multiomic cell atlas of human lung development that combines single-cell RNA and ATAC sequencing, high-throughput spatial transcriptomics, and single-cell imaging. Coupling single-cell methods with spatial analysis has allowed a comprehensive cellular survey of the epithelial, mesenchymal, endothelial, and erythrocyte/leukocyte compartments from 5-22 post-conception weeks. We identify previously uncharacterized cell states in all compartments. These include developmental-specific secretory progenitors and a subtype of neuroendocrine cell related to human small cell lung cancer. Our datasets are available through our web interface (https://lungcellatlas.org). To illustrate its general utility, we use our cell atlas to generate predictions about cell-cell signaling and transcription factor hierarchies which we rigorously test using organoid models.
2022
The Human Cell Atlas (HCA) consortium aims to establish an atlas of all organs in the healthy human body at single-cell resolution to increase our understanding of basic biological processes that govern development, physiology and anatomy, and to accelerate diagnosis and treatment of disease. The lung biological network of the HCA aims to generate the Human Lung Cell Atlas as a reference for the cellular repertoire, molecular cell states and phenotypes, and the cell-cell interactions that characterise normal lung homeostasis in healthy lung tissue. Such a reference atlas of the healthy human lung will facilitate mapping the changes in the cellular landscape in disease. The discovAIR project is one of six pilot actions for the HCA funded by the European Commission in the context of the H2020 framework program. DiscovAIR aims to establish the first draft of an integrated Human Lung Cell Atlas, combining single-cell transcriptional and epigenetic profiling with spatially resolving techniques on matched tissue samples, as well as including a number of chronic and infectious diseases of the lung. The integrated Lung Cell Atlas will be available as a resource for the wider respiratory community, including basic and translational scientists, clinical medicine, and the private sector, as well as for patients with lung disease and the interested lay public. We anticipate that the Lung Cell Atlas will be the founding stone for a more detailed understanding of the pathogenesis of lung diseases, guiding the design of novel diagnostics and preventive or curative interventions.
2022;101;11
Oral and craniofacial tissues are uniquely adapted for continuous and intricate functioning, including breathing, feeding, and communication. To achieve these vital processes, this complex is supported by incredible tissue diversity, variously composed of epithelia, vessels, cartilage, bone, teeth, ligaments, and muscles, as well as mesenchymal, adipose, and peripheral nervous tissue. Recent single cell and spatial multiomics assays-specifically, genomics, epigenomics, transcriptomics, proteomics, and metabolomics-have annotated known and new cell types and cell states in human tissues and animal models, but these concepts remain limitedly explored in the human postnatal oral and craniofacial complex. Here, we highlight the collaborative and coordinated efforts of the newly established Oral and Craniofacial Bionetwork as part of the Human Cell Atlas, which aims to leverage single cell and spatial multiomics approaches to first understand the cellular and molecular makeup of human oral and craniofacial tissues in health and to then address common and rare diseases. These powerful assays have already revealed the cell types that support oral tissues, and they will unravel cell types and molecular networks utilized across development, maintenance, and aging as well as those affected in diseases of the craniofacial complex. This level of integration and cell annotation with partner laboratories across the globe will be critical for understanding how multiple variables, such as age, sex, race, and ancestry, influence these oral and craniofacial niches. Here, we 1) highlight these recent collaborative efforts to employ new single cell and spatial approaches to resolve our collective biology at a higher resolution in health and disease, 2) discuss the vision behind the Oral and Craniofacial Bionetwork, 3) outline the stakeholders who contribute to and will benefit from this network, and 4) outline directions for creating the first Human Oral and Craniofacial Cell Atlas.
Nature 2021
It is not fully understood why COVID-19 is typically milder in children1–3. To examine differences in response to SARS-CoV-2 infection in children and adults, we analysed paediatric and adult COVID-19 patients and healthy controls (total n=93) using single-cell multi-omic profiling of matched nasal, tracheal, bronchial and blood samples. In healthy paediatric airways, we observed cells already in an interferon-activated state, that upon SARS-CoV-2 infection was further induced especially in airway immune cells. We postulate that higher paediatric innate interferon-responses restrict viral replication and disease progression. The systemic response in children was characterised by increases in naive lymphocytes and a depletion of natural killer cells, while in adults cytotoxic T cells and interferon-stimulated subpopulations were significantly increased. We provide evidence that dendritic cells initiate interferon signaling in early infection, and identify novel epithelial cell states that associate with COVID-19 and age. Our matching nasal and blood data showed a strong interferon response in the airways with the induction of systemic interferon-stimulated populations, which were massively reduced in paediatric patients. Together, we provide several mechanisms that explain the milder clinical syndrome observed in children.
Nature Genetics 2021;53;12
The endometrium, the mucosal lining of the uterus, undergoes dynamic changes throughout the menstrual cycle in response to ovarian hormones. We have generated dense single-cell and spatial reference maps of the human uterus and three-dimensional endometrial organoid cultures. We dissect the signaling pathways that determine cell fate of the epithelial lineages in the lumenal and glandular microenvironments. Our benchmark of the endometrial organoids reveals the pathways and cell states regulating differentiation of the secretory and ciliated lineages both in vivo and in vitro. In vitro downregulation of WNT or NOTCH pathways increases the differentiation efficiency along the secretory and ciliated lineages, respectively. We utilize our cellular maps to deconvolute bulk data from endometrial cancers and endometriotic lesions, illuminating the cell types dominating in each of these disorders. These mechanistic insights provide a platform for future development of treatments for common conditions including endometriosis and endometrial carcinoma. Single-cell and spatial transcriptomic profiling of the human endometrium highlights pathways governing the proliferative and secretory phases of the menstrual cycle. Analyses of endometrial organoids show that WNT and NOTCH signaling modulate differentiation into the secretory and ciliated epithelial lineages, respectively.
bioRxiv 2021
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
Nature Cell Biology 2021;23;11
The Human Reference Atlas (HRA) aims to map all of the cells of the human body to advance biomedical research and clinical practice. This Perspective presents collaborative work by members of 16 international consortia on two essential and interlinked parts of the HRA: (1) three-dimensional representations of anatomy that are linked to (2) tables that name and interlink major anatomical structures, cell types, plus biomarkers (ASCT+B). We discuss four examples that demonstrate the practical utility of the HRA. In this Perspective, Börner et al. discuss initiatives by 16 consortia to construct a Human Reference Atlas (HRA) comprising reference organs linked to tables that name major anatomical structures, cell types, plus biomarkers (ASCT+B) and present examples of HRA usage.
Nature Cell Biology 2021;23;11
Massive single-cell profiling efforts have accelerated our discovery of the cellular composition of the human body while at the same time raising the need to formalize this new knowledge. Here, we discuss current efforts to harmonize and integrate different sources of annotations of cell types and states into a reference cell ontology. We illustrate with examples how a unified ontology can consolidate and advance our understanding of cell types across scientific communities and biological domains. In this Perspective, Teichmann and colleagues present ongoing efforts from consortia of the Human Cell Atlas to harmonize and integrate data sources into a reference cell ontology and the contributions of cell ontologies to discovery.
bioRxiv 2021
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
bioRxiv 2021
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
Nature Communications 2021;12;1
Feature selection (marker gene selection) is widely believed to improve clustering accuracy, and is thus a key component of single cell clustering pipelines. Existing feature selection methods perform inconsistently across datasets, occasionally even resulting in poorer clustering accuracy than without feature selection. Moreover, existing methods ignore information contained in gene-gene correlations. Here, we introduce DUBStepR (Determining the Underlying Basis using Stepwise Regression), a feature selection algorithm that leverages gene-gene correlations with a novel measure of inhomogeneity in feature space, termed the Density Index (DI). Despite selecting a relatively small number of genes, DUBStepR substantially outperformed existing single-cell feature selection methods across diverse clustering benchmarks. Additionally, DUBStepR was the only method to robustly deconvolve T and NK heterogeneity by identifying disease-associated common and rare cell types and subtypes in PBMCs from rheumatoid arthritis patients. DUBStepR is scalable to over a million cells, and can be straightforwardly applied to other data types such as single-cell ATAC-seq. We propose DUBStepR as a general-purpose feature selection solution for accurately clustering single-cell data. Cell-type-specific genes are often strongly correlated in expression - an informative yet underexplored property of single-cell data. Here, the authors leverage gene expression correlations to develop DUBStepR, a feature selection method for accurately clustering single-cell data.
Nature 2021
Haematopoiesis in the bone marrow (BM) maintains blood and immune cell production throughout postnatal life. Haematopoiesis first emerges in human BM at 11–12 weeks after conception1,2, yet almost nothing is known about how fetal BM (FBM) evolves to meet the highly specialized needs of the fetus and newborn. Here we detail the development of FBM, including stroma, using multi-omic assessment of mRNA and multiplexed protein epitope expression. We find that the full blood and immune cell repertoire is established in FBM in a short time window of 6–7 weeks early in the second trimester. FBM promotes rapid and extensive diversification of myeloid cells, with granulocytes, eosinophils and dendritic cell subsets emerging for the first time. The substantial expansion of B lymphocytes in FBM contrasts with fetal liver at the same gestational age. Haematopoietic progenitors from fetal liver, FBM and cord blood exhibit transcriptional and functional differences that contribute to tissue-specific identity and cellular diversification. Endothelial cell types form distinct vascular structures that we show are regionally compartmentalized within FBM. Finally, we reveal selective disruption of B lymphocyte, erythroid and myeloid development owing to a cell-intrinsic differentiation bias as well as extrinsic regulation through an altered microenvironment in Down syndrome (trisomy 21). A single-cell atlas of human fetal bone marrow in healthy fetuses and fetuses with Down syndrome provides insight into developmental haematopoiesis in humans and the transcription and functional differences that occur in Down syndrome.
Nature 2021;597;7875;196-205
The Human Developmental Cell Atlas (HDCA) initiative, which is part of the Human Cell Atlas, aims to create a comprehensive reference map of cells during development. This will be critical to understanding normal organogenesis, the effect of mutations, environmental factors and infectious agents on human development, congenital and childhood disorders, and the cellular basis of ageing, cancer and regenerative medicine. Here we outline the HDCA initiative and the challenges of mapping and modelling human development using state-of-the-art technologies to create a reference atlas across gestation. Similar to the Human Genome Project, the HDCA will integrate the output from a growing community of scientists who are mapping human development into a unified atlas. We describe the early milestones that have been achieved and the use of human stem-cell-derived cultures, organoids and animal models to inform the HDCA, especially for prenatal tissues that are hard to acquire. Finally, we provide a roadmap towards a complete atlas of human development. This Perspective outlines the Human Developmental Cell Atlas initiative, which uses state-of-the-art technologies to map and model human development across gestation, and discusses the early milestones that have been achieved.
Nucleic acids research 2021;49;15;8505-8519
The transcriptomic diversity of cell types in the human body can be analysed in unprecedented detail using single cell (SC) technologies. Unsupervised clustering of SC transcriptomes, which is the default technique for defining cell types, is prone to group cells by technical, rather than biological, variation. Compared to de-novo (unsupervised) clustering, we demonstrate using multiple benchmarks that supervised clustering, which uses reference transcriptomes as a guide, is robust to batch effects and data quality artifacts. Here, we present RCA2, the first algorithm to combine reference projection (batch effect robustness) with graph-based clustering (scalability). In addition, RCA2 provides a user-friendly framework incorporating multiple commonly used downstream analysis modules. RCA2 also provides new reference panels for human and mouse and supports generation of custom panels. Furthermore, RCA2 facilitates cell type-specific QC, which is essential for accurate clustering of data from heterogeneous tissues. We demonstrate the advantages of RCA2 on SC data from human bone marrow, healthy PBMCs and PBMCs from COVID-19 patients. Scalable supervised clustering methods such as RCA2 will facilitate unified analysis of cohort-scale SC datasets.
2021;597;7875
The cellular landscape of the human intestinal tract is dynamic throughout life, developing in utero and changing in response to functional requirements and environmental exposures. Here, to comprehensively map cell lineages, we use single-cell RNA sequencing and antigen receptor analysis of almost half a million cells from up to 5 anatomical regions in the developing and up to 11 distinct anatomical regions in the healthy paediatric and adult human gut. This reveals the existence of transcriptionally distinct BEST4 epithelial cells throughout the human intestinal tract. Furthermore, we implicate IgG sensing as a function of intestinal tuft cells. We describe neural cell populations in the developing enteric nervous system, and predict cell-type-specific expression of genes associated with Hirschsprung's disease. Finally, using a systems approach, we identify key cell players that drive the formation of secondary lymphoid tissue in early human development. We show that these programs are adopted in inflammatory bowel disease to recruit and retain immune cells at the site of inflammation. This catalogue of intestinal cells will provide new insights into cellular programs in development, homeostasis and disease.
PUBMED: 34497389 PMC: PMC8426186 DOI: 10.1038/s41586-021-03852-1
Nature Biotechnology 2021
Children have reduced severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection rates and a substantially lower risk for developing severe coronavirus disease 2019 compared with adults. However, the molecular mechanisms underlying protection in younger age groups remain unknown. Here we characterize the single-cell transcriptional landscape in the upper airways of SARS-CoV-2-negative (n = 18) and age-matched SARS-CoV-2-positive (n = 24) children and corresponding samples from adults (n = 44), covering an age range of 4 weeks to 77 years. Children displayed higher basal expression of relevant pattern recognition receptors such as MDA5 (IFIH1) and RIG-I (DDX58) in upper airway epithelial cells, macrophages and dendritic cells, resulting in stronger innate antiviral responses upon SARS-CoV-2 infection than in adults. We further detected distinct immune cell subpopulations including KLRC1 (NKG2A)+ cytotoxic T cells and a CD8+ T cell population with a memory phenotype occurring predominantly in children. Our study provides evidence that the airway immune cells of children are primed for virus sensing, resulting in a stronger early innate antiviral response to SARS-CoV-2 infection than in adults. Single-cell sequencing reveals pre-activated immunity as important for milder COVID-19 symptoms in children.
2021;24;8
The human neonatal cerebellum is one-fourth of its adult size yet contains the blueprint required to integrate environmental cues with developing motor, cognitive and emotional skills into adulthood. Although mature cerebellar neuroanatomy is well studied, understanding of its developmental origins is limited. In this study, we systematically mapped the molecular, cellular and spatial composition of human fetal cerebellum by combining laser capture microscopy and SPLiT-seq single-nucleus transcriptomics. We profiled functionally distinct regions and gene expression dynamics within cell types and across development. The resulting cell atlas demonstrates that the molecular organization of the cerebellar anlage recapitulates cytoarchitecturally distinct regions and developmentally transient cell types that are distinct from the mouse cerebellum. By mapping genes dominant for pediatric and adult neurological disorders onto our dataset, we identify relevant cell types underlying disease mechanisms. These data provide a resource for probing the cellular basis of human cerebellar development and disease.
PUBMED: 34140698 PMC: PMC8338761 DOI: 10.1038/s41593-021-00872-y
2021;144;4
Cellular diversity of the lung endothelium has not been systematically characterized in humans. We provide a reference atlas of human lung endothelial cells (ECs) to facilitate a better understanding of the phenotypic diversity and composition of cells comprising the lung endothelium.
PUBMED: 34030460 PMC: PMC8300155 DOI: 10.1161/CIRCULATIONAHA.120.052318
bioRxiv 2021
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
Nature Communications 2021;12;1;4314
Patients with chronic lung disease (CLD) have an increased risk for severe coronavirus disease-19 (COVID-19) and poor outcomes. Here, we analyze the transcriptomes of 611,398 single cells isolated from healthy and CLD lungs to identify molecular characteristics of lung cells that may account for worse COVID-19 outcomes in patients with chronic lung diseases. We observe a similar cellular distribution and relative expression of SARS-CoV-2 entry factors in control and CLD lungs. CLD AT2 cells express higher levels of genes linked directly to the efficiency of viral replication and the innate immune response. Additionally, we identify basal differences in inflammatory gene expression programs that highlight how CLD alters the inflammatory microenvironment encountered upon viral exposure to the peripheral lung. Our study indicates that CLD is accompanied by changes in cell-type-specific gene expression programs that prime the lung epithelium for and influence the innate and adaptive immune responses to SARS-CoV-2 infection.
PUBMED: 34262047 PMC: PMC8280215 DOI: 10.1038/s41467-021-24467-0
Trends in genetics 2021;37;7;625-630
Comprehensively characterizing the cellular composition and organization of tissues has been a long-term scientific challenge that has limited our ability to study fundamental and clinical aspects of human physiology. The Human Cell Atlas (HCA) is a global collaborative effort to create a reference map of all human cells as a basis for both understanding human health and diagnosing, monitoring, and treating disease. Many aspects of the HCA are analogous to the Human Genome Project (HGP), whose completion presents a major milestone in modern biology. To commemorate the HGP's 20-year anniversary of completion, we discuss the launch of the HCA in light of the HGP, and highlight recent progress by the HCA consortium.
2021;21
Single cell (sc) analyses of key embryonic, fetal and adult stages were performed to generate a comprehensive single cell atlas of all the corneal and adjacent conjunctival cell types from development to adulthood.
PUBMED: 33865984 PMC: PMC8343164 DOI: 10.1016/j.jtos.2021.03.010
Nature Communications 2021;12;1;3545
Multiplexed fluorescence in situ hybridization techniques have enabled cell-type identification, linking transcriptional heterogeneity with spatial heterogeneity of cells. However, inaccurate cell segmentation reduces the efficacy of cell-type identification and tissue characterization. Here, we present a method called Spot-based Spatial cell-type Analysis by Multidimensional mRNA density estimation (SSAM), a robust cell segmentation-free computational framework for identifying cell-types and tissue domains in 2D and 3D. SSAM is applicable to a variety of in situ transcriptomics techniques and capable of integrating prior knowledge of cell types. We apply SSAM to three mouse brain tissue images: the somatosensory cortex imaged by osmFISH, the hypothalamic preoptic region by MERFISH, and the visual cortex by multiplexed smFISH. Here, we show that SSAM detects regions occupied by known cell types that were previously missed and discovers new cell types.
PUBMED: 34112806 PMC: PMC8192952 DOI: 10.1038/s41467-021-23807-4
Nature Protocols 2021;16;6;2749-2764
Single-cell transcriptomics can profile thousands of cells in a single experiment and identify novel cell types, states and dynamics in a wide variety of tissues and organisms. Standard experimental protocols and analysis workflows have been developed to create single-cell transcriptomic maps from tissues. This tutorial focuses on how to interpret these data to identify cell types, states and other biologically relevant patterns with the objective of creating an annotated map of cells. We recommend a three-step workflow including automatic cell annotation (wherever possible), manual cell annotation and verification. Frequently encountered challenges are discussed, as well as strategies to address them. Guiding principles and specific recommendations for software tools and resources that can be used for each step are covered, and an R notebook is included to help run the recommended workflow. Basic familiarity with computer software is assumed, and basic knowledge of programming (e.g., in the R language) is recommended. This tutorial provides guidelines for interpreting single-cell transcriptomic maps to identify cell types, states and other biologically relevant patterns.
Research Square 2021
Gonadal development is a complex process that involves sex determination followed by divergent maturation into ovaries or testes. Historically, limited tissue accessibility and lack of reliable in vitro models have impeded our understanding of human gonadogenesis, despite its importance in gonada...
Nature Communications 2021;12;1;2677
To provide a multi-omics resource and investigate transcriptional regulatory mechanisms, we profile the transcriptome, chromatin accessibility, and methylation status of over 70,000 single nuclei (sn) from adult mouse pituitaries. Paired snRNAseq and snATACseq datasets from individual animals highlight a continuum between developmental epigenetically-encoded cell types and transcriptionally-determined transient cell states. Co-accessibility analysis-based identification of a putative Fshb cis-regulatory domain that overlaps the fertility-linked rs11031006 human polymorphism, followed by experimental validation illustrate the use of this resource for hypothesis generation. We also identify transcriptional and chromatin accessibility programs distinguishing each major cell type. Regulons, which are co-regulated gene sets sharing binding sites for a common transcription factor driver, recapitulate cell type clustering. We identify both cell type-specific and sex-specific regulons that are highly correlated with promoter accessibility, but not with methylation state, supporting the centrality of chromatin accessibility in shaping cell-defining transcriptional programs. The sn multi-omics atlas is accessible at snpituitaryatlas.princeton.edu.
PUBMED: 33976139 PMC: PMC8113460 DOI: 10.1038/s41467-021-22859-w
2021;12;1
Single-cell RNA-Seq (scRNA-seq) is invaluable for studying biological systems. Dimensionality reduction is a crucial step in interpreting the relation between cells in scRNA-seq data. However, current dimensionality reduction methods are often confounded by multiple simultaneous technical and biological variability, result in "crowding" of cells in the center of the latent space, or inadequately capture temporal relationships. Here, we introduce scPhere, a scalable deep generative model to embed cells into low-dimensional hyperspherical or hyperbolic spaces to accurately represent scRNA-seq data. ScPhere addresses multi-level, complex batch factors, facilitates the interactive visualization of large datasets, resolves cell crowding, and uncovers temporal trajectories. We demonstrate scPhere on nine large datasets in complex tissue from human patients or animal development. Our results show how scPhere facilitates the interpretation of scRNA-seq data by generating batch-invariant embeddings to map data from new individuals, identifies cell types affected by biological variables, infers cells' spatial positions in pre-defined biological specimens, and highlights complex cellular relations.
PUBMED: 33953202 PMC: PMC8099904 DOI: 10.1038/s41467-021-22851-4
Nature Medicine 2021;27;5;904-916
Analysis of human blood immune cells provides insights into the coordinated response to viral infections such as severe acute respiratory syndrome coronavirus 2, which causes coronavirus disease 2019 (COVID-19). We performed single-cell transcriptome, surface proteome and T and B lymphocyte antigen receptor analyses of over 780,000 peripheral blood mononuclear cells from a cross-sectional cohort of 130 patients with varying severities of COVID-19. We identified expansion of nonclassical monocytes expressing complement transcripts (CD16C1QA/B/C) that sequester platelets and were predicted to replenish the alveolar macrophage pool in COVID-19. Early, uncommitted CD34 hematopoietic stem/progenitor cells were primed toward megakaryopoiesis, accompanied by expanded megakaryocyte-committed progenitors and increased platelet activation. Clonally expanded CD8 T cells and an increased ratio of CD8 effector T cells to effector memory T cells characterized severe disease, while circulating follicular helper T cells accompanied mild disease. We observed a relative loss of IgA2 in symptomatic disease despite an overall expansion of plasmablasts and plasma cells. Our study highlights the coordinated immune response that contributes to COVID-19 pathogenesis and reveals discrete cellular components that can be targeted for therapy.
PUBMED: 33879890 PMC: PMC8121667 DOI: 10.1038/s41591-021-01329-2
Nature Medicine 2021;27;5;892-903
Despite signs of infection-including taste loss, dry mouth and mucosal lesions such as ulcerations, enanthema and macules-the involvement of the oral cavity in coronavirus disease 2019 (COVID-19) is poorly understood. To address this, we generated and analyzed two single-cell RNA sequencing datasets of the human minor salivary glands and gingiva (9 samples, 13,824 cells), identifying 50 cell clusters. Using integrated cell normalization and annotation, we classified 34 unique cell subpopulations between glands and gingiva. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) viral entry factors such as ACE2 and TMPRSS members were broadly enriched in epithelial cells of the glands and oral mucosae. Using orthogonal RNA and protein expression assessments, we confirmed SARS-CoV-2 infection in the glands and mucosae. Saliva from SARS-CoV-2-infected individuals harbored epithelial cells exhibiting ACE2 and TMPRSS expression and sustained SARS-CoV-2 infection. Acellular and cellular salivary fractions from asymptomatic individuals were found to transmit SARS-CoV-2 ex vivo. Matched nasopharyngeal and saliva samples displayed distinct viral shedding dynamics, and salivary viral burden correlated with COVID-19 symptoms, including taste loss. Upon recovery, this asymptomatic cohort exhibited sustained salivary IgG antibodies against SARS-CoV-2. Collectively, these data show that the oral cavity is an important site for SARS-CoV-2 infection and implicate saliva as a potential route of SARS-CoV-2 transmission.
PUBMED: 33767405 PMC: PMC8240394 DOI: 10.1038/s41591-021-01296-8
Nature 2021;595;7865;114-119
Respiratory failure is the leading cause of death in patients with severe SARS-CoV-2 infection1,2, but the host response at the lung tissue level is poorly understood. Here we performed single-nucleus RNA sequencing of about 116,000 nuclei from the lungs of nineteen individuals who died of COVID-19 and underwent rapid autopsy and seven control individuals. Integrated analyses identified substantial alterations in cellular composition, transcriptional cell states, and cell-to-cell interactions, thereby providing insight into the biology of lethal COVID-19. The lungs from individuals with COVID-19 were highly inflamed, with dense infiltration of aberrantly activated monocyte-derived macrophages and alveolar macrophages, but had impaired T cell responses. Monocyte/macrophage-derived interleukin-1β and epithelial cell-derived interleukin-6 were unique features of SARS-CoV-2 infection compared to other viral and bacterial causes of pneumonia. Alveolar type 2 cells adopted an inflammation-associated transient progenitor cell state and failed to undergo full transition into alveolar type 1 cells, resulting in impaired lung regeneration. Furthermore, we identified expansion of recently described CTHRC1+ pathological fibroblasts3 contributing to rapidly ensuing pulmonary fibrosis in COVID-19. Inference of protein activity and ligand–receptor interactions identified putative drug targets to disrupt deleterious circuits. This atlas enables the dissection of lethal COVID-19, may inform our understanding of long-term complications of COVID-19 survivors, and provides an important resource for therapeutic development. Lung samples collected soon after death from COVID-19 are used to provide a single-cell atlas of SARS-CoV-2 infection and the ensuing molecular changes.
Nature 2021;595;7865;107-113
COVID-19, which is caused by SARS-CoV-2, can result in acute respiratory distress syndrome and multiple organ failure1–4, but little is known about its pathophysiology. Here we generated single-cell atlases of 24 lung, 16 kidney, 16 liver and 19 heart autopsy tissue samples and spatial atlases of 14 lung samples from donors who died of COVID-19. Integrated computational analysis uncovered substantial remodelling in the lung epithelial, immune and stromal compartments, with evidence of multiple paths of failed tissue regeneration, including defective alveolar type 2 differentiation and expansion of fibroblasts and putative TP63+ intrapulmonary basal-like progenitor cells. Viral RNAs were enriched in mononuclear phagocytic and endothelial lung cells, which induced specific host programs. Spatial analysis in lung distinguished inflammatory host responses in lung regions with and without viral RNA. Analysis of the other tissue atlases showed transcriptional alterations in multiple cell types in heart tissue from donors with COVID-19, and mapped cell types and genes implicated with disease severity based on COVID-19 genome-wide association studies. Our foundational dataset elucidates the biological effect of severe SARS-CoV-2 infection across the body, a key step towards new treatments. Single-cell analysis of lung, heart, kidney and liver autopsy samples shows the molecular and cellular changes and immune response resulting from severe COVID-19 infection.
bioRxiv 2021;2021.04.04.438388
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
Nature Communications 2021;12;1;1660
In less than nine months, the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) killed over a million people, including >25,000 in New York City (NYC) alone. The COVID-19 pandemic caused by SARS-CoV-2 highlights clinical needs to detect infection, track strain evolution, and identify biomarkers of disease course. To address these challenges, we designed a fast (30-minute) colorimetric test (LAMP) for SARS-CoV-2 infection from naso/oropharyngeal swabs and a large-scale shotgun metatranscriptomics platform (total-RNA-seq) for host, viral, and microbial profiling. We applied these methods to clinical specimens gathered from 669 patients in New York City during the first two months of the outbreak, yielding a broad molecular portrait of the emerging COVID-19 disease. We find significant enrichment of a NYC-distinctive clade of the virus (20C), as well as host responses in interferon, ACE, hematological, and olfaction pathways. In addition, we use 50,821 patient records to find that renin-angiotensin-aldosterone system inhibitors have a protective effect for severe COVID-19 outcomes, unlike similar drugs. Finally, spatial transcriptomic data from COVID-19 patient autopsy tissues reveal distinct ACE2 expression loci, with macrophage and neutrophil infiltration in the lungs. These findings can inform public health and may help develop and drive SARS-CoV-2 diagnostic, prevention, and treatment strategies.
PUBMED: 33712587 PMC: PMC7954844 DOI: 10.1038/s41467-021-21361-7
Nature Reviews Rheumatology 2021;17;5;247-248
The Human Cell Atlas (HCA) project aims to map tissues and organs during development, maturation and pathology at single cell resolution. The musculoskeletal HCA network is a community for fostering collaboration and shared expertise to help develop the therapeutic approaches needed to address the high global burden of musculoskeletal disorders.
Nature Medicine 2021;27;3;546-559
Angiotensin-converting enzyme 2 (ACE2) and accessory proteases (TMPRSS2 and CTSL) are needed for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) cellular entry, and their expression may shed light on viral tropism and impact across the body. We assessed the cell-type-specific expression of ACE2, TMPRSS2 and CTSL across 107 single-cell RNA-sequencing studies from different tissues. ACE2, TMPRSS2 and CTSL are coexpressed in specific subsets of respiratory epithelial cells in the nasal passages, airways and alveoli, and in cells from other organs associated with coronavirus disease 2019 (COVID-19) transmission or pathology. We performed a meta-analysis of 31 lung single-cell RNA-sequencing studies with 1,320,896 cells from 377 nasal, airway and lung parenchyma samples from 228 individuals. This revealed cell-type-specific associations of age, sex and smoking with expression levels of ACE2, TMPRSS2 and CTSL. Expression of entry factors increased with age and in males, including in airway secretory cells and alveolar type 2 cells. Expression programs shared by ACE2+TMPRSS2+ cells in nasal, lung and gut tissues included genes that may mediate viral entry, key immune functions and epithelial–macrophage cross-talk, such as genes involved in the interleukin-6, interleukin-1, tumor necrosis factor and complement pathways. Cell-type-specific expression patterns may contribute to the pathogenesis of COVID-19, and our work highlights putative molecular pathways for therapeutic intervention. An integrated analysis of over 100 single-cell and single-nucleus transcriptomics studies illustrates severe acute respiratory syndrome coronavirus 2 viral entry gene coexpression patterns across different human tissues, and shows association of age, smoking status and sex with viral entry gene expression in respiratory cell populations.
Gastroenterology 2021;160;4;1330-1344.e11
Molecular evidence of cellular heterogeneity in the human exocrine pancreas has not been yet established because of the local concentration and cascade of hydrolytic enzymes that can rapidly degrade cells and RNA upon pancreatic resection. We sought to better understand the heterogeneity and cellular composition of the pancreas in neonates and adults in healthy and diseased conditions using single-cell sequencing approaches.
Cell 2021;184;3;810-826.e23
Development of the human intestine is not well understood. Here, we link single-cell RNA sequencing and spatial transcriptomics to characterize intestinal morphogenesis through time. We identify 101 cell states including epithelial and mesenchymal progenitor populations and programs linked to key morphogenetic milestones. We describe principles of crypt-villus axis formation; neural, vascular, mesenchymal morphogenesis, and immune population of the developing gut. We identify the differentiation hierarchies of developing fibroblast and myofibroblast subtypes and describe diverse functions for these including as vascular niche cells. We pinpoint the origins of Peyer's patches and gut-associated lymphoid tissue (GALT) and describe location-specific immune programs. We use our resource to present an unbiased analysis of morphogen gradients that direct sequential waves of cellular differentiation and define cells and locations linked to rare developmental intestinal disorders. We compile a publicly available online resource, spatio-temporal analysis resource of fetal intestinal development (STAR-FINDer), to facilitate further work.
PUBMED: 33406409 PMC: PMC7864098 DOI: 10.1016/j.cell.2020.12.016
Nature Biotechnology 2021;39;2;149-153
Science 2021;371;6527
The skin confers biophysical and immunological protection through a complex cellular network established early in embryonic development. We profiled the transcriptomes of more than 500,000 single cells from developing human fetal skin, healthy adult skin, and adult skin with atopic dermatitis and psoriasis. We leveraged these datasets to compare cell states across development, homeostasis, and disease. Our analysis revealed an enrichment of innate immune cells in skin during the first trimester and clonal expansion of disease-associated lymphocytes in atopic dermatitis and psoriasis. We uncovered and validated in situ a reemergence of prenatal vascular endothelial cell and macrophage cellular programs in atopic dermatitis and psoriasis lesional skin. These data illustrate the dynamism of cutaneous immunity and provide opportunities for targeting pathological developmental programs in inflammatory skin diseases.
PUBMED: 33479125 PMC: PMC7611557 DOI: 10.1126/science.aba6500
Nature Biotechnology 2021;39;1;13-17
PUBMED: 33384458 PMC: PMC7801253 DOI: 10.1038/s41587-020-00762-x
2021;184;15
The oral mucosa remains an understudied barrier tissue. This is a site of rich exposure to antigens and commensals, and a tissue susceptible to one of the most prevalent human inflammatory diseases, periodontitis. To aid in understanding tissue-specific pathophysiology, we compile a single-cell transcriptome atlas of human oral mucosa in healthy individuals and patients with periodontitis. We uncover the complex cellular landscape of oral mucosal tissues and identify epithelial and stromal cell populations with inflammatory signatures that promote antimicrobial defenses and neutrophil recruitment. Our findings link exaggerated stromal cell responsiveness with enhanced neutrophil and leukocyte infiltration in periodontitis. Our work provides a resource characterizing the role of tissue stroma in regulating mucosal tissue homeostasis and disease pathogenesis.
PUBMED: 34129837 PMC: PMC8359928 DOI: 10.1016/j.cell.2021.05.013
2021
Numerous studies have provided single-cell transcriptome profiles of host responses to SARS-CoV-2 infection. Critically lacking however is a datamine that allows users to compare and explore cell profiles to gain insights and develop new hypotheses. To accomplish this, we harmonized datasets from COVID-19 and other control condition blood, bronchoalveolar lavage, and tissue samples, and derived a compendium of gene signature modules per cell type, subtype, clinical condition, and compartment. We demonstrate approaches to probe these via a new interactive web portal (http://toppcell.cchmc.org/COVID-19). As examples, we develop three hypotheses: (1) a multicellular signaling cascade among alternatively differentiated monocyte-derived macrophages whose tasks include T cell recruitment and activation; (2) novel platelet subtypes with drastically modulated expression of genes responsible for adhesion, coagulation and thrombosis; and (3) a multilineage cell activator network able to drive extrafollicular B maturation via an ensemble of genes strongly associated with risk for developing post-viral autoimmunity.
PUBMED: 34127975 PMC: PMC8202427 DOI: 10.1101/2021.06.07.447287
BMC bioinformatics 2021;22;1;186
Clustering is a crucial step in the analysis of single-cell data. Clusters identified in an unsupervised manner are typically annotated to cell types based on differentially expressed genes. In contrast, supervised methods use a reference panel of labelled transcriptomes to guide both clustering and cell type identification. Supervised and unsupervised clustering approaches have their distinct advantages and limitations. Therefore, they can lead to different but often complementary clustering results. Hence, a consensus approach leveraging the merits of both clustering paradigms could result in a more accurate clustering and a more precise cell type annotation.
PUBMED: 33845760 PMC: PMC8042883 DOI: 10.1186/s12859-021-04028-4
2021
Monogenic forms of inflammatory bowel disease (IBD) illustrate the essential roles of individual genes in pathways and networks safeguarding immune tolerance and gut homeostasis.
bioRxiv 2021
The Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) virus has infected over 115 million people and caused over 2.5 million deaths worldwide. Yet, the molecular mechanisms underlying the clinical manifestations of COVID-19, as well as what distinguishes them from common seasonal influenza virus and other lung injury states such as Acute Respiratory Distress Syndrome (ARDS), remains poorly understood. To address these challenges, we combined transcriptional profiling of 646 clinical nasopharyngeal swabs and 39 patient autopsy tissues, matched with spatial protein and expression profiling (GeoMx) across 357 tissue sections. These results define both body-wide and tissue-specific (heart, liver, lung, kidney, and lymph nodes) damage wrought by the SARS-CoV-2 infection, evident as a function of varying viral load (high vs. low) during the course of infection and specific, transcriptional dysregulation in splicing isoforms, T cell receptor expression, and cellular expression states. In particular, cardiac and lung tissues revealed the largest degree of splicing isoform switching and cell expression state loss. Overall, these findings reveal a systemic disruption of cellular and transcriptional pathways from COVID-19 across all tissues, which can inform subsequent studies to combat the mortality of COVID-19, as well to better understand the molecular dynamics of lethal SARS-CoV-2 infection and other viruses.
PUBMED: 33758858 PMC: PMC7987017 DOI: 10.1101/2021.03.08.434433
EMBO molecular medicine 2021;13;4;e12871
The correspondence of cell state changes in diseased organs to peripheral protein signatures is currently unknown. Here, we generated and integrated single-cell transcriptomic and proteomic data from multiple large pulmonary fibrosis patient cohorts. Integration of 233,638 single-cell transcriptomes (n = 61) across three independent cohorts enabled us to derive shifts in cell type proportions and a robust core set of genes altered in lung fibrosis for 45 cell types. Mass spectrometry analysis of lung lavage fluid (n = 124) and plasma (n = 141) proteomes identified distinct protein signatures correlated with diagnosis, lung function, and injury status. A novel SSTR2+ pericyte state correlated with disease severity and was reflected in lavage fluid by increased levels of the complement regulatory factor CFHR1. We further discovered CRTAC1 as a biomarker of alveolar type-2 epithelial cell health status in lavage fluid and plasma. Using cross-modal analysis and machine learning, we identified the cellular source of biomarkers and demonstrated that information transfer between modalities correctly predicts disease status, suggesting feasibility of clinical cell state monitoring through longitudinal sampling of body fluid proteomes.
PUBMED: 33650774 PMC: PMC8033531 DOI: 10.15252/emmm.202012871
Investigative ophthalmology & visual science 2021;62;6;18
Retinoblastoma (Rb) is a malignant neoplasm arising during retinal development from mutations in the RB1 gene. Loss or inactivation of both copies of RB1 results in initiation of retinoblastoma tumors; however, additional genetic changes are needed for the continued growth and spread of the tumor. Ex vivo research has shown that in humans, retinoblastoma may initiate from RB1-depleted cone precursors. Notwithstanding, it has not been possible to assess the full spectrum of clonal types within the tumor itself in vivo and the molecular changes occurring at the cells of origin, enabling their malignant conversion. To overcome these challenges, we have performed the first single cell (sc) RNA- and ATAC-Seq analyses of primary tumor tissues, enabling us to dissect the transcriptional and chromatin accessibility heterogeneity of proliferating cone precursors in human Rb tumors.
2021;18;11
Charting an organs' biological atlas requires us to spatially resolve the entire single-cell transcriptome, and to relate such cellular features to the anatomical scale. Single-cell and single-nucleus RNA-seq (sc/snRNA-seq) can profile cells comprehensively, but lose spatial information. Spatial transcriptomics allows for spatial measurements, but at lower resolution and with limited sensitivity. Targeted in situ technologies solve both issues, but are limited in gene throughput. To overcome these limitations we present Tangram, a method that aligns sc/snRNA-seq data to various forms of spatial data collected from the same region, including MERFISH, STARmap, smFISH, Spatial Transcriptomics (Visium) and histological images. Tangram can map any type of sc/snRNA-seq data, including multimodal data such as those from SHARE-seq, which we used to reveal spatial patterns of chromatin accessibility. We demonstrate Tangram on healthy mouse brain tissue, by reconstructing a genome-wide anatomically integrated spatial map at single-cell resolution of the visual and somatomotor areas.
PUBMED: 34711971 PMC: PMC8566243 DOI: 10.1038/s41592-021-01264-7
2021;3;4
Single-cell RNA sequencing (scRNA-seq) revolutionized our understanding of disease biology. The promise it presents to also transform translational research requires highly standardized and robust software workflows. Here, we present the toolkit , which streamlines scRNA-seq analyses and their use to deconvolute bulk RNA-seq data according to current best practices. Beyond a standard workflow covering quality control, filtering, and clustering, two complementary modules, utilizing hierarchical cell signatures and supervised machine learning, automate cell annotation and provide harmonized nomenclatures. Subsequently, the gene expression profiles can be employed to estimate cell type proportions in bulk transcriptomics data. Using multiple, diverse scRNA-seq datasets, some stemming from highly heterogeneous tumor tissue, we show how aids acceleration, interoperability, reusability and interpretability of scRNA-seq data analyses, meeting crucial demands in translational research and beyond.
PUBMED: 34761219 PMC: PMC8573822 DOI: 10.1093/nargab/lqab102
Nature Biotechnology 2020;39;6;705-716
In coronavirus disease 2019 (COVID-19), hypertension and cardiovascular diseases are major risk factors for critical disease progression. However, the underlying causes and the effects of the main anti-hypertensive therapies—angiotensin-converting enzyme inhibitors (ACEIs) and angiotensin receptor blockers (ARBs)—remain unclear. Combining clinical data (n = 144) and single-cell sequencing data of airway samples (n = 48) with in vitro experiments, we observed a distinct inflammatory predisposition of immune cells in patients with hypertension that correlated with critical COVID-19 progression. ACEI treatment was associated with dampened COVID-19-related hyperinflammation and with increased cell intrinsic antiviral responses, whereas ARB treatment related to enhanced epithelial–immune cell interactions. Macrophages and neutrophils of patients with hypertension, in particular under ARB treatment, exhibited higher expression of the pro-inflammatory cytokines CCL3 and CCL4 and the chemokine receptor CCR1. Although the limited size of our cohort does not allow us to establish clinical efficacy, our data suggest that the clinical benefits of ACEI treatment in patients with COVID-19 who have hypertension warrant further investigation. Single-cell analysis reveals how anti-hypertensive drugs affect the risk of severe disease in patients with COVID-19 who have hypertension.
Developmental cell 2020;55;6;771-783.e5
Human gut development requires the orchestrated interaction of differentiating cell types. Here, we generate an in-depth single-cell map of the developing human intestine at 6-10 weeks post-conception. Our analysis reveals the transcriptional profile of cycling epithelial precursor cells; distinct from LGR5-expressing cells. We propose that these cells may contribute to differentiated cell subsets via the generation of LGR5-expressing stem cells and receive signals from surrounding mesenchymal cells. Furthermore, we draw parallels between the transcriptomes of ex vivo tissues and in vitro fetal organoids, revealing the maturation of organoid cultures in a dish. Lastly, we compare scRNA-seq profiles from pediatric Crohn's disease epithelium alongside matched healthy controls to reveal disease-associated changes in the epithelial composition. Contrasting these with the fetal profiles reveals the re-activation of fetal transcription factors in Crohn's disease. Our study provides a resource available at www.gutcellatlas.org, and underscores the importance of unraveling fetal development in understanding disease.
PUBMED: 33290721 PMC: PMC7762816 DOI: 10.1016/j.devcel.2020.11.010
Immunity 2020;53;6;1296-1314.e9
Temporal resolution of cellular features associated with a severe COVID-19 disease trajectory is needed for understanding skewed immune responses and defining predictors of outcome. Here, we performed a longitudinal multi-omics study using a two-center cohort of 14 patients. We analyzed the bulk transcriptome, bulk DNA methylome, and single-cell transcriptome (>358,000 cells, including BCR profiles) of peripheral blood samples harvested from up to 5 time points. Validation was performed in two independent cohorts of COVID-19 patients. Severe COVID-19 was characterized by an increase of proliferating, metabolically hyperactive plasmablasts. Coinciding with critical illness, we also identified an expansion of interferon-activated circulating megakaryocytes and increased erythropoiesis with features of hypoxic signaling. Megakaryocyte- and erythroid-cell-derived co-expression modules were predictive of fatal disease outcome. The study demonstrates broad cellular effects of SARS-CoV-2 infection beyond adaptive immune cells and provides an entry point toward developing biomarkers and targeted treatments of patients with COVID-19.
PUBMED: 33296687 PMC: PMC7689306 DOI: 10.1016/j.immuni.2020.11.017
American journal of respiratory and critical care medicine 2020;202;12;1636-1645
The respiratory tract constitutes an elaborate line of defense that is based on a unique cellular ecosystem. We aimed to investigate cell population distributions and transcriptional changes along the airways by using single-cell RNA profiling. We have explored the cellular heterogeneity of the human airway epithelium in 10 healthy living volunteers by single-cell RNA profiling. A total of 77,969 cells were collected at 35 distinct locations, from the nose to the 12th division of the airway tree. The resulting atlas is composed of a high percentage of epithelial cells (89.1%) but also immune (6.2%) and stromal (4.7%) cells with distinct cellular proportions in different regions of the airways. It reveals differential gene expression between identical cell types (suprabasal, secretory, and multiciliated cells) from the nose (, , ) and tracheobronchial (, ) airways. By contrast, cell-type-specific gene expression is stable across all tracheobronchial samples. Our atlas improves the description of ionocytes, pulmonary neuroendocrine cells, and brush cells and identifies a related population of -positive cells. We also report the association of with dividing cells that are reminiscent of previously described mouse "hillock" cells and with squamous cells expressing and . Robust characterization of a single-cell cohort in healthy airways establishes a valuable resource for future investigations. The precise description of the continuum existing from the nasal epithelium to successive divisions of the airways and the stable gene expression profile of these regions better defines conditions under which relevant tracheobronchial proxies of human respiratory diseases can be developed.
Nature Machine Intelligence 2020;2;12
Recent advances in single-cell RNA sequencing have driven the simultaneous measurement of the expression of thousands of genes in thousands of single cells. These growing datasets allow us to model gene sets in biological networks at an unprecedented level of detail, in spite of heterogeneous cell populations. Here, we propose a deep neural network model that is a hybrid of matrix factorization and variational autoencoders, which we call restricted latent variational autoencoder (resVAE). The model uses weights as factorized matrices to obtain gene sets, while class-specific inputs to the latent variable space facilitate a plausible identification of cell types. This artificial neural network model seamlessly integrates functional gene set inference, experimental covariate effect isolation, and static gene identification, which we conceptually demonstrate here for four single-cell RNA sequencing datasets. The wealth of data generated by single-cell RNA sequencing can be used to identify gene sets across cells, as well as to identify specific cells. Lukassen and colleagues propose a method combining matrix factorization and variational auto encoders that can capture both cross-cell and cell-specific information.
Nature 2020;588;7838;466-472
Cardiovascular disease is the leading cause of death worldwide. Advanced insights into disease mechanisms and therapeutic strategies require a deeper understanding of the molecular processes involved in the healthy heart. Knowledge of the full repertoire of cardiac cells and their gene expression profiles is a fundamental first step in this endeavour. Here, using state-of-the-art analyses of large-scale single-cell and single-nucleus transcriptomes, we characterize six anatomical adult heart regions. Our results highlight the cellular heterogeneity of cardiomyocytes, pericytes and fibroblasts, and reveal distinct atrial and ventricular subsets of cells with diverse developmental origins and specialized properties. We define the complexity of the cardiac vasculature and its changes along the arterio-venous axis. In the immune compartment, we identify cardiac-resident macrophages with inflammatory and protective transcriptional signatures. Furthermore, analyses of cell-to-cell interactions highlight different networks of macrophages, fibroblasts and cardiomyocytes between atria and ventricles that are distinct from those of skeletal muscle. Our human cardiac cell atlas improves our understanding of the human heart and provides a valuable reference for future studies.
PUBMED: 32971526 PMC: PMC7681775 DOI: 10.1038/s41586-020-2797-4
Kidney international 2020;98;6;1502-1518
COVID-19 morbidity and mortality are increased via unknown mechanisms in patients with diabetes and kidney disease. SARS-CoV-2 uses angiotensin-converting enzyme 2 (ACE2) for entry into host cells. Because ACE2 is a susceptibility factor for infection, we investigated how diabetic kidney disease and medications alter ACE2 receptor expression in kidneys. Single cell RNA profiling of kidney biopsies from healthy living donors and patients with diabetic kidney disease revealed ACE2 expression primarily in proximal tubular epithelial cells. This cell-specific localization was confirmed by in situ hybridization. ACE2 expression levels were unaltered by exposures to renin-angiotensin-aldosterone system inhibitors in diabetic kidney disease. Bayesian integrative analysis of a large compendium of public -omics datasets identified molecular network modules induced in ACE2-expressing proximal tubular epithelial cells in diabetic kidney disease (searchable at hb.flatironinstitute.org/covid-kidney) that were linked to viral entry, immune activation, endomembrane reorganization, and RNA processing. The diabetic kidney disease ACE2-positive proximal tubular epithelial cell module overlapped with expression patterns seen in SARS-CoV-2-infected cells. Similar cellular programs were seen in ACE2-positive proximal tubular epithelial cells obtained from urine samples of 13 hospitalized patients with COVID-19, suggesting a consistent ACE2-coregulated proximal tubular epithelial cell expression program that may interact with the SARS-CoV-2 infection processes. Thus SARS-CoV-2 receptor networks can seed further research into risk stratification and therapeutic strategies for COVID-19-related kidney damage.
PUBMED: 33038424 PMC: PMC7543950 DOI: 10.1016/j.kint.2020.09.015
Science 2020;370;6518
The gene expression program underlying the specification of human cell types is of fundamental interest. We generated human cell atlases of gene expression and chromatin accessibility in fetal tissues. For gene expression, we applied three-level combinatorial indexing to >110 samples representing 15 organs, ultimately profiling ~4 million single cells. We leveraged the literature and other atlases to identify and annotate hundreds of cell types and subtypes, both within and across tissues. Our analyses focused on organ-specific specializations of broadly distributed cell types (such as blood, endothelial, and epithelial), sites of fetal erythropoiesis (which notably included the adrenal gland), and integration with mouse developmental atlases (such as conserved specification of blood cells). These data represent a rich resource for the exploration of in vivo human gene expression in diverse tissues and cell types.
PUBMED: 33184181 PMC: PMC7780123 DOI: 10.1126/science.aba7721
Science 2020;370;6518
The chromatin landscape underlying the specification of human cell types is of fundamental interest. We generated human cell atlases of chromatin accessibility and gene expression in fetal tissues. For chromatin accessibility, we devised a three-level combinatorial indexing assay and applied it to 53 samples representing 15 organs, profiling ~800,000 single cells. We leveraged cell types defined by gene expression to annotate these data and cataloged hundreds of thousands of candidate regulatory elements that exhibit cell type-specific chromatin accessibility. We investigated the properties of lineage-specific transcription factors (such as POU2F1 in neurons), organ-specific specializations of broadly distributed cell types (such as blood and endothelial), and cell type-specific enrichments of complex trait heritability. These data represent a rich resource for the exploration of in vivo human gene regulation in diverse tissues and cell types.
PUBMED: 33184180 PMC: PMC7785298 DOI: 10.1126/science.aba7612
Nature Medicine 2020;26;10;1509-1511
The Human Cell Atlas has been undergoing a massive effort to support global scientific equity. The co-leaders of its Equity Working Group share some lessons learned in the process.
2020;182;6
The enteric nervous system (ENS) coordinates diverse functions in the intestine but has eluded comprehensive molecular characterization because of the rarity and diversity of cells. Here we develop two methods to profile the ENS of adult mice and humans at single-cell resolution: RAISIN RNA-seq for profiling intact nuclei with ribosome-bound mRNA and MIRACL-seq for label-free enrichment of rare cell types by droplet-based profiling. The 1,187,535 nuclei in our mouse atlas include 5,068 neurons from the ileum and colon, revealing extraordinary neuron diversity. We highlight circadian expression changes in enteric neurons, show that disease-related genes are dysregulated with aging, and identify differences between the ileum and proximal/distal colon. In humans, we profile 436,202 nuclei, recovering 1,445 neurons, and identify conserved and species-specific transcriptional programs and putative neuro-epithelial, neuro-stromal, and neuro-immune interactions. The human ENS expresses risk genes for neuropathic, inflammatory, and extra-intestinal diseases, suggesting neuronal contributions to disease.
PUBMED: 32888429 PMC: PMC8358727 DOI: 10.1016/j.cell.2020.08.003
Experimental & molecular medicine 2020;52;9;1409-1418
The human body consists of 37 trillion single cells represented by over 50 organs that are stitched together to make us who we are, yet we still have very little understanding about the basic units of our body: what cell types and states make up our organs both compositionally and spatially. Previous efforts to profile a wide range of human cell types have been attempted by the FANTOM and GTEx consortia. Now, with the advancement in genomic technologies, profiling the human body at single-cell resolution is possible and will generate an unprecedented wealth of data that will accelerate basic and clinical research with tangible applications to future medicine. To date, several major organs have been profiled, but the challenges lie in ways to integrate single-cell genomics data in a meaningful way. In recent years, several consortia have begun to introduce harmonization and equity in data collection and analysis. Herein, we introduce existing and nascent single-cell genomics consortia, and present benefits to necessitate single-cell genomic consortia in a regional environment to achieve the universal human cell reference dataset.
PUBMED: 32929222 PMC: PMC8080593 DOI: 10.1038/s12276-020-0409-x
Circulation 2020;142;7;708-710
PUBMED: 32795091 PMC: PMC7424896 DOI: 10.1161/CIRCULATIONAHA.120.047911
Nature Communications 2020;11;1;4025
Droplet-based high throughput single cell sequencing techniques tremendously advanced our insight into cell-to-cell heterogeneity. However, those approaches only allow analysis of one extremity of the transcript after short read sequencing. In consequence, information on splicing and sequence heterogeneity is lost. To overcome this limitation, several approaches that use long-read sequencing were introduced recently. Yet, those techniques are limited by low sequencing depth and/or lacking or inaccurate assignment of unique molecular identifiers (UMIs), which are critical for elimination of PCR bias and artifacts. We introduce ScNaUmi-seq, an approach that combines the high throughput of Oxford Nanopore sequencing with an accurate cell barcode and UMI assignment strategy. UMI guided error correction allows to generate high accuracy full length sequence information with the 10x Genomics single cell isolation system at high sequencing depths. We analyzed transcript isoform diversity in embryonic mouse brain and show that ScNaUmi-seq allows defining splicing and SNVs (RNA editing) at a single cell level.
PUBMED: 32788667 PMC: PMC7423900 DOI: 10.1038/s41467-020-17800-6
Nature Biotechnology 2020;38;8;970-979
To investigate the immune response and mechanisms associated with severe coronavirus disease 2019 (COVID-19), we performed single-cell RNA sequencing on nasopharyngeal and bronchial samples from 19 clinically well-characterized patients with moderate or critical disease and from five healthy controls. We identified airway epithelial cell types and states vulnerable to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection. In patients with COVID-19, epithelial cells showed an average three-fold increase in expression of the SARS-CoV-2 entry receptor ACE2, which correlated with interferon signals by immune cells. Compared to moderate cases, critical cases exhibited stronger interactions between epithelial and immune cells, as indicated by ligand–receptor expression profiles, and activated immune cells, including inflammatory macrophages expressing CCL2, CCL3, CCL20, CXCL1, CXCL3, CXCL10, IL8, IL1B and TNF. The transcriptional differences in critical cases compared to moderate cases likely contribute to clinical observations of heightened inflammatory tissue damage, lung injury and respiratory failure. Our data suggest that pharmacologic inhibition of the CCR1 and/or CCR5 pathways might suppress immune hyperactivation in critical COVID-19. Single-cell analysis of COVID-19 patient samples identifies activated immune pathways that correlate with severe disease.
2020;38;6
The scale and capabilities of single-cell RNA-sequencing methods have expanded rapidly in recent years, enabling major discoveries and large-scale cell mapping efforts. However, these methods have not been systematically and comprehensively benchmarked. Here, we directly compare seven methods for single-cell and/or single-nucleus profiling-selecting representative methods based on their usage and our expertise and resources to prepare libraries-including two low-throughput and five high-throughput methods. We tested the methods on three types of samples: cell lines, peripheral blood mononuclear cells and brain tissue, generating 36 libraries in six separate experiments in a single center. To directly compare the methods and avoid processing differences introduced by the existing pipelines, we developed scumi, a flexible computational pipeline that can be used with any single-cell RNA-sequencing method. We evaluated the methods for both basic performance, such as the structure and alignment of reads, sensitivity and extent of multiplets, and for their ability to recover known biological information in the samples.
PUBMED: 32341560 PMC: PMC7289686 DOI: 10.1038/s41587-020-0465-8
2020;181;5
There is pressing urgency to understand the pathogenesis of the severe acute respiratory syndrome coronavirus clade 2 (SARS-CoV-2), which causes the disease COVID-19. SARS-CoV-2 spike (S) protein binds angiotensin-converting enzyme 2 (ACE2), and in concert with host proteases, principally transmembrane serine protease 2 (TMPRSS2), promotes cellular entry. The cell subsets targeted by SARS-CoV-2 in host tissues and the factors that regulate ACE2 expression remain unknown. Here, we leverage human, non-human primate, and mouse single-cell RNA-sequencing (scRNA-seq) datasets across health and disease to uncover putative targets of SARS-CoV-2 among tissue-resident cell subsets. We identify ACE2 and TMPRSS2 co-expressing cells within lung type II pneumocytes, ileal absorptive enterocytes, and nasal goblet secretory cells. Strikingly, we discovered that ACE2 is a human interferon-stimulated gene (ISG) in vitro using airway epithelial cells and extend our findings to in vivo viral infections. Our data suggest that SARS-CoV-2 could exploit species-specific interferon-driven upregulation of ACE2, a tissue-protective mediator during lung injury, to enhance infection.
PUBMED: 32413319 PMC: PMC7252096 DOI: 10.1016/j.cell.2020.04.035
The EMBO journal 2020;39;10;e105114
The SARS-CoV-2 pandemic affecting the human respiratory system severely challenges public health and urgently demands for increasing our understanding of COVID-19 pathogenesis, especially host factors facilitating virus infection and replication. SARS-CoV-2 was reported to enter cells via binding to ACE2, followed by its priming by TMPRSS2. Here, we investigate ACE2 and TMPRSS2 expression levels and their distribution across cell types in lung tissue (twelve donors, 39,778 cells) and in cells derived from subsegmental bronchial branches (four donors, 17,521 cells) by single nuclei and single cell RNA sequencing, respectively. While TMPRSS2 is strongly expressed in both tissues, in the subsegmental bronchial branches ACE2 is predominantly expressed in a transient secretory cell type. Interestingly, these transiently differentiating cells show an enrichment for pathways related to RHO GTPase function and viral processes suggesting increased vulnerability for SARS-CoV-2 infection. Our data provide a rich resource for future investigations of COVID-19 infection and pathogenesis.
PUBMED: 32246845 PMC: PMC7232010 DOI: 10.15252/embj.20105114
2020;21;1
Robust protocols and automation now enable large-scale single-cell RNA and ATAC sequencing experiments and their application on biobank and clinical cohorts. However, technical biases introduced during sample acquisition can hinder solid, reproducible results, and a systematic benchmarking is required before entering large-scale data production. Here, we report the existence and extent of gene expression and chromatin accessibility artifacts introduced during sampling and identify experimental and computational solutions for their prevention.
PUBMED: 32393363 PMC: PMC7212672 DOI: 10.1186/s13059-020-02032-0
Nature Medicine 2020;26;5;792-802
Single-cell genomics is essential to chart tumor ecosystems. Although single-cell RNA-Seq (scRNA-Seq) profiles RNA from cells dissociated from fresh tumors, single-nucleus RNA-Seq (snRNA-Seq) is needed to profile frozen or hard-to-dissociate tumors. Each requires customization to different tissue and tumor types, posing a barrier to adoption. Here, we have developed a systematic toolbox for profiling fresh and frozen clinical tumor samples using scRNA-Seq and snRNA-Seq, respectively. We analyzed 216,490 cells and nuclei from 40 samples across 23 specimens spanning eight tumor types of varying tissue and sample characteristics. We evaluated protocols by cell and nucleus quality, recovery rate and cellular composition. scRNA-Seq and snRNA-Seq from matched samples recovered the same cell types, but at different proportions. Our work provides guidance for studies in a broad range of tumors, including criteria for testing and selecting methods from the toolbox for other tumors, thus paving the way for charting tumor atlases.
PUBMED: 32405060 PMC: PMC7220853 DOI: 10.1038/s41591-020-0844-1
bioRxiv 2020;2020.04.10.022103
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
Nature Medicine 2020;26;5;681-687
We investigated SARS-CoV-2 potential tropism by surveying expression of viral entry-associated genes in single-cell RNA-sequencing data from multiple tissues from healthy human donors. We co-detected these transcripts in specific respiratory, corneal and intestinal epithelial cells, potentially explaining the high efficiency of SARS-CoV-2 transmission. These genes are co-expressed in nasal epithelial cells with genes involved in innate immunity, highlighting the cells’ potential role in initial viral infection, spread and clearance. The study offers a useful resource for further lines of inquiry with valuable clinical samples from COVID-19 patients and we provide our data in a comprehensive, open and user-friendly fashion at www.covid19cellatlas.org. An analysis of single-cell transcriptomics datasets from different tissues shows that ACE2 and TMPRSS2 are co-expressed in respiratory, corneal and intestinal epithelial cell populations, and that respiratory expression of ACE2 is associated with genes involved in innate immunity.
bioRxiv 2020;2020.04.07.007237
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
Nature Biotechnology 2020;38;6;747-755
Single-cell RNA sequencing (scRNA-seq) is the leading technique for characterizing the transcriptomes of individual cells in a sample. The latest protocols are scalable to thousands of cells and are being used to compile cell atlases of tissues, organs and organisms. However, the protocols differ substantially with respect to their RNA capture efficiency, bias, scale and costs, and their relative advantages for different applications are unclear. In the present study, we generated benchmark datasets to systematically evaluate protocols in terms of their power to comprehensively describe cell types and states. We performed a multicenter study comparing 13 commonly used scRNA-seq and single-nucleus RNA-seq protocols applied to a heterogeneous reference sample resource. Comparative analysis revealed marked differences in protocol performance. The protocols differed in library complexity and their ability to detect cell-type markers, impacting their predictive value and suitability for integration into reference cell atlases. These results provide guidance both for individual researchers and for consortium projects such as the Human Cell Atlas. A multicenter study compares 13 commonly used single-cell RNA-seq protocols.
Nature 2020;581;7808;303-309
Single-cell analysis is a valuable tool for dissecting cellular heterogeneity in complex systems1. However, a comprehensive single-cell atlas has not been achieved for humans. Here we use single-cell mRNA sequencing to determine the cell-type composition of all major human organs and construct a scheme for the human cell landscape (HCL). We have uncovered a single-cell hierarchy for many tissues that have not been well characterized. We established a ‘single-cell HCL analysis’ pipeline that helps to define human cell identity. Finally, we performed a single-cell comparative analysis of landscapes from human and mouse to identify conserved genetic networks. We found that stem and progenitor cells exhibit strong transcriptomic stochasticity, whereas differentiated cells are more distinct. Our results provide a useful resource for the study of human biology. Single-cell RNA sequencing is used to generate a dataset covering all major human organs in both adult and fetal stages, enabling comparison with similar datasets for mouse tissues.
The Journal of experimental medicine 2020;217;3
Plasmacytoid dendritic cells (pDCs) produce type I interferon (IFN-I) and are traditionally defined as being BDCA-2+CD123+. pDCs are not readily detectable in healthy human skin, but have been suggested to accumulate in wounds. Here, we describe a CD1a-bearing BDCA-2+CD123int DC subset that rapidly infiltrates human skin wounds and comprises a major DC population. Using single-cell RNA sequencing, we show that these cells are largely activated DCs acquiring features compatible with lymph node homing and antigen presentation, but unexpectedly express both BDCA-2 and CD123, potentially mimicking pDCs. Furthermore, a third BDCA-2-expressing population, Axl+Siglec-6+ DCs (ASDC), was also found to infiltrate human skin during wounding. These data demonstrate early skin infiltration of a previously unrecognized CD123intBDCA-2+CD1a+ DC subset during acute sterile inflammation, and prompt a re-evaluation of previously ascribed pDC involvement in skin disease.
Nature Immunology 2020;21;3;343-353
Gastrointestinal microbiota and immune cells interact closely and display regional specificity; however, little is known about how these communities differ with location. Here, we simultaneously assess microbiota and single immune cells across the healthy, adult human colon, with paired characterization of immune cells in the mesenteric lymph nodes, to delineate colonic immune niches at steady state. We describe distinct helper T cell activation and migration profiles along the colon and characterize the transcriptional adaptation trajectory of regulatory T cells between lymphoid tissue and colon. Finally, we show increasing B cell accumulation, clonal expansion and mutational frequency from the cecum to the sigmoid colon and link this to the increasing number of reactive bacterial species.
PUBMED: 32066951 PMC: PMC7212050 DOI: 10.1038/s41590-020-0602-z
Science 2020;367;6480
The thymus provides a nurturing environment for the differentiation and selection of T cells, a process orchestrated by their interaction with multiple thymic cell types. We used single-cell RNA sequencing to create a cell census of the human thymus across the life span and to reconstruct T cell differentiation trajectories and T cell receptor (TCR) recombination kinetics. Using this approach, we identified and located in situ CD8αα T cell populations, thymic fibroblast subtypes, and activated dendritic cell states. In addition, we reveal a bias in TCR recombination and selection, which is attributed to genomic position and the kinetics of lineage commitment. Taken together, our data provide a comprehensive atlas of the human thymus across the life span with new insights into human T cell development.
PUBMED: 32079746 PMC: PMC7611066 DOI: 10.1126/science.aay3224
Scientific reports 2020;10;1;1535
A comprehensive reference map of all cell types in the human body is necessary for improving our understanding of fundamental biological processes and in diagnosing and treating disease. High-throughput single-cell RNA sequencing techniques have emerged as powerful tools to identify and characterize cell types in complex and heterogeneous tissues. However, extracting intact cells from tissues and organs is often technically challenging or impossible, for example in heart or brain tissue. Single-nucleus RNA sequencing provides an alternative way to obtain transcriptome profiles of such tissues. To systematically assess the differences between high-throughput single-cell and single-nuclei RNA-seq approaches, we compared Drop-seq and DroNc-seq, two microfluidic-based 3' RNA capture technologies that profile total cellular and nuclear RNA, respectively, during a time course experiment of human induced pluripotent stem cells (iPSCs) differentiating into cardiomyocytes. Clustering of time-series transcriptomes from Drop-seq and DroNc-seq revealed six distinct cell types, five of which were found in both techniques. Furthermore, single-cell trajectories reconstructed from both techniques reproduced expected differentiation dynamics. We then applied DroNc-seq to postmortem heart tissue to test its performance on heterogeneous human tissue samples. Our data confirm that DroNc-seq yields similar results to Drop-seq on matched samples and can be successfully used to generate reference maps for the human cell atlas.
PUBMED: 32001747 PMC: PMC6992778 DOI: 10.1038/s41598-020-58327-6
medRxiv 2020
Despite signs of infection, the involvement of the oral cavity in COVID-19 is poorly understood. To address this, single-cell RNA sequencing data-sets were integrated from human minor salivary glands and gingiva to identify 11 epithelial, 7 mesenchymal, and 15 immune cell clusters. Analysis of SARS-CoV-2 viral entry factor expression showed enrichment in epithelia including the ducts and acini of the salivary glands and the suprabasal cells of the mucosae. COVID-19 autopsy tissues confirmed in vivo SARS-CoV-2 infection in the salivary glands and mucosa. Saliva from SARS-CoV-2-infected individuals harbored epithelial cells exhibiting expression and SARS-CoV-2 RNA. Matched nasopharyngeal and saliva samples found distinct viral shedding dynamics and viral burden in saliva correlated with COVID-19 symptoms including taste loss. Upon recovery, this cohort exhibited salivary antibodies against SARS-CoV-2 proteins. Collectively, the oral cavity represents a robust site for COVID-19 infection and implicates saliva in viral transmission.
PUBMED: 33140061 PMC: PMC7605572 DOI: 10.1101/2020.10.26.20219089
2019;21;1
The Human Cell Atlas is a large international collaborative effort to map all cell types of the human body. Single-cell RNA sequencing can generate high-quality data for the delivery of such an atlas. However, delays between fresh sample collection and processing may lead to poor data and difficulties in experimental design.
PUBMED: 31892341 PMC: PMC6937944 DOI: 10.1186/s13059-019-1906-x
Cell 2019;179;7;1455-1467
Understanding the genetic and molecular drivers of phenotypic heterogeneity across individuals is central to biology. As new technologies enable fine-grained and spatially resolved molecular profiling, we need new computational approaches to integrate data from the same organ across different individuals into a consistent reference and to construct maps of molecular and cellular organization at histological and anatomical scales. Here, we review previous efforts and discuss challenges involved in establishing such a common coordinate framework, the underlying map of tissues and organs. We focus on strategies to handle anatomical variation across individuals and highlight the need for new technologies and analytical methods spanning multiple hierarchical scales of spatial resolution.
PUBMED: 31835027 PMC: PMC6934046 DOI: 10.1016/j.cell.2019.11.019
Nature 2019;575;7783;512-518
Liver cirrhosis is a major cause of death worldwide and is characterized by extensive fibrosis. There are currently no effective antifibrotic therapies available. To obtain a better understanding of the cellular and molecular mechanisms involved in disease pathogenesis and enable the discovery of therapeutic targets, here we profile the transcriptomes of more than 100,000 single human cells, yielding molecular definitions for non-parenchymal cell types that are found in healthy and cirrhotic human liver. We identify a scar-associated TREM2CD9 subpopulation of macrophages, which expands in liver fibrosis, differentiates from circulating monocytes and is pro-fibrogenic. We also define ACKR1 and PLVAP endothelial cells that expand in cirrhosis, are topographically restricted to the fibrotic niche and enhance the transmigration of leucocytes. Multi-lineage modelling of ligand and receptor interactions between the scar-associated macrophages, endothelial cells and PDGFRα collagen-producing mesenchymal cells reveals intra-scar activity of several pro-fibrogenic pathways including TNFRSF12A, PDGFR and NOTCH signalling. Our work dissects unanticipated aspects of the cellular and molecular basis of human organ fibrosis at a single-cell level, and provides a conceptual framework for the discovery of rational therapeutic targets in liver cirrhosis.
PUBMED: 31597160 PMC: PMC6876711 DOI: 10.1038/s41586-019-1631-3
Nature 2019;574;7778;365-371
Definitive haematopoiesis in the fetal liver supports self-renewal and differentiation of haematopoietic stem cells and multipotent progenitors (HSC/MPPs) but remains poorly defined in humans. Here, using single-cell transcriptome profiling of approximately 140,000 liver and 74,000 skin, kidney and yolk sac cells, we identify the repertoire of human blood and immune cells during development. We infer differentiation trajectories from HSC/MPPs and evaluate the influence of the tissue microenvironment on blood and immune cell development. We reveal physiological erythropoiesis in fetal skin and the presence of mast cells, natural killer and innate lymphoid cell precursors in the yolk sac. We demonstrate a shift in the haemopoietic composition of fetal liver during gestation away from being predominantly erythroid, accompanied by a parallel change in differentiation potential of HSC/MPPs, which we functionally validate. Our integrated map of fetal liver haematopoiesis provides a blueprint for the study of paediatric blood and immune disorders, and a reference for harnessing the therapeutic potential of HSC/MPPs.
PUBMED: 31597962 PMC: PMC6861135 DOI: 10.1038/s41586-019-1652-y
The EMBO journal 2019;38;18;e100811
The retina is a specialized neural tissue that senses light and initiates image processing. Although the functional organization of specific retina cells has been well studied, the molecular profile of many cell types remains unclear in humans. To comprehensively profile the human retina, we performed single-cell RNA sequencing on 20,009 cells from three donors and compiled a reference transcriptome atlas. Using unsupervised clustering analysis, we identified 18 transcriptionally distinct cell populations representing all known neural retinal cells: rod photoreceptors, cone photoreceptors, Müller glia, bipolar cells, amacrine cells, retinal ganglion cells, horizontal cells, astrocytes, and microglia. Our data captured molecular profiles for healthy and putative early degenerating rod photoreceptors, and revealed the loss of MALAT1 expression with longer post-mortem time, which potentially suggested a novel role of MALAT1 in rod photoreceptor degeneration. We have demonstrated the use of this retina transcriptome atlas to benchmark pluripotent stem cell-derived cone photoreceptors and an adult Müller glia cell line. This work provides an important reference with unprecedented insights into the transcriptional landscape of human retinal cells, which is fundamental to understanding retinal biology and disease.
PUBMED: 31436334 PMC: PMC6745503 DOI: 10.15252/embj.2018100811
2019;573;7772
Elucidating the cellular architecture of the human cerebral cortex is central to understanding our cognitive abilities and susceptibility to disease. Here we used single-nucleus RNA-sequencing analysis to perform a comprehensive study of cell types in the middle temporal gyrus of human cortex. We identified a highly diverse set of excitatory and inhibitory neuron types that are mostly sparse, with excitatory types being less layer-restricted than expected. Comparison to similar mouse cortex single-cell RNA-sequencing datasets revealed a surprisingly well-conserved cellular architecture that enables matching of homologous types and predictions of properties of human cell types. Despite this general conservation, we also found extensive differences between homologous human and mouse cell types, including marked alterations in proportions, laminar distributions, gene expression and morphology. These species-specific features emphasize the importance of directly studying human brain.
PUBMED: 31435019 PMC: PMC6919571 DOI: 10.1038/s41586-019-1506-7
bioRxiv 2019;455451
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
bioRxiv 2019;721258
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
Nature 2019;572;7768;199-204
The human liver is an essential multifunctional organ. The incidence of liver diseases is rising and there are limited treatment options. However, the cellular composition of the liver remains poorly understood. Here we performed single-cell RNA sequencing of about 10,000 cells from normal liver tissue from nine human donors to construct a human liver cell atlas. Our analysis identified previously unknown subtypes of endothelial cells, Kupffer cells, and hepatocytes, with transcriptome-wide zonation of some of these populations. We show that the EPCAM population is heterogeneous, comprising hepatocyte-biased and cholangiocyte populations as well as a TROP2 progenitor population with strong potential to form bipotent liver organoids. As a proof-of-principle, we used our atlas to unravel the phenotypic changes that occur in hepatocellular carcinoma cells and in human hepatocytes and liver endothelial cells engrafted into a mouse liver. Our human liver cell atlas provides a powerful resource to enable the discovery of previously unknown cell types in normal and diseased livers.
PUBMED: 31292543 PMC: PMC6687507 DOI: 10.1038/s41586-019-1373-2
Nature Communications 2019;10;1;2907
Single-nucleus RNA-seq (snRNA-seq) enables the interrogation of cellular states in complex tissues that are challenging to dissociate or are frozen, and opens the way to human genetics studies, clinical trials, and precise cell atlases of large organs. However, such applications are currently limited by batch effects, processing, and costs. Here, we present an approach for multiplexing snRNA-seq, using sample-barcoded antibodies to uniquely label nuclei from distinct samples. Comparing human brain cortex samples profiled with or without hashing antibodies, we demonstrate that nucleus hashing does not significantly alter recovered profiles. We develop DemuxEM, a computational tool that detects inter-sample multiplets and assigns singlets to their sample of origin, and validate its accuracy using sex-specific gene expression, species-mixing and natural genetic variation. Our approach will facilitate tissue atlases of isogenic model organisms or from multiple biopsies or longitudinal samples of one donor, and large-scale perturbation screens.
PUBMED: 31266958 PMC: PMC6606589 DOI: 10.1038/s41467-019-10756-2
Nature Medicine 2019;25;7;1153-1163
Human lungs enable efficient gas exchange and form an interface with the environment, which depends on mucosal immunity for protection against infectious agents. Tightly controlled interactions between structural and immune cells are required to maintain lung homeostasis. Here, we use single-cell transcriptomics to chart the cellular landscape of upper and lower airways and lung parenchyma in healthy lungs, and lower airways in asthmatic lungs. We report location-dependent airway epithelial cell states and a novel subset of tissue-resident memory T cells. In the lower airways of patients with asthma, mucous cell hyperplasia is shown to stem from a novel mucous ciliated cell state, as well as goblet cell hyperplasia. We report the presence of pathogenic effector type 2 helper T cells (TH2) in asthmatic lungs and find evidence for type 2 cytokines in maintaining the altered epithelial cell states. Unbiased analysis of cell–cell interactions identifies a shift from airway structural cell communication in healthy lungs to a TH2-dominated interactome in asthmatic lungs. Single-cell transcriptomics reveals immune and stromal compartment remodeling, including the enrichment of unique populations of epithelial cells and CD4+ T cells, in asthmatic lungs
American journal of respiratory and critical care medicine 2019;199;12;1517-1536
The contributions of diverse cell populations in the human lung to pulmonary fibrosis pathogenesis are poorly understood. Single-cell RNA sequencing can reveal changes within individual cell populations during pulmonary fibrosis that are important for disease pathogenesis. To determine whether single-cell RNA sequencing can reveal disease-related heterogeneity within alveolar macrophages, epithelial cells, or other cell types in lung tissue from subjects with pulmonary fibrosis compared with control subjects. We performed single-cell RNA sequencing on lung tissue obtained from eight transplant donors and eight recipients with pulmonary fibrosis and on one bronchoscopic cryobiospy sample from a patient with idiopathic pulmonary fibrosis. We validated these data using RNA hybridization, immunohistochemistry, and bulk RNA-sequencing on flow-sorted cells from 22 additional subjects. We identified a distinct, novel population of profibrotic alveolar macrophages exclusively in patients with fibrosis. Within epithelial cells, the expression of genes involved in Wnt secretion and response was restricted to nonoverlapping cells. We identified rare cell populations including airway stem cells and senescent cells emerging during pulmonary fibrosis. We developed a web-based tool to explore these data. We generated a single-cell atlas of pulmonary fibrosis. Using this atlas, we demonstrated heterogeneity within alveolar macrophages and epithelial cells from subjects with pulmonary fibrosis. These results support the feasibility of discovery-based approaches using next-generation sequencing technologies to identify signaling pathways for targeting in the development of personalized therapies for patients with pulmonary fibrosis.
PUBMED: 30554520 PMC: PMC6580683 DOI: 10.1164/rccm.201712-2410OC
Developmental cell 2019;49;1;10-29
Single-cell gene expression analyses of mammalian tissues have uncovered profound stage-specific molecular regulatory phenomena that have changed the understanding of unique cell types and signaling pathways critical for lineage determination, morphogenesis, and growth. We discuss here the case for a Pediatric Cell Atlas as part of the Human Cell Atlas consortium to provide single-cell profiles and spatial characterization of gene expression across human tissues and organs. Such data will complement adult and developmentally focused HCA projects to provide a rich cytogenomic framework for understanding not only pediatric health and disease but also environmental and genetic impacts across the human lifespan.
PUBMED: 30930166 PMC: PMC6616346 DOI: 10.1016/j.devcel.2019.03.001
2019;37;4
Single-cell RNA sequencing studies of differentiating systems have raised fundamental questions regarding the discrete versus continuous nature of both differentiation and cell fate. Here we present Palantir, an algorithm that models trajectories of differentiating cells by treating cell fate as a probabilistic process and leverages entropy to measure cell plasticity along the trajectory. Palantir generates a high-resolution pseudo-time ordering of cells and, for each cell state, assigns a probability of differentiating into each terminal state. We apply our algorithm to human bone marrow single-cell RNA sequencing data and detect important landmarks of hematopoietic differentiation. Palantir's resolution enables the identification of key transcription factors that drive lineage fate choice and closely track when cells lose plasticity. We show that Palantir outperforms existing algorithms in identifying cell lineages and recapitulating gene expression trends during differentiation, is generalizable to diverse tissue types, and is well-suited to resolving less-studied differentiating systems.
PUBMED: 30899105 PMC: PMC7549125 DOI: 10.1038/s41587-019-0068-4
Nature Medicine 2018;24;12;1867-1876
Multiple myeloma, a plasma cell malignancy, is the second most common blood cancer. Despite extensive research, disease heterogeneity is poorly characterized, hampering efforts for early diagnosis and improved treatments. Here, we apply single cell RNA sequencing to study the heterogeneity of 40 individuals along the multiple myeloma progression spectrum, including 11 healthy controls, demonstrating high interindividual variability that can be explained by expression of known multiple myeloma drivers and additional putative factors. We identify extensive subclonal structures for 10 of 29 individuals with multiple myeloma. In asymptomatic individuals with early disease and in those with minimal residual disease post-treatment, we detect rare tumor plasma cells with molecular characteristics similar to those of active myeloma, with possible implications for personalized therapies. Single cell analysis of rare circulating tumor cells allows for accurate liquid biopsy and detection of malignant plasma cells, which reflect bone marrow disease. Our work establishes single cell RNA sequencing for dissecting blood malignancies and devising detailed molecular characterization of tumor cells in symptomatic and asymptomatic patients.
Nature Communications 2018;9;1;4383
The liver is the largest solid organ in the body and is critical for metabolic and immune functions. However, little is known about the cells that make up the human liver and its immune microenvironment. Here we report a map of the cellular landscape of the human liver using single-cell RNA sequencing. We provide the transcriptional profiles of 8444 parenchymal and non-parenchymal cells obtained from the fractionation of fresh hepatic tissue from five human livers. Using gene expression patterns, flow cytometry, and immunohistochemical examinations, we identify 20 discrete cell populations of hepatocytes, endothelial cells, cholangiocytes, hepatic stellate cells, B cells, conventional and non-conventional T cells, NK-like cells, and distinct intrahepatic monocyte/macrophage populations. Together, our study presents a comprehensive view of the human liver at single-cell resolution that outlines the characteristics of resident cells in the liver, and in particular provides a map of the human hepatic immune microenvironment.
PUBMED: 30348985 PMC: PMC6197289 DOI: 10.1038/s41467-018-06318-7
arXiv.org 2018
The Human Cell Atlas (HCA) will be made up of comprehensive reference maps of all human cells - the fundamental units of life - as a basis for understanding fundamental human biological processes and diagnosing, monitoring, and treating disease. It will help scientists understand how genetic variants impact disease risk, define drug toxicities, discover better therapies, and advance regenerative medicine. A resource of such ambition and scale should be built in stages, increasing in size, breadth, and resolution as technologies develop and understanding deepens. We will therefore pursue Phase 1 as a suite of flagship projects in key tissues, systems, and organs. We will bring together experts in biology, medicine, genomics, technology development and computation (including data analysis, software engineering, and visualization). We will also need standardized experimental and computational methods that will allow us to compare diverse cell and tissue types - and samples across human communities - in consistent ways, ensuring that the resulting resource is truly global. This document, the first version of the HCA White Paper, was written by experts in the field with feedback and suggestions from the HCA community, gathered during recent international meetings. The White Paper, released at the close of this yearlong planning process, will be a living document that evolves as the HCA community provides additional feedback, as technological and computational advances are made, and as lessons are learned during the construction of the atlas.
Cell 2018;174;4;999-1014.e22
The mammalian nervous system executes complex behaviors controlled by specialized, precisely positioned, and interacting cell types. Here, we used RNA sequencing of half a million single cells to create a detailed census of cell types in the mouse nervous system. We mapped cell types spatially and derived a hierarchical, data-driven taxonomy. Neurons were the most diverse and were grouped by developmental anatomical units and by the expression of neurotransmitters and neuropeptides. Neuronal diversity was driven by genes encoding cell identity, synaptic connectivity, neurotransmission, and membrane conductance. We discovered seven distinct, regionally restricted astrocyte types that obeyed developmental boundaries and correlated with the spatial distribution of key glutamate and glycine neurotransmitters. In contrast, oligodendrocytes showed a loss of regional identity followed by a secondary diversification. The resource presented here lays a solid foundation for understanding the molecular architecture of the mammalian nervous system and enables genetic manipulation of specific cell types.
PUBMED: 30096314 PMC: PMC6086934 DOI: 10.1016/j.cell.2018.06.021
bioRxiv 2018;385328
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
Nature 2018;563;7731;347-353
During early human pregnancy the uterine mucosa transforms into the decidua, into which the fetal placenta implants and where placental trophoblast cells intermingle and communicate with maternal cells. Trophoblast-decidual interactions underlie common diseases of pregnancy, including pre-eclampsia and stillbirth. Here we profile the transcriptomes of about 70,000 single cells from first-trimester placentas with matched maternal blood and decidual cells. The cellular composition of human decidua reveals subsets of perivascular and stromal cells that are located in distinct decidual layers. There are three major subsets of decidual natural killer cells that have distinctive immunomodulatory and chemokine profiles. We develop a repository of ligand-receptor complexes and a statistical tool to predict the cell-type specificity of cell-cell communication via these molecular interactions. Our data identify many regulatory interactions that prevent harmful innate or adaptive immune responses in this environment. Our single-cell atlas of the maternal-fetal interface reveals the cellular organization of the decidua and placenta, and the interactions that are critical for placentation and reproductive success.
PUBMED: 30429548 PMC: PMC7612850 DOI: 10.1038/s41586-018-0698-6
eLife 2017;6
The recent advent of methods for high-throughput single-cell molecular profiling has catalyzed a growing sense in the scientific community that the time is ripe to complete the 150-year-old effort to identify all cell types in the human body. The Human Cell Atlas Project is an international collaborative effort that aims to define all human cell types in terms of distinctive molecular profiles (such as gene expression profiles) and to connect this information with classical cellular descriptions (such as location and morphology). An open comprehensive reference map of the molecular state of cells in healthy human tissues would propel the systematic study of physiological states, developmental trajectories, regulatory circuitry and interactions of cells, and also provide a framework for understanding cellular dysregulation in human disease. Here we describe the idea, its potential utility, early proofs-of-concept, and some design considerations for the Human Cell Atlas, including a commitment to open data, code, and community.
Nature 2017;550;7677;451-453