The HCA Ingestion Service will provide the single point of entry for all HCA data. This includes raw data and metadata for projects, experiments, and samples submitted by investigators, as well as derived analyses and quality metrics automatically generated from running vetted secondary analysis pipelines.
Researchers will submit data through one of several data brokers that act as links between labs and the single Ingestion Service API. Brokers might include user-facing websites or other web services. Some may target specific geographical regions for upload efficiency, and some may provide domain- or lab-specific handling or formatting — e.g. data from image-based transcriptomics may require different handling than single cell RNA sequencing. Staging systems in cloud storage will also be developed to enable faster uploads. Upon submission, the Ingestion Service will perform basic quality assurance, and then deposit the data into the Data Store.