Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Try the next-generation Data Catalog at catalog-beta.data.gov and help shape it with your feedback.

Data from: Reference transcriptomics of porcine peripheral immune cells created through bulk and single-cell RNA sequencing

Metadata Updated: December 2, 2025

This dataset contains files reconstructing single-cell data presented in 'Reference transcriptomics of porcine peripheral immune cells created through bulk and single-cell RNA sequencing' by Herrera-Uribe & Wiarda et al. 2021. Samples of peripheral blood mononuclear cells (PBMCs) were collected from seven pigs and processed for single-cell RNA sequencing (scRNA-seq) in order to provide a reference annotation of porcine immune cell transcriptomics at enhanced, single-cell resolution. Analysis of single-cell data allowed identification of 36 cell clusters that were further classified into 13 cell types, including monocytes, dendritic cells, B cells, antibody-secreting cells, numerous populations of T cells, NK cells, and erythrocytes. Files may be used to reconstruct the data as presented in the manuscript, allowing for individual query by other users. Scripts for original data analysis are available at https://github.com/USDA-FSEPRU/PorcinePBMCs_bulkRNAseq_scRNAseq. Raw data are available at https://www.ebi.ac.uk/ena/browser/view/PRJEB43826.

Funding for this dataset was also provided by NRSP8: National Animal Genome Research Program (https://www.nimss.org/projects/view/mrp/outline/18464).


Resources in this dataset:


  • Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells 10X Format.

    File Name: PBMC7_AllCells.zip

    Resource Description: Zipped folder containing PBMC counts matrix, gene names, and cell IDs. Files are as follows:

    • matrix of gene counts* (matrix.mtx.gx)
    • gene names (features.tsv.gz)
    • cell IDs (barcodes.tsv.gz)

    *The ‘raw’ count matrix is actually gene counts obtained following ambient RNA removal. During ambient RNA removal, we specified to calculate non-integer count estimations, so most gene counts are actually non-integer values in this matrix but should still be treated as raw/unnormalized data that requires further normalization/transformation.

    Data can be read into R using the function Read10X().


  • Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells Metadata.

    File Name: PBMC7_AllCells_meta.csv

    Resource Description: .csv file containing metadata for cells included in the final dataset. Metadata columns include:

    • nCount_RNA = the number of transcripts detected in a cell
    • nFeature_RNA = the number of genes detected in a cell
    • Loupe = cell barcodes; correspond to the cell IDs found in the .h5Seurat and 10X formatted objects for all cells
    • prcntMito = percent mitochondrial reads in a cell
    • Scrublet = doublet probability score assigned to a cell
    • seurat_clusters = cluster ID assigned to a cell
    • PaperIDs = sample ID for a cell
    • celltypes = cell type ID assigned to a cell

    • Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells PCA Coordinates.

      File Name: PBMC7_AllCells_PCAcoord.csv

      Resource Description: .csv file containing first 100 PCA coordinates for cells.


    • Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells t-SNE Coordinates.

      File Name: PBMC7_AllCells_tSNEcoord.csv

      Resource Description: .csv file containing t-SNE coordinates for all cells.


    • Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells UMAP Coordinates.

      File Name: PBMC7_AllCells_UMAPcoord.csv

      Resource Description: .csv file containing UMAP coordinates for all cells.


    • Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - CD4 T Cells t-SNE Coordinates.

      File Name: PBMC7_CD4only_tSNEcoord.csv

      Resource Description: .csv file containing t-SNE coordinates for only CD4 T cells (clusters 0, 3, 4, 28). A dataset of only CD4 T cells can be re-created from the PBMC7_AllCells.h5Seurat, and t-SNE coordinates used in publication can be re-assigned using this .csv file.


    • Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - CD4 T Cells UMAP Coordinates.

      File Name: PBMC7_CD4only_UMAPcoord.csv

      Resource Description: .csv file containing UMAP coordinates for only CD4 T cells (clusters 0, 3, 4, 28). A dataset of only CD4 T cells can be re-created from the PBMC7_AllCells.h5Seurat, and UMAP coordinates used in publication can be re-assigned using this .csv file.


    • Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - Gamma Delta T Cells UMAP Coordinates.

      File Name: PBMC7_GDonly_UMAPcoord.csv

      Resource Description: .csv file containing UMAP coordinates for only gamma delta T cells (clusters 6, 21, 24, 31). A dataset of only gamma delta T cells can be re-created from the PBMC7_AllCells.h5Seurat, and UMAP coordinates used in publication can be re-assigned using this .csv file.


    • Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - Gamma Delta T Cells t-SNE Coordinates.

      File Name: PBMC7_GDonly_tSNEcoord.csv

      Resource Description: .csv file containing t-SNE coordinates for only gamma delta T cells (clusters 6, 21, 24, 31). A dataset of only gamma delta T cells can be re-created from the PBMC7_AllCells.h5Seurat, and t-SNE coordinates used in publication can be re-assigned using this .csv file.


    • Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - Gene Annotation Information.

      File Name: UnfilteredGeneInfo.txt

      Resource Description: .txt file containing gene nomenclature information used to assign gene names in the dataset. 'Name' column corresponds to the name assigned to a feature in the dataset.


    • Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells H5Seurat.

      File Name: PBMC7.tar

      Resource Description: .h5Seurat object of all cells in PBMC dataset. File needs to be untarred, then read into R using function LoadH5Seurat().

Access & Use Information

Public: This dataset is intended for public access and use. License: Creative Commons Attribution Share-Alike

Downloads & Resources

Dates

Metadata Created Date March 30, 2024
Metadata Updated Date December 2, 2025

Metadata Source

Harvested from USDA JSON

Additional Metadata

Resource Type Dataset
Metadata Created Date March 30, 2024
Metadata Updated Date December 2, 2025
Publisher Agricultural Research Service
Maintainer
Identifier 10.15482/USDA.ADC/1522411
Data Last Modified 2025-11-21
Public Access Level public
Bureau Code 005:18
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id abbbebbe-9777-4599-929d-3c0c6371a643
Harvest Source Id d3fafa34-0cb9-48f1-ab1d-5b5fdc783806
Harvest Source Title USDA JSON
License https://creativecommons.org/licenses/by-sa/4.0/
Old Spatial {"type": "Polygon", "coordinates": -93.7353515625, 41.937402745616, -93.7353515625, 42.116943355208, -93.482666015625, 42.116943355208, -93.482666015625, 41.937402745616, -93.7353515625, 41.937402745616}
Program Code 005:040
Source Datajson Identifier True
Source Hash b59dc368a90211cbbfe7a52c6e98e675a32b4914d02a7c7231a76527926de40e
Source Schema Version 1.1
Spatial {"type": "Polygon", "coordinates": -93.7353515625, 41.937402745616, -93.7353515625, 42.116943355208, -93.482666015625, 42.116943355208, -93.482666015625, 41.937402745616, -93.7353515625, 41.937402745616}
Temporal 2018-01-01/2018-01-01

Didn't find what you're looking for? Suggest a dataset here.