Skip to content

In-tron/X-Cell

 
 

Repository files navigation

X-Cell

A diffusion language model for genome-scale perturbation prediction across diverse cellular contexts

License: CC BY-NC-SA 4.0 Model Dataset Paper Docs


Status: Model weights and inference code coming soon. The Python API, model weights, and tutorials are under active development. Star/watch this repository for release updates.

X-Cell Architecture

X-Cell predicts genome-scale transcriptional responses to genetic perturbations across diverse cellular contexts. Trained on X-Atlas/Pisces (25.6M perturbed single cells, 7 CRISPRi Perturb-seq screens), X-Cell integrates multi-modal biological priors through cross-attention and generalizes zero-shot to unseen cell types and perturbations.

Highlights

  • Diffusion LM with iterative inference-time refinement
  • Multi-modal priors via cross-attention (ESM-2, STRING, GenePT, DepMap, JUMP-Cell Painting, scGPT)
  • X-Cell Mini (55M) — a compact model for single-GPU perturbation prediction
  • 5× higher Pearson Δ than the next-best method on held-out perturbations
  • Zero-shot generalization to unseen cell types, confirmed on primary human T cells and melanocyte progenitors

Installation

pip install xcell

Quick Start

import anndata as ad
from xcell import XCell

# Load pretrained X-Cell Mini
model = XCell.from_pretrained("Xaira-Therapeutics/X-Cell", variant="mini")

# Predict from an AnnData object
adata = ad.read_h5ad("control_cells.h5ad")
predictions = model.predict(adata, perturbation="BRCA1")

# Or from one or more .h5ad paths
predictions = model.predict(
    ["screen1.h5ad", "screen2.h5ad"],
    perturbation="BRCA1",
)

See the documentation for full examples.

License

This project is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Model

Model Parameters HuggingFace
X-Cell Mini 55M Xaira-Therapeutics/X-Cell

Data

X-Atlas/Pisces is available at Xaira-Therapeutics/X-Atlas-Pisces.

Citation

@article{xcell2026,
  title   = {X-Cell: Scaling Causal Perturbation Prediction Across Diverse
             Cellular Contexts via Diffusion Language Models},
  year    = {2026},
}

See LICENSE for details.

About

X-Cell: a diffusion language model for genome-scale perturbation prediction

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 100.0%