AllenInstitute/BG-macaque-dilatedCNN

Dilated CNN trained on macaque (Macaca mulatta) basal ganglia chromatin accessibility data across 58 cell type tracks. Part of the DNA sequence modeling resources associated with the basal ganglia cell atlas package. Published in Johansen, Fu et al., 2025.

Model Details

Field Value
Architecture Dilated CNN
Species Macaca mulatta (macaque)
Brain region Basal ganglia
Number of tracks 58
Framework PyTorch Lightning
Checkpoint format .ckpt

Background

This model learns cis-regulatory sequence logic underlying basal ganglia cell type specialization from snATAC-seq-derived accessibility profiles. It can be used to score candidate enhancer sequences, run in silico mutagenesis, or guide sequence design for specific cell types.

For broader context on the associated multiomic studies and the consensus cell type taxonomy used to define tracks, see the BG Cell Atlas sequence modeling repository.

Availability

This repository is currently private. The enhancer-designer codebase is an internal Allen Institute tool with a public release planned in the near future.

To request early access, please contact Kasia Kedzierska at kasia.kedzierska@alleninstitute.org.

Usage

from enhancerdesigner.models import load_model

model = load_model("AllenInstitute/BG-macaque-dilatedCNN")

Target Tracks

The full list of 58 tracks is accessible programmatically after loading:

model = load_model("AllenInstitute/BG-macaque-dilatedCNN")
print(model.target_names)

Citation

@article{Johansen2025,
  title   = {Cross-species consensus atlas of the primate basal ganglia},
  author  = {Johansen, Nelson and Fu, Yuanyuan and others},
  journal = {bioRxiv},
  year    = {2025},
  doi     = {10.64898/2025.12.15.694496}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support