AstroPT Euclid: Imaging, Metadata, Models & Embeddings
Collection
A complete suite of Euclid Q1 galaxy data, metadata, pre-trained AstroPT models, and pre-computed embeddings for large-scale astronomical ML. • 9 items • Updated
Pre-trained AstroPT model for multi-band image analysis using Euclid VIS and NISP imaging.
This is a pre-trained checkpoint for the AstroPT framework, trained on multi-band imaging from the Euclid Q1 dataset. The model combines visible (VIS) and near-infrared (NISP: Y, J, H) bands for robust multi-wavelength astronomical image analysis.
Citation: Euclid Collaboration: Siudek, M et al. 2025 (arXiv:2503.15312)
import torch
from pathlib import Path
# Load model checkpoint
model_path = "astropt/090M/ckpt.pt"
device = "cuda" if torch.cuda.is_available() else "cpu"
# Load state dict
checkpoint = torch.load(model_path, map_location=device)
# Initialize your model architecture here
# model.load_state_dict(checkpoint)
from datasets import load_dataset
import torch
# Load dataset
dataset = load_dataset(
"msiudek/astroPT_euclid_dataset",
split="train_batch_1",
streaming=True
)
# Run inference
model.eval()
with torch.no_grad():
for sample in dataset:
# Stack 4 bands: VIS + Y + J + H
images = torch.stack([
torch.tensor(sample['VIS_image'], dtype=torch.float32),
torch.tensor(sample['NISP_Y_image'], dtype=torch.float32),
torch.tensor(sample['NISP_J_image'], dtype=torch.float32),
torch.tensor(sample['NISP_H_image'], dtype=torch.float32),
]) # [4, 224, 224]
images = images.unsqueeze(0) # [1, 4, 224, 224]
# Get embeddings
embeddings = model(images)
For inference code, training scripts, and tutorials, visit the AstroPT GitHub Repository.
@article{Siudek2025,
title={AstroPT: Astronomical Physics Transformers for Multi-modal Learning},
author={Siudek, M and others},
journal={Euclid Collaboration},
eprint={2503.15312},
archivePrefix={arXiv},
year={2025},
url={https://ui.adsabs.harvard.edu/abs/2025arXiv250315312E/abstract}
}
CC-BY-4.0
Last Updated: December 2025
Model Version: 1.0