Training Domain
ISIC 2019 samples are split into train and validation partitions using deterministic seeding configured in YAML.
Cross-dataset skin lesion classification under dataset shift
Methods
LesionShiftAI uses a shared pipeline across model families so cross-model comparisons remain tied to the same preprocessing, splitting, and evaluation protocol.
Data Flow
ISIC 2019 samples are split into train and validation partitions using deterministic seeding configured in YAML.
HAM10000 is held out as an external test domain and never used for model selection.
Shared image transforms and DataLoader settings ensure consistent feature-space assumptions across all pipelines.
Models
ResNet50 backbone for single-model benchmarking with direct validation-to-external transfer measurement.
Five fold-specific ResNet50 CNN members are trained and merged via mean malignancy probability to test robustness under shift.
ViT-B16 initialized from pretrained weights with warmup and minimum-learning-rate control for stable fine-tuning.
ViT-L16 initialized from pretrained weights to test higher-capacity transfer under the same protocol.
Evaluation
validation - external test per metric.