Ransomware Detection

GIN · internal_only

Leave-One-Family-OutMar 5, 2026

32f2e82e8a584eb5b31fdfb7a4b01fc2

Description

Full leave-one-family-out (LOFO) evaluation of GIN on internal_only. Trains 6 separate models, each time holding out one ransomware family entirely, to systematically measure whether the baseline can detect ransomware families not seen during training.

Conclusion

Main result of the baseline study. Mean malware recall across held-out families is just 18.1%, with wipelocker, blackroselucy, and filecoder at 0% recall. Confirms that the GIN baseline memorises family-specific structural patterns and does not generalise to unseen families. Family-aware evaluation is the honest benchmark for this thesis.

Mean Test Metrics (across holdouts)

Accuracy

70.7%

F1 Macro

48.6%

F1 Malware

17.2%

Precision

17.4%

Recall

18.1%

AUROC

85.1%

Best Val Loss

0.1119

Training Time

6849.8000s

Summed Confusion Matrix (all holdouts)

Pred BenignPred Malware
Actual Benign39852
Actual Malware16350

Configuration

Hidden Dim128
Num Layers3
Dropout0.5
Batch Size4
Learning Rate0.001
Weight Decay0.0001
Max Epochs200
ES Patience20
ES Min Epochs100
LR Patience10
LR Factor0.5
Mixed PrecisionYes
Random Seed42
Epochs Trained111