A collection of computational biology, deep learning, and population genetics work — each built with reproducibility and software engineering discipline in mind.
// 01
✓ Model trained — 100% validation accuracyMulti-branch CNN with attention mechanisms trained on 3,000 forward-time SLiM simulations to distinguish local adaptation from genetic drift. Achieves 100% validation accuracy in ~27 epochs — directly targeting the scenarios where classical QST–FST comparisons structurally fail.
// 02
CompleteProduction-grade Convolutional Variational Autoencoder for unsupervised cell phenotype classification. Learns a compressed latent space from cell images without any labels, then projects it with t-SNE to reveal biological clusters. Built with CI/CD, Poetry, and Pytest from day one — software engineering discipline beyond academic standards.
// 03
✓ Thesis validated — UNIL 2025Large-scale Python/R simulation framework using SLiM 5.0 to stress-test the QST–FST evolutionary test across 1–1,000 QTLs, four trait architectures, and two population structures. 500 replicates per condition on SLURM HPC. Key finding: the test breaks down under stepping-stone structure regardless of polygenicity.