Since May 2025, I have been a postdoctoral researcher in bioinformatics at the Autorité de Sûreté Nucléaire et de Radioprotection (ASNR) , in collaboration with the CEA and EDF. My research focuses on studying the impact of low radiation doses on the development of thyroid cancer using multi-omics data.
From March 2023 to March 2025, I was a postdoctoral researcher at the BioSTM lab, Faculté de Pharmacie de Paris, Université Paris Cité under the supervision of Marie Verbanck . My research focuses on developing machine learning methods to explore pleiotropy in human genetic architecture and create a comprehensive map of pleiotropic effects. Before that, I have been a PhD Student in Computational Biology at CBIO team in MINES ParisTech and Institut Curie . I was working under the supervision of Chloé-Agathe Azencott.
About me
I defended my PhD on July 13th, 2022, and my thesis manuscript is online here . The main goal of my thesis is to develop a stable framework for biomarker discovery in multi-locus Genome-Wide Association Studies (GWAS) using Machine Learning, essentially feature selection models to deal with high-dimensional data. The aim is to find an association between the genotype and the phenotype, particularly in breast cancer. I addressed key challenges such as genetic population stratification, linkage disequilibrium patterns clustering, and the stability of the feature selection... In 2017, I obtained my engineering diploma in embedded systems at the National Engineering School of Sousse (ENISo), Tunisia. I later completed a Master's degree in Intelligent Communicating Systems in 2018 at the same school. I did an internship at the East Paris Institute of Chemistry and Materials Science (ICMPE) in Paris, where I applied Machine learning techniques to discover new chemical compounds for hydrogen storage. This project was supervised by Jean-Claude Crivello and Nataliya Sokolovoska .
Research Interests
My research lies at the intersection of machine learning, statistical genetics and computational biology. I am particularly interested in developing interpretable machine learning and statistical methods for the analysis of high-dimensional multi-omics data to better understand complex diseases.
- Machine Learning for Computational Biology
- Statistical Genetics and Genome-Wide Association Studies (GWAS)
- Multi-omics Data Integration
- Graphical Models and Network Inference
- Feature Selection in High-dimensional Data
- Pleiotropy and Human Genetic Architecture
Talks and Posters
- 30 June-1 July, 2026. Latent Differential Graphical model for Multi-Tissue and Multi-Omics integration to model molecular interaction networks under multiple Radiation Exposure groups. JOBIM 2026, Strasbourg (France) [Talk]
- January 30, 2025. Machine Learning and Statistical Methods for Identifying Causal Variants While Accounting for Linkage Disequilibrium and Population Stratification. AMIB(IO) seminar, Laboratoire Interdisciplinaire des Sciences du Numérique (LISN), Orsay. [Invited talk]
- March 17, 2022. Multi-task group Lasso for Genome-Wide Association Studies. The Junior Seminar in Artificial Intelligence and Digital Healthcare, Paris (France). [Invited talk]
- February 3, 2022. Multi-task group Lasso for Genome-Wide Association Studies in diverse populations. Institut Curie, Paris (France). [Talk]
- January 6, 2022. Multi-task group Lasso for Genome-Wide Association Studies in diverse populations. Pacific Symposium on Biocomputing (PSB 2022), Hawaii (USA). [Talk]
- October 25, 2021. Multi-task group Lasso for Genome-Wide Association Studies in diverse populations. NutriOmics seminar. [Invited talk]
- July 29, 2021. Multi-task group Lasso for admixed populations in Genome-Wide Association Studies. Intelligent Systems for Molecular Biology (ISMB 2021), MLCSB COSI. [Talk] [Poster]
- January 28, 2021. Stable Multi-task feature selection approach for Genome-Wide Association Studies. Institut Curie, Paris (France). [Talk]
- March 11, 2020. Multi-task group Lasso correcting for population stratification in Genome-Wide Association Studies. Institut Curie, Paris (France).[Talk]
- January 23, 2020. Multi-task group lasso for Genome-Wide Association Studies. Statistical Methods for Post Genomic Data (SMPGD 2020), Institut Pasteur, Paris (France). [Poster]
Publications
- Ory. C., Jouannet. C., Panunzi. L., Nouira. A. et al., Post-Chornobyl thyroid papillary carcinomas display distinct past 131I exposure and radiation-associated carcinogenesis molecular signatures at low and high thyroid doses. Scientific Reports 2026 [Full paper].
- Nouira. A., Tournaire. M., Favre Moiron. M., Verbanck M., Deriving LD-adjusted GWAS summary statistics through linkage disequilibrium deconvolution. preprint 2026. [Full paper].
- Nouira. A., Azencottt. C-A, Sparse multitask group Lasso for genome-wide association studies. PLOS Computational Biology 2025. [Full paper].
- Tournaire. M., Nouira. A., Favre Moiron. M., Rozenholc. Y., Verbanck M., Inferring genetic variant networks by leveraging pleiotropy shows trait relationships drive massive pleiotropy in GWAS. preprint 2024. [Full paper].
- Nouira. A., Azencottt. C-A, Multi-task group lasso for Genome-Wide Association Studies in diverse populations. Pacific Symposium on Biocomputing (PSB) 2022. [Full paper] [Supplementary Materials]
- Nouira. A , Sokolovska. N, Crivello. J-C, CrystalGAN: Learning to Discover Crystallographic Structures with Generative Adversarial Networks. AAAI Spring Symposium: Combining Machine Learning with Knowledge Engineering 2019.
GitHub Repositories
- Multi-DiffNet : Latent Differential Graphical model for Multi-Tissue and Multi-Omics integration.
- SMuGLasso : Sparse Multitask group lasso for Genome-Wide Association Studies in diverse populations.
- MuGLasso_GWAS : Multitask group lasso for Genome-Wide Association Studies in diverse populations.
- GWAS-admixed-population-simulator : Simulating case-control admixed-population GWAS data in PLINK format.
- CrystalGAN : Learning to Discover Crystallographic Structures with Generative Adversarial Networks.
- Crystal-tools : Useful tools for the preprocessing of CrystalGAN.
Teaching
- November, 2021: Teaching assistant. Introduction to Machine Learning: Support Vector Machines at MINES ParisTech.
- March, 2021 : Teaching assistant. Workshops of Large-Scale Machine Learning and Data Mining 2021 at MINES ParisTech.
- November, 2019: Teaching assistant. Introduction to Machine Learning: Support Vector Machines at MINES ParisTech.
- March, 2019 : Teaching assistant. Workshops of Large-Scale Machine Learning and Data Mining 2019 at MINES ParisTech.