Selected Publications

(2024). Post-clustering difference testing: valid inference and practical considerations. Computational Statistics & Data Analysis, 107916.

Preprint R package Article

(2023). Doubly-robust evaluation of high-dimensional surrogate markers. Biostatistics, 24(4):985-999.

Preprint Article R package

(2023). ChatGPT and beyond with artificial intelligence (AI) in health: Lessons to be learned. Joint Bone Spine, 90(5):105607.


(2023). Modéliser la COVID-19: de la population à l'individu. Interstices.


(2022). On the potential benefits of entropic regularization for smoothing Wasserstein estimators. arXiv 2210.06934.

Preprint PDF

(2021). ATLAS: An automated association test using probabilistically linked health records with application to genetic studies. JAMIA, 28(12):2582-2592.

Preprint R package

(2021). Bayesian Mixture Models for Cytometry Data Analysis. WIREs Comp. Stat. 13(4):e1535.

PDF Article

(2021). Automatic phenotyping of electronical health record: PheVis algorithm. Journal of Biomedical Informatics, 117:103746.

Preprint PDF Article R package

(2020). Realistic and Robust Reproducible Research for Biostatistics. Preprints 2020060002.

Preprint PDF

(2019). Vers une recherche reproductible : Faire évoluer ses pratiques. Bordeaux : Urfist de Bordeaux. ISBN : 979-10-97595-05-0.

PDF Source Document Bookdown

(2019). Sequential Dirichlet process mixture of skew t-distributions for model-based clustering of flow cytometry data. Ann. Appl. Stat., 13(1):638-660.

Preprint PDF R package Article

(2018). cytometree: a binary tree algorithm for automatic gating in cytometry analysis. Cytom. A, 93(11):1132-1140.

Preprint Article R package

(2015). Time-Course Gene Set Analysis for Longitudinal Gene Expression Data. PLoS Comput Biol, 11(6):e1004310.

PDF Code Article R package

Recent Publications

More Publications

(2023). Identification of early gene expression profiles associated with long-lasting antibody responses to the Ebola vaccine Ad26. ZEBOV/MVA-BN-Filo. Cell Reports, 42(9):113101.

PDF Article

(2023). Using population based Kalman estimator to model COVID-19 epidemics in France: estimating the burden of SARS-CoV-2 and the effects of NPI. International of Biostatistics, in press.

Preprint PDF

(2022). High–temporal resolution profiling reveals distinct immune trajectories following the first and second doses of COVID-19 mRNA vaccines. Science Advances, 8(45): eabp9961.

Preprint PDF Article

(2022). T-cell immunogenicity, gene expression profile and safety of four heterologous prime-boost combinations of HIV vaccine candidates in healthy volunteers - results of the randomized multi-arm phase I/II ANRS VRI01 trial. Journal of Immunology, 208 (12): 2663–2674.


Recent Posts

More Posts

I recently updated my set-up, and because I use a High-Performance cluster from my University (kudos to avakas) to run various simulations and analyses, I have MPI and Rmpi installed on my laptop in order to test my scripts before submitting them to the big cluster. So I installed openmpi from homebrew very easily: brew update brew install open-mpi But then I had extensive trouble installing the Rmpi package…


I just released a new package on CRAN. It’s called NPflow, it performs Dirichlet process mixture of multivariate normal, skew-normal or skew t-distributions modeling, you should check it out. I was a little worried because the check from Travis CI was returning a NOTE. And even though the NOTEs seem like mild problems, “you should strive to eliminate all NOTEs” before submitting to CRAN ! Preparing for an email exchange with a member of the R core team, I wrote the following in the submission comments:


After a bumpy road, along which I kept in mind Jeff Leak’s own worst (recent) experience, we finally got our article on Time-Course Gene Set Analysis for Longitudinal Gene Expression Data published in PLoS Computational Biology, a very nice journal ! I am really happy about it, don’t hesitate to check it out ! And there is the TcGSA R package that goes with it.


Useful tips