Research
Publications
Publications in journals with peer-reviewing
J5. Definition and Exploration of Realistic Chemical Spaces Using the Connectivity and Cyclic Features of ChEMBL and ZINC
2023 Digital Discovery




J4. Scalable estimator of the diversity for de novo molecular generation resulting in a more robust QM dataset (OD9) and a more efficient molecular optimization
2021 Journal of Cheminformatics




J3. EvoMol: a flexible and interpretable evolutionary algorithm for unbiased de novo molecular generation
2020 Journal of Cheminformatics



J2. MixONat, a Software for the Dereplication of Mixtures Based on 13C NMR Spectroscopy
2020 Analytical Chemistry



J1. Dataset’s chemical diversity limits the generalizability of machine learning prediction
2019 Journal of Cheminformatics



Publications in conferences with peer-reviewing
C2. Surrogate‑Based Black‑Box Optimization Method for Costly Molecular Properties
2021 IEEE 33rd International Conference on Tools with Artificial Intelligence (ICTAI)


C1. Des réseaux de neurones pour prédire des distances interatomiques extraites d’une base de données ouverte de calculs en chimie quantique
2019 Extraction et Gestion des connaissances, EGC 2019, Metz, France (National-level conference)
🥇 Award of the best application article
🔗 URL : RNTI ↕️ Conference Ranks : Rank C ERA
Publications of book chapters with peer-reviewing
B2. Goal‑directed generation of new molecules by AI methods" (Chapter 2)
2022 Computational and Data‑Driven Chemistry Using Artificial Intelligence

B1. Predicting Interatomic Distances of Molecular Quantum Chemistry Calculations" (long version of [C1])
2022 Advances in Knowledge Discovery and Management: Volume 9. Studies in Computational Intelligence. Springer International Publishing

Publications in conference workshops with light peer-reviewing
W1. Génération d’explications contre‑factuelles pour la chimie moléculaire
2022 Workshop EXPLAIN’AI hosted at EGC 2022, Blois, France (National-level conference)

Posters
Po1. Surrogate-based black-box framework to optimize electronic properties for de novo organic molecular materials
2021 4th RSC-BMCS / RSC-CICAG Artificial Intelligence in Chemistry Symposium (Virtual international event)
Also presented at 2022 Symposium GDR Madics (National research event in France)
🔗 URL : poster
Preprints
P1. WebXAII: An Open-Source Web Framework to Study Human-XAI Interaction
2025 arXiv:2506.14777 [cs.HC]


PhD thesis
My Ph.D thesis manuscript entitled "Combinatorial search lead by machine learning for molecular chemistry" (2022) is available on

Contributions to research projects
XAI and Human-XAI interaction (postdoctoral work, current)
I am studying the interaction of human operators and XAI techniques in order to assess the impact of XAI on human-machine collaboration. There has been a lot of work in the domain in the last decade, but it is yet unclear how XAI techniques compare to each other, and only very few of them have demonstrated a beneficial effect at helping a human end-user solve a task. I am working on the evaluation of various techniques in a controlled setting, in experiments involving human participants. As a first step, I lead the developement of a web-interface which can embody experimental protocols and collect participants' answers for human-XAI studies [P1].
Optimization and machine learning for molecular discovery (PhD work, 2019-2022)
The purpose of this research project is to develop methods for the automatic generation of molecules that satisfy desired properties, with a focus on the chemistry of organic molecular materials. I proposed an evolutionary algorithm named EvoMol, which is designed to be interpretable and generic so that it can be used in various subdomains of chemistry [J3]. As evolutionary algorithms tend to generate unrealistic molecules, I also worked on a filter-based approach which favors the generation of realistic molecules [J5].
Most properties of interest in the field of organic molecular materials depend on costly quantum chemistry computations (DFT calculations). This motivates the use of machine learning algorithms as fast estimators of these properties. I worked on machine learning methods predicting the DFT-optimized geometry of molecules, which is closely related to the electronic properties of interest [C1, B1]. I worked with a postdoctoral researcher to measure the importance of chemical diversity in the training datasets of machine learning models. We demonstrated that a lack of chemical diversity in the training data (including in a state-of-the-art dataset) can signifantly impair model performance [J1]. We further proposed an efficient method based on EvoMol to maximize various measures of chemical diversity, which we used to obtain a large and diverse dataset of molecules [J4].
I also proposed to combine an optimization method with a machine learning model, in the form of a surrogate-based black-box optimization method. I showed that this approach is more efficient than an evolutionary search for the optimization of a costly electronic property [C2, Po1]. The use of ML models for molecular chemistry raises questions about their interpretability. I proposed an approach based on EvoMol to generate counterfactual explanations to any binary classification model of molecules [W1].
I also published a review of the state of the art of the field of de novo molecular generation [B2].
Dereplication in vegetal-based chemistry (2019)
I worked with a group of scientists in vegetal-based chemistry during my MSc studies. My role was to improve a pre-existing tool using NMR spectrum for the identification of compounds in a mixture. I performed a refactoring of the source code and I formalized the matching algorithm and improved its efficiency [J2].