Reaction informatics and graph-based molecular transformations#
This direction develops graph-based and mathematically structured representations for chemical transformations, including reaction curation, rebalancing, graph-based rule extraction, partial ITS formulations, and benchmark-oriented datasets. The main scientific aim is to preserve chemical meaning while enabling scalable analysis and machine learning.
Key themes#
reaction curation and rebalancing
graph-based reaction rules and template extraction
chemically faithful reaction representations
benchmark construction and dataset quality
emerging work toward mechanistic transition graphs and electron-flow reasoning
Representative outputs include work on reaction rebalancing, SynTemp, SynKit, formal studies of partial ITS graphs, and the SynRXN benchmark. Together, these contributions aim to make reaction data more reliable, more interpretable, and more reusable for downstream modeling.
Why it matters#
Reaction data are only as useful as their representations and curation quality. Structured reaction modeling makes it easier to integrate heterogeneous data, train interpretable models, and create reusable knowledge bases for synthesis planning, mechanistic reasoning, and molecular design.