Multi-Modal Chemical Data to Enable AlphaChem
An open dataset that links multiple modes of chemical characterization, integrating existing databases that are currently siloed or behind paywalls.
The dataset should include structure, synthetic route, NMR/ IR spectra, and bioactivity to enable truly holistic chemical AI that can predict synthesis routes and spectra based on structure.
Resources (1)
PubChem
Initiative