Published May 3, 2025 | Version v2
Model Open

RSGPT for synthetic data

Creators

Description

We proposed Retro Synthesis Generative Pre-Trained Transformer (RSGPT), a retrosynthesis planning model , leveraging the architecture of LLaMA2. In this context, frags_dic.pkl is provided. This is a part of the {Reactants in the template: [Matched molecular fragments]} that we use in the data generation stage for reference, so that you can use your own templates and molecular fragment libraries to efficiently generate data. Our submolecules from PubChem, ChEMBL, and Enamine datasetes are provided in the .txt files.

Files

chembl_frg.txt

Files (3.7 GB)

Name Size Download all
md5:9957fa6b976cd4ae385c53016c633366
47.2 MB Preview Download
md5:ae4f1a7e733b72b00831f8bf2774cfcd
20.8 MB Preview Download
md5:817fc887221c327b6db062f9a85f1e4a
111.6 MB Download
md5:aa571973767c4dd4f5fe361f1db25e13
3.5 GB Preview Download

Additional details

Dates

Created
2025-04-29