File: README.md

package info (click to toggle)
rdkit 202503.6-4
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid
  • size: 222,000 kB
  • sloc: cpp: 411,111; python: 78,482; ansic: 26,181; java: 8,285; javascript: 4,404; sql: 2,393; yacc: 1,626; lex: 1,267; cs: 1,090; makefile: 581; xml: 229; fortran: 183; sh: 121
file content (20 lines) | stat: -rw-r--r-- 974 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
Source code of EFGs, a complete an accurate implementation of Ertl's functional group (FG) detection algorithm.

For a RDKit molecule, it provides four outputs: 
a) a PNG binary string with an image of the molecule with color-highlighted
functional groups; 
b) a list of sets of atom indices (idx), each set corresponding to a
functional group; 
c) a list of pseudo-SMILES canonicalized strings for the full functional
groups; 
d) a list of RDKit labeled mol objects, one for each full functional group 

efgs.py: code of get_dec_fgs function and additional auxiliary functions

try_efgs.py: example code to execute get_dec_fgs in Python with a Pandas dataframe

ch33query.sql: sql query used to extract ChEMBL33 bioactive compounds

ChemRxiv preprint for full details on the implementation: 
EFGs: A Complete and Accurate Implementation of Ertl’s Functional Group Detection Algorithm in RDKit
https://chemrxiv.org/engage/chemrxiv/article-details/67543afe7be152b1d03a820c