ChemAxon 587f88acea
23-09-2004 14:14:47
When using the standardizer I figured out another problem by accident. When applying a second standardize on an already standardized smiles string there are still changes be made on double bond chirality. I first thought the reason might be a standardize.xml file which has certain reaction tags that supply as output their own input... but I was wrong as my reaction tags shouldn't affect chirality and even when leaving only the aromatize action tag in the config file I get this phenomenon on certain structures. Basically this does not affect the structure but produces two distinct differing strings and makes it hard to handle them in a DBMS.
Examples:
This problem occurs pretty often. With the datasets I am working about. 1.5% of the molcules are affected.
Any ideas to avoid ?
cheers
Friedemann
Examples:
Code: |
origin:: O=C\1C4=C(NC1=C\3NC2=CC=CC=C2C3=O)C=CC=C4 normalized once: O=C1\C(Nc2ccccc12)=C3/Nc4ccccc4C3=O normalized twice: O=C1C(\Nc2ccccc12)=C3/Nc4ccccc4C3=O smiles = origin:: C2(=CC=C(NC1=CC=CC=C1)C=C2)\C(C3=CC(=C(C=C3)N)C)=C/4C=CC(/C=C4)=N\C5=CC=CC=C5 normalized once: Cc1cc(ccc1N)C(\c2ccc(Nc3ccccc3)cc2)=C4/C=CC(\C=C4)=N/c5ccccc5 normalized twice: Cc1cc(ccc1N)\C(c2ccc(Nc3ccccc3)cc2)=C4/C=CC(\C=C4)=N/c5ccccc5 |
This problem occurs pretty often. With the datasets I am working about. 1.5% of the molcules are affected.
Any ideas to avoid ?
cheers
Friedemann