Canonicalisation algorithm changes and the JChem Cartridge

User 6f58eb8616

30-07-2009 10:37:55

Hi,


I have just got the new JChem Java API ( V 5.2.3_1) as there is a couple of fixes in there, previously I was running V 5.1.3_2.  I notice that the Canonicalisation algorithm has changed eg:


input SMILES: CN1CC[C@]23[C@H]4Oc5c3c(C[C@@H]1[C@@H]2C=C[C@@H]4O)ccc5O


Canonicalised with V 5.2.3_1: CN1CC[C@]23[C@H]4Oc5c(O)ccc(C[C@@H]1[C@@H]2C=C[C@@H]4O)c35                 


Canonicalised with V 5.1.3_2: CN1CC[C@@]23[C@H]4Oc5c2c(C[C@@H]1[C@@H]3C=C[C@@H]4O)ccc5O


 


Obviously all the canonical forms are equivalent but my question is do we need to upgrade the SMILES stored in our database  running the JChem Cartidge ?  As we use the Java API to create SQL queries is it possible that any of the Chemaxon functions (eg: jc_compare, jc_equals, jc_compare, jc_tanimoto etc) will not work in exactly the same way beacause we will be including SMILES strings in a different canonical form (generated by the new API) ?


 


Hope this makes sense.


 


Thanks in advance.

ChemAxon a3d59b832c

30-07-2009 11:30:20

Hi Derek,


This change of smiles output does not effect the cartridge operators and functions.


(At each release we carefully check what is changing, and we force regeneration if the changes have any effect.)


 


However, if you run direct string comparisons of the generated canonical smiles strings,
then yes, those have to be regenerated.


 


Best regards,


Szabolcs

User 6f58eb8616

31-07-2009 18:13:29

ok thats great, just thought I'd check.


 


All the best


 


Derek