searching a smile using PMapper

User a52f97d128

01-03-2006 16:52:22

Dear All,





I use PMapper to map this molecule (jchem3.0.13_pre1):


COc1cc2nnnc2cc1C(=O)NCC3CCCN3CC=C


using this smile :


[nX2H0V3]





The three nitrogens of the triazole are returned, but it seems to me that one of them has an implicit H. Does PMapper takes the implicit H into account ?


If not, is there I way for me to distinguish the nitrogens of the triazole ring ?





many thanks,





Vincent

User a52f97d128

01-03-2006 17:35:12

Re-dear All,





Looking deeper the pb it seems both jchem and marvin never see any implicit H on the nitrogen of any pyrrol ring.





eg : COc1cc2c(CCNC(C)=O)cnc2cc1Cl


When I ask marvin (marvinView 3.5.7) to display explicitly the hydrogens, no hydrogen appears on the aromaric N.





Thanks,





Vincent

ChemAxon efa1591b5a

02-03-2006 07:29:17

Dear Vincent,





you my find the discussion at this link http://www.chemaxon.com/forum/ftopic936.html useful. If not, get back to this forum.





Regards,


Miklos

ChemAxon 25dcd765a3

02-03-2006 10:21:22

Hi Vincent,





Just a short reply why the following SMILES has no implicit H on the aromatic N:


Code:
COc1cc2c(CCNC(C)=O)cnc2cc1Cl



The problem is the following:


The N in the pyrrol should be written as:


c1cc[nH]c1. (With implicit H, explicitly written.)


Let's take an example where the situation is not obvious (kekule form):


Code:
CC1=CNC=N1



If you aromatize the molecule, the implicit H should be written explicitly to avoid ambiguity:


Cc1c[nH]cn1 (if the implicit H is not written explicitly it is not possible to tell which aromatic N should get the implicit H).


So to generalise the situation we use the following rules is SMILES:


Aromatic N with no implicit H should be written as: n,


aromatic N with implicit H should be written as [nH].





I agree, that in some situation (like the one you've presented) it is possible to detect that an aromatic N has implicit H, but this would slow down the import definitely. We would like to avoid this.


Moreover I have also checked that Daylight gives an error if you would like to convert the molecule c1ccnc1 to canonical SMILES (http://www.daylight.com/daycgi_tutorials/cansmi.cgi):


Code:
ERROR (CANSMI): Can't interpret SMILES "c1ccnc1"

ChemAxon a3d59b832c

02-03-2006 10:30:58

Dear Vincent,





As Miklos pointed out, one reason for the loss of H may be that in molfiles the impicit hydrogens on aromatic nitrogens cannot be specified, and your smiles may come from such a source.





We have made a non-standard solution for this which is available since Marvin 4.0.2 and JChem 3.1.2. See details here:


http://www.chemaxon.com/marvin/doc/user/mol-csmol-doc.html#implicith





Best regards,


Szabolcs

ChemAxon fb166edcbd

02-03-2006 16:59:36

Here is a workaround that transforms the implicit H-s on pyrrole-type rings:


http://www.chemaxon.com/forum/ftopic148.html





Here is an application of this to your case:





Code:



standardize -c "[n:1]1nncc1>>[H:2][n:1]1nncc1" "COc1cc2nnnc2cc1C(=O)NCC3CCCN3CC=C" COc1cc2[nH]nnc2cc1C(=O)NCC3CCCN3CC=C








or with dearomatization:





Code:



standardize -c "[n:1]1nncc1>>[H:2][n:1]1nncc1..dearomatize" "COc1cc2nnnc2cc1C(=O)NCC3CCCN3CC=C"


COC1=C(C=C2N=NNC2=C1)C(=O)NCC3CCCN3CC=C








I have tested this with JChem3.1.5 but it may work with JChem3.0.x as well.