User 779e37e0e6
12-11-2015 02:53:10
Hi,
I am running supersructure search, and in my debugging, I have realized that using smiles and standard inchi for earch operations sometimes return different results, especially when considering the stereochemistry.
For instance, I have the following SMARTS patterns for L- and D-alpha amino acids
L-: [$([H][C@@;X4]([#6;!$(C(=O)O)])([#7;A;X3,X4+;!$([N]~[!#6]);!$([N]*!@[#7,#8,#15,#16])])[#6;X3]([#8;A;X2H1,X1-])=[O;X1]),$([#6][C@@;X4]([#7;A;X3,X4+;!$([N]~[!#6]);!$([N]*!@[#7,#8,#15,#16])])([#6][!#1!#6])[#6;X3]([#8;A;X2H1,X1-])=[O;X1]),$([#6]-[#6][C@;X4]([#7;A;X3,X4+;!$([N]~[!#6]);!$([N]*!@[#7,#8,#15,#16])])([#6]-[#1,!#6])[#6;X3]([#8;A;X2H1,X1-])=[O;X1])]
D-: [$([H][C@;X4]([#6;!$(C(=O)O)])([#7;A;X3,X4+;!$([N]~[!#6]);!$([N]*!@[#7,#8,#15,#16])])[#6;X3]([#8;A;X2H1,X1-])=[O;X1]),$([#6][C@;X4]([#7;A;X3,X4+;!$([N]~[!#6]);!$([N]*!@[#7,#8,#15,#16])])([#6][!#1!#6])[#6;X3]([#8;A;X2H1,X1-])=[O;X1]),$([#6]-[#6][C@@;X4]([#7;A;X3,X4+;!$([N]~[!#6]);!$([N]*!@[#7,#8,#15,#16])])([#6]-[#1,!#6])[#6;X3]([#8;A;X2H1,X1-])=[O;X1])]
Standard Inchis are standard and generated using the function: MolExporter.exportToFormat(molecule, "inchi:SAbs"), where molecue is an instance of the class Molecule.
Here are the results for L-Proline and D-Glutamine:
[H][C@]1(CCCN1)C(O)=O (L-proline)
- Using smiles: l-alpha-amino-acid_2
-using inchi: d-alpha-amino-acid_2
[H][C@@](N)(CCC(O)=N)C(O)=O (D-glutamine)
- using smiles: d-alpha-amino-acid_2
- using smiles: l-alpha-amino-acid_2
Could you please help me out here? What could be the cause.
Regards,
MrYan