define new aminoacids

User 0f28873a29

15-01-2009 13:35:20

hi people:





Well, I have a peptide sequence with a modification in one of the aminoacids (Serine) and I want to convert this peptides to a smile format. To do this, I need the definition of the Serine in Smarts format, and with this, define a new Serine with the modification. In the picture attached bellow I show the problem. Well, the questions is :





How can I defined the modification of the new aminoacid in marts fomat as a custom aminoacid for (SERINE (S),Trionine(T), Tyrosine (Y))?





Where are the file with the definitions of aminoacids in the Jchem Instalation?





Thanks in advance





Yasset.

ChemAxon d9100dae53

19-01-2009 12:00:35

Dear Yasset,





Peptide definitions can't be modified yet, so if you want to modifiy a peptide, you have to do it manually after import.





The easiest way to do it, to define a custom template in the mskecth, and then, you can change the default group to this by one click.

User 0f28873a29

19-01-2009 13:22:52

With chemaxon we should defined new aminoacids as:





  X(Hcy)    X(Hcy)      [SX2H1][CH2][CH2][C@HH1]([NX3])C=O    5     6





  X(1-foo)   X(1-foo)    [SX2H1][CH2][C@HH1]([NX3])C=O          4      5


X(b) X(b) [CH3][CH2][CH2][CH2][CH2][C@HH1]([NX3])C=O 7 8


(This example is from the documentation)











Well a need to defined my need aminoacids as a modification of the (SERINE (S),Trionine(T), Tyrosine (Y)). Then I need the standard SMARTS definition of these aminoacids to re-defined with a modification. My idea is to add a phophorilatyion modification to OH group.

ChemAxon d9100dae53

19-01-2009 15:08:37

Yes, you can define a new aminoacid, but can't modify the original one.





So, you can create a new peptide, with the given format. eg.:





X(Sph)    X(Sph)    [#7][C@@H](COP(O)([#8-])=O)[C]=O    1    9











To use it, you need to put it into the the custom_aminoacids.dict file. If you have no custom peptide definitions yet, this file doesn't exist, so you need to create it to your chemaxon home directory. (This is .chemaxon (UNIX) or chemaxon directory in user's home (MS Windows).)





I hope, could help





Suti

User 0f28873a29

19-01-2009 15:21:21

Thanks for all:





   I need the original definitions only for a guide to re-defined in a new file named custom_aminoacids.dict. I don't know the definition in SMARTs format the (SERINE (S),Trionine(T), Tyrosine (Y)) and for these reason I need the definition from chemaxon original file. I try with the option convert from peptide sequence to SMARTs but it don't work.





Thanks in advance.

ChemAxon d9100dae53

20-01-2009 08:46:41

The original dictionary is attached. (Format is the same, as the custom dictionary.)





How did you tried to convert the peptides to SMARTS?





Kind regards,


Suti

User 0f28873a29

20-01-2009 15:10:33

Hi Suti:





Finally I defined my custom_aminoacids.dict file in the chemaxon directory (attach below). But when I run the command





            molconvert smiles lolo.seq -o lolo.smi 





the program show me the error :





chemaxon.formats.MolFormatException: Could not find aminacid in peptide string H-S-K-K-K-K-S-X(SerP)-T-T-K-OH


        at chemaxon.marvin.io.formats.peptide.PeptideReader.convert(PeptideReader.java:127)


        at chemaxon.marvin.io.formats.peptide.PeptideImport.readMol(PeptideImport.java:102)


        at chemaxon.marvin.io.MRecordImporter.readDoc(MRecordImporter.java:170)


        at chemaxon.formats.MolImporter.readDoc(MolImporter.java:702)


        at chemaxon.formats.MolConverter.readDoc(MolConverter.java:792)


        at chemaxon.formats.MolConverter.convert0(MolConverter.java:706)


        at chemaxon.formats.MolConverter.convert(MolConverter.java:658)


        at chemaxon.formats.MolConverter.main(MolConverter.java:1517)


lolo.seq:1: error: SKKKKSX(SerP)TTK : error in molecule file





The file lolo.seq contain sequence SKKKKSX(SerP)TTK





Thanks in advance.

ChemAxon d9100dae53

20-01-2009 15:36:19

This is a problem with the converter, and will be fixed at next release.





The first letter inside X(...) must be uppercased, the other letters must be lowercased.


So, if you change X(SerP) to X(Serp), it will work.





Thanks for report.

User 0f28873a29

20-01-2009 16:32:38

Hi Suti:





   Thanks for all, finally the program convert sequences to smiles format. But the definition is wrong because   the new peptides are in wrong form.  





I think that the error is in the index of the:





  1. the number of the backbone N in the SMARTS string (the third atom


    for Ala in the first line of the example)



  2. the number of the backbone C next to the acyl oxygen (fourth atom


    for Ala in the first line of example).






My idea is obtain a SERINE (S),Trionine(T), Tyrosine (Y)) in a phosphorilatyon form. My definition is attached in the previous post as a custom_aminoacids.dict file.





You might check the file?





Thanks for all.

ChemAxon d9100dae53

20-01-2009 19:08:44

The SMARTS of your peptides must be correct, but the attachpoints you defined are not represents the same atom as at the original peptides.


If I am right, you want to keep the attachpoints. In this case, I think the correct atomnumbers are:





X(Serp)    X(Serp)    [NX3][C@@H]([CH2]OP(O)([#8-])=O)C=O    1       9


X(Thrp)    X(Thrp)    [CH3][C@@H](OP(O)([#8-])=O)[C@H]([N;X3])C=O    9       10


X(Tyrp)    X(Tyrp)    [NX3][C@@H]([CH2]C1=[CH1][CH1]=C(OP(O)([#8-])=O)[CH1]=[CH1]1)C=O    1       15





(The atoms contains to the attachpoints shown bold.)





Kind regards,





Suti

User 0f28873a29

20-01-2009 20:09:35

Thakns for all Suti..