User dc11dd6805
15-02-2012 16:52:09
Hi
I am pretty new to the field of cheminformatics, I wondered if anyone has knowledge of how ambiguous structures of lipids can be/are represented and treated. To illustrate what I mean:
take the example of glycerophospholipids, which have a "head group" (such as
1,2-diacyl-sn-glycero-3-phosphocholine, see CHEBI:57643) and two tails (R1 + R2). Often biologists specify the general composition of the tails, but not the precise structure. For instance, they might know that the tails (R1 + R2 in the structure) together have 32 carbons, and no double bonds, and would refer to the entity like this: PC(32:0). However they might not be able to specify if the compound is PC(18:0/14:0) (two tails of 18 and 14 carbons) or PC(16:0/16:0) (two tails of 16 carbons) etc. The precise length of each tail may not be known.
I wondered if anyone out there knows of functionalities or methods which have been developed to treat and store such entities. For instance, a method that allows you to load the base structure of the head group (with Rs), and subsequently specify that together the R1 and R2 groups have a certain number of carbon atoms and double bonds, without specifying exactly what each of the R groups actually are. From that information, a formula and mass could be generated (but of course, no structure).
Thanks and Best Regards, Alan