User 677b9c22ff
04-11-2008 11:14:08
Hi,
would it make sense to uniquify (crate canonical results) from a given Markush library generation if no stereochemistry is involved?
For the attached file (Naphthalene with all R1 and R1 = H,F,Cl,Br,I cxcalc would generate 390625 results but after
aromatization and creating unique smiles the number of true results would be 98125 molecules.
cxcalc markushenumerationcount -m true napthlaene-markush-2.mol
cxcalc enumerations -f smiles napthlaene-markush-2.mol > naphthalene-h-f-cl-br-i-markush.smi
molconvert smiles:+a naphthalene-h-f-cl-br-i-markush.smi -o naphthalene-h-f-cl-br-i-markush-aroma.smi
Remove duplicates (2 options)
A) TEXTPAD sort and delete duplicate SMILES (6 seconds!)
B) Import into Instant-JChem with option: Remove duplicates
(2,143s) on Core 2Duo 2GHz with slow HD.
Result with A) and B) = 98125 molecules.
Cheers
Tobias
Inspiration: Depth-First
would it make sense to uniquify (crate canonical results) from a given Markush library generation if no stereochemistry is involved?
For the attached file (Naphthalene with all R1 and R1 = H,F,Cl,Br,I cxcalc would generate 390625 results but after
aromatization and creating unique smiles the number of true results would be 98125 molecules.
cxcalc markushenumerationcount -m true napthlaene-markush-2.mol
cxcalc enumerations -f smiles napthlaene-markush-2.mol > naphthalene-h-f-cl-br-i-markush.smi
molconvert smiles:+a naphthalene-h-f-cl-br-i-markush.smi -o naphthalene-h-f-cl-br-i-markush-aroma.smi
Remove duplicates (2 options)
A) TEXTPAD sort and delete duplicate SMILES (6 seconds!)
B) Import into Instant-JChem with option: Remove duplicates
(2,143s) on Core 2Duo 2GHz with slow HD.
Result with A) and B) = 98125 molecules.
Cheers
Tobias
Inspiration: Depth-First