An aromatization problem, nested jc_molconvert problem

22-06-2006 04:46:20

With the conversion functions/operators (jc_molconvert, jc_standardize, jc_evaluate_x), you will lose no functionality when using functions instead of operators anyway.

22-06-2006 12:33:29

With the conversion functions/operators (jc_molconvert, jc_standardize, jc_evaluate_x), you will lose no functionality when using functions instead of operators anyway.

22-06-2006 17:19:22

For this statement, the optimizer will most probably select a plan whereby strtable will be accessed using full-table scan and jc_molweight will be called once for each row in the table.

Assume that strtable has a jc_idxtype index on the smiles structure column. One of the extra pieces of information which will be provided for the function-mode implementation of jc_molweight for each row processed is the rowid of the current row of strtable. During indexing with jc_idxtype, molweight values for the structures in the smiles column were precomputed and stored in the index table for strtable.smiles (with the rowids of strtable as the primary key of the indextable). Using the rowid for the current row, jc_molweight can retrieve and simply return the precomputed value in the index table -- instead of computing the molweight on-the-fly.

If strtable.smiles is not indexed, the rowid parameter passed to the callback interface by Oracle will be null and the molweight will be computed on-the-fly, using the value in the smiles colunm in the current row.

While the above SQL statement takes less than 3 seconds to execute on my machine (assuming that strtable contains 1k smiles), the statement

22-06-2006 20:49:51

In the past we were actually discussing this internally. There are several reasons:

1. (Conceptual reason:) It seems to be misleading if a function called aromatization removed some aromatic bonds, even if they were not correctly formulated.

2. (Practical reason:) It is more efficient to not call dearomatization inside aromatization.

3. (Flexibility of representation) If someone wants to represent a ring in aromatic form which otherwise is not considered aromatic by neither of our methods, then aromatization may ruin this information. For example, an antiaromatic transition state may be represented by aromatic notation for theoretical studies, although it is not a stable form.

We may introduce new aromatization options which would fix aromatization as you suggest. Do you think it would be useful for you?

Best regards,

Szabolcs