User 61e6d0ff7a
12-07-2014 16:58:10
I have a question regarding the different representations of the hashed fingerprints.
It was my understanding that using option -D in generatemd always results in all hash codes present in molecule, while using options -2 in a folded version of the fingerprint (specified with parameter -f ).
For the ECFP generatemd behaves like I would expect it to do: If using option -D, always the same list of hash codes is produced (independent of specified fngerprint length), while the bitstring representations are different.
But the situation is different for the CF descriptor: The total number of produced integer hash codes with option -D varies with the specified fingerprint length. But the output is the same if called multiple times with the same length parameter. Why does the fingerprint length influence the generated hash codes?