Very large files (GByte, millions of molecules) with mview

11-11-2008 07:32:16

Unfortunately, if you move the file pointer to 50% of the file size, then you have about 99.9% chance that what you find there is NOT the beginning of a molecule record. Molecule importer modules fail to read anything if you try to start reading at the middle of a record. Some tricky solutions may be found in case of single-line formats (like SMILES) or SDF, to look for the nearest end of line or for "$$$$", but there is a huge number of other formats.

cheers,

Peter

11-11-2008 08:29:26

"Pre-reading" is more than 3 times faster in 5.1.3 than in 5.1.2. An additional improvement was also implemented, but currently only in the development branch (not sure yet whether it will appear in 5.1.4): "pre-reading" starts automatically and immediately when you open the file.

Peter