NEEMP:files: Difference between revisions
No edit summary |
|||
Line 5: | Line 5: | ||
=SDF file= | =SDF file= | ||
'''SDF''' file contains MOL records for each molecule separated by line consisting only of four dollar signs, i.e. $$$$. Each MOL record can be either in '''V2000''' or '''V3000''' version. The latter one is required for molecules with more than 999 atoms or bonds. For further reference on MOL format, see specification: [http://c4.cabrillo.edu/404/ctfile.pdf]. | '''SDF''' file contains MOL records for each molecule separated by line consisting only of four dollar signs, i.e. $$$$. Each MOL record can be either in '''V2000''' or '''V3000''' version. The latter one is required for molecules with more than 999 atoms or bonds. For further reference on MOL format, see specification: [http://c4.cabrillo.edu/404/ctfile.pdf CT file formats]. | ||
For convenience '''NEEMP''' has a support for reading '''SDF''' files which are compressed using ''gzip'' method. This might be useful when working with large databases of molecules. | For convenience '''NEEMP''' has a support for reading '''SDF''' files which are compressed using ''gzip'' method. This might be useful when working with large databases of molecules. |
Revision as of 10:53, 29 May 2016
NEEMP uses 3 types of input files. SDF files contain structural information about the molecules, i.e. positions of atoms and bonding information; CHG files contain previously computed ab-initio charges; and PAR files store the list of NEEMP's EEM parameters.
The following sections illustrate the details of each file type. Examples of these input files can also be found in the examples directory.
SDF file
SDF file contains MOL records for each molecule separated by line consisting only of four dollar signs, i.e. $$$$. Each MOL record can be either in V2000 or V3000 version. The latter one is required for molecules with more than 999 atoms or bonds. For further reference on MOL format, see specification: CT file formats.
For convenience NEEMP has a support for reading SDF files which are compressed using gzip method. This might be useful when working with large databases of molecules.