Difference between revisions of "Input files"
Jump to navigation
Jump to search
(Created page with "== File content determination via file name extensions == {{lmt}} automatically detects the format of input files by the [https://en.wikipedia.org/wiki/Filename_extension file...") |
|||
Line 5: | Line 5: | ||
*".bin" for binary files in block format | *".bin" for binary files in block format | ||
".csv" files may contain commented lines at the top only where the comment character is "#". | == ".csv" files == | ||
".csv" files may contain several commented lines at the top only where the comment character is "#". | |||
The type of the file content is determined by its prospective use, that is | The type of the file content is determined by its prospective use, that is | ||
Line 12: | Line 14: | ||
*a file containing a missing value indicator matrix is supposed to contain only character strings. | *a file containing a missing value indicator matrix is supposed to contain only character strings. | ||
{{lmt}} accepts only single file containing the actual data | === Data file === | ||
{{lmt}} accepts only single file containing the actual data. A data file in ".csv" format must follow the following formatting rules: | |||
*file must have at least one commented line where the last commented line must containing the column names separated by comma, | |||
*the column names | |||
**must be alpha-numeric only | |||
**must not be quoted | |||
**must be unique | |||
*below the header the data file must contain only numeric values where the decimal separator is a dot("."). | |||
An example for a data file with three columns is shown below. | |||
#y,mu,id | |||
25.0,1,5 | |||
33.1,1,6 | |||
36.0,1,7 | |||
28.3,1,8 | |||
Co-variance matrices must be supplied as full squared symmetric matrices. |
Revision as of 09:35, 28 December 2020
File content determination via file name extensions
lmt automatically detects the format of input files by the filename extension. Supported extensions are
- ".csv" for ordinary comma separated values ascii text files
- ".blkcsv" for comma separated value ascii text files in block format
- ".bin" for binary files in block format
".csv" files
".csv" files may contain several commented lines at the top only where the comment character is "#".
The type of the file content is determined by its prospective use, that is
- the data file is supposed to contain only real/float numbers which are transferred to integer if required,
- a file containing an ordinary pedigree is supposed to contain only integer numbers,
- a file containing a missing value indicator matrix is supposed to contain only character strings.
Data file
lmt accepts only single file containing the actual data. A data file in ".csv" format must follow the following formatting rules:
- file must have at least one commented line where the last commented line must containing the column names separated by comma,
- the column names
- must be alpha-numeric only
- must not be quoted
- must be unique
- below the header the data file must contain only numeric values where the decimal separator is a dot(".").
An example for a data file with three columns is shown below.
#y,mu,id 25.0,1,5 33.1,1,6 36.0,1,7 28.3,1,8
Co-variance matrices must be supplied as full squared symmetric matrices.