Difference between revisions of "Input files"

From Linear Mixed Models Toolbox
Jump to navigation Jump to search
Line 14: Line 14:
*a file containing a missing value indicator matrix is supposed to contain only character strings.
*a file containing a missing value indicator matrix is supposed to contain only character strings.


=== Data file ===
=== data file ===


{{lmt}} accepts only a single file containing the actual data. A data file in ".csv" format must follow the following formatting rules:
{{lmt}} accepts only a single file containing the actual data. A data file in ".csv" format must follow the following formatting rules:
Line 31: Line 31:
  28.3,1,8
  28.3,1,8


=== co-variance matrix files ===


 
Co-variance matrix files contain full squared symmetric matrices, for instance
Co-variance matrices must be supplied as full squared symmetric matrices.
1.5,0.8
0.8,2.1

Revision as of 09:37, 28 December 2020

File content determination via file name extensions

lmt automatically detects the format of input files by the filename extension. Supported extensions are

  • ".csv" for ordinary comma separated values ascii text files
  • ".blkcsv" for comma separated value ascii text files in block format
  • ".bin" for binary files in block format

".csv" files

".csv" files may contain several commented lines at the top only where the comment character is "#".

The type of the file content is determined by its prospective use, that is

  • the data file is supposed to contain only real/float numbers which are transferred to integer if required,
  • a file containing an ordinary pedigree is supposed to contain only integer numbers,
  • a file containing a missing value indicator matrix is supposed to contain only character strings.

data file

lmt accepts only a single file containing the actual data. A data file in ".csv" format must follow the following formatting rules:

  • file must have at least one commented line where the last commented line must containing the column names separated by comma,
  • the column names
    • must be alpha-numeric only
    • must not be quoted
    • must be unique
  • below the header the data file must contain only numeric values where the decimal separator is a dot(".").

An example for a data file with three columns is shown below.

#y,mu,id
25.0,1,5
33.1,1,6
36.0,1,7
28.3,1,8

co-variance matrix files

Co-variance matrix files contain full squared symmetric matrices, for instance

1.5,0.8
0.8,2.1