Difference between revisions of "Input files"

From Linear Mixed Models Toolbox
Jump to navigation Jump to search
(Created page with "== File content determination via file name extensions == {{lmt}} automatically detects the format of input files by the [https://en.wikipedia.org/wiki/Filename_extension file...")
 
Line 5: Line 5:
*".bin" for binary files in block format
*".bin" for binary files in block format


".csv" files may contain commented lines at the top only where the comment character is "#".
== ".csv" files ==
 
".csv" files may contain several commented lines at the top only where the comment character is "#".


The type of the file content is determined by its prospective use, that is
The type of the file content is determined by its prospective use, that is
Line 12: Line 14:
*a file containing a missing value indicator matrix is supposed to contain only character strings.
*a file containing a missing value indicator matrix is supposed to contain only character strings.


{{lmt}} accepts only single file containing the actual data where that file '''must''' have a commented line containing the column header. Co-variance matrices must be supplied as full squared symmetric matrices.
=== Data file ===
 
{{lmt}} accepts only single file containing the actual data. A data file in ".csv" format must follow the following formatting rules:
*file must have at least one commented line where the last commented line must containing the column names separated by comma,
*the column names
**must be alpha-numeric only
**must not be quoted
**must be unique
*below the header the data file must contain only numeric values where the decimal separator is a dot(".").
An example for a data file with three columns is shown below.
 
#y,mu,id
25.0,1,5
33.1,1,6
36.0,1,7
28.3,1,8
 
 
 
Co-variance matrices must be supplied as full squared symmetric matrices.

Revision as of 09:35, 28 December 2020

File content determination via file name extensions

lmt automatically detects the format of input files by the filename extension. Supported extensions are

  • ".csv" for ordinary comma separated values ascii text files
  • ".blkcsv" for comma separated value ascii text files in block format
  • ".bin" for binary files in block format

".csv" files

".csv" files may contain several commented lines at the top only where the comment character is "#".

The type of the file content is determined by its prospective use, that is

  • the data file is supposed to contain only real/float numbers which are transferred to integer if required,
  • a file containing an ordinary pedigree is supposed to contain only integer numbers,
  • a file containing a missing value indicator matrix is supposed to contain only character strings.

Data file

lmt accepts only single file containing the actual data. A data file in ".csv" format must follow the following formatting rules:

  • file must have at least one commented line where the last commented line must containing the column names separated by comma,
  • the column names
    • must be alpha-numeric only
    • must not be quoted
    • must be unique
  • below the header the data file must contain only numeric values where the decimal separator is a dot(".").

An example for a data file with three columns is shown below.

#y,mu,id
25.0,1,5
33.1,1,6
36.0,1,7
28.3,1,8


Co-variance matrices must be supplied as full squared symmetric matrices.