|  |   | 
| (110 intermediate revisions by 2 users not shown) | 
| Line 2: | Line 2: | 
|  | 
 |  | 
 | 
|  | The <b>L</b>inear mixed <b>M</b>odels <b>T</b>oolbox ({{lmt}}) is a stand-alone single executable software for for large scale linear mixed model analysis. |  | The <b>L</b>inear mixed <b>M</b>odels <b>T</b>oolbox ({{lmt}}) is a stand-alone single executable software for for large scale linear mixed model analysis. | 
|  | It is the successor of DMU, the well-known and
 |  | 
|  | widely used software package for linear mixed model analysis developed and maintained
 |  | 
|  | by Per Madsen and Just Jensen.
 |  | 
|  | 
 |  | 
 | 
|  | Since the early days of software development instatistics andquantitative genetics
 |  | {{lmt}} supports all models commonly used in genetic evaluation and has various options to handle genomic markers. | 
|  | time hasmoved on in terms of what programming languages are capable of and therefore
 |  | 
|  | DMU has been given a thorough overhaul.
 |  | 
|  | 
 |  | 
 | 
|  | One result of the overhaul is the new name, {{lmt}},resulting from the difficulty to translate
 |  | {{lmt}} has been used successfully for genetic evaluation data sets with >>200k genotyped animals, >>15m animals, >>500m equations. | 
|  | the acronym DMU into something which is generally meaningful throughout time. For
 |  | 
|  | those who prefer the acronym DMU,they may refer to {{lmt}} as <b>DMU-next</b>.
 |  | 
|  | 
 |  | 
 | 
|  | The second area of the overhaul is the parameter file interface. {{lmt}}now comes with
 |  | {{lmt}} is only available for 64 bit Linux operation systems, is run from the Linux command line, and uses an [https://www.w3schools.com/xml/ xml] style parameter file which is supposed to allow for an easy understanding by the user. Further using [https://www.w3schools.com/xml/ xml] comes with support for automated commenting, uncommenting, indentation, code-folding and syntax highlighting by almost every editor, | 
|  | an xml style parameter file which is supposed to allow for a much easier understanding |  | thus easing to follow the structure of the parameter file even if it spans several tens of lines of code. | 
|  | by the user. Further using xml comes with support for automated commenting, un- |  | 
|  | commenting, indentation, code-folding and syntax highlighting by almost every editor,
 |  | 
|  | thus easing to follow the structure of the parameter file even if it spans several tens of |  | 
|  | lines of code. |  | 
|  | 
 |  | 
 | 
|  | The third area ofthe overhaul is the program structure. DMU was structured into
 |  | == Conditions of use == | 
|  | several programs (<i>DMU1, DMU4, DMU5, DMUAI, RJMC</i>). In contrast, {{lmt}}is meant
 |  | {{lmt}} can be used by the scientific community free of charge, but users must credit {{lmt}} | 
|  | to provide thefunctionalities all those programs via a single parameter file and a single
 |  | in any publications. | 
|  | executable.
 |  | Commercial users must obtain the explicit approval of the author before using {{lmt}} and must credit {{lmt}} in any publication in scientific journals. | 
|  |   |  | If {{lmt}} cannot be credited via citation the author must become a co-author. | 
|  | While {{lmt}} is finally meant to be a full scale successor ofDMU, it does not yet provide
 |  | 
|  | all its functionalities in some areas, in others it already provides more. More specifi-
 |  | 
|  | cally, there no REML facilities available yet, butlarge scale linear mixed model solving
 |  | 
|  | provides Single-Step-T-BLUP facilities, uploading of genotypes and building of genomic
 |  | 
|  | relationship matrices on the fly etc etc.
 |  | 
|  |   |  | 
|  | ==Supported features ==
 |  | 
|  |   |  | 
|  | === Supported operations ===
 |  | 
|  |   |  | 
|  | Currently {{lmt}}support the following operations on linear mixed models:
 |  | 
|  |   |  | 
|  | *Solving for BLUP and BLUE solutions conditional on supplied variances for random and fixed factor, respectively;
 |  | 
|  | *Gibbs sampling of variance components insingle pass and blocked mode;
 |  | 
|  | *MC-EM-REML estimation of variance components
 |  | 
|  | *Sampling elements of theinverse of themixed model coefficient matrix
 |  | 
|  |   |  | 
|  | === Supported factors and variables ===
 |  | 
|  |   |  | 
|  | {{lmt}} supports |  | 
|  | *fixed
 |  | 
|  | *random factors
 |  | 
|  | *classification variables
 |  | 
|  | *continuous co-variables, which can be nested. For continuous co-variables {{lmt}}support user-defined polynomials and hard coded [https://en.wikipedia.org/wiki/Legendre_polynomials Legendre polynomials] up to order 6.
 |  | 
|  | *genetic group co-variables
 |  | 
|  |   |  | 
|  | All classification and co-variables can be associated to a fixed or random factor.
 |  | 
|  |   |  | 
|  | === Supported variance structures ===
 |  | 
|  |   |  | 
|  | For random factor {{lmt}}supports variance structures of
 |  | 
|  | *structure [https://en.wikipedia.org/wiki/Kronecker_product $$\Gamma\otimes\Sigma$$], where $$\Sigma$$ is an dense symmetric positive definite matrix, and
 |  | 
|  | *$$\Theta_L(\Gamma\otimes I_{\Sigma})\Theta_L^{'}$$, where $$\Theta$$ is symmetric positive definite [https://en.wikipedia.org/wiki/Block_matrix#Block_diagonal_matrices block-diagonal matrix] of $$n$$ symmetric positive definite martices $$\Sigma_i, i=1,..,n$$, $$\Theta_L$$ is thelower [https://en.wikipedia.org/wiki/Cholesky_decomposition Cholesky factor] of $$\Theta$$ and $$I_{\Sigma}$$ is an identity matrix of dimension $$\Sigma_i$$.
 |  | 
|  |   |  | 
|  | $$\Gamma$$ is
 |  | 
|  | *an [https://en.wikipedia.org/wiki/Identity_matrix identity matrix]
 |  | 
|  | *an arbitrary positive definite [https://en.wikipedia.org/wiki/Diagonal_matrix diagonal matrix]
 |  | 
|  | *apedigree-based numerator relationship matrix $$A$$
 |  | 
|  | *a pedigree- and genotype-based relationship matrix $$H$$
 |  | 
|  | *a user-defined(u.d.) symmetric, positive definite matrix of which inverse is supplied
 |  | 
|  | **as a sparse upper-triangular matrix stored in [https://en.wikipedia.org/wiki/Sparse_matrix#Compressed_sparse_row_(CSR,_CRS_or_Yale_format) csr format]
 |  | 
|  | **as a dense matrix
 |  | 
|  | *a co-variance matrix of a selected auto-regressive process
 |  | 
|  |   |  | 
|  |   |  | 
|  | === Supported linear mixed model solvers ===
 |  | 
|  | 
 |  | 
 | 
|  | {{lmt}} supports
 |  | == How to get it == | 
|  | 
 |  | 
 | 
|  | *a direct solver requiring to explicitly build the linear mixed model equations left-hand-side coefficient matrix($$C$$)
 |  | {{lmt}} can be obtained '''on request''' from the [mailto:vinzent.boerner@qgg.au.dk author]. | 
|  | *an iteration-on-data pre-conditioned gradient solver which '''does not''' require $$C$$
 |  | 
|  |   |  | 
|  | === Supported features related to genomic data ===
 |  | 
|  |   |  | 
|  | *direct use of genomic marker data
 |  | 
|  | *building of genomic relationship matrices($$G$$) from supplied genomic data
 |  | 
|  | *uploading of a u.d. $$G$$
 |  | 
|  | *adjustment of $$G$$ to $$A_{gg}$$
 |  | 
|  | *solving Single-Step-G-BLUP models
 |  | 
|  | *sampling variances for Single-Step-G-BLUP models
 |  | 
|  | *solving Single-Step-T-BLUP models
 |  | 
|  | *solving Single-Step-SNP-BLUP models
 |  | 
|  | *all Single-Step models can be run from "bottom-up", that is the user supplies the genotypes and all necessary ingredients(e.g. $$G$$) are built on the fly.
 |  | 
|  |   |  | 
|  | === Supported pedigree types===
 |  | 
|  |   |  | 
|  | *ordinary pedigrees
 |  | 
|  | *probabilistic pedigrees with an unlimited number of parent pairs per individual
 |  | 
|  | *genetic group pedigrees
 |  | 
|  | *meta-founders
 |  | 
|  |   |  | 
|  | == File formats for data input ==
 |  | 
|  |   |  | 
|  | {{lmt}} automatically detects the format of input files by the [https://en.wikipedia.org/wiki/Filename_extension filename extension]. Supported extensions are |  | 
|  | *".csv" for ordinary [https://en.wikipedia.org/wiki/Comma-separated_values comma separated values] ascii text files
 |  | 
|  | *".blkcsv" for comma separated value ascii text files in block format
 |  | 
|  | *".bin" for binary files in block format
 |  | 
|  |   |  | 
|  | ".csv" files may contain commented lines at the top only where the comment character is "#".
 |  | 
|  |   |  | 
|  | The type of the file content is determined by its prospective use, that is
 |  | 
|  | *the data file is supposed to contain only real/float numbers which are transferred to integer if required,
 |  | 
|  | *a file containing an ordinary pedigree is supposed to contain only integer numbers,
 |  | 
|  | *a file containing a missing value indicator matrix is supposed to contain only character strings.
 |  | 
|  |   |  | 
|  | {{lmt}} accepts only single file containing the actual data where that file '''must'''have a commented line containing thecolumn header. Co-variance matrices must be supplied as full squared symmetric matrices.
 |  | 
|  |   |  | 
|  | == Disclaimer ==
 |  | 
|  |   |  | 
|  | {{lmt}} is under ongoing development and many of its features have been tested only a few
 |  | 
|  | times on a limited number of models and data sets.Thus, the users uses {{lmt}} completely
 |  | 
|  | on his/her own risk. This also applies to any decisions made based on the results provided
 |  | 
|  | by {{lmt}}.
 |  | 
|  |   |  | 
|  | == Conditions of use ==
 |  | 
|  |   |  | 
|  | {{lmt}} can be used by the scientific community free of charge, but users must credit {{lmt}}
 |  | 
|  | in any publications.Commercial users must obtain the explicit approval of the author
 |  | 
|  | before using {{lmt}} and must credit {{lmt}} in any publication in scientific journals.
 |  | 
|  | 
 |  | 
 | 
|  | == Feedback and support == |  | == Feedback and support == | 
| Line 131: | Line 25: | 
|  | However, the author appreciates feedback about the program functionality, possible aborts (segmentation faults), usability of output and comprehensiveness of the manual. |  | However, the author appreciates feedback about the program functionality, possible aborts (segmentation faults), usability of output and comprehensiveness of the manual. | 
|  | 
 |  | 
 | 
|  |  | For feedback, wish list, questions and support contact [mailto:vinzent.boerner@qgg.au.dk vinzent.boerner@qgg.au.dk](infrequently checked) or [mailto:vinzent.boerner@gmx.de vinzent.boerner@gmx.de](frequently checked). | 
|  | 
 |  | 
 | 
|  |   |  | == [[supported features| Supported features]] == | 
|  | * [http://localhost/mediawiki/index.php/Run_It Run It]
 |  | == [[Algorithms|Algorithms]] == | 
|  | * [http://localhost/mediawiki/index.php/InputFileFormats Input file formats]
 |  | == [[Parameterfile1| Parameter file terminology]] == | 
|  | * [http://localhost/mediawiki/index.php/Parameterfile1 Parameter file terminologypart 1]
 |  | == [[linear mixed models in lmt| Linear mixed models in lmt]] == | 
|  | * [http://localhost/mediawiki/index.php/Jump_Start Jump Start]
 |  | == [[Genomic data in lmt| Genomic data lmt]] == | 
|  |  | == [[File formats|File formats]] == | 
|  |  | == [[Input files|Input files]] == | 
|  |  | == [[Output files|Output files]] == | 
|  |  | == [[Run_It|How to run it]] == | 
|  |  | == [[Examples|Examples]] == | 
|  |  | == [[Parameter file elements| Parameter file elements]] == |