Modules ===== .. _modules: DMU ------------ .. code-block:: DMU Reading a driver file (to be described later) containing description of which type of analysis to perform, input data, model, variance structures, prior variances and covariances and a number of optional parameters (see description of the driver file). All effect ID’s in data and additional input files are recoded and checked for consistency across the different input data files. Basen on the ITASK specified on the $ANALYSIS line, The modules DMU4, DMU5, DMUAI or RJMC are executed. DMU4 ------------ .. code-block:: DMU4 This module can be used to predict future outcomes of random effects (e.g. breeding values) and to estimate fixed effects. The multiple trait mixed model equations are set up and solved using techniques, that requires that all non-zeros of the whole system is stored in memory of the computer. The multiple trait mixed model equations can be solved by seven different iterative methods using various forms of adaptive relaxation techniques from the subroutine package ITPACK (Kicaid et al., 1982), or by direct methods based on FSPAK (Perez-Enciso et al., 1994) or LAPACK subroutines. The optimum solver depends on the model used, the amount of data, and the data structure. For sparse system, FSPAK and ITPACK solvers are the most efficient. The ITPACK solver requires less memory then FSPAK solvers. Among the FSPAK solvers, method 8 has the smallest and method 9 the largest memory requirement. Time requirements for the FSPAK solvers are less for method 9, followed by 10 and 8. If a direct method cannot be used due to memory requirement,ITPAK method 1 (JCG) is a good choice. For dens system as occurs in SNP- and G-BLUP models, the dens solver is the most efficient in terms of computer time. If run on a SMP (multi core/CPU) computer, the solving step is parallelized over all available CPU’s/cores, or on the number of CPU’s/cores specified by setting the environment variable MKL_NUM_THREADS=n, where n is the number of CPU’s/cores to use. If solutions are obtained by FSPAK or for dense systems by LAPACK, standard error of estimate for fixed effects and standard error of prediction for random effects in the model are also computed. Irrespectively of the solver used, standard error of estimate/prediction and correlation among estimates/predictions of selected fixed or random effects in the model(s) can be obtained. Such information can be used to test various hypotheses. DMU5 ------------ .. code-block:: DMU5 This module can be used to solve the multiple trait mixed model equations based on iteration on data. Since the explicit construction of the system of equations is avoided, it is possible to solve much larger systems by this module than by the DMU4. The DMU5 module solves the model based on processing/reading i the data file in each round of iteration. The iterative solver is based on the Preconditioned Conjugate Gradient method. DMUAI ------------ .. code-block:: DMUAI This module can be used for estimation of (co)variance components using Average Information REstricted Maximum Likelihood (AI-REML) (Jensen et al. 1997). The algorithm is based on the use of Average Information (AI) as second differentials of the likelihood function. The AI is obtained by averaging the information matrices based on observed and expected information. The module can also use Expectation Maximization (EM) to maximize the restricted likelihood function. It is only the part of Likelihood that depends on random effect that is maximized. The full Likelihood also contains a factor (constant) that depends on data and fixed effects. Therefore, the reported “-2LogL + constant” value can only be used to compare models using the exact same data and fixed effects. Asymptotic standard errors of estimated (co)variance components are obtained from the Average Information matrix. If parameters in the form of interclass correlations and correlations between random effects (e.g. genetic correlations) the standard errors of these parameter estimates are computed based on a Taylor series approximation. AI-REML can yield updates of the parameter vector outside the parameter space. To overcome this problem different methods are implemented in DMUAI. The methods are: AI, but combining AI and EM if an update goes outside the parameter space. EM based on a algorithm by Robin Thompson EM based on an algorithm by Esa Mäntysaari AI, but with step halving if an update goes outside the parameter space. DMUAI can use sparse computation based on FSPAK subroutines or dens computation based on LAPACK subroutines. As for DMU4, the dens computation can use parallel computations on SMP (multi core/CPU) computer. The default is to parallelize over all over all available CPU’s/cores, but can be limited by setting the environment variable MKL_NUM_THREADS=n, where n is the number of CPU’s/cores to use. RJMC ------------ .. code-block:: RJMC Module for single and multiple trait Markov Chain Monte Carlo (MCMC) based estimation of location and dispersion parameters. The implementation is based on iteration on data techniques and can handle Gaussian, binary, ordered categorical, 2-component mixture, zero inflated sequential binary traits and reaction norm models with unknown environmental gradient. Sampling of genetic dispersion parameters can be restricted so only information on parental animals is used. This approach is especially useful for cross-sectional binary traits (Ødegård et al. (2010)).