Driverfile

The interface to DMU is based on a driver file in ASCII format. The information is organized in sections
defined by keywords. Keywords marked with * are mandatory.

$COMMENT

Specify that the following lines are comments to be put on page 1 of the listing file.

$COMMENT
Followed by up to 10 lines with up to 80 characters.

$ANALYSES

Specify type of analyses and method to use.

$ANALYSE task method scaling test_prt

where: task =  1 ->  REML  estimation if (co)variances components using DMUAI.
                     2 -> RJMC.
                    11 -> BLUE AND BLUP using DMU4.
                    12 -> BLUE AND BLUP using DMU5.

       method = method to use.

          For task = 1 (REML) method can be:

            Sparse computation
               1:  AI, but combining AI and EM if an update goes outside the parameter space (the default).
               2:  EM based on an algorithm by Robin Thompson.
               3:  EM based on an algorithm by Esa Mäntysaari.
               4:  AI, but with step halving if an update goes outside the parameter space.

            Dense computation
              31:  AI, but combining AI and EM if an update goes outside the parameter space.
              32:  EM based on an algorithm by Robin Thompson.
              33:  EM based on an algorithm by Esa Mäntysaari.
              34:  AI, but with step halving if an update goes outside the parameter space.

          For task = 2 (RJMCL) method must be 0.

            Additional parameters for RJMC must be specified in the section $RJMC.

          For task = 11 (BLUP) method can be:

            Sparse computation
               1:  Jacobi Conjugate Gradient (JCG).
               2:  Jacobi Semi-Iteration (JSI).
               3:  Successive Overrelaxation (SOR).
               4:  Symmetric SOR Conjugate Gradient (SSORCG).
               5:  Symmetric SOR Semi-Iteration (SSORSI).
               6:  Reduced System Conjugate Gradient (RSCG).
               7:  Reduced System Semi-Iteration (RSSI).
               8:  FSPAK – Prediction error calculated one by one.
               9:  FSPAK – Prediction error calculated from a sparse inverse of the MME speed optimized.
              10:  FSPAK – Prediction error calculated from a sparse.  inverse of the MME memory optimized.
              21:  PARDISO – Parallel solver on multi CPU and/or multi Core computers.
                   Prediction error is not computed.

            Dense computation
              30:  MME build as dens matrix and solve using LAPACK subroutines.
                   This require that MME have full rank.

          For task = 12 method can be:

               2:  Preconditioned Conjugate Gradient.

       scaling:  = 0:  No scaling of data prior to computation.
                 = 1:  Data are scaled by the specified residual variance ($PRIOR) before computations.
                        This can help convergency in multitrait models, where the traits are on
                        different scale.

                        Estimated parameters and effects are scaled back to the original units.

       test_prt  =  0:  Standard. Yield minimum amount of output
                    1:  Standard output plus lists of all class levels and with number of observations
                    2:  As 1 plus additional test output.
                        WARNING: this option may generate large volumes of output.

$DATA

Description of data file.

$DATA FMT (#int,#real,miss) fn [fn2]

where:  FMT  = ASCII or BINARY
        #int   = no. of integer variables
        #real  = no. of real variables
        miss   = reals below this value are regarded as missing
        fn     = name of the data files. Starting with "/" => full path and name
                 Otherwise relative to current directory
        fn2    = if specified, integer part is in fn, and real part is in fn2

Format of data file(s) see: section DATA FILE(S).

$VARIABLE

Specifies names for the variables in the data set. The names can be up to 8 character long.
If not specified variables are named I1-I#int and R1-R#real.

$VARIABLE
   Followed by lines with names for all integer and real input variables in the data set.
   Variable names can be specified as individual names or as a indexed group of variable
   names using the following syntax:

     SNP[1:45000]

     This will create 45000 variable names: SNP1, SNP2, ..., SNP45000

$MODEL

Specifies the model or a file containing the model specifications.

$MODEL [fn]

where   fn = name of file containing the model description.
             Starting with "/" => full path and name otherwise relative to current directory.

        Otherwise model directives are read from lines following the $MODEL keyword.

 Format of model directives see: section MODEL DIRECTIVES.

$GLMM

Specifies that a trait is modeled by a Generalized Linear Mixed Model.

$GLMM trait VARF=vf LINK=lf [OFFSET=ri] [CF=x]
where trait = trait number (sequence) as specified in the $MODEL section
         vf = the variance function
         lf = The link function
      Optional an offset and a correction factor can be specified.
         ri = real input number for the offset variable
          x = real constant to add to data in order to avoid singularities in the initial
              iteration (default value = 0.5)

Implemented variance functions:
   NORMAL, POISSON, BINOMIAL, GAMMA and INVGAUSSIAN

Implemented link functions:
   IDENTITY, LOG, EXPONENTIAL, RECIPROCAL, LOGIT, PROBIT, COMPLOG and LOGLOG

$GLMM_PRED

Specifies iteration parameters for prediction in models with trait(s) modeled via GLMM (works only for DMU4).

In order to improve convergency, GLMM_PRED starte with a GLM run on the fixed part of the model.
When this has converged, GLMM iteration on the full model is started.

$GLMM_PRED Round_Fixed CD_Fixed Round_Full CD_Full

Where Round_Fixed = Max. no. of GLMM iterations on the fixed part of the model (integer > 1)
         CD_Fixed = Convergence criteria for the fixed part of the model (real < 1.0)
       Round_Full = Max. Nr. of GLMM iterations on the full model
          CD_Full = Convergence criteria for the full model

$REDUCE

Used to merge random factors across trait, e.g. a random factor could be defined to have same
effect on several traits.

$REDUCE
   Followed by a line per trait.
   Each line contain as many numbers as there are random factors in the model
   (Co-variance matrices except the residual).

   For each random factor, the position in the (co)variance matrix must be specified,
   "0" indicate that this random factor is not in the model for the trait.

   Example: 4 trait and 3 random factors
      1    1    1
      1    2    1
      1    3    2
      1    4    2

   The first random factor is specified to have the same effect on each trait. The second random factor is
   specified to have different effect on each trait. The third random factor is specified to have the same
   effect on trait 1 and 2, and a different on trait 3 and 4.

$VAR_STR

Specify (co)variance structure for random factors.

$VAR_STR r_factor type <options>

where : r_factor = structure number, used to associate (co)variance.
                              structure to random effects in the model section.
        type     =  PED, DOM, COR, GREL, PGMIX, PGMIX_S, PGMIX_R, ABS_QTL or GROUP.

Options for type = PED

         method   = method for forming A-1 (1, 2, 3,  or 6).

         [ RAM ]  = if specified, a Reduced Animal Model relationship is used for sampling genetic
                    dispersion parameters This works only for RJMC (task = 2).

         FMT      = ASCII or BINARY

         fn       = name of the pedigree file. Starting with "/" => full path  and name
                    otherwise relative to current directory.

         If method = 6, the PHG’s can be treated as fixed or random. In the latter case, the  method (6)
                        must be followed by the word “RANDOM” and a real number. The (co)-variance matrix
                        multiplied by this number is added to the diagonal element/block for PHG’s.

Options for type = DOM

         ass_rf  = Random effect with the corresponding the corresponding  pedigree structure.
         FMT     = ASCII or BINARY.

         fn      = name of the  file with elements of the inverse dominance matrix.
                   Starting with "/" => full path and name otherwise relative to current directory.

Options for type = COR

         FMT     = ASCII or BINARY.

         fn      = name of the  file with elements of the inverse co-variance matrixi.
                   Starting with "/" => full path and name otherwise relative to current directory.

Options for type = GREL

         FMT     = ASCII or BINARY

         fn      = name of the  file with elements of the inverse co-variance matrix.
                   Starting with "/" => full path and name otherwise relative to current directory.

        This option is for situations where the correlation structure is dens as in the case of genomic
        relationship It utilize dense matrix operation and can use parallel computation and is therefore
        much faster the COR option.

 Options for type = PGMIX

         method   = method for forming A-1 (1, 2, 3, 4 or 6)

         FMT      = ASCII or BINARY

         fn       = name of the pedigree  file.
                    Starting with "/" => full path  and name otherwise relative to current directory.

         typed    = name of the file with ID’s of typed animals.
                    Starting with "/" => full path  and name otherwise relative to current directory.

         G-mat    = name of the  file with elements of the genomic relationship matrix.
                    Starting with "/" => full path and name otherwise relative to current directory.

         w.w      = optional weight to put on additive relationship matrix when forming the combined
                    relationship matrix (see Christensen and Lund, 2010).

         G-ADJUST = Adjust elements in the genomic relationship so Average of diagonal elements and
                    average of off-diagonal elements equal the same average in the additive relationship
                    for the typed animals.

Options for type  = PGMIX_S

                    Same format as PGMIX, but saves the part added to the inverse additive relationship
                    matrix to construct the H-1 matrix. This is for facilitating the use of the same H-1
                    for several analysis.

Options for type = PGMIX_R

                    Same format as PGMIX, but reuse the saves part to added to the inverse additive
                    relationship matrix to construct the H-1 matrix.

Options for type = ABS_QTL

         ass_rf  = Random effect with the same structure as the QTL effect (This will typically be the
                   random effect with a pedigree structure).

         FMT     = ASCII or BINARY.

         fn      = name of the data files.
                   Starting with "/" => full path and name otherwise relative to current directory.


Options for type = GROUP

                   This is for handling data with stratified heterogeneous residual (co)variance.

        Int. no. = Integer input no. for the variable relating observations to heterogeneous residual
                   variance strata (group)


Format of variance structure file see: VARIANCE STRUCTURE

$VAR_REST

Specification of restrictions on (co)-variance matrices.

$VAR_REST type options.

where type = type of restriction.

Type of restriction:

   VAR     = Variance components are kept as specified as prior.
   COV     = Co-variance components are kept as specified as prior.
   COR     = Correlation is kept as specified as prior.
   V_RATIO = Variance ratios are kept as specified by the priors.

 To keep all variances in a co-variance matrix at the values specified in the $PRIOR section of the
 driver file, include the following line in the driver file:

 $VAR_REST VAR rf_no ALL

 Where rf_no = random factor number = co-variance matrix number.

         ALL = all variances should be kept at the specified value.

 If only some of the variances should be kept at the specified value:

 $VAR_REST VAR rf_no E r_no(1) r_no(2) ... r_no(n)

 where  rf_no = random factor number = co-variance matrix number.
            E = only some of the variances that should be kept at the specified value.

      r_no(x) = row/column for the variance the keep constant ( 1 <= x  <= the dimension of
                the co-variance matrix).

To keep all co-variances in a co-variance matrix at the specified value by the following line:

$VAR_REST COV rf_no ALL

If only some co-variance components should be kept at the specified value specification of both
row and column are needed, so use:

$VAR_REST COV rf_no E r_no(1),c_no(1) r_no(2),c_no(2) ... r_no(n),c_no(n)

Correlations can also be kept at the values specified by the priors. It is specified in the same
way as keeping co-variance component constant, except that the key word “COV” is replaced by “COR”.

$VAR_REST COR rf_no ALL
$VAR_REST COR rf_no E r_no(1),c_no(1) r_no(2),c_no(2) ... r_no(n),c_no(n)

Variance ratios can also be kept at the values specified by the priors.
This requires specifications of which variance to restrict and the function of variances that should
be kept constant (=the value specified by the $PRIOR section).

This only works for functions of (co)-variance matrices of equal dimensions.

If all variance ratios are to be kept constant specify:

$VAR_REST V_RATIO rest_rf_no n_num rfn_no(1) .. rfn_no(n_num) n_den rfd_no(1) ..rfd_no(n_den) ALL

where rest_rf_no  = the number for the random factor ((co)variance matrix to restrict)
           n_num  = number of co-variance matrices in the numerator
       rfn_no(i)  = (co)-variance matrix i in the numerator  (i=1,… , n_num)
           n_den  = number of co-variance matrices in the denominator
       rfd_no(j)  = (co)-variance matrix j in the denominator (j=1,.. , n_den)

If only variance ratios for some of the elements are to be kept constant, “ALL” should be replaced by “E”,
and the row/column number for the element(s) to impose restrictions on should be specified:

$VAR_REST V_RATIO rest_rf_no n_num rfn_no(1) .. rfn_no(n_num) n_den rfd_no(1) .. rfd_no(n_den)
          E r_no(1) r_no(2)..r_no(n)

$MIXTURE

Can only be used with the gibbs sampler (rjmc) module. It specifies that at least one trait is modelled
as a mixture of two distributions

$MIXTURE int. no.

where : int_ no. = integer input no. for the variable contaning the initial guess for which distribution
                   (1 or 2) the observation belongs to. This variable is updated in each round of the
                   Gibbs sampler

$PRIOR

Specifies priors / starting values / true values for (co)variance components or a file containing
priors / starting values / true values.
If not specified, an identity matrix is assumed for all (co)variance matrices for the model.

For task = 11 and 12 all non-zero (co)-variance components must be specified.

$PRIOR [fn]

where   fn  = Starting with "/" => full path and name Otherwise relative to current directory
              If specified, priors are read from fn

              Otherwise priors are read from lines Following the $PRIOR keyword

Format for priors see :VARIANCES AND COVARIANCES

$PRECOND

Specifies the layout of the preconditioned matrix used by DMU5.

The structure is defined for the following 3 parts of MME:
 1) Fixed over all regressions
 2) Fixed nested regressions
 3) Fixed classification effects

$PRECOND  a b c

Where: a, b and c describes the structure for each of the 3 groups.

Legal combinations are:

  S  S S ->   All fixed effects across all traits as one block

  F F F ->    Fixed overall regressions: Full block across traits
                          Fixed nested regressions: Full block across traits
                          Fixed class effects:   Full block across traits

  F T T ->   Fixed overall regressions: Full block across traits
                         Fixed nested regressions: Trait block
                         Fixed class effects:  Trait block

  F D D -> Fixed overall regressions:  Full block across traits
                         Fixed nested regressions: Diagonal
                         Fixed class effects:  Diagonal

  T T T ->  Fixed overall regressions: Trait block
                         Fixed nested regressions: Trait block
                         Fixed class effects:  Trait block

  T D D -> Fixed overall regressions: Trait block
                         Fixed nested regressions: Diagonal
                         Fixed class effects:  Diagonal

  D D D ->Fixed overall regressions: Diagonal
                         Fixed nested regressions: Diagonal
                         Fixed class effects:  Diagonal

$SOLUTION

Specify that the final solution vector is stored on disk and the fist max 250 solutions
for each effect is printed to standard output. For the "free" version of DMU, only solutions for
1000 levels for random factors for which there are specified an variance structure are stored.
In order to have the fulle solution vector, a license is needed

$SOL_COV

Compute co-variances and correlations matrix between solutions to selected equations. The solutions,
co-variances and correlations are stored in an ascii file with filename SOL_STD.

The facility is only available in DMAUI and DMU4 when using direct solvers.

The format is explained in the output to standard out.

$SOL_PEV n_eq eq_1 eq_2 eq_3 ….  Eq_neq

Where n_eq is the number of selected equations and eq_1 eq_2 eg_3 … eq_n_eq are the equation number
for which the (co-variances and correlations are required.

$SOL_PEV

Compute and store the within random effect level solutions vector and the corresponding diagonal block
of the inverse coefficient matrix (lower triangle format) in a file. The output is stored in an ascii
file with filename SOL_PEV#, where # is the random effect number (rf_no).

The format depends on the actual model and is explained in the output to standard out.

The facility is only available in DMAUI and DMU4 when using direct solvers.

For the "free" version of DMU, only solutions for 1000 levels for random factors for which there are
specified an variance structure are stored. In order to have the fulle solution vector, a license is needed.


$SOL_PEV r_factor

where r_factor = structure number, used to associate (co)variance structure to random effects in the
                 model section

$RESIDUAL

Specifies that residuals should be computed and stored in a file.

Computation of residuals is only implemented in DMUAI, DMU4 and DMU5 (Task 1, 11 and 12).

$RESIDUALS FMT

where:  FMT   = ASCII or BINARY.

The content and format depends on the actual model and is explained in the output to standard out.

$DMU4

Specifies optional input to DMU4 or a file containing the optional input.

$DMU4 [fn]

 where   fn    = name of file contaning the optional input to DMU4.
                 Starting with "/" => full path and name Otherwise relative to current directory.

 Otherwise input is read from lines following the $DMU4 keyword.

 Format of input see: OPTIONAL DMU4 INPUT

$DMU5

Specifies optional input to DMU5 or a file containing the optional input.

$DMU5 [fn]

 where   fn    = name of file contaning the optional input to DMU5.
                 Starting with "/" => full path and name Otherwise relative to current directory.

 Otherwise input is read from lines following the $DMU5 keyword.

 Format of input see: OPTIONAL DMU5 INPUT

$DMUAI

Specifies optional input to DMUAI or a file containing the optional input.

$DMUAI [fn]

 where   fn    = name of file contaning the optional input to DMUAI.
                 Starting with "/" => full path and name Otherwise relative to current directory.

 Otherwise input is read from lines following the $DMUAI keyword.

 Format of input see: OPTIONAL DMUAI INPUT.

$RJMC

Specifies optional input to RJMC or a file containing the optional input.

$RJMC [fn]

 where   fn    = name of file contaning the optional input to RJMC.
                 Starting with "/" => full path and name Otherwise relative to current directory.

 Otherwise input is read from lines following the $RJMC keyword.

 Format of input see: OPTIONAL RJMC.