Hi Doug and Nick,
thanks for your quick answer. I think that the
assumption of linearity can often be made in genetics and in large GWAS studies
the coding is done as suggested by Nick. However, Nick's solution does not suit
this particular case since one of the homozygous groups has less than 10
subjects which, in my opinion, might skew our results. Therefore we would
like to lump homozygotes and the carriers together and have 2 groups
(instead of 3).
Unfortunately I and Stefan Brauns do not understand
your reply, Doug. We were suggesting to include the binary variable for genotype
as a "variable" ( covariate) instead of a "class" (factor) in the FSGD. This
would enable us to get around the problem of "small cell sizes". Why do
you say this is not possible with an
FSGD?
We would just specify
Variables age SNP
and SNP would be 0 and 1 instead of 0, 1 and 2 (as
suggested by Nick)
This is how the header would look
like:
GroupDescriptorFile 1
Title
rs8216888_status
MeasurementName thickness
Class
SCZMALEMGH
Class SCZFEMALEMGH
Class
HCMALEMGH
Class HCFEMALEMGH
Class
SCZMALEIA
Class SCZFEMALEIA
Class
HCMALEIA
Class HCFEMALEIA
Class
SCZMALEUMN
Class SCZFEMALEUMN
Class
HCMALEUMN
Class HCFEMALEUMN
Class
SCZMALEUNM
Class SCZFEMALEUNM
Class
HCMALEUNM
Class HCFEMALEUNM
Variables age
SNP
In contrast, if we would include it as a class - we
would have 32 instead of 16 classes and then "Variables age"
The question is if it violates some assumptions if
we specify a "variables" (covariate) which is binary.
Many thanks, Stefan
Message: 6
Date: Wed, 02 Dec 2009 13:18:20 -0500
From: Douglas N
Greve <
greve@nmr.mgh.harvard.edu>
Subject:
Re: [Freesurfer] "dummy variable" in mri-glmfit
To: Stefan Brauns <
stefan.brauns@googlemail.com>
Cc:
freesurfer <
freesurfer@nmr.mgh.harvard.edu>
Message-ID:
<
4B16AF6C.9080103@nmr.mgh.harvard.edu>
Content-Type:
text/plain; charset=UTF-8; format=flowed
Do you mean having just another
column in your design matrix with 0s and
1s? You can do this, but not with
an FSGD. You'll have to supply your
own matrix. An easy way to do this would
be to run mri_glmfit with and
FSGD without the genotype. This will create a
matrix Xg.dat in the
output dir, then just modify that matrix and pass it to
a new call to
mri_glmfit
doug
Stefan Brauns wrote:
> Hi
there,
>
> we would like to test the effect of a binary variable
(genotype
> = carrier vs. homozygous) on cortical thicknes in mri-glmfit.
Since we
> are also controlling for gender and aquisition site (4 sites)
we
> already have 16 groups. In order to control for age as a covariate
we
> need at least 2 subjects per group to be able to estimate an age
slope.
>
> If we include the aforementioned binary variable
(genotype) as a
> factor (two different "groups"), we would have 32
groups and
> unfortunately not enough subjects per group.
>
>
Is it possible to include binary variables ("dummy variable" coded as
> 0
and 1) such as genotype or gender as covariates (slope), in order to
>
reduce the number of groups and examine the effect on thickness? In
>
simple regression this would not affect the results - what would we
>
expect here?
>
> Many thanks,
>
>
Stefan
>