Misplaced Pages

Standardized mean of a contrast variable

Article snapshot taken from[REDACTED] with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
The topic of this article may not meet Misplaced Pages's general notability guideline. Please help to demonstrate the notability of the topic by citing reliable secondary sources that are independent of the topic and provide significant coverage of it beyond a mere trivial mention. If notability cannot be shown, the article is likely to be merged, redirected, or deleted.
Find sources: "Standardized mean of a contrast variable" – news · newspapers · books · scholar · JSTOR (July 2011) (Learn how and when to remove this message)

In statistics, the standardized mean of a contrast variable (SMCV or SMC), is a parameter assessing effect size. The SMCV is defined as mean divided by the standard deviation of a contrast variable. The SMCV was first proposed for one-way ANOVA cases and was then extended to multi-factor ANOVA cases.

Background

Consistent interpretations for the strength of group comparison, as represented by a contrast, are important.

When there are only two groups involved in a comparison, SMCV is the same as the strictly standardized mean difference (SSMD). SSMD belongs to a popular type of effect-size measure called "standardized mean differences" which includes Cohen's d {\displaystyle d} and Glass's δ . {\displaystyle \delta .}

In ANOVA, a similar parameter for measuring the strength of group comparison is standardized effect size (SES). One issue with SES is that its values are incomparable for contrasts with different coefficients. SMCV does not have such an issue.

Concept

Suppose the random values in t groups represented by random variables G 1 , G 2 , , G t {\displaystyle G_{1},G_{2},\ldots ,G_{t}} have means μ 1 , μ 2 , , μ t {\displaystyle \mu _{1},\mu _{2},\ldots ,\mu _{t}} and variances σ 1 2 , σ 2 2 , , σ t 2 {\displaystyle \sigma _{1}^{2},\sigma _{2}^{2},\ldots ,\sigma _{t}^{2}} , respectively. A contrast variable V {\displaystyle V} is defined by

V = i = 1 t c i G i , {\displaystyle V=\sum _{i=1}^{t}c_{i}G_{i},}

where the c i {\displaystyle c_{i}} 's are a set of coefficients representing a comparison of interest and satisfy i = 1 t c i = 0 {\displaystyle \sum _{i=1}^{t}c_{i}=0} . The SMCV of contrast variable V {\displaystyle V} , denoted by λ {\displaystyle \lambda } , is defined as

λ = E ( V ) stdev ( V ) = i = 1 t c i μ i Var ( i = 1 t c i G i ) = i = 1 t c i μ i i = 1 t c i 2 σ i 2 + 2 i = 1 t j = i c i c j σ i j {\displaystyle \lambda ={\frac {\operatorname {E} (V)}{\operatorname {stdev} (V)}}={\frac {\sum _{i=1}^{t}c_{i}\mu _{i}}{\sqrt {{\text{Var}}\left(\sum _{i=1}^{t}c_{i}G_{i}\right)}}}={\frac {\sum _{i=1}^{t}c_{i}\mu _{i}}{\sqrt {\sum _{i=1}^{t}c_{i}^{2}\sigma _{i}^{2}+2\sum _{i=1}^{t}\sum _{j=i}c_{i}c_{j}\sigma _{ij}}}}}

where σ i j {\displaystyle \sigma _{ij}} is the covariance of G i {\displaystyle G_{i}} and G j {\displaystyle G_{j}} . When G 1 , G 2 , , G t {\displaystyle G_{1},G_{2},\ldots ,G_{t}} are independent,

λ = i = 1 t c i μ i i = 1 t c i 2 σ i 2 . {\displaystyle \lambda ={\frac {\sum _{i=1}^{t}c_{i}\mu _{i}}{\sqrt {\sum _{i=1}^{t}c_{i}^{2}\sigma _{i}^{2}}}}.}

Classifying rule for the strength of group comparisons

The population value (denoted by λ {\displaystyle \lambda } ) of SMCV can be used to classify the strength of a comparison represented by a contrast variable, as shown in the following table. This classifying rule has a probabilistic basis due to the link between SMCV and c-probability.

Effect type Effect subtype Thresholds for negative SMCV Thresholds for positive SMCV
Extra large Extremely strong λ 5 {\displaystyle \lambda \leq -5} λ 5 {\displaystyle \lambda \geq 5}
Very strong 5 < λ 3 {\displaystyle -5<\lambda \leq -3} 5 > λ 3 {\displaystyle 5>\lambda \geq 3}
Strong 3 < λ 2 {\displaystyle -3<\lambda \leq -2} 3 > λ 2 {\displaystyle 3>\lambda \geq 2}
Fairly strong 2 < λ 1.645 {\displaystyle -2<\lambda \leq -1.645} 2 > λ 1.645 {\displaystyle 2>\lambda \geq 1.645}
Large Moderate 1.645 < λ 1.28 {\displaystyle -1.645<\lambda \leq -1.28} 1.645 > λ 1.28 {\displaystyle 1.645>\lambda \geq 1.28}
Fairly moderate 1.28 < λ 1 {\displaystyle -1.28<\lambda \leq -1} 1.28 > λ 1 {\displaystyle 1.28>\lambda \geq 1}
Medium Fairly weak 1 < λ 0.75 {\displaystyle -1<\lambda \leq -0.75} 1 > λ 0.75 {\displaystyle 1>\lambda \geq 0.75}
Weak 0.75 < λ < 0.5 {\displaystyle -0.75<\lambda <-0.5} 0.75 > λ > 0.5 {\displaystyle 0.75>\lambda >0.5}
Very weak 0.5 λ < 0.25 {\displaystyle -0.5\leq \lambda <-0.25} 0.5 λ > 0.25 {\displaystyle 0.5\geq \lambda >0.25}
Small Extremely weak 0.25 λ < 0 {\displaystyle -0.25\leq \lambda <0} 0.25 λ > 0 {\displaystyle 0.25\geq \lambda >0}
No effect λ = 0 {\displaystyle \lambda =0}

Statistical estimation and inference

The estimation and inference of SMCV presented below is for one-factor experiments. Estimation and inference of SMCV for multi-factor experiments has also been discussed.

The estimation of SMCV relies on how samples are obtained in a study. When the groups are correlated, it is usually difficult to estimate the covariance among groups. In such a case, a good strategy is to obtain matched or paired samples (or subjects) and to conduct contrast analysis based on the matched samples. A simple example of matched contrast analysis is the analysis of paired difference of drug effects after and before taking a drug in the same patients. By contrast, another strategy is to not match or pair the samples and to conduct contrast analysis based on the unmatched or unpaired samples. A simple example of unmatched contrast analysis is the comparison of efficacy between a new drug taken by some patients and a standard drug taken by other patients. Methods of estimation for SMCV and c-probability in matched contrast analysis may differ from those used in unmatched contrast analysis.

Unmatched samples

Consider an independent sample of size n i {\displaystyle n_{i}} ,

Y i = ( Y i 1 , Y i 2 , , Y i n i ) {\displaystyle Y_{i}=\left(Y_{i1},Y_{i2},\ldots ,Y_{in_{i}}\right)}

from the i th ( i = 1 , 2 , , t ) {\displaystyle i^{\text{th}}(i=1,2,\ldots ,t)} group G i {\displaystyle G_{i}} . Y i {\displaystyle Y_{i}} 's are independent. Let Y ¯ i = 1 n i j = 1 n i Y i j {\displaystyle {\bar {Y}}_{i}={\frac {1}{n_{i}}}\sum _{j=1}^{n_{i}}Y_{ij}} ,

s i 2 = 1 n i 1 j = 1 n i ( Y i j Y ¯ i ) 2 , {\displaystyle s_{i}^{2}={\frac {1}{n_{i}-1}}\sum _{j=1}^{n_{i}}\left(Y_{ij}-{\bar {Y}}_{i}\right)^{2},}
N = i = 1 t n i {\displaystyle N=\sum _{i=1}^{t}n_{i}}

and

MSE  = 1 N t i = 1 t ( n i 1 ) s i 2 . {\displaystyle {\text{MSE }}={\frac {1}{N-t}}\sum _{i=1}^{t}\left(n_{i}-1\right)s_{i}^{2}.}

When the t {\displaystyle t} groups have unequal variance, the maximal likelihood estimate (MLE) and method-of-moment estimate (MM) of SMCV ( λ {\displaystyle \lambda } ) are, respectively

λ ^ MLE  = i = 1 t c i Y ¯ i i = 1 t n i 1 n i c i 2 s i 2 {\displaystyle {\hat {\lambda }}_{\text{MLE }}={\frac {\sum _{i=1}^{t}c_{i}{\bar {Y}}_{i}}{\sqrt {\sum _{i=1}^{t}{\frac {n_{i}-1}{n_{i}}}c_{i}^{2}s_{i}^{2}}}}}

and

λ ^ MM = i = 1 t c i Y ¯ i i = 1 t c i 2 s i 2 . {\displaystyle {\hat {\lambda }}_{\text{MM}}={\frac {\sum _{i=1}^{t}c_{i}{\bar {Y}}_{i}}{\sqrt {\sum _{i=1}^{t}c_{i}^{2}s_{i}^{2}}}}.}

When the t {\displaystyle t} groups have equal variance, under normality assumption, the uniformly minimal variance unbiased estimate (UMVUE) of SMCV ( λ {\displaystyle \lambda } ) is

λ ^ UMVUE = K N t i = 1 t c i Y ¯ i i = 1 t MSE  c i 2 {\displaystyle {\hat {\lambda }}_{\text{UMVUE}}={\sqrt {\frac {K}{N-t}}}{\frac {\sum _{i=1}^{t}c_{i}{\bar {Y}}_{i}}{\sqrt {\sum _{i=1}^{t}{\text{MSE }}c_{i}^{2}}}}}

where K = 2 ( Γ ( N t 2 ) ) 2 ( Γ ( N t 1 2 ) ) 2 {\displaystyle K={\frac {2\left(\Gamma \left({\frac {N-t}{2}}\right)\right)^{2}}{\left(\Gamma \left({\frac {N-t-1}{2}}\right)\right)^{2}}}} .

The confidence interval of SMCV can be made using the following non-central t-distribution:

T = i = 1 t c i Y ¯ i i = 1 t MSE  c i 2 / n i noncentral  t ( N t , b λ ) {\displaystyle T={\frac {\sum _{i=1}^{t}c_{i}{\bar {Y}}_{i}}{\sqrt {\sum _{i=1}^{t}{\text{MSE }}c_{i}^{2}/n_{i}}}}\sim {\text{noncentral }}t(N-t,b\lambda )}

where b = i = 1 t c i 2 i = 1 t c i 2 / n i . {\displaystyle b={\sqrt {\frac {\sum _{i=1}^{t}c_{i}^{2}}{\sum _{i=1}^{t}c_{i}^{2}/n_{i}}}}.}

Matched samples

In matched contrast analysis, assume that there are n {\displaystyle n} independent samples ( Y 1 j , Y 2 j , , Y t j ) {\displaystyle \left(Y_{1j},Y_{2j},\cdots ,Y_{tj}\right)} from t {\displaystyle t} groups ( G i {\displaystyle G_{i}} 's), where i = 1 , 2 , , t ; j = 1 , 2 , , n {\displaystyle i=1,2,\cdots ,t;j=1,2,\cdots ,n} . Then the j th {\displaystyle j^{\text{th}}} observed value of a contrast V = i = 1 t c i G i {\displaystyle V=\sum _{i=1}^{t}c_{i}G_{i}} is v j = i = 1 t c i Y i {\displaystyle v_{j}=\sum _{i=1}^{t}c_{i}Y_{i}} .

Let V ¯ {\displaystyle {\bar {V}}} and s V 2 {\displaystyle s_{V}^{2}} be the sample mean and sample variance of the contrast variable V {\displaystyle V} , respectively. Under normality assumptions, the UMVUE estimate of SMCV is

λ ^ UMVUE = K n 1 V ¯ s V {\displaystyle {\hat {\lambda }}_{\text{UMVUE}}={\sqrt {\frac {K}{n-1}}}{\frac {\bar {V}}{s_{V}}}}

where K = 2 ( Γ ( n 1 2 ) ) 2 ( Γ ( n 2 2 ) ) 2 . {\displaystyle K={\frac {2\left(\Gamma \left({\frac {n-1}{2}}\right)\right)^{2}}{\left(\Gamma \left({\frac {n-2}{2}}\right)\right)^{2}}}.}

A confidence interval for SMCV can be made using the following non-central t-distribution:

T = V ¯ s V / n noncentral  t ( n 1 , n λ ) . {\displaystyle T={\frac {\bar {V}}{s_{V}/{\sqrt {n}}}}\sim {\text{noncentral }}t\left(n-1,{\sqrt {n}}\lambda \right).}

See also

References

  1. ^ Zhang XHD (2011). Optimal High-Throughput Screening: Practical Experimental Design and Data Analysis for Genome-scale RNAi Research. Cambridge University Press. ISBN 978-0-521-73444-8.
  2. ^ Zhang XHD (2009). "A method for effectively comparing gene effects in multiple conditions in RNAi and expression-profiling research". Pharmacogenomics. 10: 345–58. doi:10.2217/14622416.10.3.345. PMID 20397965.
  3. ^ Zhang XHD (2010). "Assessing the size of gene or RNAi effects in multifactor high-throughput experiments". Pharmacogenomics. 11: 199–213. doi:10.2217/PGS.09.136. PMID 20136359.
  4. Rosenthal R, Rosnow RL, Rubin DB (2000). Contrasts and Effect Sizes in Behavioral Research. Cambridge University Press. ISBN 0-521-65980-9.
  5. Huberty CJ (2002). "A history of effect size indices". Educational and Psychological Measurement. 62: 227–40. doi:10.1177/0013164402062002002.
  6. Kirk RE (1996). "Practical significance: A concept whose time has come". Educational and Psychological Measurement. 56: 746–59. doi:10.1177/0013164496056005002.
  7. Cohen J (1962). "The statistical power of abnormal-social psychological research: A review". Journal of Abnormal and Social Psychology. 65: 145–53. doi:10.1037/h0045186. PMID 13880271.
  8. Glass GV (1976). "Primary, secondary, and meta-analysis of research". Educational Researcher. 5: 3–8. doi:10.3102/0013189X005010003.
  9. Steiger JH (2004). "Beyond the F test: Effect size confidence intervals and tests of close fit in the analysis of variance and contrast analysis". Psychological Methods. 9: 164–82. doi:10.1037/1082-989x.9.2.164. PMID 15137887.
Categories:
Standardized mean of a contrast variable Add topic