TY - JOUR
T1 - Is it necessary to transform nutrient variables prior to statistical analyses?
AU - Millns, Helen
AU - Woodward, Mark
AU - Bolton-smith, Caroline
N1 - Funding Information:
The SHHS was funded by the Scottish Office Home and Health Department. H. M. received funding from the Medical Research Council.
PY - 1995/2/1
Y1 - 1995/2/1
N2 - The distributions of the intakes of many nutrients are skewed, yet this is often overlooked when standard statistical analyses are applied to nutrient data. The nutrient intakes of 5,123 men and 5,236 women, recorded by food frequency questionnaire in the Scottish Heart Health Study, were transformed to achieve approximately symmetric distributions. Power transformations were chosen using letter value analyses. A letter value analysis uses selected order statistics and their position around the median to assess symmetry. The effect that each transformation had on a comparison of nutrient intakes between those with and without prevalent coronary heart disease was determined from t tests on the untransformed and transformed variable. The effects of the logarithm and square root transformation and of the optimum Box-Cox transformation were also determined, and the results were compared with the nonparametric Mann-Whitney test. The conclusion of whether or not to reject the null hypotheses often varied, depending on the transformation and test used. The nonparametric test usually gave a conclusion similar to that of the t test on the letter value-transformed data, the Box-Cox-transformed variable, and after either the logarithm or square root transformation of the data, but not always both. The results from the untransformed variable were sometimes very different. Failure to account for skewness in nutrient variables may thus lead to spurious conclusions.
AB - The distributions of the intakes of many nutrients are skewed, yet this is often overlooked when standard statistical analyses are applied to nutrient data. The nutrient intakes of 5,123 men and 5,236 women, recorded by food frequency questionnaire in the Scottish Heart Health Study, were transformed to achieve approximately symmetric distributions. Power transformations were chosen using letter value analyses. A letter value analysis uses selected order statistics and their position around the median to assess symmetry. The effect that each transformation had on a comparison of nutrient intakes between those with and without prevalent coronary heart disease was determined from t tests on the untransformed and transformed variable. The effects of the logarithm and square root transformation and of the optimum Box-Cox transformation were also determined, and the results were compared with the nonparametric Mann-Whitney test. The conclusion of whether or not to reject the null hypotheses often varied, depending on the transformation and test used. The nonparametric test usually gave a conclusion similar to that of the t test on the letter value-transformed data, the Box-Cox-transformed variable, and after either the logarithm or square root transformation of the data, but not always both. The results from the untransformed variable were sometimes very different. Failure to account for skewness in nutrient variables may thus lead to spurious conclusions.
KW - Models
KW - Nutrition assessment
KW - Statistical
UR - http://www.scopus.com/inward/record.url?scp=0028813976&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0028813976&partnerID=8YFLogxK
U2 - 10.1093/oxfordjournals.aje.a117427
DO - 10.1093/oxfordjournals.aje.a117427
M3 - Article
C2 - 7840099
AN - SCOPUS:0028813976
SN - 0002-9262
VL - 141
SP - 251
EP - 262
JO - American Journal of Epidemiology
JF - American Journal of Epidemiology
IS - 3
ER -