mental health of child and adolescent: multiple testing problem에 대한 solution (4/4) - non-parametric - permutation test

fsl에서 permutation method를 사용하기 위한 도구로 사용되는 것은 randomise 툴을 사용하는 것입니다.

Randomise 시에 추천되는 옵션은 TFCE (threshold-Free cluster enhancement)입니다. Cluster-based thresholding은 voxel-based보다는 더 민감도가 높지만 이를 통한 결과들이 initial thresholding의 level에 아주 의존적인데, 이 level을 선택하는 것에 대한 원칙이나 객관적인 방법이 없다는 약점이 있습니다. 따라서 대안으로 사용되는 TFCE의 경우에는 cluster-based thresholding에 유사하나 일반적으로 더 강력하고도 임의로 initial cluster-forming threshold를 설정할 필요가 없습니다. 이 옵션을 preprocessed data에 사용하기 위해서는 --T2 옵션을 추가하면 됩니다. 아래에 구체적인 실례를 들어서 설명을 하여 보도록 하겠습니다.

randomise 옵션 사용하기:

자료에서의 노이즈가 simple distribution을 따르지 않거나 자료를 summarize하기 위하여 non-standard statistics가 사용될 때 null distribution을 알 수 없게 된다. 이렇게 null distribution이 알려지지 않을 때에 permutation method(다른 말로 randomization method)는 statistic map들에 대한 inference (thresholding)를 위하여 사용된다. Randomise는 standard GLM 디자인 setup을 사용하여 modeling과 inference를 가능하게 하여 주는 단순한 permutation program이다. Randomise의 수식은 다음과 같다.

randomise -i <4D_input_data> -o <output_rootname> -d design.mat -t design.con -m <mask_image> -n 500 -D -v 5 --T2 -V

여기에서 –i 파라미터 뒤의 4D_input_data를 기반으로 하여 -o 파라미터 뒤의 output_rootname으로 test statistic image를 만들어 내게 된다.

design.mat와 design.con은 design matrix와 필요한 contrast들의 명단을 포함하는 text file들이다.

–n 옵션은 randomise가 테스트할 null distribution을 구축하기 위하여 보통 500회(5000회를 추천)의 permutation을 수행하도록 하는 것이다. 이 옵션을 빼면 자동으로 5000회의 permutation을 수행한다.

-D 옵션은 randomise가 데이터를 demean하도록 하는 것인데 이 것은 design matrix에서 mean을 모델링하지 않을 때에 필요한 옵션이다. –D 옵션의 사용 유무는 design matrix를 어떻게 짜는가에 달려있다. 예를 들어서, mean predictor가 design matrix안에 포함이 되어 있다면, -D를 사용하지 않는다. 그러나 만약에 mean predictor가 design matrix안에 포함되지 않는다면 그 때에는 –D 옵션을 꼭 사용하여 주어서 mean이 data (와 design)에서 제거가 되도록 하여 주어야 한다.

-v 옵션: You can potentially improve the estimation of the variance that feeds into the final "t" statistic image by using the variance smoothing option -v <number> where you need to specify the spatial extent of the smoothing in mm

--T2 옵션은 TFCE를 사용하기 위한 옵션이다. (주의: These optimizations are different for different "dimensionality" of your data; for normal, 3D data (such as in an FSL-VBM analysis), you should use just the -T option, while for TBSS analyses (that is in effect on the mostly "2D" white matter skeleton), you should use the --T2 option)

-V 옵션: a deprecated flag (중요도가 떨어지는 기호, verbose mode를 활성화시키기 위해서 사용하는 옵션이지만 이미 randomise에서는 초기치로 잡혀 있다. 따라서 생략 가능)

mental health of child and adolescent

2013년 11월 23일 토요일

multiple testing problem에 대한 solution (4/4) - non-parametric - permutation test - randomise in fsl

댓글 없음:

댓글 쓰기