FreeSurfer experts,
I need to make the process of each one brain 10-20 times faster somehow - one way should be parallelization approach.
Currently I'm trying to add OpenMP parallelization to time-consuming part of the source code, especially mri_ca_register and mri_em_register.
Not completed yet, but at this point of time, I can not see the speed-up in proportion to the number of CPU cores; it's just x2.5 speed-up using 8 or 16 cores.
I'm afraid there might be fundamental limitations in algorithm and/or implementation of the code. Should I proceed with this work?
Any advice, help or comment would be appreciated.
Akio