Dear Jerome,
I think I could chime in with my opinion: The Schoemaker et al paper results are based on hippocampus/amygdala segmentations in FreeSurfer 4.4.
Hippocampal subfields segmentation module from development version you used should be more precise than 4.4. version and also the anatomical landmarks are well specified in paper Iglesias et al, 2015:
https://www.ncbi.nlm.nih.gov/pubmed/25936807
I think this paper can be used as a good basis for arguments to your reviewer.
Concerning the reliability of estimation of particular hippocampal subfields, I think it also depend on what resolution was your input images, and whether you used additional high-resolution T2. The effect of various image resolution on the reliability of results is also well discussed in above-mentioned paper I think.
Regards,
Antonin Skoch