Content area
Clinical target volume (CTV) delineation is essential in cervical cancer brachytherapy (BT), as accurate and automatic segmentation can improve treatment effect for patients with locally advanced disease while reducing the workload of clinicians. We propose BCTVNet (BT CTV),, a 3D hybrid neural network that integrates convolutional neural network (CNN) and transformer branches to combine strong local feature extraction with global context modeling. The 3D architecture enables the model to capture spatial relationships across slices, which is crucial for accurately identifying CTV boundaries in volumetric computed tomography (CT) data. In addition, a 3D contrast limited adaptive histogram equalization preprocessing step is applied to enhance the local contrast of soft tissues, improving anatomical structure visibility and facilitating boundary recognition. Experiments on a private BT CT dataset of 95 patients show that BCTVNet achieves superior performance compared with popular CNN-based and transformer-based segmentation models, reaching a Dice similarity coefficient (DSC) of 83.23% and a Hausdorff distance 95th percentile of 3.53 mm. Evaluation on the publicly available SegTHOR dataset further confirms its strong generalizability, achieving the highest average score among all compared methods, with a DSC of 87.09%. Multiple ablation experiments verify the effectiveness of both the hybrid architecture and the adaptive preprocessing strategy. These results demonstrate that BCTVNet provides accurate and stable CTV delineation, making it a reliable tool for clinical BT and a valuable approach for wider medical image segmentation tasks.