Abstract

Background

Incidence of premenopausal breast cancer (BC) has risen in recent years, though most existing BC prediction models are not generalizable to young women due to underrepresentation of this age group in model development.

Methods

Using questionnaire-based data from 19 prospective studies harmonized within the Premenopausal Breast Cancer Collaborative Group (PBCCG), representing 783,830 women, we developed a premenopausal BC risk prediction model. The data were split into training (2/3) and validation (1/3) datasets with equal distribution of cohorts in each. In the training dataset variables were chosen from known and hypothesized risk factors: age, age at menarche, age at first birth, parity, breastfeeding, height, BMI, young adulthood BMI, recent weight change, alcohol consumption, first-degree family history of BC, and personal history of benign breast disease (BBD). Hazard ratios (HR) and 95% confidence intervals (CI) were estimated by Cox proportional hazards regression using age as time scale, stratified by cohort. Given that complete information on all risk factors was not available in all cohorts, coefficients were estimated separately in groups of cohorts with the same available covariate information, adjusted to account for the correlation between missing and non-missing variables and meta-analyzed. Absolute risk of BC (in situ or invasive) within 5 years, was determined using country-, age-, and birth cohort-specific incidence rates. Discrimination (area under the curve, AUC) and calibration (Expected/Observed, E/O) were evaluated in the validation dataset. We compared our model with a literature-based model for women < 50 years (iCARE-Lit).

Results

Selected model risk factors were age at menarche, parity, height, current and young adulthood BMI, family history of BC, and personal BBD history. Predicted absolute 5-year risk ranged from 0% to 5.7%. The model overestimated risk on average [E/O risk = 1.18 (1.14–1.23)], with underestimation of risk in lower absolute risk deciles and overestimation in upper absolute risk deciles [E/O 1st decile = 0.59 (0.58–0.60); E/O 10th decile = 1.48 (1.48–1.49)]. The AUC was 59.1% (58.1–60.1%). Performance was similar to the iCARE-Lit model.

Conclusion

In this prediction model for premenopausal BC, the relative contribution of risk factors to absolute risk was similar to existing models for overall BC. The discriminatory ability was nearly identical (< 1% difference in AUC) to the existing iCARE-Lit model developed in women under 50 years. The inability to improve discrimination highlights the need to investigate additional predictors to better understand premenopausal BC risk.

Details

Title
Development and validation of a risk prediction model for premenopausal breast cancer in 19 cohorts
Author
Brantley, Kristen D; Jones, Michael E; Tamimi, Rulla M; Rosner, Bernard A; Kraft, Peter; Nichols, Hazel B; Katie M. O’Brien; Adami, Hans-Olov; Aizpurua, Amaia; Berrington de Gonzalez, Amy; Blot, William J; Braaten, Tonje; Chen, Yu; Jessica Clague DeHart; Dossus, Laure; Elias, Sjoerd
Pages
1-12
Section
Research
Publication year
2025
Publication date
2025
Publisher
BioMed Central
ISSN
1465-5411
e-ISSN
1465542X
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3201562402
Copyright
© 2025. This work is licensed under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.