Oncotarget

Clinical Research Papers:

Difference between observed and expected number of involved lymph nodes reflects the metastatic potential of breast cancer independent to intrinsic subtype

PDF |  HTML  |  How to cite

Oncotarget. 2015; 6:16686-16697. https://doi.org/10.18632/oncotarget.3647

Metrics: PDF 2472 views  |   HTML 1922 views  |   ?  

Ke-Da Yu _, Yi-Zhou Jiang and Zhi-Ming Shao

Abstract

Ke-Da Yu1, Yi-Zhou Jiang1, Zhi-Ming Shao1

1Department of Breast Surgery, Cancer Center and Cancer Institute, Shanghai Medical College, Fudan University, Shanghai, P.R. China

Correspondence to:

Ke-Da Yu, e-mail: [email protected]

Keywords: breast cancer, metastatic potential, lymph node

Received: February 15, 2015     Accepted: March 24, 2015     Published: April 10, 2015

ABSTRACT

Purpose: Poor prognosis associated with metastasis in breast cancer patients highlights the critical need to develop an effective evaluation model for metastatic potential (MP). We hypothesized that MP could be also indicated by primary tumor size and involved lymph nodes (LNs).

Methods: The expected number of involved LNs is defined as tumor size (cm) divided by 1.5. The effect of the surrogate for MP (defined as difference between the number of observed and expected involved LNs) on breast cancer-specific survival (BCSS) was investigated in the first cohort from SEER (n = 108,814). Validation was performed in another SEER cohort (n = 50,414) and a third cohort (n = 3,755).

Results: MP is an independent predictor for BCSS in the overall population [hazard ratio (HR) for high MP: 2.92; 95% confidence interval (CI): 2.80–3.03] and in subgroups. The effect of surrogate for MP on survival was independent to intrinsic subtype, with adjusted HRs of 3.46 (95%CI, 2.02–5.93), 2.30 (95%CI, 1.64–3.24), 4.05 (95%CI, 2.85–5.76), and 1.45 (95%CI, 1.04–2.03) in luminal-A, luminal-B, triple-negative, and HER2-positive subtypes, respectively.

Conclusion: Difference between the observed and expected number of involved LNs serves as an indicator for MP, which is independent to intrinsic subtype and could predict survival. Our findings need further validation.


INTRODUCTION

Breast cancer mortality is typically linked with distant metastasis, which is the most lethal type of recurrence and practically undetectable at the time of diagnosis. [1, 2] The poor prognosis associated with metastasis highlights the critical need to better understand the biology of breast cancer metastasis and to develop an accurate evaluation model for metastatic potential (MP) after surgery for patients with operable disease.

With the development of intrinsic molecular subtype, clinicians use the surrogate of intrinsic subtype to determine MP. Intrinsic subtype correlates well with differences in prognosis, tumor aggressiveness, and response to available therapies. [35] Compared with luminal-like (luminal-A and luminal-B) tumors, HER2-positive and triple negative breast cancer (TNBC) are more likely to exhibit higher MP and are associated with markedly worse survival. Unfortunately, it is difficult for clinicians to discriminate different levels of MP within one subtype. Of note, emerging evidence demonstrates that very small tumors with extensive lymph node (LN) involvement can exhibit highly aggressive behavior compared with larger ones; [6] similarly, in the absence of LN involvement, large tumor size can be a surrogate for biologically indolent disease. [7] Consequently, the integrated understandings of LN involvement with tumor size might provide some useful information on tumor biology independent to intrinsic subtype.

We hypothesized that, within a certain subtype, the MP of breast cancer could be determined by the difference between the number of observed and expected involved LNs. First, we measured the quantitative ratio between tumor size and the number of involved LNs in the overall population. Using this ratio, we determined the expected number of involved LNs based on a given tumor size. The difference between the observed and expected numbers of involved LNs might serve as a surrogate for MP, which is proved closely associated with distance disease-free survival (DDFS) and breast cancer-specific survival (BCSS). To perform high-powered statistical analysis, we used the National Cancer Institute’s Surveillance, Epidemiology and End Results (SEER) cancer database for testing and validation. We further validated our findings in another independent cohort from Fudan University Shanghai Cancer Center (FDUSCC).

MATERIALS AND METHODS

The first cohort from SEER for model establishment and the survival test

In the first cohort, we selected female patients with invasive breast cancer from the SEER database (released in Nov 2012) from Jan-1 1997 to Dec-31 2006. Patients diagnosed after 2006 were excluded to ensure an adequate follow-up time.

Initially, we identified 111,321 patients according to the following inclusion criteria: female, age of diagnosis between 18 and 74 years, surgical treatment with either mastectomy or breast-conserving surgery, AJCC TNM stages I-III, pathologically confirmed invasive ductal carcinoma, at least four axillary LNs dissected, unilateral cancer, known time of diagnosis, breast cancer as the first and only cancer diagnosis, known number of involved LNs, and known tumor size. The following information was also obtained if available: estrogen receptor (ER) and progesterone receptor (PgR) status, histological grade, race, and use of radiotherapy. A few cases with borderline values of ER/PgR were treated as positive because, according to current standards, ER and PgR status is considered positive if there are at least 1% positive nuclei. [8] Although SEER provides HER2 status from 2010 and the subtype of each case could be determined after that time, the follow-up time for survival is inadequate yet. SEER did not provide information on chemotherapy and endocrine therapy. There were very few cases with extreme values of tumor size (1.3% of cases were larger than 8.0 cm) and numbers of positive LNs (1.0% of cases had more than 18). To minimize the influence of extreme values, we excluded these cases. In total, 108,814 cases composed the first cohort.

The primary study outcome was BCSS. The cause of death was categorized as breast cancer-specific or non-breast cancer-related. BCSS was calculated from the date of diagnosis to the date of breast cancer death. Patients who died from other causes were censored at the date of death.

The second cohort from SEER for survival validation

Using the criteria above, we selected 50,414 patients with invasive ductal breast cancer from SEER between Jan-1 1990 and Dec-31 1997 to validate the preliminary findings in the first cohort. Patients diagnosed before 1990 were excluded because of the lack of hormone receptor data. Because early cases might exhibit different distributions in tumor stage compared with those at present, [9] we did not use this set for linear regression to calculate the ratio of tumor size to the number of involved LNs.

The third cohort from FDUSCC for survival validation

To further validate the findings from the SEER dataset, to determine a direct relationship between the MP category and distant metastasis, and especially to test the performance of MP category in a certain subtype, we used the data from 3,755 consecutive patients diagnosed with operable unilateral breast cancer between Jan-1 1998 and Dec-31 2006 at FDUSCC. This is a well-characterized series of patients, whose clinicopathologic and follow-up information were maintained on a prospective basis. [10] Patients’ treatments were based on St. Gallen consensus. [11, 12] The cut-off for ER or PgR positivity was ≥ 10% of tumor cells with nuclear staining. Pathologic HER2 status was defined according to ASCO/CAP guidelines. [13] The primary treatment for all of these patients was surgery. Intrinsic subtypes (luminal-A, luminal-B, TNBC, and HER2-positive) were determined according to the clinicopathologic criteria recommended by the St Gallen panelists. [14] In brief, luminal-A is ER/PgR positive, HER2 negative, and Ki-67 low (< 14%); luminal-B is ER/PgR positive, and HER2 positive or Ki-67 high; TNBC is ER, PgR, and HER2 negative; HER2-positive is ER/PgR negative and HER2 positive. Because information on Ki-67 was not available in earlier cases, we used grade to capture cell proliferation, as suggested by von Minckwitz et al. [15]

The outcomes of interest were DDFS, which was calculated from the date of diagnosis to the date of first distant metastasis. To determine distant relapse events, isolated local recurrence was further followed until a metastasis event. The research protocol of this part of our study was reviewed and approved by the Ethical Committee and Institutional Review Board of FDUSCC. All patients provided written informed consent.

The basic characteristics of the patients in the three cohorts are presented in Table 1.

Table 1: Characteristics of patients from three cohorts

SEER set (1998–2006)
N = 108,814

SEER set (1990–1997)
N = 50,414

FDUSCC set (1998–2006)
N = 3,755

Characteristic

No.

%

No.

%

No.

%

Median follow-up, months

92

174

79

 IQR

64–118

116–204

60–106

Patient age, years

 ≤ 50

42,344

38.9

18,992

37.7

1,849

49.2

 > 50

66,470

61.1

31,422

62.3

1,906

50.8

Race

 White

87,162

80.4

41,788

83.1

0

0

 Black

11,591

10.7

4,139

8.2

0

0

 Others*

9,602

8.9

4,356

8.7

3,755

100.0

 Unknown

459

-

131

-

-

-

Lymph node status

 Negative

59,207

54.4

32,531

64.5

1,981

52.8

 Positive

49,607

45.6

17,883

35.5

1,774

47.2

Tumor size, mm

 0–20

65,315

60

32,832

65.1

1,623

43.2

 21–50

38,628

35.5

15,999

31.7

1,957

52.1

 ≥ 51

4,871

4.5

1,583

3.1

175

4.7

Grade

 I

15,663

14.8

5,453

12.8

102

3.3

 II

41,590

39.3

17,930

42

2,194

70

 III or UD

48,597

45.9

19,276

45.2

838

26.7

 Unknown

2,964

-

7,755

-

621

-

ER status

 Negative

26,839

27.5

11,578

27

1,192

37.5

 Positive

70,637

72.5

31,227

73

1,985

62.5

 Unknown

11,338

-

7,609

-

578

PgR status

 Negative

35,609

37.1

14,225

34.1

1,366

43.3

 Positive

60,475

62.9

27,439

65.9

1,786

56.7

 Unknown

12730

-

8,750

-

603

-

HER2 status

 Negative

N.A.

N.A.

N.A.

N.A.

2,408

75.8

 Positive

N.A.

N.A.

N.A.

N.A.

769

24.2

 Unknown

N.A.

-

N.A.

-

578

-

Radiotherapy&

 No

47,653

45.3

26,794

54.4

2,273

61.6

 Yes

57,654

54.7

22,416

45.6

1,417

38.4

 Unknown

3,507

-

1,204

-

65

-

Adjuvant chemotherapy

 No

N.A.

N.A.

N.A.

N.A.

607

16.8

 Yes

N.A.

N.A.

N.A.

N.A.

3,014

83.2

 Unknown

N.A.

-

N.A.

-

134

-

Abbreviations: ER, estrogen receptor; FDUSCC, Fudan University Shanghai Cancer Center; HER2, human epidermal growth factor receptor-2; IQR, inter-quartile range (from 75th percentile to the 25th percentile); N.A., not applicable; PgR, progesterone receptor; SEER, Surveillance, Epidemiology and End-Results registry; UD, undifferentiated

*Including American Indian, Alaska Native, Asian, and Pacific Islander

&In FDUSCC set, radiotherapy is in adjuvant setting.

Statistics

The median follow-up times were 92, 174, and 79 months for the first, second, and third cohort, respectively; we therefore reported the 8-year, 15-year, and 6-year rates of survival, respectively.

In the first set, we performed an exploratory analysis of the relationship between tumor size and the number of involved LNs (both as continuous variables). Because we could not rule out a nonlinear relationship, we regressed the number of involved LNs on tumor size using nonparametric regression based on either locally weighted scatterplot smoothing (LOWESS; the command “lowess” in Stata) [16] or Kernel-weighted local polynomial smoothing method (the command “lpoly” in Stata). [17] If a linear relationship was revealed, linear regression was employed to determine the quantitative relationship between tumor size and the observed number of involved LNs. Linear regression was performed with or without adjustments for other clinicopathologic factors. By this procedure, the expected (predicted) number of positive LNs of each case could be calculated. The continuous variable of the difference between the observed and expected number of involved LNs was used as the surrogate for MP. The nonlinear effect of continuous values of the surrogate of MP on BCSS was assessed using a B-spline transformation with evenly spaced knots. High MP was defined as a difference between the observed number and expected number (the former minus the latter) greater than or equal to 1; otherwise, the MP was considered to be low/normal.

Survival curves were constructed using the Kaplan-Meier method, and the univariate survival difference was determined by log-rank test. Survival time was estimated using life-table method. Adjusted hazard ratios (HR) with 95% confidence interval (CI) were calculated using the Cox proportional hazards models. All of the statistical analyses were performed using Stata v.10.0 (Stata Corporation, College Station, TX). Two-sided P < 0.05 was considered statistically significant.

RESULTS

A linear relationship between tumor size and the number of involved axillary LNs in the first cohort from SEER

Clinicians use the TNM staging system, which provides a description of the extent and spread of a tumor, to determine the severity of disease. Specifically, the TNM stage is determined by tumor size, LN involvement, and whether cancer has metastasized. Generally, larger tumors are positively associated with greater numbers of involved LNs. To explore the potential relationship (either linear or nonlinear) between tumor size and the number of involved LNs, the continuous number of positive LNs was regressed on the continuous tumor size (cm) using LOWESS (Figure 1A) or the local polynomial smoothing method (Figure 1B). Both methods consistently suggested a monotonic linear relationship between tumor size and LN involvement in the overall population. Subsequently, linear regression was used to predict the number of positive LNs according to tumor size. The regression formula was Y (number of involved LNs) = 0.70* × (tumor size) + 0.09 (P < 0.001 for the coefficient, P < 0.001 for the regression model), indicating that the number of involved LNs increased by 0.70 for each centimeter increase in size. In other words, for each LN involved, tumor size should increase by 1.43-cm. After adjusting for age, year of diagnosis, race, grade, and ER status, the coefficient was 1.66 (P < 0.001), i.e., when tumor size progressed by 1.52-cm, one more LN would be involved. Notably, race had no effect on the correlation between tumor size and the number of involved LNs (P = 0.29). To achieve feasible utility, we use the approximate value of 1.50 to calculate the expected number of involved LNs for a given tumor size. The agreement between the approximate value of 1.50 and the original value of 1.52 was measured, and the Cohen’s Kappa coefficient was 0.95, indicating perfect agreement. [18] Therefore, the ratio of 1.5 was an acceptable value for feasible use. We calculated the expected number of involved LNs for each case. For instance, a patient with a 1.5-cm breast tumor theoretically had 1 (1.5/1.5) positive node; when the tumor further progressed to 3.0-cm with no treatment, theoretically 2 (3.0/1.5) regional LNs should be involved.

Relation between tumor size and number of involved lymph nodes based on univariate nonparametric smoothing method using LOWESS

Figure 1: Relation between tumor size and number of involved lymph nodes based on univariate nonparametric smoothing method using LOWESS A. or local polynomial smoothing B.

The difference between the observed number and the expected number of involved LNs serves as a significant surrogate for MP

Because the ratio of tumor size to the number of involved LNs was approximate 1.5 in the overall population, we calculated the expected number according to tumor size for each case. The relationship between the continuous value for numerical difference (observed value minus expected value) and 8-year BCSS is illustrated in Figure 2A, which reveals a pattern of comparable 8-year BCSS when the value for the numerical difference is less than 1, with a peak at the value of -1; when this value is greater than 1, the increasing value is related to remarkably decreasing BCSS. Accordingly, we determined that, if a patient’s numerical difference (observed number minus expected value) was greater than or equal to 1, she was assigned to the high MP group; otherwise, the patient was assigned to the low/normal MP group.

We then studied the prognostic value of categorical MP. In univariate analysis, a survival difference was noted between the high MP and low/normal MP groups, with 8-year BCSS rates of 74.1% (95% CI, 73.4–74.7%) and 91.1% (95% CI, 90.9–91.3%), respectively (Figure 2B). The unadjusted HR for high MP was 3.20 (95% CI, 3.09–3.32) relative to low/normal MP (Table 2). In multivariate analysis using the Cox model, after adjustment for other prognostic indicators, the HR of high MP was 2.92 (95% CI, 2.80–3.03; Table 2). Moreover, the prognostic significance of this surrogate of MP persisted in each subgroup stratified by other prognostic factors (Figure 2C), even in each subgroup stratified by tumor size (Table 2).

Effect of surrogates for metastastic potential (MP) on breast cancer-specific survival (BCSS).

Figure 2: Effect of surrogates for metastastic potential (MP) on breast cancer-specific survival (BCSS). A. Relationship between the continuous values of MP (determined by the difference between the number of observed and expected involved LNs) and 8-year BCSS. We chose a value of 1 as the cutoff between high and low/normal MP. B. Kaplan-Meier curves for patients by categorical MP. C. Forest plot of multivariate analysis. The hazard ratios (HRs) with 95% confidence intervals (CI) for the patients with high MP, compared with those with low/normal MP, were assessed using the Cox regression model by adjusting for all other prognostic factors listed. The diamond denotes the HR of each subgroup. An HR > 1.0 indicates higher risk for breast cancer-specific mortality in the high MP group and vice versa.

Table 2: Univariate and multivariate analysis of survival in three cohorts

Validation of MP in the second cohort from SEER

To validate the prognostic effect of the surrogate of MP on breast cancer survival, we chose an additional 50,414 cases from the SEER registry from 1990 to 1997. The high and low/normal MP groups demonstrated 15-year BCSS rates of 51.8% (95 CI, 50.7–52.9%) and 83.1% (95% CI, 82.7–83.5%) (Figure 3A), respectively, with an unadjusted HR of 3.62 (95% CI, 3.48–3.76) and an adjusted HR of 3.25 (95% CI, 3.11–3.41) (Table 2).

Validation of MP in the third cohort from FDUSCC

In this cohort, the endpoint of survival analysis was distant metastasis. Consistent with observations in SEER, higher MP was associated with an increased risk of distant metastasis (Kaplan-Meier curves shown in Figure 3B; adjusted HR for DDFS was 2.60; 95% CI, 2.11–3.18). Because information concerning the intrinsic subtype was available in our dataset, we analyzed each subtype separately and found that MP was a prognostic factor independent of the subtype (Table 2). In luminal-A, luminal-B, TNBC, and HER2-positive subtypes, the high MP group had an adjusted HR of 3.46 (95% CI, 2.02–5.93), 2.30 (95% CI, 1.64–3.24), 4.05 (95% CI, 2.85–5.76), and 1.45 (95% CI, 1.04–2.03), respectively, when compared with low MP. The value of adjusted HR for HER2-positive group was relative lower than those in other subtypes, which might be because about 50% cases of HER2-positive cases received adjuvant trastuzumab.

DISCUSSION

In the present study, we sought to determine whether there was a clinicopathologic surrogate for the MP of breast cancer. We hypothesized that the difference between the expected number of involved LNs according to tumor size and the actual number of involved LNs might serve as an indicator for MP and thus could predict survival. Using a large-population cohort from SEER, we identified a linear relationship between tumor size and the number of involved LNs, and we subsequently established a prediction model for the expected number of involved LNs at a given tumor size. By this procedure, we classified patients into a high MP or low/normal MP group according to the difference between the observed and expected number. After adjustment for other prognostic factors, patients with high MP had a higher likelihood of death from breast cancer compared with those with low/normal MP. The prognostic effect was subsequently successfully validated in other two cohorts and proved to be independent to intrinsic subtypes. To the best of our knowledge, the influence of differences between the observed and expected numbers of involved LNs on MP has not yet been proposed.

Kaplan-Meier curves for the second cohort from SEER

Figure 3: Kaplan-Meier curves for the second cohort from SEER A. and the third cohort from FDUSCC B. stratified by categorical MP, respectively. P values for Kaplan-Meier curves in both plots are < 0.0001 by log-rank test.

The conventional view of cancer spread is that cancer gains metastatic ability through the accumulation of mutations as the tumor grows to a large size. [3] In overall population, it is an indisputable fact that size of the primary tumor is positively related to LN involvement, suggesting that MP evolves as the tumor grows. [19] However, there is increasing awareness of tumor biology in predicting patient outcome. A growing body of literature demonstrates that distant metastasis could be, to some extent, determined by the intrinsic biology of breast cancer rather than the local disease severity. [20, 21] Clinically, the abnormal relationship between tumor size and involved LNs suggests varied tumor biology. [6, 7] Currently, there are limited numbers of clinicopathologic markers to assess the MP of breast cancer. Conventional TNM staging does not work well to determine MP, and the intrinsic subtype cannot further discern MP subgroups within one subtype.

We developed a quantitative marker, the numerical difference between the observed and expected numbers of involved LNs, to reflect different levels of MP. The patients with values less than -3 (mainly T3 tumor with negative nodes) had slightly lower BCSS compared with those with value of -3 to 1. In contrast, once the value exceeded 1, the survival curve began to decrease in a monotonic pattern. For a feasible evaluation, we arbitrarily divided patients into two classes, high or low/normal MP with a cutoff value at a numerical difference of 1. It should be noted that our model might have limited predictive capability in larger tumors, as tumors with numerical differences of -5 to -3 exhibit comparably poor survival relative to tumors exhibiting values of 1 to 3. In contrast, our model is excellent for the personalized evaluation of MP in T1–2 tumors with extensive LN involvement. For instance, a patient with a 1-cm tumor and 10 positive LNs and another patient with a 2.5 cm tumor and 10 positive LNs share the same pathological TNM stage. However, the value for the numerical difference (observed number minus expected number) differs. Our model predicts better survival for the patient with the 2.5 cm tumor and 10 positive LNs. In SEER set between 1998 and 2006, we identified 12 cases with 1-cm tumors and 10 involved LNs and 70 cases with 2.5-cm tumors and 10 involved LNs. The actual survival outcomes demonstrate that 7 of 12 (58%) and 21 of 70 (30%) died from breast cancer in the former and latter groups, respectively, in accordance with the predicted MP levels.

Taken together, the surrogate marker of MP derived from tumor size and number of involved LNs provides us a simple but effective tool to determine the potential for distant metastasis. Notably, the linear relationship between tumor size and the number of involved LNs is independent of race, implying that our findings, originally from a western population, could be extrapolated to Asian and other populations. Indeed, successful validation in a Chinese population supports this assumption and warrants the worldwide use of this model.

Our study had several limitations. First, the SEER database lacks several important variables such as HER2 status, adjuvant chemotherapy, and recurrence type. We could not adjust for more confounding factors, nor could we directly investigate the effect of surrogates of MP on DDFS. Second, our study was limited to invasive ductal histology; thus, our findings cannot be extrapolated to other histology types. Moreover, our model should be used with caution for cases with large tumor sizes. The current model based on tumor size and involved number of nodes seems to have limited capability to predict the survival of very larger tumor with negative nodes disease. Although the potential for distant metastasis may not be large enough for large tumors with negative nodes, [7] the local tumor burden is heavy, and the likelihood of recurrence would be much higher, which could negatively affect survival. [22] Despite these limitations, our data represent the most robust evaluation of the effect of tumor size and the number of involved LNs on MP in breast cancer.

In conclusion, our study reveals that differences between the expected number of involved LNs according to tumor size and the observed number of involved LNs might serve as an indicator for MP, and this surrogate for MP could predict survival. We introduced a simple but effective tool to determine breast cancer MP by comprehensively understanding the relationship between tumor size and the number of involved LNs. A deeper understanding of the biology of breast cancer using common clinicopathologic factor-based tools would certainly help clinicians to predict distant metastasis and provide personalized systemic therapies.

ACKNOWLEDGMENTS

This work was supported by National Natural Science Foundation of China(81001169), Training Plan of Excellent Talents in Shanghai Municipality Health System (XYQ2013101), Research and Innovation Project of Shanghai Municipal Education Commission, Shanghai International Science and Technique Cooperation Foundation (12410707700), International S&T Cooperation Program of China (ISTCP No. 09), and the Shanghai Key Laboratory of Breast Cancer (12DZ2260100). The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author Contributions

Conceived and designed: KDY ZMS. Performed the research: KDY YZJ. Analyzed the data: KDY YZJ. Contributed reagents/materials/analysis tools: KDY YZJ ZMS. Wrote the paper: KDY YZJ ZMS. Approve the paper: KDY YZJ ZMS

CONFLICTS OF INTEREST

The authors have declared that no competing interests exist.

REFERENCES

1. Weigelt B, Peterse JL, van’t Veer LJ. Breast cancer metastasis: markers and models. Nat Rev Cancer. 2005; 5:591–602.

2. Rabbani SA, Mazar AP. Evaluating distant metastases in breast cancer: from biology to outcomes. Cancer Metastasis Rev. 2007; 26:663–674.

3. Perou CM, Sorlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees CA, Pollack JR, Ross DT, Johnsen H, Akslen LA, et al. Molecular portraits of human breast tumours. Nature. 2000; 406:747–752.

4. Sorlie T, Perou CM, Tibshirani R, Aas T, Geisler S, Johnsen H, Hastie T, Eisen MB, van de Rijn M, Jeffrey SS, et al. Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc Natl Acad Sci U S A. 2001; 98:10869–10874.

5. Sorlie T, Tibshirani R, Parker J, Hastie T, Marron JS, Nobel A, Deng S, Johnsen H, Pesich R, Geisler S, et al. Repeated observation of breast tumor subtypes in independent gene expression data sets. Proc Natl Acad Sci U S A. 2003; 100:8418–8423.

6. Wo JY, Chen K, Neville BA, Lin NU, Punglia RS. Effect of very small tumor size on cancer-specific mortality in node-positive breast cancer. J Clin Oncol. 2011; 29:2619–2627.

7. Yu KD, Jiang YZ, Chen S, Cao ZG, Wu J, Shen ZZ, Shao ZM. Effect of large tumor size on cancer-specific mortality in node-negative breast cancer. Mayo Clin Proc. 2012; 87:1171–1180.

8. Hammond ME, Hayes DF, Dowsett M, Allred DC, Hagerty KL, Badve S, Fitzgibbons PL, Francis G, Goldstein NS, Hayes M, et al. American Society of Clinical Oncology/College Of American Pathologists guideline recommendations for immunohistochemical testing of estrogen and progesterone receptors in breast cancer. J Clin Oncol. 2010; 28:2784–2795.

9. Anderson WF, Reiner AS, Matsuno RK, Pfeiffer RM. Shifting breast cancer trends in the United States. J Clin Oncol. 2007; 25:3923–3929.

10. Yu KD, Di GH, Wu J, Lu JS, Shen KW, Shen ZZ, Shao ZM. Development and trends of surgical modalities for breast cancer in China: a review of 16-year data. Ann Surg Oncol. 2007; 14:2502–2509.

11. Goldhirsch A, Glick JH, Gelber RD, Coates AS, Thurlimann B, Senn HJ. Meeting highlights: international expert consensus on the primary therapy of early breast cancer 2005. Ann Oncol. 2005; 16:1569–1583.

12. Goldhirsch A, Wood WC, Gelber RD, Coates AS, Thurlimann B, Senn HJ. Progress and promise: highlights of the international expert consensus on the primary therapy of early breast cancer 2007. Ann Oncol. 2007; 18:1133–1144.

13. Wolff AC, Hammond ME, Schwartz JN, Hagerty KL, Allred DC, Cote RJ, Dowsett M, Fitzgibbons PL, Hanna WM, Langer A, et al. American Society of Clinical Oncology/College of American Pathologists guideline recommendations for human epidermal growth factor receptor 2 testing in breast cancer. J Clin Oncol. 2007; 25:118–145.

14. Goldhirsch A, Wood WC, Coates AS, Gelber RD, Thurlimann B, Senn HJ. Strategies for subtypes—dealing with the diversity of breast cancer: highlights of the St. Gallen International Expert Consensus on the Primary Therapy of Early Breast Cancer 2011. Ann Oncol. 2011; 22:1736–1747.

15. von Minckwitz G, Untch M, Blohmer JU, Costa SD, Eidtmann H, Fasching PA, Gerber B, Eiermann W, Hilfrich J, Huober J, et al. Definition and impact of pathologic complete response on prognosis after neoadjuvant chemotherapy in various intrinsic breast cancer subtypes. J Clin Oncol. 2012; 30:1796–1804.

16. Cleveland WS. LOWESS: A program for smoothing scatterplots by robust locally weighted regression. Am Stat. 1981; 35:54.

17. Fan J. Design-adaptive nonparametric regression. J Am Stat Assoc. 1992; 87:998–1004.

18. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977; 33:159–174.

19. Carter CL, Allen C, Henson DE. Relation of tumor size, lymph node status, and survival in 24,740 breast cancer cases. Cancer. 1989; 63:181–187.

20. Metzger-Filho O, Sun Z, Viale G, Price KN, Crivellari D, Snyder RD, Gelber RD, Castiglione-Gertsch M, Coates AS, Goldhirsch A, et al. Patterns of Recurrence and Outcome According to Breast Cancer Subtypes in Lymph Node-Negative Disease: Results From International Breast Cancer Study Group Trials VIII, and IX. J Clin Oncol. 2013; 31:3083–3090.

21. Voduc KD, Cheang MC, Tyldesley S, Gelmon K, Nielsen TO, Kennecke H. Breast cancer subtypes and the risk of local and regional relapse. J Clin Oncol. 2010; 28:1684–1691.

22. Darby S, McGale P, Correa C, Taylor C, Arriagada R, Clarke M, Cutter D, Davies C, Ewertz M, Godwin J, et al. Effect of radiotherapy after breast-conserving surgery on 10-year recurrence and 15-year breast cancer death: meta-analysis of individual patient data for 10,801 women in 17 randomised trials. Lancet. 2011; 378:1707–1716.


Creative Commons License All site content, except where otherwise noted, is licensed under a Creative Commons Attribution 4.0 License.
PII: 3647