Research Papers:

Nasopharyngeal carcinoma risk prediction via salivary detection of host and Epstein-Barr virus genetic variants

PDF |  HTML  |  Supplementary Files  |  Order a Reprint

Oncotarget. 2017; 8:95066-95074. https://doi.org/10.18632/oncotarget.11144

Metrics: PDF 895 views  |   HTML 1036 views  |   ?  

Qian Cui, Fu-Tuo Feng, Miao Xu, Wen-Sheng Liu, You-Yuan Yao, Shang-Hang Xie, Xi-Zhao Li, Zu-Lu Ye, Qi-Sheng Feng, Li-Zhen Chen, Jin-Xin Bei, Lin Feng, Qi-Hong Huang, Wei-Hua Jia, Su-Mei Cao, Ellen T. Chang, Weimin Ye, Hans-Olov Adami and Yi-Xin Zeng _


Qian Cui1, Fu-Tuo Feng1, Miao Xu1, Wen-Sheng Liu1, You-Yuan Yao1, Shang-Hang Xie1, Xi-Zhao Li1, Zu-Lu Ye1, Qi-Sheng Feng1, Li-Zhen Chen1, Jin-Xin Bei1, Lin Feng1, Qi-Hong Huang5, Wei-Hua Jia1, Su-Mei Cao1, Ellen T. Chang4, Weimin Ye1,2, Hans-Olov Adami2,3 and Yi-Xin Zeng1

1Sun Yat-sen University Cancer Center, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangzhou, P. R. China

2Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden

3Department of Epidemiology, Harvard School of Public Health, Boston, MA, USA

4Division of Epidemiology, Department of Health Research and Policy, Stanford University School of Medicine, Stanford, CA, USA

5Sihui Cancer Institute, Sihui, Guangdong, P. R. China

Correspondence to:

Yi-Xin Zeng, email: zengyx@sysucc.org.cn

Hans-Olov Adami, email: hadami@hsph.harvard.edu

Keywords: nasopharyngeal carcinoma, saliva, case-control, risk prediction

Received: April 16, 2016     Accepted: June 30, 2016     Published: August 09, 2016


Genetic susceptibility and Epstein-Barr virus (EBV) infection are important etiological factors in nasopharyngeal carcinoma (NPC). In this study, in southern China, where NPC is endemic, a single nucleotide polymorphism (SNP) in the EBV-encoded RPMS1 gene (locus 155391: G > A [G155391A]) and seven host SNPs (rs1412829, rs28421666, rs2860580, rs2894207, rs31489, rs6774494, and rs9510787) were confirmed to be significantly associated with NPC risk in 50 NPC cases versus 54 hospital-based controls with throat washing specimens and 1925 NPC cases versus 1947 hospital-based controls with buffy coat samples, respectively. We established a strategy to detect the NPC-associated EBV and host SNPs using saliva samples in a single test that is convenient, noninvasive, and cost-effective and displays good compliance. The potential utility of this strategy was tested by applying a risk prediction model integrating these EBV and host genetic variants to a population-based case-control study comprising 1026 incident NPC cases and 1148 controls. Receiver operating characteristic (ROC) curve analysis revealed an area under the curve of the NPC risk prediction model of 0.74 (95% CI: 0.71−0.76). Net reclassification improvement (NRI) analysis showed that inclusion of the EBV SNP significantly improved the discrimination ability of the model (NRI = 0.30, P < 0.001), suggesting the promising value of EBV characteristics for identifying high-risk NPC individuals in endemic areas. Taken together, we developed a promising NPC risk prediction model via noninvasive saliva sampling. This approach might serve as a convenient and effective method for screening the population with high-risk of NPC.

Nasopharyngeal carcinoma risk prediction <i>via</i> salivary detection of host and Epstein-Barr virus genetic variants | Cui | Oncotarget


Nasopharyngeal carcinoma (NPC) is a malignancy associated with Epstein-Barr virus (EBV). It is quite rare in most parts of the world, with incidence rates well below 1 per 100,000 person-years, but it is rather prevalent in southern China, southeast Asia and northern Africa [1]. Remarkably, it has a geographic distribution and strikingly high incidence in the Guangdong Province, southern China, with an incidence rates above 20 per 100,000. Moreover, NPC incidence has remained high for at least the last 30 years in Sihui County, Guangdong Province [24]. For early-stage NPC, radiotherapy is often curative [5], whilst patients diagnosed at advanced stages have poorer outcomes [68]. Thus prevention and early detection are key to reducing NPC burden.

Risk prediction models aim to identify individuals at high risk for certain diseases, such as cancer. Risk prediction models of 13 cancer sites, not including NPC, were compiled by the U.S. National Cancer Institute [9]. Risk factors such as patient demographics, behavioral characteristics, medical history, and genetic variants may be useful in identifying high-risk individuals for earlier or more frequent screening and for preventative cancer counseling [10].

Several genome-wide association studies (GWASs) of NPC recently have been published [1114]. The largest GWAS, based on subjects of southern Chinese descent, confirmed independent associations with three genetic variants in the human leukocyte antigen (HLA) region (rs2860580, rs2894207 and rs28421666 ), and also revealed three new susceptibility loci (rs1412829, rs6774494, and rs9510787) [13]. In a GWAS meta-analysis, we also identified an NPC risk locus in the CLPTM1L-TERT region on chromosome 5p15.33 (rs31489) [15]. EBV infection is an early, possibly initiating event in the development of NPC and is thought to play a critical role in transforming nasopharyngeal epithelial cells into invasive cancer cells [16, 17]. In endemic regions, high-risk EBV variants might exist that contribute to NPC risk.

In the current study, we aimed to construct a new risk prediction model of NPC based on both host and EBV genetic variants determined through saliva testing. Our objective was to develop a noninvasive and convenient method to identify individuals at high risk for NPC.


Association of a SNP in the EBV genome with NPC

Based on genotyping for RPMS1 using nested PCR and Sanger sequencing methods in study population 1, we found that the frequencies of G155391A were significantly higher in the 50 matched samples from NPC patients (84% in tumor biopsy samples and 82% in tumor washing samples) than that in 54 healthy throat washing samples (39%) (Table 1). The sex and age-adjusted ORs for NPC associated with the A vs. G genotype was 8.41 (95% CI = 3.13–22.62, P = 2.42 × 10−5) based on tumor biopsy samples, and 6.34 (95% CI = 2.52–15.98, P = 9.01 × 10−5) based on throat washing samples. These results confirmed the strong association of G155391A in RPMS1 of EBV with NPC risk in patients from Guangdong Province.

Table 1: Epstein-Barr virus RPMS1 G155391A variant in matched TB and TW samples from NPC patients and healthy donors (study population 1)



NPC patients

Healthy donors

TB (%)

TW (%)

TW (%)



42 (84%)

41 (82%)

21 (39%)*


7 (14%)

9 (18%)

29 (54%)


1 (2%)

0 (0%)

4 (7%)





Abbreviations: TB, tumor biopsy; TW, throat washing

*P value < 0.0001 for variant A vs. variant G in tumor biopsy or throat washing samples from NPC patients versus throat washing samples from healthy donors.

Association of the 7 host SNPs with NPC

All of the 7 host SNPs identified from GWAS (rs2860580, rs2894207, rs28421666, rs1412829, rs31489, rs6774494, and rs9510787) were highly significantly associated with NPC in study population 2 (Table 2). The ORs and P values were as follows: rs2860580, OR = 1.68, P = 7.62 × 10−25; rs2894207, OR = 1.57, P = 2.54 × 10−12 ; rs28421666, OR = 1.64, P = 4.16 × 10−11; rs1412829, OR = 1.23, P = 6.85 × 10−3 ; rs31489, OR = 1.26, P = 3.70 × 10−5; rs6774494, OR = 1.14, P = 7.69 × 10−3; and rs9510787, OR = 1.17, P = 9.23 × 10−4.

Table 2: Associations of host SNPs previously reported in genome-wide association studies with NPCrisk as measured in buffy coat samples (study population 2) or saliva samples (study population 3)





Buffy coat samples of 1925 cases and 1947 controls

Saliva samples of 1026 NPC cases and 1148 controls

P valueb

OR (95% CI)c

P valueb

OR (95% CI)c





7.62 ×10−25

1.68 (1.52, 1.85)

3.26 × 10−15

1.71 (1.50,1.96)





2.54 × 10−12

1.57 (1.39, 1.79 )

8.59 × 10−14

1.88 (1.59,2.22)





4.16 × 10−11

1.64 (1.41, 1.89 )

7.97 × 10−3

1.34 (1.08,1.67)





6.85 × 10−3

1.23 (1.06, 1.42)

3.75 × 10−5

1.52 (1.25,1.85)





3.70 × 10−5

1.26 (1.13,1.41)

7.62 × 10−3

1.22 (1.05,1.41)





7.69 × 10−3

1.14 (1.03, 1.25)

3.63 × 10−2

1.15 (1.01,1.30)





9.23 × 10−4

1.17 (1.07, 1.28)

5.92 × 10−2

1.13 (1.00,1.27)

Abbreviations: OR, odds ratio; CI, confidence interval; Chr.: chromosome

a. Risk allele/reference allele.

b. P values for trend were derived from Cochran-Armitage trend tests.

c. Adjusted for age and sex.

Saliva for simultaneous identification of host and EBV genetic variants

Paired saliva and buffy coat specimens of 103 controls from the study population 3 were used. The genotyping results for the seven host SNPs and one EBV variant in 103 buffy coat samples from study population 3 were compared with those from paired saliva samples. The seven host SNPs were successfully genotyped in all tested samples, and the concordance rates for the seven human SNPs in the paired saliva and buffy coat samples were all 100% (Table 3). The EBV variant was genotyped in 70.87% (73/103) of the saliva samples but only 9.71% (10/103) of the buffy coat samples (Table 3), suggesting that saliva is more enriched with EBV than buffy coats.

Table 3: Call rate and concordance rate of SNPgenotyping results using paired saliva and buffy coat samples (study population 3)


Call rate

Genotyped successfully in both saliva and buffy coat (n)

Concordance rate


Buffy coat




































EBV G155391A





Abbreviations: EBV: Epstein-Barr virus

Association of the host and EBV SNPs simultaneously identified in saliva with NPC risk

Based on simultaneous genotyping for host and EBV genetic variants in saliva samples from study population 3 (population-based case-control study), the three SNPs in the HLA region were significantly associated with NPC, i.e., rs2860580 (OR = 1.71, P = 3.26 × 10−15), rs2894207 (OR = 1.88, P = 8.59 × 10−14), and rs28421666 (OR = 1.34, P = 7.97 × 10−3) (Table 2), confirming GWAS results. Regarding the SNPs outside the HLA region, the previous associations of rs1412829 at CDKN2A/2B (OR = 1.52, P = 3.75 × 10−5), rs31489 at CLPTM1L-TERT (OR = 1.22, P = 7.62 × 10−3), and rs6774494 at MDS1-EVI1 (OR = 1.15, P = 3.63 × 10−2) with NPC were replicated (Table 2). At the TNFRSF19 locus, rs9510787 showed a borderline significant association with NPC (OR = 1.13, P = 5.92 × 10−2) consistent with previous findings [13].

G155391A was also significantly associated with NPC in study population 3 (OR = 5.74, 95% CI = 4.42–7.46, P = 2.56 × 10−39) (Table 4). Because NPC risk was also increased among those with the GA genotype, albeit nonsignificantly, we combined GA and G carriers for subsequent analyses.

Table 4: Association of variant G155391A in Epstein-Barr virus RPMS1 with NPC risk in saliva samples from population-based NPC cases and controls (study population 3)


Cases (%)

Controls (%)

P value*

OR (95% CI)*


92 (13%)

359 (45%)




610 (85%)

416 (52%)

2.56 ×10−39

5.74 (4.42,7.46)


15 (2%)

31 (4%)


1.89 (0.98,3.66)

Abbreviations: OR: odds ratio; CI: confidence interval

*Adjusted for age and sex.

Risk prediction models based on the host and EBV genetic variants

ROC curve analysis revealed that the area under the curve (AUC) of the NPC risk prediction model that included the seven host SNPs and the EBV variant was 0.74 (95% CI = 0.71–0.76). This combined model substantively improved the risk prediction performance compared with the models including only the seven host SNPs (AUC = 0.65, 95% CI = 0.62−0.68) or only the EBV variant (AUC = 0.67, 95% CI = 0.64−0.69) (Figure 1, Supplementary Table 1). Adding age and sex (which were matched) did not change the AUC. We also performed the ROC curves in males and females, the AUC were 0.74 and 0.75, respectively (Supplementary Figures).

Receiver-operating characteristic (ROC) curve analysis.

Figure 1: Receiver-operating characteristic (ROC) curve analysis. Areas under the ROC curves (AUCs) were used as measures of the predictive power of the risk-assessment models based on host and EBV genetic variants. The host genetic model factors were 7 SNPs: rs1412829, rs28421666, rs2860580, rs2894207, rs31489, rs6774494, and rs9510787. The EBV model factor was G155391A. The combined model factors were the EBV variant and the 7 host SNPs.

To determine whether the combined host and EBV model provides better classification results than the model based solely on host genetic markers, we calculated the NRI, setting the predicted risk threshold at 0.2 or 0.3 (Supplementary Table 2). Based on these calculations, the NRI was estimated to be 0.30 (P < 0.001), suggesting an improvement in prediction performance due to the inclusion of the EBV variant.


We developed and tested a risk prediction model incorporating host and EBV genetic variants as a tool to identify individuals at high risk for NPC in an endemic population. The AUC of the full model was 0.74 in the population-based case-control study and addition of the EBV variant clearly improved the performance of the risk prediction model over the use of host genetic variants alone. Moreover, we established a strategy to simultaneously detect host and EBV genetic variants using a small amount of DNA in saliva. These results represent an advance in NPC-related genotyping methodology and in the prediction of NPC development using a practical approach.

Previous studies demonstrated associations of host SNPs and certain EBV variants with risk of NPC, using DNA from buffy coat and throat washing samples, respectively. However, such genotyping information is difficult to obtain because the procedures for separately collecting and genotyping the buffy coat and throat washing samples are complex. Saliva is an efficient, cost-effective, and noninvasive approach to examining predictors or biomarkers of complex diseases, and compliance is high because self-collection of saliva is feasible, painless and non-invasive, thus could improve the donor care [18]. In addition, previous findings show that saliva may be a suitable source of human DNA for SNP analysis [19]. In many studies, infectious disease markers detected in blood are also found in saliva. For instance, viral DNA/RNA, antibodies, and viral antigens have been detected in saliva, and their levels strongly correlate with those in blood samples [20, 21]. We speculated that saliva might be a good source of both human and EBV DNA [22, 23]. Indeed, our results showed high concordance rates of host and EBV genotypes between blood and saliva, and high call rates of both host and EBV genetic variants in saliva samples. Therefore, our findings demonstrate that saliva is an appropriate material for simultaneously identifying host and EBV genetic variants, and that saliva sampling might be convenient for large-scale population risk prediction of NPC.

Eight of 103 paired samples were successfully examined using both saliva and buffy coat samples, and 6 of the samples displayed no difference; thus, the genotyping results for saliva did not completely correspond to those for buffy coat. The number of EBV genomes was previously estimated as a median value of only 7 using 106 B cells from the peripheral blood of healthy individuals [24]. Evidence also has indicated that the distribution and interchange of viral strains among peripheral blood mononuclear cells, plasma, and saliva are complex [25].

We confirmed the contributions of host genetic factors, particularly involving the HLA locus, to NPC etiology. Our independent analyses of the G155391A variant in EBV RPMS1 in tumor biopsy and throat washing from study population 1 and in saliva from study population 3 also consistently revealed its strong association with NPC. Thus, G155391A in EBV RPMS1 could serve as a valuable indicator for high risk of NPC in southern China. Furthermore, our results support that the EBV strain in tumor biopsy and throat washing may have the same origin. It has been demonstrated that EBV within intact oropharyngeal epithelium was derived from EBV-infected salivary cells through cell-to-cell contact [26]. Therefore, examining EBV strain in throat washing or saliva samples could replace the need for doing so in tumor biopsy.

Cancer risk prediction models may identify individuals at high risk who could benefit from targeted interventions. For instance, based on a risk prediction model [27, 28], an interactive risk assessment tool was designed to estimate a woman’s risk of developing invasive breast cancer (http://www.cancer.gov/bcrisktool/) to help guide prevention strategies [29]. In addition, because human papillomaviruses 16 and 18 cause most cases of cervical cancer [30, 31], vaccines are now available to prevent initial infection with these strains. Multiple etiological factors are believed to be involved in NPC development, including genetic susceptibility, EBV infection, and diet [32, 33]. Thus, we could potentially utilize our risk prediction model in regions with high NPC incidence to identify high-risk groups for prevention or screening strategies specific to this population.

Although EBV antibody and DNA levels in serum are considered sufficiently sensitive and specific for NPC screening [34], the EBV antibody level depends on the host immune response to EBV infection and changes over time. The first NPC predictive genetic model did not include EBV antibody titers as predictors [35]. Another study showed that combining the most significant host SNPs with EBV IgA antibody status, which is presently used as a biomarker for NPC, did not improve the AUC estimate for NPC diagnosis [36]. Our results also showed that detection of circulating antibodies (VCA and EBNA1 IgA) can identify NPC patients with good performance and no significant differences in EBV antibody levels in subgroups of subjects defined by different EBV genetic variants (data not shown). EBV seromarkers might appear to be useful only for early detection of NPC, whereas other risk predictors may be useful for long-term NPC risk prediction. A more frequent antibody-based screening could be applied among individuals who screen positive by the long-term prediction of DNA-based tests.

More efforts are required to further improve our NPC risk prediction model. Adding informative factors such as family history might improve the predictive ability, and a long-term prospective cohort study is needed to validate any risk prediction model. Given recent developments in the sequencing of the EBV genome, a more valuable approach would be to derive whole genome sequences to identify the possibility of more significant EBV strain variation contributing to NPC risk. Hence, a large prospective study including environmental factors, host genetic factors, EBV characteristics (EBV antibodies, EBV DNA levels and EBV variations), family history and their interactions would be expected to lead to the construction of a more comprehensive risk prediction model.

In summary, we established a strategy to determine host and NPC-associated EBV genetic variants via noninvasive saliva sampling. We further developed a promising NPC risk prediction model integrating host and EBV genetic variants. This approach might provide a convenient and effective method to identify individuals who are at high risk for NPC development, thus reducing the NPC burden in endemic populations.


Study population 1: EBV genotyping

Fifty-four histologically confirmed nasopharyngeal carcinoma (NPC) patients with individually matched samples from tumor biopsy (fresh or paraffin embedded) and throat washing were diagnosed at and enrolled from the Sun Yat-sen University Cancer Center in Guangzhou, China, between 2005 and 2007. The selection criteria for patients were Cantonese, not belonging to NPC family, without any close relative being NPC patient and newly diagnosed.

Sixty healthy controls with throat washing samples were enrolled from the physical examination center at the First Affiliated Hospital of Sun Yat-sen University. The selection criteria for control subjects included no individual history of cancer and matched to NPC cases by age, gender, residential region and ethnics.

Study population 2: Host genotyping

Buffy coat samples were collected from 2023 histologically confirmed NPC cases treated at the Sun Yat-sen University Cancer Center between 2005 and 2010 and from 2009 healthy controls free of NPC, other cancers, or infectious diseases identified from the physical examination center at the First Affiliated Hospital of Sun Yat-sen University. All subjects were residing in the province of Guangdong. The characteristics of these subjects are summarized in Supplementary Table 3.

Study population 3: Risk prediction model testing

Between 2010 and 2014 in Zhaoqing, Guangdong Province, 1026 population-based NPC cases were identified and enrolled through a rapid case ascertainment system involving a network of physicians in the area who diagnose or treat NPC. Through random selection from total population registries, 1148 population-based control subjects were identified and enrolled, with frequency matching to the cases by age, sex, and area of residence. Participation rates were approximately 81% among cases and 83% among controls.

Saliva samples were collected into vials containing an equal volume of prepared lysis buffer (50 mM Tris, pH 8.0, 50 mM EDTA, 50 mM sucrose, 100 mM NaCl, 1% SDS) as previously described [19]. Venous blood was collected simultaneously. Saliva specimens were provided by approximately 93% of participating cases and 89% of participating controls, while blood specimens were provided by approximately 98% and 83% of cases and controls, respectively. Each participant completed a detailed interview conducted face-to-face by a trained interviewer employing a structured questionnaire. The characteristics of these subjects are summarized in Supplementary Table 4.

Ethics statement

Each subject provided informed consent, and the institutional review boards of all participating institutions approved this collaborative study.

DNA extraction

DNA was isolated from tumor biopsy, throat washing, and buffy coat samples using a commercial DNA extraction kit (Qiagen, Germany). DNA from 200 μl of saliva or buffy coat samples in the population-based case-control study (study population 3) was automatically extracted using Chemagic STAR (Hamilton Robotics, Sweden) according to the manufacturer’s instructions. DNA concentration greater than 10 ng/μl was required.

Nested PCR and Sanger sequencing

To detect specific EBV genomic variants linked to NPC development, we previously sequenced several EBV-encoded genes, including EBNA1, LMP1, and the BamHI-A rightward transcripts (BARTs) family, in NPC cases and controls from Guangdong Province [37]. The most striking finding was a significant association between a single nucleotide polymorphism (SNP) in the EBV-encoded RPMS1 gene (locus 155391: G > A, referred to here as G155391A, resulting in the alternation of Asp to Asn) and NPC risk. RPMS1 encodes a major part of mRNA of the BARTs family and is regularly transcribed in NPC tissues [38]. Thus, G155391A in RPMS1 of EBV may represent a specific EBV variant in the NPC-endemic region of southern China that could serve as an indicator of high risk of NPC in this population.

For the present study, we interrogated the trend of nucleotide polymorphisms in RPMS1 were same in EBV DNA samples from tumor biopsy and throat washing. The RPMS1 fragment was amplified by nested polymerase chain reaction (PCR) using two primer pairs (RPMS1-1/RPMS1-2 and RPMS1-3/RPMS1-4; Supplementary Table 5) and PCR Master Mix (Promega). DNA from the EBV-positive C666 cell line and the absence of DNA were used as the positive and negative controls, respectively. The sequences of the PCR products were identified by Sanger sequencing.


Seven host SNPs previously identified in GWASs (rs2860580, rs2894207, rs28421666, rs9510787, rs6774494, rs1412829, and rs31489), together with 264 405 autosomal SNPs mostly enriched in the exonic regions, were genotyped in buffy coat samples from study population 2 using customized Illumina Infinium HumanExome BeadChips. For quality control of the exome-wide data, the SNP filtering criteria included remaining singleton autosomal SNPs, minor allele frequency above 1%, genotyping call rate above 98%, and no deviation from Hardy-Weinberg equilibrium (P ≥ 0.05 in controls); the sample filtering criteria included call rate above 98% for SNPs per sample. After further examination for relatedness and population structure, 1925 cases and 1947 controls were retained for association analysis of the seven GWAS-identified SNPs.

Simultaneous genotyping for the seven human SNPs and the single EBV variant was performed in saliva samples from study population 3 using the Agena Bioscience MassArray platform. The primers used are shown in Supplementary Table 6. Eighty-two samples were re-genotyped due to a bad spectrum or no allele information. After quality validation, based on call rate (100%) and no deviation from Hardy-Weinberg equilibrium (P ≥ 0.05 in controls), 1026 cases and 1148 controls were included for further analysis.

Statistical analysis

We used logistic regression analysis to estimate odds ratios (ORs) and 95% confidence intervals (CIs) for associations with NPC risk adjusting for age and sex, assuming an additive model for host genetic variants. For comparisons across SNPs, we used the ORs for the high-risk alleles rather than minor alleles. P values for trend were derived from Cochran-Armitage trend tests. All reported P values are two-sided.

Using a logistic regression method, we constructed risk prediction models based on the seven host genetic variants, the single EBV variant, or all eight variants in saliva samples from study population 3 in which the EBV variant was successfully detected. Receiver operating characteristic (ROC) curve analysis was used to evaluate the performance of the model.

To quantify discriminatory improvement for the model including the EBV variant and the seven host variants compared with the model including only the host variants, we computed the net reclassification improvement (NRI). The predicted risk threshold was set at 0.2 or 0.3, and we used a reclassification table to evaluate how accurately the two models assigned the individuals to the low, intermediate, and high-risk categories.

The statistical analyses were performed using SPSS (version 16.0) and R (version 2.14.0).


We thanked all the participants in our study and appreciated the contributions of the staffs from the biobank at the Sun Yat-sen University Cancer Center.


The authors declare no conflicts of interest.


This work was supported by the National Basic Research Program of China (2011CB504302), the National High Technology Research and Development Program of China (2012AA02A501), the US National Institutes of Health (R01CA115873-01), the National Natural Science Foundation of China (81222035) and National Natural Science Foundation of China (81430059).


1. Tang LL, Chen WQ, Xue WQ, He YQ, Zheng RS, Zeng YX, Jia WH. Global trends in incidence and mortality of nasopharyngeal carcinoma. Cancer Lett. 2016; 374:22–30.

2. Wei KR, Zheng RS, Zhang SW, Liang ZH, Ou ZX, Chen WQ. Nasopharyngeal carcinoma incidence and mortality in China in 2010. Chin J Cancer. 2014; 33:381–387.

3. Jia WH, Huang QH, Liao J, Ye W, Shugart YY, Liu Q, Chen LZ, Li YH, Lin X, Wen FL, Adami HO, Zeng Y, Zeng YX. Trends in incidence and mortality of nasopharyngeal carcinoma over a 20-25 year period (1978/1983-2002) in Sihui and Cangwu counties in southern China. BMC Cancer. 2006; 6:178.

4. Zhang LF, Li YH, Xie SH, Ling W, Chen SH, Liu Q, Huang QH, Cao SM. Incidence trend of nasopharyngeal carcinoma from 1987 to 2011 in Sihui County, Guangdong Province, South China: an age-period-cohort analysis. Chin J Cancer. 2015; 34:15.

5. Su SF, Han F, Zhao C, Chen CY, Xiao WW, Li JX, Lu TX. Long-term outcomes of early-stage nasopharyngeal carcinoma patients treated with intensity-modulated radiotherapy alone. Int J Radiat Oncol Biol Phys. 2012; 82:327–333.

6. Tham IW, Hee SW, Yeo RM, Salleh PB, Lee J, Tan TW, Fong KW, Chua ET, Wee JT. Treatment of nasopharyngeal carcinoma using intensity-modulated radiotherapy-the national cancer centre singapore experience. Int J Radiat Oncol Biol Phys. 2009; 75:1481–1486.

7. Wu F, Wang R, Lu H, Wei B, Feng G, Li G, Liu M, Yan H, Zhu J, Zhang Y, Hu K. Concurrent chemoradiotherapy in locoregionally advanced nasopharyngeal carcinoma: treatment outcomes of a prospective, multicentric clinical study. Radiother Oncol. 2014; 112:106–111.

8. Chen MY, Jiang R, Guo L, Zou X, Liu Q, Sun R, Qiu F, Xia ZJ, Huang HQ, Zhang L, Hong MH, Mai HQ, Qian CN. Locoregional radiotherapy in patients with distant metastases of nasopharyngeal carcinoma at diagnosis. Chin J Cancer. 2013; 32:604–613.

9. http://epi.grants.cancer.gov/cancer_risk_prediction/#overview.

10. Xiao Q, Liu ZJ, Tao S, Sun YM, Jiang D, Li HL, Chen H, Liu X, Lapin B, Wang CH, Zheng SL, Xu J, Wu ZY. Risk prediction for sporadic Alzheimer’s disease using genetic risk score in the Han Chinese population. Oncotarget. 2015; 6:36955–36964. doi: 10.18632/oncotarget.6271.

11. Ng CC, Yew PY, Puah SM, Krishnan G, Yap LF, Teo SH, Lim PV, Govindaraju S, Ratnavelu K, Sam CK, Takahashi A, Kubo M, Kamatani N, et al. A genome-wide association study identifies ITGA9 conferring risk of nasopharyngeal carcinoma. J Hum Genet. 2009; 54:392–397.

12. Tse KP, Su WH, Chang KP, Tsang NM, Yu CJ, Tang P, See LC, Hsueh C, Yang ML, Hao SP, Li HY, Wang MH, Liao LP, et al. Genome-wide association study reveals multiple nasopharyngeal carcinoma-associated loci within the HLA region at chromosome 6p21.3. Am J Hum Genet. 2009; 85:194–203.

13. Bei JX, Li Y, Jia WH, Feng BJ, Zhou G, Chen LZ, Feng QS, Low HQ, Zhang H, He F, Tai ES, Kang T, Liu ET, et al. A genome-wide association study of nasopharyngeal carcinoma identifies three new susceptibility loci. Nat Genet. 2010; 42:599–603.

14. Hildesheim A, Wang CP. Genetic predisposition factors and nasopharyngeal carcinoma risk: a review of epidemiological association studies, 2000-2011: Rosetta Stone for NPC: genetics, viral infection, and other environmental factors. Semin Cancer Biol. 2012; 22:107–116.

15. Bei JX, Su WH, Ng CC, Yu K, Chin YM, Lou PJ, Hsu WL, McKay JD, Chen CJ, Chang YSD, Chen LZ, Chen MY, Cui Q, et al. A GWAS meta-analysis and replication study identifies a novel locus within CLPTM1L/TERT associated with nasopharyngeal carcinoma in individuals of Chinese ancestry. Cancer Epidemiol Biomarkers Prev. 2015.

16. Pathmanathan R, Prasad U, Sadler R, Flynn K, Raab-Traub N. Clonal proliferations of cells infected with Epstein-Barr virus in preinvasive lesions related to nasopharyngeal carcinoma. N Engl J Med. 1995; 333:693–698.

17. Lo AK, Huang DP, Lo KW, Chui YL, Li HM, Pang JC, Tsao SW. Phenotypic alterations induced by the Hong Kong-prevalent Epstein-Barr virus-encoded LMP1 variant (2117-LMP1) in nasopharyngeal epithelial cells. Int J Cancer. 2004; 109: 919–925.

18. Xie Z, Chen X, Li J, Guo Y, Li H, Pan X, Jiang J, Liu H, Wu B. Salivary HOTAIR and PVT1 as novel biomarkers for early pancreatic cancer. Oncotarget. 2016; 7:25408–19. doi: 10.18632/oncotarget.8323.

19. Quinque D, Kittler R, Kayser M, Stoneking M, Nasidze I. Evaluation of saliva as a source of human DNA for population and association studies. Anal Biochem. 2006; 353:272–277.

20. Yoshizawa JM, Schafer CA, Schafer JJ, Farrell JJ, Paster BJ, Wong DT. Salivary biomarkers: toward future clinical and diagnostic utilities. Clin Microbiol Rev. 2013; 26:781–791.

21. Mazzanti CM, Lessi F, Armogida I, Zavaglia K, Franceschi S, Al Hamad M, Roncella M, Ghilli M, Boldrini A, Aretini P, Fanelli G, Marchetti I, Scatena C, et al. Human saliva as route of inter-human infection for mouse mammary tumor virus. Oncotarget. 2015; 6:18355–18363. doi: 10.18632/oncotarget.4567.

22. Goode MR, Cheong SY, Li N, Ray WC, Bartlett CW. Collection and extraction of saliva DNA for next generation sequencing. J Vis Exp. 2014;

23. Palser AL, Grayson NE, White RE, Corton C, Correia S, Ba Abdullah MM, Watson SJ, Cotten M, Arrand JR, Murray PG, Allday MJ, Rickinson AB, Young LS, et al. Genome diversity of Epstein-Barr virus from multiple tumor types and normal infection. J Virol. 2015; 89:5222–5237.

24. Wagner HJ, Kluter H, Kruse A, Bucsky P, Hornef M, Kirchner H. Determination of the number of Epstein-Barr virus genomes in whole blood and red cell concentrates. Transfus Med. 1995; 5:297–302.

25. Kwok H, Chan KW, Chan KH, Chiang AK. Distribution, Persistence and Interchange of Epstein-Barr Virus Strains among PBMC, Plasma and Saliva of Primary Infection Subjects. PLoS One. 2015; 10:e0120710.

26. Tugizov SM, Berline JW, Palefsky JM. Epstein-Barr virus infection of polarized tongue and nasopharyngeal epithelial cells. Nat Med. 2003; 9:307–314.

27. Gail MH, Brinton LA, Byar DP, Corle DK, Green SB, Schairer C, Mulvihill JJ. Projecting individualized probabilities of developing breast cancer for white females who are being examined annually. J Natl Cancer Inst. 1989; 81:1879–1886.

28. Projecting breast cancer probabilities. J Natl Cancer Inst. 1990; 82:879–882.

29. Vogel VG, Costantino JP, Wickerham DL, Cronin WM, Cecchini RS, Atkins JN, Bevers TB, Fehrenbacher L, Pajo ER Jr, Wade JL 3rd, Robidoux A, Margolese RG, James J, et al. Effects of tamoxifen vs raloxifene on the risk of developing invasive breast cancer and other disease outcomes: the NSABP Study of Tamoxifen and Raloxifene (STAR) P-2 trial. JAMA. 2006; 295:2727–2741.

30. Durst M, Gissmann L, Ikenberg H, zur Hausen H. A papillomavirus DNA from a cervical carcinoma and its prevalence in cancer biopsy samples from different geographic regions. Proc Natl Acad Sci U S A. 1983; 80:3812–3815.

31. Boshart M, Gissmann L, Ikenberg H, Kleinheinz A, Scheurlen W, zur Hausen H. A new type of papillomavirus DNA, its presence in genital cancer biopsies and in cell lines derived from cervical cancer. EMBO J. 1984; 3:1151–1157.

32. Gallicchio L, Matanoski G, Tao XG, Chen L, Lam TK, Boyd K, Robinson KA, Balick L, Mickelson S, Caulfield LE, Herman JG, Guallar E, Alberg AJ. Adulthood consumption of preserved and nonpreserved vegetables and the risk of nasopharyngeal carcinoma: a systematic review. Int J Cancer. 2006; 119:1125–1135.

33. Jia WH, Luo XY, Feng BJ, Ruan HL, Bei JX, Liu WS, Qin HD, Feng QS, Chen LZ, Yao SY, Zeng YX. Traditional Cantonese diet and nasopharyngeal carcinoma risk: a large-scale case-control study in Guangdong, China. BMC Cancer. 2010; 10:446.

34. Liu Z, Ji MF, Huang QH, Fang F, Liu Q, Jia WH, Guo X, Xie SH, Chen F, Liu Y, Mo HY, Liu WL, Yu YL, et al. Two Epstein-Barr virus-related serologic antibody tests in nasopharyngeal carcinoma screening: results from the initial phase of a cluster randomized controlled trial in Southern China. Am J Epidemiol. 2013; 177:242–250.

35. Ruan HL, Qin HD, Shugart YY, Bei JX, Luo FT, Zeng YX, Jia WH. Developing genetic epidemiological models to predict risk for nasopharyngeal carcinoma in high-risk population of China. PLoS One. 2013; 8:e56128.

36. Guo X, Winkler CA, Li J, Guan L, Tang M, Liao J, Deng H, de The G, Zeng Y, O’Brien SJ. Evaluation and integration of genetic signature for prediction risk of nasopharyngeal carcinoma in Southern China. Biomed Res Int. 2014; 2014:434072.

37. Feng FT, Cui Q, Liu WS, Guo YM, Feng QS, Chen LZ, Xu M, Luo B, Li DJ, Hu LF, Middeldorp JM, Ramayanti O, Tao Q, et al. A single nucleotide polymorphism in the Epstein-Barr virus genome is strongly associated with a high risk of nasopharyngeal carcinoma. Chin J Cancer. 2015; 34:563–572.

38. Li A, Zhang XS, Jiang JH, Wang HH, Liu XQ, Pan ZG, Zeng YX. Transcriptional expression of RPMS1 in nasopharyngeal carcinoma and its oncogenic potential. Cell Cycle. 2005; 4:304–309.

Creative Commons License All site content, except where otherwise noted, is licensed under a Creative Commons Attribution 3.0 License.
PII: 11144