Research Papers:

Epigenome-wide association study in hepatocellular carcinoma: Identification of stochastic epigenetic mutations through an innovative statistical approach

PDF |  HTML  |  Supplementary Files  |  How to cite

Oncotarget. 2017; 8:41890-41902. https://doi.org/10.18632/oncotarget.17462

Metrics: PDF 2411 views  |   HTML 3966 views  |   ?  

Davide Gentilini, Stefania Scala, Germano Gaudenzi, Paolo Garagnani, Miriam Capri, Matteo Cescon, Gian Luca Grazi, Maria Giulia Bacalini, Serena Pisoni, Alessandra Dicitore, Luisa Circelli, Sara Santagata, Francesco Izzo, Anna Maria Di Blasio, Luca Persani, Claudio Franceschi and Giovanni Vitale _


Davide Gentilini1, Stefania Scala2, Germano Gaudenzi3, Paolo Garagnani4,5, Miriam Capri4,5, Matteo Cescon6, Gian Luca Grazi7, Maria Giulia Bacalini8, Serena Pisoni1, Alessandra Dicitore1, Luisa Circelli9, Sara Santagata2, Francesco Izzo10, Anna Maria Di Blasio1, Luca Persani1,3, Claudio Franceschi4,5,8 and Giovanni Vitale1,3

1Istituto Auxologico Italiano IRCCS, Cusano Milanino, Milan, Italy

2Functional Genomics, Istituto Nazionale per lo Studio e la Cura dei Tumori, IRCCS Fondazione “G. Pascale”, Napoli, Italy

3Department of Clinical Sciences and Community Health (DISCCO), University of Milan, Milan, Italy

4Department of Experimental, Diagnostic and Specialty Medicine, Alma Mater Studiorum, University of Bologna, Bologna, Italy

5Interdepartmental Center "L. Galvani", University of Bologna, Bologna, Italy

6DIMEC-Department of General Surgery and Medicine Sciences, S. Orsola-Malpighi Hospital, Bologna, Italy.

7Regina Elena National Cancer Institute Via Elio Chianesi 53, Rome, Italy

8IRCCS Institute of Neurological Sciences, Bologna, Italy

9Department of Experimental Oncology, Laboratory of Molecular Biology and Viral Oncology, Istituto Nazionale per lo Studio e la Cura dei Tumori, Fondazione “G. Pascale”, Napoli, Italy

10Department of Surgical Oncology, Abdominal and Hepatobiliary Unit, Istituto Nazionale per lo Studio e la Cura dei Tumori, IRCCS Fondazione “ G. Pascale”, Napoli, Italy

Correspondence to:

Giovanni Vitale, email: [email protected]

Keywords: hepatocellular carcinoma, stochastic epigenetic mutation, epigenetics, DNA methylation, genome-wide

Received: January 27, 2017     Accepted: April 15, 2017     Published: April 27, 2017


Hepatocellular carcinoma (HCC) results from accumulation of both genetic and epigenetic alterations. We investigated the genome-wide DNA methylation profile in 69 pairs of HCC and adjacent non-cancerous liver tissues using the Infinium HumanMethylation 450K BeadChip array. An innovative analytical approach has been adopted to identify Stochastic Epigenetic Mutations (SEMs) in HCC.

HCC and peritumoral tissues showed a different epigenetic profile, mainly characterized by loss of DNA methylation in HCC. Total number of SEMs was significantly higher in HCC tumor (median: 77,370) than in peritumoral (median: 5,656) tissues and correlated with tumor grade. A significant positive association emerged between SEMs measured in peritumoral tissue and hepatitis B and/or C virus infection status. A restricted number of SEMs resulted to be shared by more than 90% of HCC tumor samples and never present in peritumoral tissue. This analysis allowed the identification of four epigenetically regulated candidate genes (AJAP1, ADARB2, PTPRN2, SDK1), potentially involved in the pathogenesis of HCC.

In conclusion, HCC showed a methylation profile completely deregulated and very far from adjacent non-cancerous liver tissues. The SEM analysis provided valuable clues for further investigations in understanding the process of tumorigenesis in HCC.


Hepatocellular carcinoma (HCC) is a common cancers and the second leading cause of cancer-related death among males in the world [1]. Indeed, advanced HCC not eligible for local therapies has limited chances of cure [2], notwithstanding a massive effort in identifying novel therapeutic targets and predictive markers [35]. The pathogenesis of HCC is multifactorial and includes several extrinsic and intrinsic factors. Chronic liver damage secondary to viral hepatitis, alcohol abuse or non-alcoholic steatohepatitis, represents the most critical determinant in the development of this tumor. Several recent studies have provided relevant insights in this field through the characterization of HCC by using whole genome/exome sequencing, genome-wide methylation arrays and RNA sequencing [68]. Although it is known that HCC results from accumulation of both genetic and epigenetic alterations, the molecular pathogenesis of this tumor is still not completely understood. Indeed, HCC is a complex and heterogeneous disease [9].

DNA methylation is one of the most widely characterized epigenetic modifications implicated in cancer development. Several studies evaluated the DNA methylation profile in HCC through genome-wide arrays [1014]. Most of these reports compared the mean methylation level calculated for each CpG site in human HCC with that in adjacent non-cancerous liver tissue, and correlated the tumor methylation profile with clinical and pathological parameters in order to obtain a deeper understanding of the pathogenesis of this tumor. While this approach allows to identify epigenetic alterations shared by a group of subjects and potentially associated with their phenotype, rare or stochastic epigenetic alterations, that are not shared among subjects and minimally affect the mean methylation level of the group, remain unexplored, although they may play a role in hepatocarcinogenesis. In addition, an analytical strategy based on comparisons of mean methylation values does not allow to process data obtained from a single subject [15].

Epigenetic mutations, or the more common term “epimutations” are heritable changes in gene activity not associated with a DNA mutation but rather with gain or loss of DNA methylation or other inheritable modifications of chromatin [16]. We have recently developed an analytical approach able to identify stochastic epigenetic mutations (SEMs) not shared among subjects. We applied this peculiar analytical strategy to investigate the relations between stochastic epigenetic alterations and aging [15].

In the present study, we investigated the genome-wide DNA methylation profile in HCC, adjacent non-cancerous liver tissue and normal liver tissue. A new analytical approach has been adopted to identify epimutations potentially involved in the development of HCC.


Clinical and pathological characteristics of HCC patients

Clinical and pathological characteristics of enrolled HCC patients are described in Table 1. The mean age at HCC diagnosis was 65.9 ± 8.8 years. Most patients were male (76.8%) and positive for HCV (59.5%). Among the 69 HCC patients, 87% have pathologically defined cirrhosis and 85.6% have tumors grade II.

Table 1: Clinical and pathological characteristics of 69 HCC patients


Number of cases (%)


< 60 years

17 (24.6)

> 60 years

52 (75.4)



53 (76.8)


16 (23.2)

Viral infection

No Viral Infection

15 (21.8)


9 (13)


41 (59.5)


4 (5.7)

Alcoholic liver disease


3 (4.4)


66 (95.6)

Non-alcoholic steatohepatitis


8 (11.5)


61 (88.5)



60 (87)


9 (13)

Child-Pugh class


57 (82.6)


12 (17.4)

Tumor Grade


3 (4.2)


59 (85.6)


7 (10.2)

Tumor Number


49 (71)


11 (15.9)


4 (5.7)


5 (7.4)

The complexity of variability of all the phenotypic traits has been reduced using Multiple Factor Analysis (Supplementary Figure 1). The first and second dimensions explained 18.29% and 16.63%, respectively, of the total variability of phenotypic traits.

Genome-wide methylation profiles in HCC tumor and adjacent non-cancerous tissues

A dimension reduction approach was used to visually inspect the dataset for signals in the methylation values. The Multi Dimensional Scaling Analysis was performed considering methylation signals from all CpG sites (Figure 1A) and in several genomic regions (tiling, genes, promoters and CpG Islands) (Figure 1B). This analysis showed a dramatic difference in methylation levels between HCC and peritumoral tissues. The Figure 2 shows hierarchical clustering of samples based on the top 1,000 CpG loci differentially methylated between HCC tumor and adjacent tissues. Excellent separation of tumor and peritumoral non-cancerous tissues was observed with only 3 misclassifications.

Exploratory analysis of methylation data.

Figure 1: Exploratory analysis of methylation data. Dimension reduction obtained by Multi Dimensional Scaling (MDS) is used to visually resume the complexity of methylation profile in HCC and peritumoral tissues at site level (A) and at regional level (B). Plots have been obtained using the first two dimensions.

Hierarchical cluster analysis.

Figure 2: Hierarchical cluster analysis. Hierarchical cluster analysis of the top 1000 significantly differentially methylated cpG sites between 69 pairs of HCC and adjacent non-cancerous liver tissues.

Paired differential methylation analysis between HCC vs peritumoral tissues was computed across the entire epigenome and selecting specific regions. The overall analysis identified 302,133 CpG sites out of 473,929 across the entire epigenome that significantly differed in methylation levels. Among all significant CpG sites, 36,615 probes resulted to have a delta greater than |0.30|. Differential methylation at the region level was computed based on a variety of metrix and combining methylation values inside tiling, genes, promoters and CpG Islands. Considering all the subgroups, the number of regions that resulted significantly different between the two sets and the number of the regions included in the array were: 105,827 and 131,743 for tiling, respectively; 25,572 and 29,592 for genes, respectively; 22,827 and 29,740 for promoters, respectively; 17,534 and 25,837 for CpG islands, respectively. Mean DNA methylation differences are represented as a volcano plot in Figure 3. Scatterplots for differential methylation at site and regional levels are shown in Supplementary Figure 2.

Differential methylation analysis of Illumina 450K data.

Figure 3: Differential methylation analysis of Illumina 450K data. Volcano plot showing differential methylation analysis of 69 paired HCC tumor and adjacent non-cancerous liver tissues at site level (A) and at regional level (B). The x-axis represents the mean DNA methylation difference (adjacent non-cancerous liver tissue – HCC), while the y-axis shows the – log10 of the p value for each CpG site (A) or genomic region (B).

These data indicate that aberrant DNA methylation in HCC is a very common event, covering the entire epigenome and mainly characterized by a loss of methylation in the tumor tissue. Considering that methylation differences between the two groups resulted extremely strong and involved the whole epigenome, a prioritization of significant results has been performed. The top 10 hypomethylated or hypermethylated genes extracted from this analysis and ranked by statistical significance are reported in Table 2.

Table 2: Top 10 ranked hypomethylated or hypermethylated genes in HCC tumor tissue obtained after prioritization of all differentially methylated probes

Stochastic epigenetic mutation analysis

For each sample, as described in the Material and Method section, the total number of SEMs was calculated using tissue from normal liver as reference. The number of SEMs resulted to be statistically significant higher in HCC (median: 77,370, IQR: 58,260–92,220) compared to adjacent non-cancerous tissue (median: 5,656, IQR: 2,068–7,431, p = 1.8 × 10-11) and normal liver tissue extracted from healthy subjects (median: 1,308, IQR: 854-2,148, p = 2 × 10-16) (Figure 4). Chromosomal distribution of SEMs in HCC samples and adjacent non-cancerous tissue has been reported in Supplementary Figure 3.

Exploration of SEMs in HCC.

Figure 4: Exploration of SEMs in HCC. Box Plots show distribution of SEMs number in normal, peritumoral and HCC tumor tissues.

Associations between SEMs and clinical and pathological characteristics

A multivariate regression analysis was performed in order to identify clinical and pathological traits potentially associated with SEMs. The number of SEMs identified in HCC and adjacent non-tumor tissue has been used as dependent variable, while specific phenotypic traits (viral infections, alcoholic liver disease, non-alcoholic steatohepatitis, cirrhosis, Child-Pugh class, tumor grade, tumor number and Multiple Factor Analysis dimensions) have been adopted as independent variables. All the multiple regression models have been adjusted for age.

Viral infection resulted significantly associated to the number of SEMs observed in peritumoral tissue. In the multivariate regression hepatitis B virus (HBV), hepatitis C virus (HCV) and HBV+HCV infection status resulted independently associated to the number of SEMs observed in peritumoral tissue (p = 8 × 10-5, p = 0.002 and p = 0.027, respectively) and, less pronounced, in tumor (p = 0.032, p = 0.034 and p = 0.075, respectively). Figure 5 shows the number of SEMs observed in peritumoral tissue (Figure 5A) and in HCC tumor tissue (Figure 5B) according to the viral infection status.

Number of SEMs is associated to viral infection.

Figure 5: Number of SEMs is associated to viral infection. Box Plots show distribution of SEMs in peritumoral tissue (A) and HCC tumor tissue (B) in relation to the HBV/HCV infection status.

Tumor grade resulted significantly associated to the number of SEMs observed in HCC tumor tissue (p = 0.02), but not in peritumoral tissue (p = 0.40). Figure 6 shows the number of SEMs observed in peritumoral tissue (Figure 6A) and in HCC tumor tissue (Figure 6B) according to the tumor grade.

Number of SEMs is associated to tumor grade.

Figure 6: Number of SEMs is associated to tumor grade. Box Plots showing distribution of SEMs in peritumoral tissue (A) and tumoral tissue (B) in relation to the tumor grade.

Complexity of clinical traits has been initially reduced using the Multiple Factor Analysis. The correlogram in Supplementary Figure 1 shows clinical traits associated to each dimension. The multiple regression analysis indicated that dimensions 1 (p = 2 × 10-7) and 5 (p = 1.6 × 10-5) resulted associated to SEMs observed in HCC tumor, while only dimension 3 (mainly collecting variability of age, HBV, HBV+HCV and non-alcoholic steatohepatitis) resulted significantly associated to the number of SEMs observed in HCC tumor and peritumoral tissues (p = 0.0003 and p = 0.015, respectively).

For other phenotypic traits the associations with the number of SEMs were not significant.

SEMs annotation analysis and candidate markers

Number of SEMs has been calculated for all samples. We identified 109,174 and 14,315 different SEMs in HCC tumor and adjacent non-tumor tissues, respectively. Among them 101,921 SEMs resulted present only in HCC and absent in adjacent non-cancerous tissues. We annotated genomic position of SEMs discovered in each sample and selected SEMs that were present in more than 90% of HCC samples and never present in adjacent non-cancerous tissues. Through this procedure a list of 4 genes (Table 3) was identified: Adherens Junctions Associated Protein 1 (AJAP1), Adenosine Deaminase RNA Specific B2 (ADARB2), Protein Tyrosine Phosphatase Receptor Type N2 (PTPRN2) and Sidekick Cell Adhesion Molecule 1 (SDK1).

Table 3: List of genes epimutated in more than 90% of HCC samples and never found in peritumoral tissues


DNA methylation is the main epigenetic mechanism adopted by cells to regulate gene expression, and alterations of this mechanism are involved in the pathogenesis of cancer. In this paper, we studied DNA methylation aberrations in HCC through the Illumina 450K DNA methylation array and adopting an innovative analysis devoted to the identification of epimutations.

We characterized DNA methylation profiles in paired HCC tumor and adjacent non-cancerous liver tissue samples from 69 patients. Visual inspection of DNA methylation levels, obtained through multi-dimensional scaling, revealed a marked difference between HCC tumor and peritumoral tissues. Also the differential methylation analysis confirmed substantial changes of DNA methylation mainly due to loss of methylation in HCC.

Nishida et al. [17] compared DNA-methylation profile extracted from 59 HCC with that from 58 adjacent non-cancerous liver tissues, using HumanMethylation450 BeadChip array. They identified 38,330 CpG sites differentially methylated with an effect greater than 15% between both groups of samples, of which 92% and 8% were respectively hypomethylated (mainly located within intergenic regions) and hypermethylated (predominantly observed within promoter regions and CpG islands) in HCC. A significant association between methylation profile and tumor size has been detected, particularly in HCV-positive patients. A predominant DNA hypomethylation in HCC has been also reported by several other studies and a specific subset of CpG sites correlated with HCV infection and liver cirrhosis [1013] .

In the present study we confirmed that HCC and adjacent non-cancerous tissues have a different epigenetic profile, indeed the number of differentially methylated probes resulted extremely high (n=302,133). In this condition it is difficult to identify candidate epigenetic signatures and to highlight specific biological pathways and functions, considering that epigenetic deregulation is widespread throughout the genome. Several studies selected the top hypermethylated or hypomethylated genes on the basis of p value or effect size [1013]. However it has to be considered that the identification of best associations, based only on deserving of statistical thresholds, may lead to inconclusive or biased results.

On reviewing the literature, we identified four studies [1013] that explored genome-wide DNA methylation profiles in HCC tumor and adjacent non-tumor tissues through Illumina’s 450K or 27K arrays and published a list of the top differentially methylated CpG sites. A total number of 161 probes resulted combining the lists from these four studies (Supplementary Table 1). Although, all these probes have been confirmed in the present study with a similar trend, only 8 probes have been reported by at least 2 out of 4 selected studies. This poor overlap suggests that the selection of candidate markers obtained by fixing arbitrary statistical thresholds might produce biased results.

In our study, we used an alternative approach to identify epigenetic markers, based on epigenetic mutations analysis, as previously applied in the field of aging [15]. In each sample, SEMs have been identified using normal liver tissue methylation data as reference population. We have previously reported that the number of SEMs increases exponentially with age, and for this reason all analyses have been adjusted, considering age as covariate. Interestingly, we observed a progressive increase in the number of SEMs from normal liver, peritumoral tissue (predominantly with chronic hepatitis/cirrhosis) to HCC tissue. In HCC the median number of SEMs resulted 13 times higher than in adjacent non-cancerous liver tissue. Moreover the number of SEMs observed in peritumoral tissues was 4-fold higher than that identified in normal liver tissues obtained from healthy subjects. The number of SEMs observed in peritumoral and tumoral tissues resulted also significantly higher in samples with HBV and HCV viral infection. Both HBV and HCV infections have a main role in the development of HCC through genetic and epigenetic alterations. It has been reported that aberrant DNA methylation may be induced by direct viral activity or indirectly through related chronic liver inflammation [18]. In addition, it is well known that viral infection is able to affect DNA methyl transferase expression in HCC [19].

We also focused the attention on other clinical-phenotypic traits and we observed that the number of SEMs detected in HCC tissue increased progressively with histopathological tumor grade, suggesting that tumor progression and aggressiveness correlate with the accumulation of epigenetic alterations. This represents an interesting finding, supporting the potential use of SEMs number as a prognostic marker in HCC.

Therefore, the extreme heterogeneity and complexity among HCC epigenomes makes difficult to identify specific epigenetic profiles associated to phenotypic traits. While, the number of SEMs emerged as a new variable with high sensitivity and potentially able to summarize the degree of epigenetic alterations with a simple score.

In order to identify novel candidate potential epidrivers, after the calculation of SEMs, we analysed genomic position of SEMs discovered in each sample. We performed this analysis both for HCC and peritumoral tissues. We identified a list of SEMs that were present in more than 90% of HCC samples and never present in peritumoral tissues. Four genes emerged from this analysis: AJAP1, ADARB2, PTPRN2, SDK1. These epimutations were characterized by hypermethylation at promoters level and concomitant hypomethylation at gene body level in HCC tumor tissues (Table 3). Interestingly, a similar epigenetic signature for AJAP1, ADARB2 and PTPRN2 has been previously reported in glioblastoma [2021]. Unfortunately, it was not possible using microarray-based method to understand if these epigenetic alterations were allele-specific.

AJAP1 is a component of adherent junctions. It interacts with E-cadherin–β-catenin complexes in polarized epithelial cells. The methylation of AJAP1 may lead to the release of β-catenin and the activation of Wnt signaling [22]. AJAP1 expression has been recently reported to be reduced in HCC compared with non-cancerous liver tissues and to be associated with hypermethylation of the AJAP1 promoter. Indeed, suppression of AJAP1 expression may represent a specific event that occurs in the final stage of the initiation of HCC [23]. Several evidences concerning the role of AJAP1 as a putative tumor suppressor genes regulated by promoter hypermethylation have been reported in cervical cancer, esophageal squamous cell carcinoma and gastric cancer [22, 24, 25]. In addition, it has been shown that AJAP1 expression was drastically reduced in glioblastoma tumors, and the loss of expression correlated with AJAP1 promoter hypermethylation and poorer survival. Interestingly, restoration of AJAP1 gene expression by transfection or demethylating agents decreased tumor cell proliferation and migration in glioblastoma cell lines [26].

ADARB2 encodes a member of the double-stranded RNA adenosine deaminase family of RNA-editing enzymes and plays a regulatory role in RNA editing. This may affect gene expression and function through the modulation of: mRNA translation, by changing codons and hence the amino acid sequence of proteins; pre-mRNA splicing, by altering splice site recognition sequences; RNA stability, by changing sequences involved in nuclease recognition; genetic stability in the case of RNA virus genomes, by changing sequences during viral RNA replication; and RNA structure-dependent activities, such as microRNA production or targeting or protein-RNA interactions [21]. The gene body region of ADARB2 resulted extremely hypomethylated in HCC tumor compared to adjacent tissues. Currently there are no data available concerning the potential role of ADARB2 in the development of HCC. However, a growing body of evidence indicates an important role of adenosine to inosine RNA editing, modulated through epigenetic mechanisms, in the pathogenesis and progression of cancer [21, 27]: 1) hypomethylation of ADARB2 has been reported as a marker of relapse in pediatric acute lymphoblastic leukemia; 2) RNA levels of the three ADAR family members (ADAR, ADARB1, and ADARB2) were significantly reduced in brain tumors, the reduction of ADARB2 correlated with the grade of malignancy of glioblastoma multiforme, the most aggressive of brain tumors, displaying a 99% decrease in ADARB2 RNA levels; 3) overexpression of ADAR and ADARB1 (an important paralog of ADARB2) in the U87 glioblastoma multiforme cell line significantly decreased proliferation rate.

PTPRN2 belongs to the protein tyrosine phosphatase family, it has an important role in vesicle-mediated secretory processes and exhibits a phosphatidylinositol phosphatase against phosphatidylinositol 4,5-diphosphate and phosphatidylinositol 3-phosphate [28]. PTPRN2 has been reported to be significant hypermethylated in squamous cell lung cancer and glioblastoma [20, 29]. PTPRN2 resulted in the top list of marker down methylated in HCC, as reported by Shen et al [10], and similarly to our results the hypomethylation of this gene was restricted to the gene body region.

SDK1 gene codes for a cell adhesion molecule with a potential role in cancer progression. This gene is responsive to androgens, resulted overexpressed and able to regulate cellular migration in prostate cancer [30]. Moreover, in whole-exome sequencing studies performed on adrenocortical carcinoma, this gene was affected by somatic mutations [31]. However, to the best of our knowledge there are not data regarding the role of this gene in the pathogenesis of HCC.

In summary, we confirmed that HCC showed a methylation profile completely deregulated and very far from adjacent non-cancerous liver tissues. An alternative genome-wide DNA methylation analytical approach, based on the identification of epimutations, represents the real novelty of this study. This statistical approach resulted a robust method to appreciate epigenetic variability among samples. Indeed, the number of SEMs observed in HCC tissue was strongly associated to several important phenotypic traits. In addition, this analysis allowed the identification of four epigenetically regulated candidate genes (AJAP1, ADARB2, PTPRN2, SDK1), potentially involved in the hepatocarcinogenesis. Future studies are required to further validate the differential expression of these genes at the RNA or protein level in HCC.


Samples, patients and phenotypes

Paired HCC tumor and adjacent non-cancerous liver tissue samples were obtained from 69 patients (16 females, 53 males) who underwent surgical resection from 2008 to 2015 at the National Cancer Institute of Naples. Samples were frozen immediately after surgical removal and stored at –80°C until DNA extraction. Demographic data (age and gender), information on viral infection (HBV and HCV), clinical and pathological characteristics (alcoholic liver disease, non-alcoholic steatohepatitis, cirrhosis, Child-Pugh class, tumor grade and tumor number) were obtained from the medical records. Written informed consent was obtained from all patients. This study was approved by the Ethics Committee of the Istituto Auxologico Italiano.

DNA preparation and infinium methylation 450K assay

DNA was extracted using the QIAamp DNA Mini kit (Qiagen GmbH,D-40724 Hilden, Germany), according to the manufacturer’s instructions. Bisulfite modification of 1 μg DNA was conducted using an EZ DNA Methylation Kit (Zymo Research) according to the manufacturer’s procedure. The Infinium Methylation 450K assay was performed according to Illumina’s standard protocol. Six HCC/adjacent non-tumor tissue pairs were processed on the same chip to avoid chip-to-chip variation. Quality control and quantification of DNA were performed before and after bisulfite conversion.

450k methylation data and populations description

Methylation data analysed in the study were obtained from different datasets: 1. The Case Population: 69 HCC tumor tissues. 2. The Control Population: 69 paired adjacent non-cancerous liver tissue samples. 3. The Reference Population: Methylation raw data from 47 liver biopsies from healthy heart-beating and brain death donors with an age spanning from 13 to 90 years [32] were obtained from the Istituto Auxologico Italiano Epigenetic Database. Donors were free from known viral hepatitis and liver cirrhosis. Methylation row data from 67 normal liver tissues were obtained from the public functional genomics data repository GEO. We selected one dataset containing methylation idat files classified as normal liver tissue (GSE61446). Using these two datasets we obtained a unique reference containing 114 subjects. This population represented the reference population used for the estimation of epigenetic mutations.

Data management, pre-processing, normalisation and quality control

Illumina Methylation 450K raw data were analysed using the RnBeads analysis software package [29]. Sites overlapping SNPs were firstly removed from the analysis (n = 4713) as well as probes on sex chromosomes (n = 11119). Probes and samples of highest impurity were removed from the dataset using the Greedycut algorithm provided in the RnBeads package. We have considered every β value to be unreliable when its corresponding detection p-value was not below the threshold (T = 0.05). The background was subtracted using the methylumi package (method “noob”) [33]. The signal intensity values were normalized using the SWAN normalization method, as implemented in the minfi package. In addition to CpG sites, 4 sets of genomic regions were covered in the analysis (tailing, genes, promoters and CpG Islands).

Differential methylation analysis and prioritization analysis

Paired Differential methylation analysis was conducted both at site and region level according to the sample groups. P-values were computed using the limma method for the site level analysis, while a combined p-value was calculated from all site p-values for the region-based analysis [33]. After this analysis results have been ordered depending on their relevance on HCC field. Lists of hypermethylated and hypomethylated genes have been prioritized using the Phenolyzer on line tool (http://phenolyzer.wglab.org/) and using “HEPATOCELLULAR CARCINOMA” and “HEPATIC” as key word.

Epimutation detection

In both HCC and adjacent liver tissue samples SEMs were identified, as previously described [15]. Briefly, after the pre-processing step, for all probes, the distribution and variability of methylation levels were studied using box and whiskers plots. Methylation levels of each sample were compared to those of normal liver tissues from a reference population. The same approach was applied also for each subject of the reference population. One by one each subject of the reference population was extracted and compared to the remaining N-1 subjects. At each locus, whenever the methylation level of one sample differed extremely from the rest of the population this outlier was considered as epimutated. Thus, for each locus, epimutated samples were identified as the extreme outliers when their methylation levels lay outside of Q1-(3 x IQR) and Q3+(3 x IQR). Finally, all epimutated loci were annotated in a new data matrix that allowed to calculate, for each sample, the total amount of epimutations and their genomic position. The box and whiskers plot analysis was conducted using boxplot function in the R car package and confirmed using the outlier function in the R outliers package. A validation of this analytical approach has been previously reported [15].

Statistical analysis

The “Shapiro.test” function provided in the R package “stats” was applied to test normality among variables. The “Wilcox.test” function provided in the R package “class” was used to test differences between cases and controls groups for all non-parametric data. Considering the presence of categorical variables, the Multiple Factor Analysis was performed using the Multiple Factor Analysis of mixt data approach and the “FAMD” function provided in the R package “FactoMineR”. The univariate and multivariate linear regressions were conducted using the “Generalised Linear Model” function provided in the R “base” package. Bonferroni’s correction was performed to correct for multiple testing.


HBV: hepatitis B virus; HCC: hepatocellular carcinoma; HCV: hepatitis C virus; IQR: Interquartile range; MDS: Multi Dimensional Scaling SEMs: stochastic epigenetic mutations.

Authors’ contributions

Gentilini D: contributed to acquisition of data, analysis and interpretation of data, drafting of the article and revising it for important contents and final approval of the version to be published. Vitale G: contributed to the conception and design of the study, acquisition of data, analysis and interpretation of data; drafting of the article and revising it for important contents and final approval of the version to be published. Franceschi C: contributed to acquisition of data and revising the article for important contents.Dicitore A: contributed to acquisition of data. Stefania S: contributed to acquisition of data. Gaudenzi G: contributed to acquisition of data. Garagnani P: contributed to acquisition of data and interpretation of data. Bacalini MG: contributed to acquisition of data and interpretation of data. Di Blasio AM: revising the article for important contents. Persani L: provided support and supervision of the study group and critical revision of the article. Capri M: contributed to acquisition of data. Cescon M: contributed to acquisition of data. Grazi GL: contributed to acquisition of data. Pisoni S: contributed to acquisition of data. Cirelli L: contributed to acquisition of data. Santagata S: contributed to acquisition of data. Izzo F: contributed to acquisition of data. All the authors approved the final version of the article.


All authors disclosed no potential conflicts.


This study was funded by the Italian Ministry of Education, University and Research (grant number: FIRB RBAP11884 M) to GV and FI. The study was also partially supported by funds of the Ricerca Corrente of Istituto Auxologico Italiano, Milan, Italy (code: 05C205_2012) to GV and PRIN2008 to GLG (grant number: 2008WRBYPE).


1. Torre LA, Siegel RL, Ward EM, Jemal A. Global Cancer Incidence and Mortality Rates and Trends–An Update. Cancer Epidemiol Biomarkers Prev. 2016; 25:16–27. doi: 10.1158/1055-9965.EPI-15-0578.

2. Prete SD, Montella L, Caraglia M, Maiorino L, Cennamo G, Montesarchio V, Piai G, Febbraro A, Tarantino L, Capasso E, Palmieri G, Guarrasi R, Bianco M, et al. Sorafenib plus octreotide is an effective and safe treatment in advanced hepatocellular carcinoma: multicenter phase II So.LAR. study. Cancer Chemother Pharmacol. 2010; 66:837–44. doi: 10.1007/s00280-009-1226-z.

3. Marra M, Sordelli IM, Lombardi A, Lamberti M, Tarantino L, Giudice A, Stiuso P, Abbruzzese A, Sperlongano R, Accardo M, Agresti M, Caraglia M, Sperlongano P. Molecular targets and oxidative stress biomarkers in hepatocellular carcinoma: an overview. J Transl Med. 2011; 9:171. doi: 10.1186/1479-5876-9-171.

4. Potenza N, Mosca N, Zappavigna S, Castiello F, Panella M, Ferri C, Vanacore D, Giordano A, Stiuso P, Caraglia M, Russo A. MicroRNA-125a-5p Is a Downstream Effector of Sorafenib in Its Antiproliferative Activity Toward Human Hepatocellular Carcinoma Cells. J Cell Physiol. 2017; 232:1907–13. doi: 10.1002/jcp.25744.

5. Stiuso P, Potenza N, Lombardi A, Ferrandino I, Monaco A, Zappavigna S, Vanacore D, Mosca N, Castiello F, Porto S, Addeo R, Prete SD, De Vita F, et al. MicroRNA-423-5p Promotes Autophagy in Cancer Cells and Is Increased in Serum From Hepatocarcinoma Patients Treated With Sorafenib. Mol Ther Nucleic Acids. 2015; 4:e233. doi: 10.1038/mtna.2015.8.

6. Huang Q, Lin B, Liu H, Ma X, Mo F, Yu W, Li L, Li H, Tian T, Wu D, Shen F, Xing J, Chen ZN. RNA-Seq analyses generate comprehensive transcriptomic landscape and reveal complex transcript patterns in hepatocellular carcinoma. PLoS One. 2011; 6:e26168. doi: 10.1371/journal.pone.0026168.

7. Inokawa Y, Inaoka K, Sonohara F, Hayashi M, Kanda M, Nomoto S. Molecular alterations in the carcinogenesis and progression of hepatocellular carcinoma: Tumor factors and background liver factors. Oncol Lett. 2016; 12:3662–8. doi: 10.3892/ol.2016.5141.

8. Schulze K, Imbeaud S, Letouze E, Alexandrov LB, Calderaro J, Rebouissou S, Couchy G, Meiller C, Shinde J, Soysouvanh F, Calatayud AL, Pinyol R, Pelletier L, et al. Exome sequencing of hepatocellular carcinomas identifies new mutational signatures and potential therapeutic targets. Nat Genet. 2015; 47:505–11. doi: 10.1038/ng.3252.

9. Dhanasekaran R, Bandoh S, Roberts LR. Molecular pathogenesis of hepatocellular carcinoma and impact of therapeutic advances. F1000Res. 2016; 5. doi: 10.12688/f1000research.6946.1.

10. Neumann O, Kesselmeier M, Geffers R, Pellegrino R, Radlwimmer B, Hoffmann K, Ehemann V, Schemmer P, Schirmacher P, Lorenzo Bermejo J, Longerich T. Methylome analysis and integrative profiling of human HCCs identify novel protumorigenic factors. Hepatology. 2012; 56:1817–27. doi: 10.1002/hep.25870.

11. Shen J, Wang S, Zhang YJ, Kappil M, Wu HC, Kibriya MG, Wang Q, Jasmine F, Ahsan H, Lee PH, Yu MW, Chen CJ, Santella RM. Genome-wide DNA methylation profiles in hepatocellular carcinoma. Hepatology. 2012; 55:1799–808. doi: 10.1002/hep.25569.

12. Shen J, Wang S, Zhang YJ, Wu HC, Kibriya MG, Jasmine F, Ahsan H, Wu DP, Siegel AB, Remotti H, Santella RM. Exploring genome-wide DNA methylation profiles altered in hepatocellular carcinoma using Infinium HumanMethylation 450 BeadChips. Epigenetics. 2013; 8:34–43. doi: 10.4161/epi.23062.

13. Song MA, Tiirikainen M, Kwee S, Okimoto G, Yu H, Wong LL. Elucidating the landscape of aberrant DNA methylation in hepatocellular carcinoma. PLoS One. 2013; 8:e55761. doi: 10.1371/journal.pone.0055761.

14. Zhang C, Li J, Huang T, Duan S, Dai D, Jiang D, Sui X, Li D, Chen Y, Ding F, Huang C, Chen G, Wang K. Meta-analysis of DNA methylation biomarkers in hepatocellular carcinoma. Oncotarget. 2016; 7:81255–67. doi: 10.18632/oncotarget.13221.

15. Gentilini D, Garagnani P, Pisoni S, Bacalini MG, Calzari L, Mari D, Vitale G, Franceschi C, Di Blasio AM. Stochastic epigenetic mutations (DNA methylation) increase exponentially in human aging and correlate with X chromosome inactivation skewing in females. Aging (Albany NY). 2015; 7:568–78. doi: 10.18632/aging.100792.

16. Oey H, Whitelaw E. On the meaning of the word 'epimutation'. Trends Genet. 2014; 30:519–20. doi: 10.1016/j.tig.2014.08.005.

17. Nishida N, Nishimura T, Nakai T, Chishina H, Arizumi T, Takita M, Kitai S, Yada N, Hagiwara S, Inoue T, Minami Y, Ueshima K, Sakurai T, et al. Genome-wide profiling of DNA methylation and tumor progression in human hepatocellular carcinoma. Dig Dis. 2014; 32:658–63. doi: 10.1159/000367982.

18. Hattori N, Ushijima T. Epigenetic impact of infection on carcinogenesis: mechanisms and applications. Genome Med. 2016; 8:10. doi: 10.1186/s13073-016-0267-2.

19. Hlady RA, Tiedemann RL, Puszyk W, Zendejas I, Roberts LR, Choi JH, Liu C, Robertson KD. Epigenetic signatures of alcohol abuse and hepatitis infection during human hepatocarcinogenesis. Oncotarget. 2014; 5:9425–43. doi: 10.18632/oncotarget.2444.

20. Lee EJ, Rath P, Liu J, Ryu D, Pei L, Noonepalle SK, Shull AY, Feng Q, Litofsky NS, Miller DC, Anthony DC, Kirk MD, Laterra J, et al. Identification of Global DNA Methylation Signatures in Glioblastoma-Derived Cancer Stem Cells. J Genet Genomics. 2015; 42:355–71. doi: 10.1016/j.jgg.2015.06.003.

21. Paz N, Levanon EY, Amariglio N, Heimberger AB, Ram Z, Constantini S, Barbash ZS, Adamsky K, Safran M, Hirschberg A, Krupsky M, Ben-Dov I, Cazacu S, et al. Altered adenosine-to-inosine RNA editing in human cancer. Genome Res. 2007; 17:1586–95. doi: 10.1101/gr.6493107.

22. Chen YC, Huang RL, Huang YK, Liao YP, Su PH, Wang HC, Chang CC, Lin YW, Yu MH, Chu TY, Lai HC. Methylomics analysis identifies epigenetically silenced genes and implies an activation of beta-catenin signaling in cervical cancer. Int J Cancer. 2014; 135:117–27. doi: 10.1002/ijc.28658.

23. Ezaka K, Kanda M, Sugimoto H, Shimizu D, Oya H, Nomoto S, Sueoka S, Tanaka Y, Takami H, Hashimoto R, Okamura Y, Yamada S, Fujii T, et al. Reduced Expression of Adherens Junctions Associated Protein 1 Predicts Recurrence of Hepatocellular Carcinoma After Curative Hepatectomy. Ann Surg Oncol. 2015; 22:S1499–507. doi: 10.1245/s10434-015-4695-9.

24. Matsusaka K, Kaneda A, Nagae G, Ushiku T, Kikuchi Y, Hino R, Uozaki H, Seto Y, Takada K, Aburatani H, Fukayama M. Classification of Epstein-Barr virus-positive gastric cancers by definition of DNA methylation epigenotypes. Cancer Res. 2011; 71:7187–97. doi: 10.1158/0008-5472.CAN-11-1349.

25. Tanaka H, Kanda M, Koike M, Iwata N, Shimizu D, Ezaka K, Sueoka S, Tanaka Y, Takami H, Hashimoto R, Tanaka C, Yamada S, Fujii T, et al. Adherens junctions associated protein 1 serves as a predictor of recurrence of squamous cell carcinoma of the esophagus. Int J Oncol. 2015; 47:1811–8. doi: 10.3892/ijo.2015.3167.

26. Lin N, Di C, Bortoff K, Fu J, Truszkowski P, Killela P, Duncan C, McLendon R, Bigner D, Gregory S, Adamson DC. Deletion or epigenetic silencing of AJAP1 on 1p36 in glioblastoma. Mol Cancer Res. 2012; 10:208–17. doi: 10.1158/1541-7786.MCR-10-0109.

27. Nordlund J, Backlin CL, Wahlberg P, Busche S, Berglund EC, Eloranta ML, Flaegstad T, Forestier E, Frost BM, Harila-Saari A, Heyman M, Jonsson OG, Larsson R, et al. Genome-wide signatures of differential DNA methylation in pediatric acute lymphoblastic leukemia. Genome Biol. 2013; 14:r105. doi: 10.1186/gb-2013-14-9-r105.

28. Caromile LA, Oganesian A, Coats SA, Seifert RA, Bowen-Pope DF. The neurosecretory vesicle protein phogrin functions as a phosphatidylinositol phosphatase to regulate insulin secretion. J Biol Chem. 2010; 285:10487–96. doi: 10.1074/jbc.M109.066563.

29. Anglim PP, Galler JS, Koss MN, Hagen JA, Turla S, Campan M, Weisenberger DJ, Laird PW, Siegmund KD, Laird-Offringa IA. Identification of a panel of sensitive and specific DNA methylation markers for squamous cell lung cancer. Mol Cancer. 2008; 7:62. doi: 10.1186/1476-4598-7-62.

30. Verone AR, Duncan K, Godoy A, Yadav N, Bakin A, Koochekpour S, Jin JP, Heemers HV. Androgen-responsive serum response factor target genes regulate prostate cancer cell migration. Carcinogenesis. 2013; 34:1737–46. doi: 10.1093/carcin/bgt126.

31. Juhlin CC, Goh G, Healy JM, Fonseca AL, Scholl UI, Stenman A, Kunstman JW, Brown TC, Overton JD, Mane SM, Nelson-Williams C, Backdahl M, Suttorp AC, et al. Whole-exome sequencing characterizes the landscape of somatic mutations and copy number alterations in adrenocortical carcinoma. J Clin Endocrinol Metab. 2015; 100:E493–502. doi: 10.1210/jc.2014-3282.

32. Capri M, Olivieri F, Lanzarini C, Remondini D, Borelli V, Lazzarini R, Graciotti L, Albertini MC, Bellavista E, Santoro A, Biondi F, Tagliafico E, Tenedini E, et al. Identification of miR-31-5p, miR-141-3p, miR-200c-3p, and GLT1 as human liver aging markers sensitive to donor-recipient age-mismatch in transplants. Aging Cell. 2017; 16:262–72. doi: 10.1111/acel.12549.

33. Assenov Y, Muller F, Lutsik P, Walter J, Lengauer T, Bock C. Comprehensive analysis of DNA methylation data with RnBeads. Nat Methods. 2014; 11:1138–40. doi: 10.1038/nmeth.3115.

Creative Commons License All site content, except where otherwise noted, is licensed under a Creative Commons Attribution 4.0 License.
PII: 17462