Abstract
Christophe Badie1, Agnieszka Blachowicz2, Zarko Barjaktarovic3, Rosemary Finnon1, Arlette Michaux4, Hakan Sarioglu5, Natalie Brown1, Grainne Manning1, M. Abderrafi Benotmane4, Soile Tapio3, Joanna Polanska2, Simon D. Bouffler1
1Radiation Effects Department, Centre for Radiation, Chemical and Environmental Hazards, Public Health England, Chilton, UK
2Faculty of Automatic Control, Electronics and Computer Science, Silesian University of Techology, Gliwice, Poland
3Helmholtz Zentrum München, German Research Center for Environmental Health GmbH, Radiation Proteomics Group, Institute of Radiation Biology, Neuherberg, Germany
4Radiobiology Unit, Institute for Environment, Health and Safety, Belgian Nuclear Research Centre (SCK•.CEN), Mol, Belgium
5Helmholtz Zentrum München, German Research Center for Environmental Health GmbH, Research Unit Protein Science, Neuherberg, Germany
Correspondence to:
Simon D. Bouffler, email: [email protected]
Keywords: ionising radiation, acute myeloid leukaemia, mouse, gene expression, protein expression
Received: February 11, 2016 Accepted: May 09, 2016 Published: May 26, 2016
ABSTRACT
A combined transcriptome and proteome analysis of mouse radiation-induced AMLs using two primary AMLs, cell lines from these primaries, another cell line and its in vivo passage is reported. Compared to haematopoietic progenitor and stem cells (HPSC), over 5000 transcriptome alterations were identified, 2600 present in all materials. 55 and 3 alterations were detected in the proteomes of the cell lines and primary/in vivo passage material respectively, with one common to all materials. In cell lines, approximately 50% of the transcriptome changes are related to adaptation to cell culture, and in the proteome this proportion was higher. An AML ‘signature’ of 17 genes/proteins commonly deregulated in primary AMLs and cell lines compared to HPSCs was identified and validated using human AML transcriptome data. This also distinguishes primary AMLs from cell lines and includes proteins such as Coronin 1, pontin/RUVBL1 and Myeloperoxidase commonly implicated in human AML. C-Myc was identified as having a key role in radiation leukaemogenesis. These data identify novel candidates relevant to mouse radiation AML pathogenesis, and confirm that pathways of leukaemogenesis in the mouse and human share substantial commonality.
INTRODUCTION
Ionising radiation is a known leukaemogen, evidence from studies of the Japanese atomic bomb survivors indicate that acute myeloid leukaemia (AML) predominates [eg 1]. Mouse models of radiation-induced AML are available and there is substantial knowledge of the mechanisms that drive leukaemogenesis [2]. Furthermore, the relationship with human myeloid leukaemogenesis is becoming clearer, with PU.1 deregulation through mutation in the mouse or other regulatory pathways in human playing an important role [2]. The majority of mouse radiation-induced AMLs carry a deletion of one allele of the Sfpi1 gene, the mouse homologue of human PU.1, and accompanying point mutations at codon 235 in the retained allele of the gene. Whether these two events are either necessary or sufficient for leukaemogenesis is not clear. There is at least one alternative pathway of leukaemogenesis in the mouse involving internal tandem duplication (ITD) mutations of Flt3 [3], indicating that direct Sfpi1 involvement is not necessary. Furthermore, some studies have suggested that point mutations are not rate limiting [4], possibly suggesting that Sfpi1 deletion and point mutation are not jointly sufficient.
There is a need therefore to identify other common alterations associated with radiation-induced AMLs and to examine their relevance to AML pathogenesis. In this study we present a combined transcriptome and proteome analysis of a small set of mouse radiation-induced AMLs, comparing primary material, cell lines and an in vivo passaged cell line. Array based transcriptomics was combined with mass spectrometry-based proteomics and subsequent bioinformatics to provide a comprehensive and in depth analysis of alterations associated with leukaemogenesis and adaptation to in vitro culture.
The analysis strongly indicated that primary material is preferred for the identification of modifications related to disease pathogenesis. A set of seventeen genes/proteins is identified that is commonly altereded in these AMLs, several of these genes/proteins have roles in human leukaemogenesis and the gene set is able to distinguish human AMLs from normal control cells.
RESULTS
The experimental materials available (Figure 1) and results obtained allow for the identification of transcriptome and proteome changes associated with two primary leukaemias (RF12-p, RF26-p), three AML cell lines (RF12-cl, RF26-cl, MLP3-cl) and an in vivo passaged cell line (MLP3-ivp). The samples included in the current analysis were first characterised for Sfpi1 status, given its important role in mouse radiation leukaemogenesis [2]. The primary AMLs and derived cell lines, RF12-p, RF26-p, RF12-cl and RF26-cl were found to be hemizygous for Sfpi1 and carry point mutations in the retained allele such that the protein carried the R235C amino acid substitution. The MLP3 cell line is known to be deleted for both Sfpi1 copies [5]. Figure 2 provides an unsupervised heat map of transcriptome modifications. This indicates that the two primary and one in vivo passaged AMLs show a high degree of similarity in the transcriptome modifications compared with the control sample, Lin-depleted bone marrow HPSCs. By contrast there are substantial differences in transcriptome between each cell line and its primary or in vivo passaged counterpart. The Venn diagrams (Figure 3A and 3B) summarise the findings; 5521 transcripts are commonly deregulated in the primary AMLs and in vivo passaged MLP3, 5220 transcripts are commonly deregulated in the three cell lines. While the number of transcripts deregulated in cell lines by comparison with primary and in vivo passaged material are similar, Figure 3C shows that only approximately half (2600) of the modifications are common in the two forms of material originating from the same primary AML, some 2620 modifications are specific to growth in in vitro culture. Supplementary Table S1 provides a full listing of the transcripts commonly altereded in AML cell lines, primary and in vivo passaged material.
Figure 1: Schematic showing the origin of AML materials used in the study. CBA/H mice were irradiated with 3 Gy x-rays, AMLs RF12, RF26 and MLP3 presented following several months latency. Spleen tissue from the mouse in which the AML arose was used as the source of primary AML material for RF12-p and RF26-p. These AMLs were also adapted to cell culture, to form RF12-cl and RF26-cl cell lines. Primary spleen tissue from MLP3 was not available but it was established as a cell line (MLP3-cl), the cell line was in vivo passaged to provide in vivo passaged MLP3 (MLP3-ivp).
Figure 2: Unsupervised heat map of transcriptomics analysis, showing clustering of the primary and in vivo passaged samples and the AML cell lines. A. normalisation procedure to eliminate transcripts below a specified noise threshold and above a specified variance threshold, led to construction of an unsupervised heatmap from 2152 transcripts B.
Figure 3: Transcriptome alterations in the primary and in vivo passaged samples, and cell lines. A. Venn diagram summarising the numbers of transcripts altered in the two primary AMLs, RF12-p and RF26-p and MLP3-ivp by comparison with control HPSCs. B. Venn diagram summarising the numbers of transcripts altered in the three AML cell lines by comparison with the control HPSCs. C. Venn diagram summarising the numbers of transcripts commonly and uniquely altered in the three AML cell lines and two primary plus in vivo passaged samples. A subset of 2600 transcripts was found to be commonly altered in all materials compared to the control HPSCs. D. Dice index analysis of the degree of similarity between samples; see text for further discussion.
Dice index analysis allows a quantitative overview of the degree of similarity between sets of samples – the closer to 1, the more similar groups of samples are. Figure 3D provides quantitative Dice index analysis for various comparisons of the deregulated transcripts. In 3-way comparisons between MLP3, RF12 and RF26, the primary/in vivo passaged materials are more different from cell lines (Dice = 0.4141) than either material is from control HPSCs, suggesting that caution is required in inferring changes in primary AML material from changes observed in cell lines. Pairwise comparisons tend to show a higher similarity between RF12-p and RF26-p than between MLP3-ivp and the RF12 & 26-p. In data not presented it has been observed that upregulated transcript changes tend to be more similar than down regulated transcripts.
To provide some insight into the functional significance of the transcriptional changes observed in AML materials, pathway analyses have been undertaken using the gene ontology and topGO database terms [6, 7]. This analysis identified significant representation of transcriptome changes in 28 GO terms in primary and in vivo passaged AMLs compared to control HPSCs. The comparison of cell lines and control indicated a much greater number of pathways being affected, represented by 110 GO terms, 24 of the 28 GO terms affected in primary (RF12-p, RF26-p) and in vivo passage (MLP3-ivp) material are also affected in the cell lines, these are listed in Table 1. The affected pathways included immune system processes, regulation of signalling, regulation of signal transduction, cytokine receptor activity and regulation of cell communication indicating substantial disruption of important regulatory processes in immune cells.
Table 1: List of the 24 pathways as defined by GO terms commonly affected in primary/in vivo passage and cell line AML materials compared to Lin-depleted bone marrow cells
# | GO_ID | Term name | ontology |
---|---|---|---|
1 | GO:0005623 | cell | CC |
2 | GO:0044464 | cell part | CC |
3 | GO:0008152 | metabolic process | BP |
4 | GO:0010033 | response to organic substance | BP |
5 | GO:0035556 | intracellular signal transduction | BP |
6 | GO:0005488 | binding | MF |
7 | GO:0043226 | organelle | CC |
8 | GO:0003824 | catalytic activity | MF |
9 | GO:0007275 | multicellular organismal development | BP |
10 | GO:0002376 | immune system process | BP |
11 | GO:0048583 | regulation of response to stimulus | BP |
12 | GO:0009966 | regulation of signal transduction | BP |
13 | GO:0044237 | cellular metabolic process | BP |
14 | GO:0004896 | cytokine receptor activity | MF |
15 | GO:0023051 | regulation of signaling | BP |
16 | GO:0051179 | localization | BP |
17 | GO:0070887 | cellular response to chemical stimulus | BP |
18 | GO:0051234 | establishment of localization | BP |
19 | GO:0032502 | developmental process | BP |
20 | GO:0048519 | negative regulation of biological process | BP |
21 | GO:0010035 | response to inorganic substance | BP |
22 | GO:0010646 | regulation of cell communication | BP |
23 | GO:0005737 | cytoplasm | CC |
24 | GO:0044459 | plasma membrane part | CC |
The proteomics analysis is more limited in the number of distinct proteins that were detectable, the average number being around distinct 1,000 proteins. The number of identified and deregulated proteins in AML cell lines and primary cells were as follows: MLP3-cl (773 identified; 67 deregulated), RF12-cl (993; 111), RF26-cl (906; 102), MLP3-ivp (917; 151), RF12-p (1153; 180) and RF26-p (1153; 165). Nonetheless, commonly deregulated proteins were identified in primary and in vivo passaged materials (Figure 4A), RF12-p and RF26-p were notably similar at the level of deregulated protein as indicated by a Dice index of 0.77 (Figure 4B), however, only three proteins were commonly deregulated amongst the primary and in vivo passaged samples, Tln1, Tuba4a and Fkbp5. Substantially greater similarity of protein modification in cell lines compared to HPSC controls was apparent (Figure 5) with 55 commonly deregulated proteins identified (Table 2). Only the protein Fkbp5 was found to be commonly deregulated in all materials, ie the three cell lines, in vivo passaged MLP3 and the primaries.
Figure 4: Proteomics analysis of primary and in vivo passaged samples. A. Venn diagrams showing degree of similarity between the three samples all by comparison with control Lin-depleted bone marrow HPSCs, and B. quantitative Dice indices and 95% confidence intervals for pairwise comparisons. Note that the diagram was prepared using only proteins with complete data. In case of proteins with 1 or 2 measurements missing, the k-neighbour algorithm was used for data imputation and the protein was included into the analysis.
Figure 5: Proteomics analysis of AML cell lines. A. Venn diagrams showing degree of similarity between the three samples all by comparison with control Lin-depleted bone marrow HPSCs, and B. quantitative Dice indices and 95% confidence intervals for pairwise comparisons.
The proteins altered in the cell lines included Thrombospondin 1 (Table 2). Western blot analysis was performed to confirm the mass spectrometry proteomics results (Figure 6). The results from cell lines are validated with Thps1 expression being around 1.5-2 fold lower compared to the control Lin-depleted bone marrow HPSCs. It can also be seen that in the primary AMLs and in vivo passaged MLP3 Thbs1 expression is approximately 2-fold higher than in the control. This suggests that growth conditions can influence the nature of expression changes observed.
Figure 6: Western blot analysis of Thrombospondin 1 expression in the primary and in vivo passaged samples A. and the AML cell lines B. Lower panels show the blots including Rad50 (Rad), a loading control, with samples loaded in the following order from left to right: control (‘C’, normal mouse bone marrow HPSCs), MLP3, RF12, RF26. The upper panels provide histograms for quantitation of expression by image analysis, controls ‘C’ represent expression levels in normal mouse bone marrow HSPC, *indicate statistical significance at p<0.05 level compared to control.
Table 2: Proteins differentially expressed in the AML cell lines by comparison with Lin-depleted control bone marrow HPSCs
Entrez ID | Gene Symbol | Ensembl_Protein_ID | mean SLR Cell line vs Control | ||
---|---|---|---|---|---|
MLP3 | RF12 | RF26 | |||
20832 | Ssr4 | ENSMUSP00000002090 | 1.01 | 1.71 | 1.38 |
67465 | Sf3a1 | ENSMUSP00000002198 | -3.63 | -3.02 | -4.15 |
26395 | Map2k1 | ENSMUSP00000005066 | 2.42 | 2.25 | 1.28 |
21672 | Prdx2 | ENSMUSP00000005292 | -4.14 | -3.35 | -3.21 |
11821 | Aprt | ENSMUSP00000006764 | 2.05 | 1.24 | 1.59 |
16549 | Khsrp | ENSMUSP00000007814 | -2.04 | -1.27 | -1.91 |
13035 | Ctsg | ENSMUSP00000015583 | 2.35 | 3.54 | 1.12 |
16993 | Lta4h | ENSMUSP00000016033 | 1.90 | 2.00 | 1.72 |
15078 | H3f3a | ENSMUSP00000016703 | -2.36 | -3.79 | -2.17 |
17523 | Mpo | ENSMUSP00000020779 | -6.47 | -4.47 | -1.85 |
16416 | Itgb3 | ENSMUSP00000021028 | -4.00 | -4.28 | -5.29 |
15212 | Hexb | ENSMUSP00000022169 | 2.02 | 2.23 | 4.34 |
20655 | Sod1 | ENSMUSP00000023707 | -1.77 | -2.62 | -1.86 |
16906 | Lmnb1 | ENSMUSP00000025486 | -1.25 | -1.19 | -1.59 |
67988 | Tmx3 | ENSMUSP00000025515 | 1.94 | 1.06 | 1.17 |
14156 | Fen1 | ENSMUSP00000025651 | -1.58 | -1.10 | -1.55 |
12359 | Cat | ENSMUSP00000028610 | -1.80 | -1.57 | -2.61 |
12349 | Car2 | ENSMUSP00000029078 | -8.43 | -6.64 | -6.64 |
67103 | Ptgr1 | ENSMUSP00000030069 | 3.06 | 2.87 | 3.97 |
17025 | Alad | ENSMUSP00000030090 | -3.98 | -3.31 | -4.09 |
11669 | Aldh2 | ENSMUSP00000031411 | 1.64 | 1.83 | 1.35 |
11745 | Anxa3 | ENSMUSP00000031447 | 1.32 | 1.85 | 1.70 |
11674 | Aldoa | ENSMUSP00000032934 | 2.16 | 1.94 | 1.99 |
12721 | Coro1a | ENSMUSP00000032949 | -1.35 | -2.95 | -6.51 |
12751 | Tpp1 | ENSMUSP00000033184 | -2.21 | -2.50 | -3.34 |
11739 | Slc25a4 | ENSMUSP00000034049 | 1.27 | 3.63 | 1.61 |
12306 | Anxa2 | ENSMUSP00000034756 | 4.09 | 2.39 | 2.13 |
320011 | Uggt1 | ENSMUSP00000037930 | 3.02 | 1.95 | 2.23 |
233016 | Blvrb | ENSMUSP00000043092 | -2.71 | -3.88 | -3.01 |
18950 | Pnp | ENSMUSP00000043926 | 2.47 | 2.20 | 1.36 |
18432 | Mybbp1a | ENSMUSP00000044827 | 2.56 | 3.00 | 4.45 |
21825 | Thbs1 | ENSMUSP00000044903 | -1.73 | -2.79 | -3.45 |
14870 | Gstp1 | ENSMUSP00000047790 | -1.79 | -2.80 | -1.32 |
56307 | Metap2 | ENSMUSP00000048285 | -1.50 | -1.40 | -2.06 |
14751 | Gpi1 | ENSMUSP00000049355 | 2.35 | 1.73 | 2.12 |
13861 | Epx | ENSMUSP00000050497 | -3.50 | -5.66 | -5.48 |
14104 | Fasn | ENSMUSP00000052872 | 3.31 | 3.26 | 2.09 |
80838 | Hist1h1a | ENSMUSP00000062030 | -2.22 | -1.98 | -2.73 |
12332 | Capg | ENSMUSP00000063389 | -1.43 | -4.74 | -1.64 |
236539 | Phgdh | ENSMUSP00000064755 | -1.47 | -1.61 | -1.03 |
66222 | Serpinb1a | ENSMUSP00000075690 | 2.31 | 2.55 | 1.91 |
13806 | Eno1 | ENSMUSP00000079727 | 2.45 | 2.16 | 1.73 |
18655 | Pgk1 | ENSMUSP00000080302 | 2.54 | 2.07 | 2.12 |
110208 | Pgd | ENSMUSP00000081141 | 2.34 | 3.99 | 1.49 |
17105 | Lyz2 | ENSMUSP00000089801 | -2.47 | -2.87 | -3.65 |
15288 | Hmbs | ENSMUSP00000095166 | -3.50 | -3.20 | -4.87 |
192176 | Flna | ENSMUSP00000098997 | -1.60 | -2.34 | -3.12 |
56431 | Dstn | ENSMUSP00000099461 | 2.96 | 2.98 | 2.13 |
11637 | Ak2 | ENSMUSP00000099664 | 1.66 | 1.03 | 1.17 |
13382 | Dld | ENSMUSP00000106481 | 1.08 | 4.12 | 1.24 |
12796 | Camp | ENSMUSP00000107653 | -2.64 | -5.08 | -3.04 |
108989 | Tpr | ENSMUSP00000112606 | 1.65 | 1.23 | 1.42 |
14229 | Fkbp5 | ENSMUSP00000116466 | 3.38 | 3.09 | 2.28 |
15275 | Hk1 | ENSMUSP00000118601 | 6.46 | 2.43 | 2.80 |
11983 | Atpif1 | ENSMUSP00000133099 | -1.42 | -2.34 | -2.23 |
To integrate the transcriptome and proteome results a multiomic analysis was undertaken. This required the following steps (i) Identification of the transcripts representing the proteins included in proteomics analysis, (ii) For those transcripts, identification of those which differentiate between the cell lines (RF12-cl, RF26-cl, MLP3-cl) and the primary/in vivo passaged materials (RF12-p, RF26-p, MLP3-ivp) and the cell lines at p≤0.05 level (Bonferroni corrected), (iii) Amongst these significantly affected transcripts, identification of the represented proteins where the expression level is 2 fold up- or down- regulated – this results in 67 proteins/genes being identified, (iv) Of the 67, identification of those where transcript and protein are either both up- or down- regulated in the three cell lines and the primary and in vivo passaged materials or those where transcript and protein are up and down regulated respectively or vice-versa. The latter category was included as the analysis indicated that in primary AMLs there is a negative correlation between gene and protein expression, similar inverse relationships have been observed previously [eg 8, 9, 10]. This results in a set of 17 genes/proteins being identified that are deregulated at the gene and protein level in all AML materials examined. These genes/proteins allow RF12-p and RF26-p to be distinguished from RF12-cl, RF26-cl and MLP3-cl and in vivo passaged cell line, MLP3-ivp (Figure 7A, Table 3). Several of the genes/proteins are part of a network (Figure 7B) and the affected pathways as defined by KEGG and PANTHER terms are given in Table 4. Many of the transcripts/proteins have been implicated in human leukaemogenesis and the affected pathways include acute myeloid leukaemia and other cancer related pathways plus growth regulatory signalling, glycolysis and apoptotic signalling.
Figure 7: Multiomics analysis of combined transcriptome and proteome data. A. heatmap showing relatedness between the samples, on the basis of these 17 transcripts/proteins it is possible to distinguish primary AMLs from cell lines. B. network analysis of the 17 identified proteins/genes (STRING db).
The top two networks identified using Ingenuity analysis, 1. Cellular Function and Maintenance, Cell Death and Survival, Cellular Compromise and 2. Connective Tissue Disorders, Inflammatory Disease, Skeletal and Muscular Disorders, in which most of the genes are present, can be combined (Figure 8). In these combined networks, MYC is major player, the overexpression of which was confirmed by Western blot analysis (Figure 9).
Figure 8: Merged ingenuity pathways for (i) cellular function and maintenance, cell death and survival, cellular compromise and (ii) connective tissue disorders, inflammatory disease, skeletal and muscular disorders. All of the 17 member signature components are in this merged network with the exception of Hist1h4h. Solid lines imply direct relationships between proteins; dotted lines imply indirect interactions. Relationships are primarily due to co-expression, but can also include phosphorylation/dephosphorylation, proteolysis, activation/deactivation, transcription, binding, inhibition, biochemical modification. The purple lines represent the connections between the two most significant networks that were merged into one.
Figure 9: Western blot analysis of c-myc expression in the primary and in vivo passaged samples and the AML cell lines. The upper panels show the blots including actin as loading control, with quantitation by image analysis in the histograms below. Lanes and bars marked ‘C’ represent the control normal mouse bone marrow HPSCs, *indicate statistical significance at p<0.05 level compared to control.
Table 3: Multiomics analysis of combined transcriptome and proteome data
Gene Symbol | Entrez GeneID | Gene Name |
---|---|---|
Hist1h1a | 80838 | histone cluster 1, H1a |
Anxa6 | 11749 | annexin A6 |
Ruvbl2 | 20174 | RuvB-like protein 2 |
Coro1a | 12721 | coronin, actin binding protein 1A |
Sri | 109552 | sorcin |
Mpo | 17523 | myeloperoxidase |
Sdhb | 67680 | succinate dehydrogenase complex, subunit B, iron sulfur (Ip) |
Lyz2 | 17105 | lysozyme 2 |
Map2k1 | 26395 | mitogen-activated protein kinase kinase 1 |
Lmnb1 | 16906 | lamin B1 |
Itgb3 | 16416 | integrin beta 3 |
Alad | 17025 | aminolevulinate, delta-, dehydratase |
Glrx | 93692 | glutaredoxin |
Hmbs | 15288 | hydroxymethylbilane synthase |
Aldoa | 11674 | aldolase A, fructose-bisphosphate |
Hist1h4h | 69386 | histone cluster 1, H4h |
Eno1 | 13806 | enolase 1, alpha non-neuron |
Identity of the 17 transcripts/proteins that are deregulated in all the AML materials analysed at both the gene and protein level.
To explore the wider use of the 17 gene/protein signature and its ability to distinguish other AML samples, an analysis of the expression of the genes in two human AML datasets was carried out. The human data sets were generated from analysis of bone marrow from four karyotypically normal AMLs compared to granulocyte macrophage progenitors (GMPs) from six healthy normal control donors and bone marrow from AMLs with monosomy of chromosome 7 compared to healthy control donor GMPs respectively. An overall summary of this analysis is shown as a heatmap (Figure 10). The 17 gene signature correctly clusters the human and mouse control samples and is able to discriminate each AML sample set from the control. Furthermore each AML group is distinct from each other. Mean centred box plots relating to each individual gene and sample are provided in Supplementary Figure S1.
Figure 10: Heatmap summarising normalised transcriptome changes in expression of the 17 member signature genes in the mouse AML materials (mAML) examined in the current analysis and in human AMLs from publically available datasets (hAML1 – karyotypically normal AMLs, hAML2 – monosomy 7 AMLs) plus respective controls (mCTRL, hCTRL1, hCTRL2).
DISCUSSION
This transcriptome and proteome analysis included within the study design materials from primary AMLs (RF12-p, RF26-p), derived cell lines (RF12-cl, RF26-cl), an independently derived and long-established AML cell line (MLP3-cl) and in vivo passaged MLP3 (MLP3-ivp). The datasets allowed identification of transcript and protein alterations associated with leukaemogenesis and those associated with adaptation to growth in vitro and the impact of an in vivo growth environment. The transcriptome analysis was more comprehensive (21266 transcripts) than the proteomic analysis (1005 proteins). The results obtained strongly indicate that primary AML material is best used to identify common changes of relevance to leukaemogenesis, adaptation of primary material to growth in vivo has substantial impacts on the transcriptome and proteome. Broadly speaking if cell lines are selected for analysis, only approximately 50% of detected changes may be of relevance or leukaemogenesis (see Figure 3). Further evidence that adaptation to cell culture has a significant effect on the transcriptome comes from the observation that primary and in vivo passaged materials are more similar to the control HPSC population (obtained directly from bone marrow without in vitro culture) than are the in vitro cultured cell lines (Figure 2). Further Dice index analysis of transcriptome data (Figure 3) and Western blotting (Figure 5) has shown that expression changes in cell lines can differ from those in primary material. The inclusion of the MLP3 cell line and MLP3-ivp was particularly informative. The Western analysis (Figure 6) indicates that the expression Thsp1 differs under the two growth conditions, indicating an important role for the microenvironment in determining gene and protein expression. The unsupervised transcriptome heatmap (Figure 1) clearly groups MLP3-ivp more closely to the primary AMLs RF12-p and RF26-p than its in vitro passage counterpart, MLP3-cl, adding further weight to the argument that growth microenvironment is an important determinant of gene expression levels, and that alterations are not necessarily cell intrinsic.
The transcriptome analysis identified 24 pathways that are commonly deregulated in all AML materials. The affected pathways (Table 1) include immune system processes, regulation of signalling, regulation of signal transduction, cytokine receptor activity and regulation of cell communication indicating substantial disruption of important regulatory processes in the AML cells.
Partly because of the more limited range of proteins analysed, the commonly deregulated proteins were fewer in number with just 3 being commonly deregulated in the three primary and in vivo passaged samples, Fkbp5, Tln1 and Tuba4a and just one, Fkbp5, commonly deregulated in all AML materials (cell lines, in vivo passaged MLP3 and primary AMLs). Fkbp5, FK506 binding protein 5, is an immunophillin that is most highly expressed in T-lymphocytes and the thymus. The FK506 binding proteins are involved in multiple cellular processes including protein folding, receptor signalling, protein trafficking, transcription, apoptosis and T-cell activation [11]. Over 200 regulatory microRNAs are predicted to interact with Fkbp5 (Mouse Genome Database, accessed September 2015). Tln1, Talin 1 bridges between vinculin and integrins providing a link between the cytoskeleton and extra-cellular matrix, the reduced expression of Tln1 likely relates to the release of AML cells from the bone marrow into the peripheral circulation that occurs characteristically late in the pathogenesis of radiation AML in the mouse. Tuba4a is a tubulin, also indicating modification of the cytoskeleton. Amongst the wider group of 55 proteins that were commonly deregulated in the three AML cell lines (Table 2) are several involved in proliferative signalling, apoptosis, myeloid cells and leukaemia. Ten of these proteins are also identified as part of the 17 gene/protein signature, characteristic of all AML materials identified through the multiomic analysis, and some of these are considered further below.
Bringing the data together in a multiomic analysis has confirmed that there is significant similarity in the pathways deregulated at the transcriptome and proteome levels. Perhaps most importantly, this analysis indicated a small set of 17 genes/proteins (Table 3) that is characteristically deregulated in all the AML materials examined in this study and therefore represent a potential gene/protein signature of mouse AML. The expression levels of these genes/proteins can further distinguish the two primaries (RF12-p, RF26-p) from the remaining cell line – derived samples and indicate greater similarity amongst in vitro cultured samples than between in vitro and in vivo cultured samples. This point emphasises the need to examine primary AML materials to be more confident to identify key changes related to the disease pathogenesis. The affected pathways (Table 4) include many related to cancer, and specifically acute myeloid leukaemia, along with some relating to growth regulation, apoptotic signalling and the haemopietic system. Interestingly many of the genes/proteins included in the 17 member ‘signature’ have been found to have roles in human and mouse leukaemogenesis. In particular Coronin 1 (Coro1a), a member of the evolutionarily conserved coronin protein family involved in a variety of cellular processes, is highly expressed in all leukocytes. In mice and human, genetic inactivation of Coro1a results in immuno-deficiencies and it was recently shown that human PU.1 is a direct transcriptional regulator of CORO1A in acute promyelocytic leukaemia and acute myeloid leukaemia [12]. We also identified pontin/RUVBL1 a gene which is upregulated by AML1-ETO and generated by one of the most frequent chromosomal rearrangements in human AML (translocation t(8;21)(q22;q22)) and reported to participate in the oncogenic growth of t(8;21) cells [13]. The percentage of MPO positive leukemic cells is a simple and highly significant prognostic factor in AML patients [14]. MPO as well as lamin B1 (LMNB1) genes carry polymorphisms which predict risk of relapse of childhood acute lymphoblastic [15]. Integrin Beta 3 (Itgb3) is essential for leukaemogenesis but dispensable for normal haematopoiesis hence suggesting that Itgb3 signalling pathway is a potential therapeutic target in AML [16]. Succinylacetone (SA; 4,6-dioxoheptanoic acid), a specific inhibitor of delta-aminolevulinic acid dehydrase (ALAD), leading to growth inhibition of leukaemia cells, [17] is associated with an important pathway in mouse erythroleukaemia cells [18]. In mouse, erythroleukaemias are associated with Friend virus infection where a clonal leukaemia develops through the proviral insertional activation of Spi1/Pu.1 leading to overexpression of PU.1 [19, 20]. Porphyria is a disease which can be caused by mutations in ALAD as well as the HMBS gene hydroxymethylbilane synthase (Hmbs). Interestingly Porphyria is often diagnosed in erythropoietic protoporphyria in association with haematological malignancy [21]. Two other genes identified are members of histone cluster 1 and a PcG methylation of the histone cluster 1 (including Hist1h1a and Hist1h4h) has been identified as an epigenetic marker of AML [22].
Table 4: Pathways affected at the transcriptional and protein level in all analysed AML materials in the multiomics analysis
A) overrepresented KEGG pathways
# | ID | KEGG pathway name | p-value |
---|---|---|---|
1 | mmu04066 | HIF-1 signalling pathway | 0.0000845 |
2 | mmu04145 | Phagosome | 0.0002630 |
3 | mmu00860 | Porphyrin and chlorophyll metabolism | 0.0004380 |
4 | mmu00010 | Glycolysis / Gluconeogenesis | 0.0011415 |
5 | mmu04919 | Thyroid hormone signalling pathway | 0.0038027 |
6 | mmu04380 | Osteoclast differentiation | 0.0041962 |
7 | mmu05034 | Alcoholism | 0.0062702 |
8 | mmu05205 | Proteoglycans in cancer | 0.0111167 |
9 | mmu04510 | Focal adhesion | 0.0114341 |
10 | mmu04015 | Rap1 signalling pathway | 0.0123001 |
11 | mmu04810 | Regulation of actin cytoskeleton | 0.0124103 |
12 | mmu05206 | MicroRNAs in cancer | 0.0171720 |
13 | mmu04320 | Dorso-ventral axis formation | 0.0198055 |
14 | mmu05216 | Thyroid cancer | 0.0229399 |
15 | mmu00030 | Pentose phosphate pathway | 0.0237220 |
16 | mmu00020 | Citrate cycle (TCA cycle) | 0.0245036 |
17 | mmu00051 | Fructose and mannose metabolism | 0.0276238 |
18 | mmu05020 | Prion diseases | 0.0276238 |
19 | mmu04151 | PI3K-Akt signalling pathway | 0.0299092 |
20 | mmu05219 | Bladder cancer | 0.0299578 |
21 | mmu05213 | Endometrial cancer | 0.0407804 |
22 | mmu05223 | Non-small cell lung cancer | 0.0438517 |
23 | mmu05221 | Acute myeloid leukemia | 0.0446180 |
24 | mmu04370 | VEGF signaling pathway | 0.0461490 |
25 | mmu04730 | Long-term depression | 0.0469137 |
26 | mmu05210 | Colorectal cancer | 0.0499664 |
B) overrepresented PANTHER pathways
# | ID | PANTHER Pathways | P value |
---|---|---|---|
1 | P02746 | Heme biosynthesis | 0.000054 |
2 | P02744 | Fructose galactose metabolism | 0.008570 |
3 | P00024 | Glycolysis | 0.000229 |
4 | P00020 | FAS signalling pathway | 0.023400 |
5 | P00032 | Insulin/IGF pathway-mitogen activated protein kinase kinase/MAP kinase cascade | 0.024100 |
6 | P05911 | Angiotensin II-stimulated signalling through G proteins and beta-arrestin | 0.026900 |
7 | P00011 | Blood coagulation | 0.040100 |
8 | P00056 | VEGF signalling pathway | 0.040800 |
9 | P00054 | Toll receptor signalling pathway | 0.040800 |
10 | P00010 | B cell activation | 0.045600 |
11 | P00034 | Integrin signalling pathway | 0.006930 |
The mitogen-activated protein kinase kinase 1 (MAP2K1) is a member of the dual specificity protein kinase family which acts as a mitogen-activated protein kinase (also known as extracellular signal-regulated kinase ERK). Mutations in FLT3 kinase which are frequent in AML patients and were identified in radiation-induced mouse AML [3], affect the extracellular signal-regulated kinase ERK1/2 [23]. Sorcin (Sri) is highly expressed in the heart and in the brain, and overexpressed in many cancer cells in general and in AML [24]. The subunit B Succinate dehydrogenase (SDHb) expression is modified in AML [25]. Levels of serum Lysozyme (Lyz2) are sometimes used as an aid for diagnostic AML subtyping and prognosis in AML [26]. Glutaredoxin (GRX) is a redox-regulating protein putatively associated with neoplastic process in a human leukaemia HL-60 cell line [27]. There are some reports on aldolase (Aldoa) activity in B chronic lymphocytic leukaemia and hyperaldolasaemia has been detected in patients with AML [28]; although literature screening failed to find a known direct involvement of Enolase 1 (Eno1) in AML, it is a tumour marker without recognised role in cancer [29]. Finally, the last gene identified, Annexin A6 (Anxa6) belongs to a family of calcium- and phospholipid-binding proteins in which increased expression leads to the constitutive activation of extracellular signal-regulated kinase ERK. Interestingly Annexin A1 was reported to be a PU.1 target in leukaemic cells [30].
Our analysis also highlights the potential importance of the MYC oncogene (Figures 8, 9); MYC is known to be commonly overexpressed in AML due to trisomy 8/15 (human/mouse), FLT3-ITD mutation, or gene amplifications (double minute chromosomes) [31]. C-Myc also rapidly induces acute myeloid leukaemia in mice when expressed in the bone marrow [32]. In radiation-induced mouse AML, significantly higher expression of c-myc was previously reported especially in the PU.1-deficient (deletion of one copy and R235 point mutation) AMLs [33] and interestingly in therapy-related AML, abnormalities of chromosomes 5 and/or 7 accounted for 76% of all cases with an abnormal karyotype and it has been reported that AML with a -5/del(5q) have a higher expression of c-MYC [34]. Furthermore, c-myc has been identified as an AML driver mutation in a retrovial transduction/transplantation screen in mice [35]. These data confirm that AMLs are characterised by deregulation of transcriptional networks that control the lineage specificity of gene expression. Collectively, the data suggest that this signature has potential value as a biomarker relevant for AML early detection and diagnosis, treatment or prediction of response to therapy. The potential wider application of the signature and its use on human as well as mouse material has been validated using two human AML datasets (Figure 10). Furthermore, while most of the 17 transcripts show similar levels of expression in the human AML datasets, four (CORO1A, ITGB3, RUVBL2, SRI) are expressed differentially between the two human AML datasets (Supplementary Figure S1). These may be useful for AML sub-typing; an issue that would benefit further investigation.
In summary, a combined transcriptome and proteome analysis has been carried out on two primary AMLs, the cell lines from these primaries, one additional AML cell line and an in vivo passage of that cell line. The data were analysed individually and in a combined multiomics analysis. This has revealed a small set of genes/proteins, many related to cancer development and leukaemia specifically, commonly affected in all materials. These provide an insight into potential novel candidates relevant to pathogenesis of radiation-induced AML in the mouse, and confirm that pathways of leukaemogensis in the mouse and human share substantial commonality in the pathways affected. The proteomics analysis suggested that Talin 1 might be involved in the late stage release of leukaemic cells from the bone marrow into the peripheral circulation and subsequent tissue infiltration. A general finding is that in AML cell lines, approximately 50% of the transcriptome changes are related to adaptation to cell culture, and in the proteome this proportion is higher. It is also clear that the growth microenvironment plays a major role in determining gene and protein expression patterns in these AML samples. Therefore it is more efficient to use primary materials to search for additional target genes and proteins related to radiation AML pathogenesis.
MATERIALS AND METHODS
An overview of the materials used is provided in Figure 1.
rAML induction experiments
The protocol for AML induction generated from CBA\H mice whole body irradiated has been described previously [36]. Briefly, mice were whole-body 3 Gy irradiated at 12–15 weeks of age with 250 kVp X-rays at a dose-rate of 0.887 Gy/min (MRC, Harwell, Oxon,UK). AMLs were diagnosed using the criteria described in the Bethesda Proposals for Classification of Non-Lymphoid Neoplasms in mice [37]. Mice were examined daily for signs of illness and euthanised with a rising concentration of CO2. Animals found to have increased white blood cell counts in the peripheral blood film and displaying splenomegaly or hepatosplenomegaly upon dissection were treated as potential AMLs. Samples of spleen were either stored at −70°C in RNAlater (Ambion,Austin, US) for nucleic acid extraction or disaggregated and used for flow cytometry or cytogenetics. All cases defined as AML had a rapid onset,with ≥20% immature forms/blasts found when spleen cell samples were analysed by flow cytometry, and a white blood cell count above that of controls (controls: approx.5–10 × 106/mL). Flow cytometry analysis furthermore established that cases of AML are further defined by cells surface marker expression [38]. Animals were bred and handled according to UK Home Office Animals (Scientific Procedures) Act 1986 and with guidance from the local ethical review committee on animal experiments.
RF12 and RF26 cell lines
Cell lines were established from disaggregated AML spleen cells. Cells (3-5 × 106) were washed with PBS (1200 rpm, 5 mins) and resuspend in 10 ml culture medium (RPMI 1640 with 10 % fetal bovine serum, FBS, and 10ng/ml IL3, Sigma). Cell suspensions were transferred to T25 vented flasks and placed in humidified incubator at 37°C and 5% CO2. Cells were maintained in these conditions for approximately 6 weeks, and split in to new flasks with fresh medium when confluent. IL3 was then removed from the medium, if the culture continued to grow independently of IL3, it became defined as an established cell line. It has been found that only a minority of AMLs adapt to growth in cell culture under this protocol.
MLP3 cell line
Derived from a primary AML induced in a CBA/H mouse from Drs Emmy Meijne and Rene Huiskamp (NRG, Petten, Netherlands). Cell lines were maintained in RPMI 1640 plus 20% foetal bovine serum, with added L-glutamine, penicillin and streptomycin at standard concentrations (Invitrogen, UK). Cultures were maintained in a humidified incubator at 37°C and 5% CO2.
AML passage
AML cells from spleen cells or cell lines grown in culture were passaged in to CBA recipient mice to obtain large amounts of material for genetic and protein analysis.
Cell suspensions were prepared by washing homogenised spleen or cells from in vitro culture twice in Iscove’s Modified Dulbecco’s Medium (IMDM medium, Sigma-Aldrich Co, St Louis, MO, USA). 1 × 106 cells in 250 ml IMDM medium were injected intraperitoneally in to each of 4 recipients. Recipient mice were monitored at least daily for presentation of AML. Any mice displaying symptoms of AML (decreased movement, hunched posture, rapid breathing, pallor, enlarged spleen on palpation), or any other significant deviation from normal were euthanized by a rising concentration of CO2. Tissues and cells were obtained and stored as for primary AMLs.
Sfpi1/PU.1 copy number assessment
The protocol used for chromosome 2 deletion identification has been described previously [38, 39].
Mutation status of Sfpi1/PU.1
DNA was extracted from AML using a DNeasy Blood and Tissue kit (Qiagen) according to manufacturer’s instructions, exon 5 of Sfpi1/PU.1 was amplified by PCR and sequenced as described by Suraweera et al [40].
Immunomagnetic cell separation
To obtain HPSC populations (Lin–or Lin–Sca-1+c-Kit+(LSK) cells), bone marrow cells were flushed from femora and tibias of 8 donor mice either exposed to 3 Gy X-rays 7–9 days beforehand or from unirradiated controls. Lin-depleted cells were selected using the Mouse Hematopoietic Progenitor Enrichment Kit (Stem Cell Technologies, Grenoble, France) according to the manufacturer’s instructions.
RNA extraction and transcriptome analysis
Total RNA was extracted from flash frozen cells using the AllPrep DNA/RNA/protein Mini kit (Qiagen, Hilden, Germany), quality-controlled using a 2100 BioAnalyzer (Agilent, Santa Clara, CA, USA) and quantified using a Nanodrop 2000c spectrophotometer (Thermo Scientific, Wilmington, USA). Only samples with an RNA Integrity Number ‘RIN’ >8.0 were used for further gene expression analysis. Using the WT Expression Kit (Ambion Inc, Austin, TX, USA), cDNA was prepared from 10 μg of purified cRNA, originally synthesised and purified from 0.25 μg of total RNA following the manufacturer’s instructions. The cDNA (2.75 μg) was then used for fragmentation and labeling using GeneChip Terminal Labelling Kit (Affymetrix, Santa Clara, CA, USA). Using GeneChip Hybridization, Wash and Stain (Hybridization module) (Affymetrix, Santa Clara, CA, USA), and Hybridization controls (Affymetrix, Santa Clara, CA, USA), fragmented and labelled cDNA was hybridized to Mouse Gene 2.0 ST Arrays (Affymetrix, Santa Clara, CA, USA). After hybridization under orbital rotation for 16 h at 45°C, arrays were washed and stained using GeneChip Hybridization, Wash and Stain Kit (Stain module) (Affymetrix, Santa Clara, CA, USA) according to the manufacturer’s instructions. Finally, arrays were scanned immediately using Affymetrix GeneChip Scanner GS 3000.
Proteomic analysis
ICPL Labelling and one-dimensional gel electrophoresis (1-DE)
Proteins from AML cell lines and AML primary cells were extracted using 1% Triton-X100 (40 mM Tris, pH=7.6) buffer. The extracted proteins were precipitated with the 2D clean-up kit (GE Healthcare) following the manufacturer’s instructions. The pellets were resuspended in ICPL lysis buffer (SERVA) and triplicate aliquots of 100 μg of protein were labelled with ICPL reagents (SERVA) as described previously [41, 42]. After labelling, equal amounts of light and heavy labelled samples were combined and separated by 12% SDS gel electrophoresis [43] before staining with colloidal Coomassie [44]. The gel lanes were cut in 3 slices and subjected to in-gel digestion. Prior to digestion, proteins were destained with 50 mM NH4HCO3 in 30% acetonitrile (ACN). In-gel digestion was performed overnight with trypsin of sequencing grade (SERVA Electrophoresis GmbH, Germany) using a total protein to enzyme ratio of 50:1 in 10 mM NH4HCO3. Peptides were extracted and acidified with 1% formic acid for subsequent mass spectrometry analysis.
LC/MS/MS analysis
The digested peptides were separated by reversed phase chromatography (PepMap, 15 cm × 75 μm ID, 3 μm/100Å pore size, LC Packings) operated on a nano-HPLC (Ultimate 3000, Dionex) with a nonlinear 170 min gradient using 2% acetonitrile in 0.1% formic acid in water (A) and 0.1% formic acid in 98% acetonitrile (B) and eluted with a flow rate of 250 nl/min. The gradient settings were: 0-140 min: 2-30% B, 140-150 min: 31-99% B, 151-160 min: stay at 99% B and equilibrate for 10 min at starting conditions. The nano-LC was connected to a linear quadrupole ion trap-Orbitrap (LTQ Orbitrap XL) mass spectrometer (Thermo Fisher, Bremen, Germany) equipped with a nano-ESI source. The mass spectrometer was operated in the data-dependent mode to automatically switch between Orbitrap-MS and LTQ-MS/MS acquisition. Survey full scan MS spectra (from m/z 300 to 1500) were acquired in the Orbitrap with resolution R = 60,000 at m/z 400 (after accumulation to a target of 1,000,000 charges in the LTQ). The method used allowed sequential isolation of up to ten most intense ions depending on signal intensity, for fragmentation on the linear ion trap using collision-induced dissociation at a target value of 100,000 ions with a normalised collision energy of 35 % and an activation time of 30 ms. Minimum signal intensity required was 200, isolation with 2 amu and default charge state 2. Precursor masses were selected in a data-dependent manner. High resolution MS scans in the Orbitrap and MS/MS scans in the linear ion trap were performed in parallel.
Target peptides already selected for MS/MS were dynamically excluded for 30 seconds. General mass spectrometry conditions were: electrospray voltage, 1.25-1.4 kV; no sheath and auxiliary gas flow. An activation Q-value of 0.25 and activation time of 30 ms were also applied for MS/MS. The acquired MS/MS spectra were searched against the Ensembl Mus musculus database using an in-house version of Mascot (release 62 with 54,576 sequences). A version of MASCOT (Matrix Science, version 2.3.02 with a number of residues of 26 203 053) was used with the following parameters: MS/MS spectra were searched with a precursor mass tolerance of 10 ppm and a fragment tolerance of 0.8 Da. MASCOT scores are probability-based MOWSE score: –10xLog(P), where P is the probability that the observed match is a random event. Scores >34 indicate identity or extensive homology; p <0.05. One missed cleavage was allowed. Carbamidomethylation was set as fixed modification. Oxidised methionine and the heavy and light ICPL labels of lysines as well as heavy and light ICPL labels of the protein N-terminus were set as variable modifications.
Data processing for protein identification and quantification of ICPL pairs was performed using Proteome Discoverer version 1.3 (Thermo Fisher) as described before [45]. Proteome Discoverer (Thermo Scientific) software performs automated statistical analysis of the results and uses unique peptides to calculate accurate relative protein quantification. All proteins showing significance p < 0.05 and fold-change >2 or <0.5 in Proteome Discoverer and Perseus software tool [45] were considered as deregulated.
Immunoblotting
For the validation of protein expression changes by immunoblotting, 20 μg protein extract was separated on 8% or 12% SDS polyacrylamide gels according to Laemmli [43]. Proteins were transferred to nitrocellulose membranes (GE Healthcare) using a semidry blotting system at 100 mA for 90 min. Membranes were saturated for one hour with 5% advance blocking reagent (GE Healthcare) in TBS (50 mM Tris.HCl, pH 7.6 and 150 mM NaCl) containing 0.1% Tween 20 (TBS/T). Blots were incubated overnight at +4°C with antibodies against either cMYC (Santa Cruz Biotechnology, Inc.) or thrombospondin (Abcam).
After washing three times in Tris-buffered saline/Tween 20 TBS/T, blots were incubated for one hour at room temperature with horseradish peroxidase-conjugated anti-mouse or anti-goat secondary antibody (Santa Cruz Biotechnology) in blocking buffer (TBS/T with 5% w/v advance blocking reagent). Immunodetection was performed either with ECL advance Western blotting detection kit (GE Healthcare) following standard procedures. The protein bands were quantified using ImageQuant 5.2 software (GE Healthcare) by integration of all pixel values in the band area after background correction and normalised to the loading control, RAD50 (GeneTex, Taiwan) or actin (Santa Cruz Biotechnology, Inc., US).
Bioinformatics, statistics and data handling
RMA algorithm [47] with Dai annotation [48, version 19, November 2014] was used for normalisation of transcriptomic data. Gaussian Mixture Model based algorithm [49] was applied for both noise level detection and filtration of uninformative features. Two-way and one-way ANOVA followed by Tukey-Kramer tests for pairwise comparisons were performed to verify the hypothesis on mean value equality independently on cell type and AML passage. The analysis was carried out separately for hypotheses on up- and down- regulation. Benjamini-Hochberg correction for multiple testing was applied to avoid high level of false discoveries [50]. Generalised Dice index [51] and its 95% confidence interval were used as a measure of set similarity. Functional analysis of differentially expressed genes was performed using Bioconductor TopGO package (Alexa A. and Rahnenfuhrer J.: topGO: Enrichment analysis for Gene Ontology. R package version 2.22.0) with parentchild method and hypergeometric test for overrepresentation [52].
In case of proteomic data, only proteins detected in at least four of six experimental conditions were considered. The missing data were imputed by k-nearest neighbours algorithm combined with linear regression technique [53]. The proteins with significant fold change greater than 2 or lower than 0.5 were recognised as deregulated.
Integration of multiomic data was performed by mapping of Protein Ensembl identifers to Gene Entrez identifiers with the use of Bioconductor biomaRt package [54]. Spearman correlation coefficient was calculated to measure association between gene and protein expression levels. Hierarchical clustering UPGMA (Unweighted Group Method with Arithmetic Mean) bottom-up algorithm with standardised Euclidean metric and average linkage was applied to create the samples’ dendrogram [55]. Validation of the 17 gene/protein signature utilised the publically available human AML datasets, GSE35008 (four AMLs with normal karyotype), and GSE35010 (six AMLs with monosomy of chromosome 7) [56, 57] - before performing UPGMA hierarchical clustering of averaged mouse and human samples, due to the different platforms on which gene expression was measured, expression levels for each of the 17 transcripts were transformed into z-scores within each dataset using the respective healthy control as a reference.
ACKNOWLEDGMENTS
The authors wish to thank Kevin Whitehill and Christopher Ottaway for assistance.
CONFLICTS OF INTEREST
The authors declare no conflicts of interest.
GRANT SUPPORT
The studies reported here were supported by the EC EURATOM DoReMi Network of Excellence, grant agreement 249689 and NCN Harmonia grant 2013/08/M/ST6/924 (JP and AB). Calculations were performed using the IT infrastructure supported by POIG.02.03.01-24-099/ grant: “GeCONiI - Upper Silesian Center for Computational Science and Engineering”.
REFERENCES
1. Hsu WL, Preston DL, Soda M, Sugiyama H, Funamoto S, Kodama K, Kimura A, Kamada N, Dohy H, Tomonaga M, Iwanaga M, Miyazaki Y, Cullings HM, et al. The incidence of leukemia, lymphoma and multiple myeloma among atomic bomb survivors: 1950-2001. Radiat. Res. 2013; 179: 361-382.
2. Verbiest T, Bouffler S, Nutt SL, and Badie C. PU.1 downregulation in murine radiation-induced acute myeloid leukaemia (AML): from molecular mechanism to human AML. Carcinogenesis. 2015; 36: 413-419.
3. Finnon R, Brown N, Moody J, Badie C, Olme CH, Huiskamp R, Meijne E, Sutmuller M, Rosemann M, and Bouffler SD. Flt3-ITD mutations in a mouse model of radiation-induced acute myeloid leukaemia. Leukemia. 2012; 26: 1445-1446.
4. Genik PC, Vyazunova I, Steffen LS, Bacher JW, Bielefeldt-Ohmann H, McKercher S, Ullrich RL, Fallgren CM, Weil MM, and Ray FA. Leukemogenesis in heterozygous PU.1 knockout mice. Radiat. Res. 2014; 182: 310-315.
5. Silver A, Moody J, Dunford R, Clark D, Ganz S, Bulman R, Bouffler S, Finnon P, Meijne E, Huiskamp R, and Cox R. Molecular mapping of chromosome 2 deletions in murine radiation-induced AML localizes a putative tumor suppressor gene to a 1.0 cM region homologous to human chromosome segment 11p11-12. Genes Chromosomes Cancer. 1999; 24: 95-104.
6. Alexa A, and Rahnenfuhrer J. TopGO: enrichment analysis for Gene Ontology. R package version, 2010 - bioconductor.uib.no.
7. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 2000; 25: 25-29.
8. de Sousa Abreu R, Penalva LO, Marcotte EM, and Vogel C. Global signatures of protein and mRNA expression levels. Mol. Biosyst. 2009; 5: 1512-1526.
9. Vogel C, and Marcotte EM. Insights into the regulation of protein abundance from proteomic and transcriptomic analyses. Nat. Rev. Genet. 2012; 13: 227-232.
10. Koussounadis A, Langdon SP, Um IH, Harrison DJ, and Smith VA. Relationship between differentially expressed mRNA and mRNA-protein correlations in a xenograft model system. Sci. Rep. 2015; 5: 10775. doi: 10.1038/srep10775.
11. Kang CB, Hong Y, Dhe-Paganon S, and Yoon HS. FKBP family proteins: immunophilins with versatile biological functions. Neurosignals. 2008; 16: 318-325.
12. Federzoni EA, Humbert M, Valk PJ, Behre G, Leibundgut EO, Torbett BE, Fey MF, and Tschan MP. The actin-binding protein CORO1A is a novel PU.1 (SPI1)- and CEBPA-regulated gene with significantly lower expression in APL and CEBPA-mutated AML patients. Br. J. Haematol. 2013; 160: 855-859.
13. Breig O, Bras S, Martinez Soria N, Osman D, Heidenreich O, Haenlin M, and Waltzer L. Pontin is a critical regulator for AML1-ETO-induced leukemia. Leukemia. 2014; 28: 1271-1279.
14. Storr J, Dolan G, Coustan-Smith E, Barnett D, and Reilly JT. Value of monoclonal anti-myeloperoxidase (MPO7) for diagnosing acute leukaemia. J. Clin. Pathol. 1990; 43: 847-849.
15. Wesolowska-Andersen A, Borst L, Dalgaard MD, Yadav R, Rasmussen KK, Wehner PS, Rasmussen M, Orntoft TF, Nordentoft I, Koehler R, Bartram CR, Schrappe M, Sicheritz-Ponten T, et al. Genomic profiling of thousands of candidate polymorphisms predicts risk of relapse in 778 Danish and German childhood acute lymphoblastic leukemia patients. Leukemia. 2015; 29: 297-303.
16. Miller PG, Al-Shahrour F, Hartwell KA, Chu LP, Jaras M, Puram RV, Puissant A, Callahan KP, Ashton J, McConkey ME, Poveromo LP, Cowley GS, Kharas MG, et al. In Vivo RNAi screening identifies a leukemia-specific dependence on integrin beta 3 signaling. Cancer Cell. 2013; 24: 45-58.
17. Ebert PS, Hess RA, and Tschudy DP. Augmentation of hematoporphyrin uptake and in vitro-growth inhibition of L1210 leukemia cells by succinylacetone. J. Natl. Cancer Inst. 1985; 74: 603-608.
18. Fuchs O. Ferrochelatase, glutathione peroxidase and transferrin receptor mRNA synthesis and levels in mouse erythroleukemia cells. Stem Cells. 1993; 11 Suppl 1: 13-23.
19. Moreau-Gachelin F, Tavitian A, and Tambourin P. Spi-1 is a putative oncogene in virally induced murine erythroleukaemias. Nature. 1988; 331: 277-280.
20. Nguyen VC, Ray D, Gross MS, de Tand MF, Frezal J, and Moreau-Gachelin F. Localization of the human oncogene SPI1 on chromosome 11, region p11.22. Hum. Genet. 1990; 84: 542-546.
21. Bharati A, Badminton MN, Whatley SD, O’Brien DV, and Bell HK. Late-onset erythropoietic protoporphyria in association with haematological malignancy. Clin. Exp. Dermatol. 2006; 31: 668-670.
22. Tiberi G, Pekowska A, Oudin C, Ivey A, Autret A, Prebet T, Koubi M, Lembo F, Mozziconacci MJ, Bidaut G, Chabannon C, Grimwade D, Vey N, et al. PcG methylation of the HIST1 cluster defines an epigenetic marker of acute myeloid leukemia. Leukemia. 2015; 29: 1202-1206.
23. Radomska HS, Basseres DS, Zheng R, Zhang P, Dayaram T, Yamamoto Y, Sternberg DW, Lokker N, Giese NA, Bohlander SK, Schnittger S, Delmotte MH, Davis RJ, et al. Block of C/EBP alpha function by phosphorylation in acute myeloid leukemia with FLT3 activating mutations. J. Exp. Med. 2006; 203: 371-381.
24. Tan Y, Li G, Zhao C, Wang J, Zhao H, Xue Y, Han M, and Yang C. Expression of sorcin predicts poor outcome in acute myeloid leukemia. Leuk. Res. 2003; 27: 125-131.
25. Schildgen V, Wulfert M, and Gattermann N. Impaired mitochondrial gene transcription in myelodysplastic syndromes and acute myeloid leukemia with myelodysplasia-related changes. Exp. Hematol. 2011; 39: 666-675.e661.
26. Astrom M, Bodin L, Hornsten P, Wahlin A, and Tidefelt U. Evidence for a bimodal relation between serum lysozyme and prognosis in 232 patients with acute myeloid leukaemia. Eur. J. Haematol. 2003; 70: 26-33.
27. Ulger C, Toruner GA, Alkan M, Mohammed M, Damani S, Kang J, Galante A, Aviv H, Soteropoulos P, Tolias PP, Schwalb MN, and Dermody JJ. Comprehensive genome-wide comparison of DNA and RNA level scan using microarray technology for identification of candidate cancer-related genes in the HL-60 cell line. Cancer Gene Cytogenet. 2003; 147: 28-35.
28. Morioka M. [Clinical significance of aldolase A in sera of patients with leukemia]. Rinsho Ketsueki. 1992; 33: 1191-1198. Japanese.
29. Vizin T, and Kos J. Gamma-enolase: a well-known tumour marker, with a less-known role in cancer. Radiol. Oncol. 2015; 49: 217-226.
30. Iseki Y, Imoto A, Okazaki T, Harigae H, and Takahashi S. Identification of annexin 1 as a PU.1 target gene in leukemia cells. Leuk. Res. 2009; 33: 1658-1663.
31. Alitalo K, Saksela K, Winqvist R, Alitalo R, Keski-Oja J, Laiho M, Ilvonen M, Knuutila S, and de la Chapelle A, Acute myelogenous leukaemia with c-myc amplification and double minute chromosomes. Lancet. 1985; 2: 1035-1039.
32. Luo H, Li Q, O’Neal J, Kreisel F, Le Beau MM, and Tomasson MH. c-Myc rapidly induces acute myeloid leukemia in mice without evidence of lymphoma-associated antiapoptotic mutations. Blood. 2005; 106: 2452-2461.
33. Hirouchi T, Takabatake T, Yoshida K, Nitta Y, Nakamura M, Tanaka S, Ichinohe K, Oghiso Y, and Tanaka K. Upregulation of c-myc gene accompanied by PU.1 deficiency in radiation-induced acute myeloid leukemia in mice. Exp. Hematol. 2008; 36: 871-885.
34. Larson RA, and Le Beau MM. Therapy-related myeloid leukaemia: a model for leukemogenesis in humans. Chem. Biol. Interact. 2005; 153-154: 187-195.
35. Jang SH, Lee S, and Chung HY. Characterization of Leukemia-Inducing Genes Using a Proto-Oncogene/Homeobox Gene Retroviral Human cDNA Library in a Mouse In Vivo Model. PLoS One. 2015; 10: e0143240. doi: 10.1371/journal.pone.0143240.
36. Olme CH, Brown N, Finnon R, Bouffler SD, and Badie C. Frequency of acute myeloid leukaemia-associated mouse chromosome 2 deletions in X-ray exposed immature haematopoietic progenitors and stem cells. Mutat. Res. 2013; 756: 119-126.
37. Kogan SC, Ward JM, Anver MR, Berman JJ, Brayton C, Cardiff RD, Carter JS, de Coronado S, Downing JR, Fredrickson TN, Haines DC, Harris AW, Harris NL, et al. Bethesda proposals for classification of nonlymphoid hematopoietic neoplasms in mice. Blood. 2002; 100: 238-245.
38. Brown NL, Finnon R, Bulman RA, Finnon P, Moody J, Bouffler SD, and Badie C. Sfpi1/PU.1 mutations in mouse radiation-induced acute myeloid leukaemias affect mRNA and protein abundance and associate with disrupted transcription. Leuk. Res. 2011; 35: 126-132.
39. Peng Y, Brown N, Finnon R, Warner CL, Liu X, Genik PC, Callan MA, Ray FA, Borak TB, Badie C, Bouffler SD, Ullrich RL, Bedford JS, et al. Radiation leukemogenesis in mice: loss of PU.1 on chromosome 2 in CBA and C57BL/6 mice after irradiation with 1 GeV/nucleon 56Fe ions, X rays or gamma rays. Part I. Experimental observations. Radiat. Res. 2009; 171: 474-483.
40. Suraweera N, Meijne E, Moody J, Carvajal-Carmona LG, Yoshida K, Pollard P, Fitzgibbon J, Riches A, van Laar T, Huiskamp R, Rowan A, Tomlinson IP, and Silver A. Mutations of the PU.1 Ets domain are specifically associated with murine radiation-induced, but not human therapy-related, acute myeloid leukaemia. Oncogene. 2005; 24: 3678-3683.
41. Barjaktarovic Z, Schmaltz D, Shyla A, Azimzadeh O, Schulz S, Haagen J, Dorr W, Sarioglu H, Schafer A, Atkinson MJ, Zischka H, and Tapio S. Radiation-induced signaling results in mitochondrial impairment in mouse heart at 4 weeks after exposure to X-rays. PLoS One. 2011; 6: e27811. doi: 10.1371/journal.pone.0027811.
42. Barjaktarovic Z, Shyla A, Azimzadeh O, Schulz S, Haagen J, Dorr W, Sarioglu H, Atkinson MJ, Zischka H, and Tapio S. Ionising radiation induces persistent alterations in the cardiac mitochondrial function of C57BL/6 mice 40 weeks after local heart exposure. Radiother. Oncol. 2013; 106: 404-410.
43. Laemmli UK. Cleavage of structural proteins during the assembly of the head of bacteriophage T4. Nature. 1970; 227: 680-685.
44. Heukeshoven J, and Dernick R. Simplified method for silver staining of proteins in polyacrylamide gels and the mechanism of silver staining. ELECTROPHORESIS. 1985; 6: 103-112.
45. Azimzadeh O, Scherthan H, Sarioglu H, Barjaktarovic Z, Conrad M, Vogt A, Calzada-Wack J, Neff F, Aubele M, Buske C, Atkinson MJ, and Tapio S. Rapid proteomic remodeling of cardiac tissue caused by total body ionizing radiation. Proteomics. 2011; 11: 3299-3311.
46. Geiger T, Cox J, and Mann M. Proteomic changes resulting from gene copy number variations in cancer cells. PLoS Genet. 2010; 6: e1001090. doi: 10.1371/journal.pgen.1001090.
47. Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, and Speed TP. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003; 4: 249-264.
48. Dai M, Wang P, Boyd AD, Kostov G, Athey B, Jones EG, Bunney WE, Myers RM, Speed TP, Akil H, Watson SJ, and Meng F. Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data. Nucleic Acids Res. 2005; 33: e175. doi: 10.1093/nar/gni179.
49. Marczyk M, Jaksik R, Polanski A, and Polanska J. Adaptive filtering of microarray gene expression data based on Gaussian mixture decomposition. BMC Bioinformatics. 2013; 14: 101. doi: 10.1186/1471-2105-14-101.
50. Benjamini Y, and Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society. Series B (Methodological). 1995; 57: 289-300.
51. Dice LR. Measures of the Amount of Ecologic Association Between Species. Ecology. 1945; 26: 297-302.
52. Alexa A, Rahnenfuhrer J, and Lengauer T. Improved scoring of functional groups from gene expression data by decorrelating GO graph structure. Bioinformatics. 2006; 22: 1600-1607.
53. Armitage EG, Godzien J, Alonso-Herranz V, Lopez-Gonzalvez A, and Barbas C. Missing value imputation strategies for metabolomics data. Electrophoresis. 2015; 36: 3050-3060.
54. Durinck S, Spellman PT, Birney E, and Huber W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat. Protoc. 2009; 4: 1184-1191.
55. Sokal, R, and Michener, C. A statistical method for evaluating systematic relationships. University of Kansas Science Bulletin, 1958; 38:1409–1438.
56. Barreyro L, Will B, Bartholdy B, Zhou L, Todorova TI, Stanley RF, Ben-Neriah S, Montagna C, Parekh S, Pellagatti A, Boultwood J, Paietta E, Ketterling RP, et al. Overexpression of IL-1 receptor accessory protein in stem and progenitor cells and outcome correlation in AML and MDS. Blood. 2012; 120: 1290-1298.
57. Schinke C, Giricz O, Li W, Shastri A, Gordon S, Barreyro L, Bhagat T, Bhattacharyya S, Ramachandra N, Bartenstein M, Pellagatti A, Boultwood J, Wickrema A, et al. IL8-CXCR2 pathway inhibition as a therapeutic strategy against MDS and AML stem cells. Blood. 2015; 125: 3144-3152.