- Open Access
Elucidating the identity of resistance mechanisms to prednisolone exposure in acute lymphoblastic leukemia cells through transcriptomic analysis: A computational approach
Journal of Clinical Bioinformatics volume 1, Article number: 36 (2011)
It has been shown previously that glucocorticoids exert a dual mechanism of action, entailing cytotoxic, mitogenic as well as cell proliferative and anti-apoptotic responses, in a dose-dependent manner on CCRF-CEM cells at 72 h. Early gene expression response implies a dose-dependent dual mechanism of action of prednisolone too, something reflected on cell state upon 72 h of treatment.
In this work, a generic, computational microarray data analysis framework is proposed, in order to examine the hypothesis, whether CCRF-CEM cells exhibit an intrinsic or acquired mechanism of resistance and investigate the molecular imprint of this, upon prednisolone treatment. The experimental design enables the examination of both the dose (0 nM, 10 nM, 22 uM, 700 uM) effect of glucocorticoid exposure and the dynamics (early and late, namely 4 h, 72 h) of the molecular response of the cells at the transcriptomic layer.
In this work, we demonstrated that CCRF-CEM cells may attain a mixed mechanism of response to glucocorticoids, however, with a clear preference towards an intrinsic mechanism of resistance. Specifically, at 4 h, prednisolone appeared to down-regulate apoptotic genes. Also, low and high prednisolone concentrations up-regulates genes related to metabolism and signal-transduction in both time points, thus favoring cell proliferative actions. In addition, regulation of NF-κB-related genes implies an inherent mechanism of resistance through the established link of NF-κB inflammatory role and GC-induced resistance. The analysis framework applied here highlights prednisolone-activated regulatory mechanisms through identification of early responding sets of genes. On the other hand, study of the prolonged exposure to glucocorticoids (72 h exposure) highlights the effect of homeostatic feedback mechanisms of the treated cells.
Overall, it appears that CCRF-CEM cells in this study exhibit a diversified, combined pattern of intrinsic and acquired resistance to prednisolone, with a tendency towards inherent resistant characteristics, through activation of different molecular courses of action.
Resistance to glucocorticoids (GC) is considered to be one of the most important factors in the prognosis of leukemia [1, 2]. In a previous study, it has been shown that when a resistant T-cell leukemia cell line (CCRF-CEM) is treated with prednisolone, the drug exerts a dual (biphasic) effect on these cells . At low doses, prednisolone has a mitogenic/anti-apoptotic effect, whereas at higher doses it manifests a cytotoxic/mitogenic effect. Also, it has been shown that the actual underlying effect of prednisolone, either mitogenic or cytotoxic, becomes apparent at 72 h of prednisolone exposure, providing evidence for activation of a cellular, homeostatic, feedback mechanism at the transcriptional or translational layer (protein synthesis) .
In addition, it remains elusive whether cells possess inherent mechanisms inducing GC tolerance on them, or their responce upon GC treatment is one of gradual adjustment, meaning that originally sensitive cells become resistant. Thus, as glucocorticoid receptor regulates directly or indirectly several thousands of genes, this partly refers to activation of genes related to anti-apoptosis and mitogenesis. In this sense, those mechanisms may possibly, through intricate, regulatory actions and cross-talks, confer to the induction of resistance in leukemic cells. Apoptosis evasion, or proliferation stimulation are two alternative mechanisms through which cells exhibit resistance. In the present work we refer to acute lymphoblastic leukemia (ALL), though glucocorticoid treatment belongs to the first-line of medications against lymphoid malignancies in general [4, 5]. There is adequate evidence supporting a far more intricate mechanism of resistance to glucocorticoids than mere down-regulation of steroid receptors.
In this sense, several, possible resistance mechanisms of leukemic cells to glucocorticoid administration have been proposed, like the presence of somatic mutations on the GR gene that may lead to aberrant regulation of the receptor through intracellular signaling. Besides, several polymorphisms, but not somatic mutations, have been found in normal and ALL populations, not linked to resistance or sensitivity induction though, either in vivo or in vitro[6, 7]. Other GC resistance scenarios are emphasizing in defects in intracellular signaling pathways that involve interactions of GR with other sequence-specific transcription factors, such as AP-1 and Nuclear Factor kappa-light-chain-enhancer of activated B cells (NF-κB) . In a normal cell, ligand-activated GR may potentially interfere with transcription factor c-Jun or p65 NF-κB and thereby repress genes promoting cell proliferation and cell survival [6, 9, 10]. GR-dependent inhibition of the transcription factor p65 NF-κB, plays a significant role in the manifestation of apoptotic and anti-apoptotic effects of GR in leukemia cells and has been identified as a pivotal component of the mechanism of cancer cell resistance to chemotherapy . Previous studies of GC effects on leukemia cells identified c-myc and cyclin D3 as early GR-regulated targets, in GC-sensitive cells . Further studies showed that introduction of a conditionally expressed cyclin-dependent kinase inhibitor p16 (INK4A) gene, sensitized GC-resistant leukemia cells, through induction of cell cycle arrest .
Thus, p16 inactivation may change GR levels, affecting GR-mediated gene regulation and resulting in resistance to GCs. For this purpose, the parental CCRF-CEM cell line was chosen as the system of study for the effects of prednisolone treatment, a T-cell leukemia cell line characterized by a mutation (L753F) on one GR gene allele that impairs ligand binding . It is known that both the DNA and ligand binding domains of the GR are required in order to repress NF-κB transactivation . Interestingly, concerning the question whether this mutation would affect GC resistance, it has been reported previously that both the GC-resistant, as well as the GC-sensitive CCRF-CEM subclones, express heterogeneous populations of the GR (GRwt/GRL753F) [15, 16]. The CCRF-CEM cell line has been reported to be resistant to GCs, presumably due to the accumulation of more resistant variants after long periods of prolonged culture . In addition, utilization of an in vitro system provides reproducibility, an expedient system to systematically examine the impact of intracellular signals and at the same time minimize the effect of undesired crosstalks introduced by other in vivo-participating systems.
A detailed molecular explanation of the intricate mechanisms, underlying the resistance phenotype to GC-induced apoptosis, remains elusive. The present work proposes a rational computational framework in order to aid the elucidation of the question whether the system under study, has intrinsic or acquired mechanisms of resistance. Our presumption is that the system in study possesses an intrinsic mechanism of resistance to glucocorticoids i.e. prednisolone. Using the proposed computational analysis workflow, we have analyzed microarray data from two time points (4 and 72 h treatment) and three different concentrations (10 nM, 22 uM and 700 uM). For the 4 h time point, we used a 1.2 k platform, comprising of cancer specific genes, which has been reported and analyzed previously [3, 18]. In order to expand our view of prednisolone effects on the cell line, we used a 4.8 k platform. Genes included in the 1.2 k platform are also represented in the 4.8 k platform. Data analysis was performed in order to find groups of genes associated with characteristics related to anti-apoptosis and apoptosis, cell cycle arrest, drug resistance etc.
The CCRF-CEM cell line was obtained from the European Collection of Cell Cultures (ECACC). Concentrations of prednisolone (Pharmacia) were: 0 uM (control), 10 nM, 22 uM, and 700 uM. In general, all three prednisolone concentrations correspond to in vivo dosages administrated intravenously to children at ages between 1 month and 12 years old . Specifically, the 10 nM and 700 uM prednisolone concentrations were chosen as indicative of manifestation of specific phenotypic effects, i.e. anti-apoptosis accompanied with mitogenic effect and cytotoxicity accompanied with resistance, as observed by flow cytometry . Moreover, the high concentration (700 uM) used, is similar to concentrations used in different studies in CCRF-CEM cells  as well as primary cell cultures derived from childhood ALL patients . The 22 uM prednisolone concentration was chosen as an intermediate concentration between the aforementioned two.
RNA was isolated with Trizol (Invitrogen Inc.) according to the manufacturer's instructions. At least 40 ug of RNA from each sample was used. cDNA microarray chips from two platforms (1.2 k and 4.8 k) were obtained from TAKARA (IntelliGene® Human Cancer CHIP Version 4.0 and IntelliGene® II CHIP, respectively). Hybridization was performed with the CyScribe Post-Labeling kit (RPN5660, Amersham) as described by the manufacturer . The experimental setups consisted of the five following pairs: control vs. 10 nM prednisolone at 4 h (designated as '1'), 10 nM vs. 700 uM prednisolone at 4 h (designated as '2'), control vs. 700 uM prednisolone at 4 h (designated as '3'), 22 uM vs. 700 uM prednisolone at 72 h (designated as '4'), and control vs. 700 uM prednisolone at 72 h (designated as '5'). In this study, triplicate hybridizations are utilized for the 4 h experiments. Experimental pairs were co-hybridized on the same slide, each stained with a different fluorophore. Fluorophores used were Cy3 and Cy5. Slides were scanned with the ScanArray 4000XL microarray scanner. Images were generated with ScanArray microarray acquisition software (GSI Lumonics, USA). Image analysis was performed with the ImaGene® 6.0 software (Biodiscovery Inc., USA). The raw datasets have been deposited in NCBI's Gene Expression Omnibus (GEO), and are accessible through GEO Series accession number [GEO: GSE28154].
A common for both platforms data preprocessing stage, namely the median intensity value in each channel, was applied to the raw data. Specifically, the well performing robust version of the robust loess-based background correction (rLsBC) approach, as proposed by , was applied. rLsBC assumes that the background noise affects the spot intensities in a multiplicative manner . Instead of using the measurements of the local (feature-related) background for the correction, rLsBC utilizes the regression estimate of the logarithmic background distribution BR, Gaccording to the logarithmic foreground intensity FR, Gfor each channel (R: Red and G: Green). Thus, rLsBC provides a robust estimation of the channel-specific background noise, utilized to background-correct the logarithmic foreground intensities:
where is the logarithmic background-corrected foreground intensity, and the robust estimate of background noise, for each channel. The absolute background-corrected foreground intensity for each channel is then calculated as:
In order to reduce the complexity of the data set, we followed the replicate averaging approach proposed by . In this approach, instead of estimating a constant c and utilizing it to adjust each of the individual replicate measurements, equivalently the replicates were averaged by taking their geometric mean, that is:
where is the (background-corrected) foreground intensity of the replicate r i , i = 1, 2, 3, and is the averaged foreground intensity across all replicates (henceforth referred simply as signal intensity), for each channel (Red and Green).
Since outliers can significantly influence one or more of the subsequent processing steps, extreme outlier values, that is signal intensities deviating more than 3 interquartile distances from the first or the third quartile , were identified and excluded in an iterative process.
The signal intensities of each dataset were further normalized in order to mitigate the effect of extraneous, non-biological variation in the measured gene expression levels. The robust version of the intensity-dependent scatter-plot smoother loess [25, 26] with a quadratic polynomial model was applied to the M-A scatter-plot , where M and A are the log-ratio and log-mean, respectively:
where FR and FG are the corresponding logarithmic signal intensities for each channel. The smoothing parameter of the loess procedure used equals to 10%, which was considered appropriate for the relatively small number of probes attached in the microarrays.
In order to identify and remove dubious features, a filtering approach was followed based on (i) the loop-design [28–31] used for the experimental setups at 4 h (experiments '1', '2', and '3'), and (ii) the philosophy of the replicate filtering approach proposed by . In particular, the fold changes of experiment '1' (control vs. 10 nM prednisolone) and experiment '2' (10 nM vs. 700 uM prednisolone) should roughly equal the fold change in experiment '3' (control vs. 700 uM prednisolone), that is:
where are the signal intensity of the experiment i, i = 1, 2, 3, for each channel, or equivalently, the following quantity should ideally be equal to zero:
where M i is the log-ratio of the experiment i, i = 1, 2, 3. Thus, we sought to filter out features whose LdF deviates greatly from this expected value of zero. We calculated the mean and standard deviation (SD) of LdF, and eliminated features whose LdF is greater than 2 SD from the mean. The selected number of standard deviations of the mean ensures that on the one hand the features used for the subsequent analysis roughly comply with this loop-design rule with relatively high confidence, while on the other hand selection thresholds are not too stringent, and thus reject potentially interesting genes.
Before proceeding to the cross-platform analysis, since two different microarray platforms (IntelliGene® Human Cancer CHIP Version 4.0 and IntelliGene® II CHIP) were utilized in the present study, we chose to perform data integration at a lower level , instead of conducting a meta-analysis. That is, the preprocessed datasets of each array were combined to a single, unified dataset, in which standard statistical procedures were applied. Nonetheless, whenever applicable to the nature of the present study, some of the key issues that need to be addressed, i.e. preprocessing, preparation and annotation of the individual datasets, complied to the guidelines suggested by .
In order to perform data integration, two main issues had to be resolved: (i) matching reporters on the two microarray platforms, and (ii) normalizing data to address platform related differences .
The first task was rather straightforward, since both platforms were obtained from the same manufacturer and many reporters were common between the 1.2 k and the 4.8 k platforms. Specifically, each reporter-level identifier (GenBank accession numbers) was mapped to a UniGene identifier (UniGene Cluster ID) [36–40]. The mapping was performed through the web-based tool SOURCE  in the 19th of November 2010, simultaneously for both platforms, in order to avoid inconsistencies . All mapped reporter-level identifiers had one-to-one relationship with the gene-level identifiers that is, each reporter was associated with a single UniGene identifier and no more than one reporter was mapped to the same UniGene identifier. Reporters having insufficient information to be mapped to any gene-level identifier were omitted. Thus, a fully updated set of unique gene-level identifiers was generated for each platform. The intersection of these two sets formed a common gene set (CGS) consisting of 490 UniGene identifiers common in all setups, which was utilized for the subsequent analysis.
Then, a cross-platform normalization procedure was applied in order to address batch effect issues, namely the median rank score (MRS) normalization method [43, 44]. After choosing a microarray set as reference, this simple approach replaces the expression values for all the others (non-reference) microarray sets by median expression values of genes from the reference set. The reference set of choice was the 4.8 k platform set, since it outperforms in data quality when compared to the 1.2 k one . The improved comparability of the data from the two different platforms after cross-platform normalization is shown in Additional File 1, where it can be seen that the distributions of gene expression values derived from the 1.2 k and 4.8 k platforms are more similar after application of MRS in comparison to non-integrated data.
The following analysis steps were performed on the integrated gene expression values of the CGS.
Identification of differentially expressed genes
In order to identify potential, differentially expressed (DE) genes for each slide, whenever the experimental design did not include replicates for the same condition, we adjusted a selection process, exploiting the intensity-dependent calculation of the standard Z-score  of each feature within each slide. This approach utilizes a sliding window to calculate smoothed local means and standard deviations (SDs), which are then used to calculate the Z-score of the logarithmic ratio values for each data point in the normalized MA-plot. At the present study, a sliding window of width equal to 20% of the total number of data points was utilized. The selected percentage on one hand represents a plausible tradeoff between the adequate sample size for statistics calculation, and neighbourhood sensitivity regarding the intensity-dependent action. The DE genes per experiment were identified at a confidence level of 95%.
In order to partition the gene expression profiles throughout all five experimental setups and exploit similarities in the expression profiles, for surmising functional relevance, possibly through common regulatory actions, which orchestrate genomic expression with respect to the studied phenotypes in this study, an unsupervised cluster analysis was utilized.
The unsupervised clustering was applied to an appropriately selected subset of the CGS, since simulations have indicated that keeping irrelevant genes during cluster analysis, results in reduced accuracy . Specifically, each gene of the CGS was ranked by SD across experiments. The top 100 genes with the highest SD (SD100) were selected for downstream analysis. As demonstrated in the recent thorough evaluation study for two-channel microarray data , this widely used method is one of the best performing gene selection approaches.
The method utilized for cluster analysis was the k-means clustering [47–49], considered as one of the best performing clustering options in particular for microarray class discovery studies . The k-means algorithm applied  uses the squared Euclidean as a distance measure, which has been frequently found to outperform, regargding ratio-based measurements . Also, in order to evade the problem of possible local minima, the algorithm is repeated a number of executions with different initializations. Specifically, the clustering procedure is repeated 100 times, each with a new set of initial cluster centroid positions (seeds), selected at random, and the best execution (the one that minimizes the sum, over all clusters, of the within-cluster sums of object-to-cluster-centroid distances) is taken as the final result.
In order to predict the optimal number of clusters for interpretation, which constitutes a fundamental problem in unsupervised clustering, a cluster validity measure was applied. The validity measure of choice was the average silhouette width for the entire dataset S comprising N objects [52, 53]:
where S i is the silhouette width of object i, which is defined as:
where a i is the average distance between object i and all the other objects in the cluster, and b i is the minimum of the average distances between i and objects in other clusters. To determine the optimal number of clusters , the clustering was executed for k varying between 2 to 30. For each k, the aforementioned k-means algorithm was repeated 1,000 times. The best (maximum) values of obtained by each k were plotted as the function of k (Additional File 2). As illustrated in this figure, since the plot did not exhibit any specific increasing or decreasing trend, we sought for its maximum value . Thus, the optimal cluster number was the one corresponding to the maximum value of the plot and was found to be equal to 7.
The implementation of all preprocessing steps, along with the identification of the DE genes and cluster analysis, was performed in the Matlab® 7.9 (R2009b) (The MathWorks Inc. Natick, MA, USA) computing environment.
Gene Ontology based analysis
In order to compare different groups of genes, highlighting different functionalities among all experimental setups, interesting genes formed study sets that were further subjected to Gene Ontology (GO) based analysis to test the nature of the observed resistance mechanism. Specifically, for each study set formed, statistical analysis of GO term overrepresentation was performed against the CGS, which was utilized as a reference set, as proposed by . The chosen approach was the parent-child-union method , since it was found to outperform the standard method of overrepresentation analysis (ORA) in GO. The standard approach treats each GO term independently and hence does not take dependencies between parent and child terms into account, ignoring the structure of the GO hierarchy. It was shown that this behavior can result in certain types of false positive results, with potentially misleading biological interpretation . In contrast, the parent-child method measures the overrepresentation of a term with respect to the presence of its parental terms in the set, and hence resolving the problem of the standard method which tends to falsely detect an overrepresentation of more specific terms below terms known to be overrepresented.
ORA was performed with the publicly available Ontologizer 2.0 tool  using GO terms definitions and associations between genes and GO downloaded from the Gene Ontology consortium  on the 26th of November 2010.
In order to identify the transcription factors driving the observed changes in the gene expression, we investigated the Transcription Factor Binding Motifs (TFBMs) in the Transcription Element Listening System Database (TELiS) . The TRANSFAC transcription factor database was used for the identification of gene transcription factor binding sites .
Chromosome mapping appears to be a promising method for identifying patterns among genes. The main idea reported initially by  is to map genes in chromosomal regions and in this way, if correlations do exist, they appear through the location of genes on chromosomal regions, since consistent expression of whole functional entities is related with activation of given chromosomal regions. For chromosome mapping analyses, we used the Gene Ontology Tree Machine, WebGestalt web-tool .
Pathway analysis was performed in order to find genes that participate in known pathways related to the investigated mechanisms, such as apoptosis or cell proliferation. This approach was used in order to gain further insight into the mechanisms underlying the common genes identified by our proposed model. The differentially expressed genes were mapped on different pathways using the Pathway Explorer software . First of all, it was investigated the percentage of genes present in all known pathways using the databases available through the Pathway Explorer software. The KEGG Pathways database was used for our analysis [64–66].
Hypothesis examination and computational work flow
In order to answer the question of inherently disposed or acquired resistance and examine our initial working hypothesis, significant genes derived from the computational workflow were grouped in three sets, for each experiment i, with i = 1...5, S i , U i , and D i , where: S i includes genes which expression is considered to be unchanged, U i includes genes up-regulated, and D i includes genes down-regulated. Furthermore, in order to facilitate interpretation, gene functions were categorized in two major categories, F p and F a , where F p is related to apoptosis evasion and cell cycle progression, whereas F a is related to apoptosis induction and cell cycle arrest. In other words, F p is related to genes that are not desired targets of prednisolone regulation and the glucocorticoid receptor (GR) in the system under study, while F a is related to genes involved in apoptosis, which are the desired ones of GC regulation.
Concerning the 4 h treatment, namely the early treatment, the following functional categories were defined:
i) If genes simultaneously remain unaffected at the low and high prednisolone doses (intersection of sets S 1 and S 3 (Figure 1a), which points out dose-independent genes), and GO functions are related to F p , then those functions seem not to confer to the manifestation of resistive behavior by the cells, thus suggesting that these cells are originally sensitive and gradually develop resistant phenotypic characteristics. On the contrary, if gene functions are related to F a then cells tend to remain unresponsive to GC treatment, thus implying an intrinsic mechanism of resistance.
ii) If genes are simultaneously up-regulated at both concentrations (intersection of sets U1 and U3 (Figure 1b), which points out probable early time-dependent genes or a generalized reaction mechanism to GCs which is dose-independent), and GO functions are related to F p , then cells have an intrinsic mechanism of resistance. On the contrary, if gene functions are related to F a then cells are originally sensitive and gradually develop, or homeostatically adjust to resistant phenotypic characteristics.
iii) If genes are simultaneously down-regulated at the low and high prednisolone doses (intersection of sets D1 and D3 (Figure 1c), which points out dose-independent genes, that manifest the same expression profile under these two extreme GC concentrations, viz, down-regulated in the cell system in its natural condition), and GO functions are related to F p , then cells are originally sensitive and gradually develop resistant phenotypic characteristics. On the contrary, if gene functions are related to F a this predicates upon an intrinsic mechanism of resistance by the cells.
iv) If genes are simultaneously up-regulated at the low prednisolone concentration and down-regulated at the high prednisolone dose (intersection of sets U1 and D3 (Figure 1d), which points out the dose-dependent genes) and GO functions are related to F p , then cells tend to present a dose dependent sensitization to GC exposure, thus postponing resistance manifestation at a later point, something which implies acquired resistance mechanisms. On the contrary, if gene functions are related to F a then cells have an intrinsic mechanism of resistance.
v) Finally, if genes are simultaneously down-regulated at the low prednisolone concentration and up-regulated at the high prednisolone dose (intersection of sets D 1 and U 3 (Figure 1e), which points out the dose-dependent genes) and GO functions are related to F p , then cells show a dose-dependent intrinsic mechanism of resistance. On the contrary, if gene functions are related to F a then cells present dose-dependent sensitization, postponing resistance manifestation at a later point something which implies acquired resistance mechanisms.
Further on, extrapolating the aforementioned approach in the analysis of the 72 h experiments, namely the late treatment, the following functional categories were defined:
i) If genes remain unchanged (intersection of sets S 4 and S 5 (Figure 1f) or are up-regulated (intersection of sets U 4 and U 5 (Figure 1g) at both treatments at 72 h, and GO functions are related to F p , then cells manifest a time-dependent acquired mechanism of resistance. If the opposite is true, then the nature of the resistance mechanism cannot be defined.
ii) If genes are down-regulated (intersection of sets D4 and D5 (Figure 1h) and GO functions are related to F a , then the mechanism of resistance is probably inherent. Again, if the opposite is true, then the nature of the resistance mechanism cannot be defined.
iii) If genes are up-regulated at the 22 uM vs. 700 uM experiment and down-regulated at the control vs. 700 uM experiment (intersection of sets U4 and D5 (Figure 1i) or are down-regulated at the 22 uM vs. 700 uM experiment and up-regulated at the control vs. 700 uM experiment (intersection of sets D4 and U5 (Figure 1j), and GO functions are related to F p , then cells manifest an acquired resistance mechanism which is partially remitting to a dose-dependent re-sensitization by large GC doses. On the other hand, if gene functions are related to F a , then the mechanism of resistance cannot be defined.
All aforementioned combinations are summarized in the simplified flowcharts of Figure 2.
Gene expression profiling concerning two distinct time points (4 h and 72 h) followed mRNA isolation from the CCRF-CEM cell line, cultured in three different prednisolone concentrations (10 nM, 22 uM, and 700 uM). The mRNA collected was used for hybridization on two cDNA microarray platforms (1.2 k and 4.8 k). Specifically, the experimental design consisted of the five following pairs: control vs. 10 nM prednisolone at 4 h (designated as '1'), 10 nM vs. 700 uM prednisolone at 4 h (designated as '2'), control vs. 700 uM prednisolone at 4 h (designated as '3'), 22 uM vs. 700 uM prednisolone at 72 h (designated as '4'), and control vs. 700 uM prednisolone at 72 h (designated as '5').
A common for both platforms data preprocessing stage was applied to the raw expression datasets, including proper background correction for each replicate slide, within-slide normalization and filtering. Concerning data integration, a two-step procedure was applied, firstly by matching reporters on the two, microarray platforms and then applying a cross-platform, normalization approach. In this way the preprocessed expression values of the common gene-level identifiers, existing in both platforms, were selected to form a unified dataset on which standard statistical procedures could be applied. Each subsequent analysis step was performed on the integrated gene expression values of this unified dataset (Additional file 3), which includes a total number of 490 genes, common in all experimental setups (see Methods for details).
Gene Ontology analysis based on statistically significant genes
In order to identify differentially expressed (DE) genes for each experiment, we utilized the intensity-dependent calculation of the standard Z-score  of each common feature within each slide, at a confidence level of 95%. Then, for each experiment i, with i = 1...5, all genes were further classified into the three sets: S i , U i , and D i , where S i includes genes which expression is considered to be unchanged, U i includes genes up-regulated, and D i includes genes down-regulated (Additional file 4).
In order to address the question of the prevalence of either inherently disposed or acquired mechanisms of resistance, the following intersections were defined, in line with the formulations described in the Methods section (Figure 1): S1 ∩ S3, U1 ∩ U3, D1 ∩ D3, U1 ∩ D3, D1 ∩ U3, S4 ∩ S5, U4 ∩ U5, D4 ∩ D5, U4 ∩ U5, and D4 ∩ U5 (Additional file 4). Then, these intersections were subjected to statistical analysis of Gene Ontology (GO) term overrepresentation, in order to test the nature of the observed resistance mechanism (Additional file 5). The generated from the GO-based analysis gene functions were also further categorized in two major categories: F p and F a , where F p is related to apoptosis evasion and cell cycle progression, whereas F a is related to apoptosis induction and cell cycle arrest (Figure 2).
The results of this examination demonstrated specifically that:
i) Genes belonging to intersection S 1 ∩ S 3 (Figure 1a) do not incorporate any functions related to apoptosis induction (F a ) or cell proliferation (F p ), thus the nature of the resistance mechanism cannot be defined.
ii) No common genes were observed for sets U1 and U3 (Figure 1b), implying that no genes from the present dataset are dose-independent upon prednisolone treatment.
iii) Intersection D1 ∩ D3 (Figure 1c) showed only one common gene, the DAPK1. However, DAPK1 is known to be a positive mediator of gamma interferon induced cell death , thus down-regulation of DAPK1 implies a possible linkage with inherent resistance mechanisms. Moreover, it appears that DAPK1's down-regulation is dose-independent and the glucocorticoid receptor (GR), when stimulated in the system under study, deactivates it.
iv) Intersections U1 ∩ D3 (Figure 1d) and D1 ∩ U3 (Figure 1e) do not yield common genes. Since there are no genes that are regulated dose-dependently and in opposing manners, it may be concluded that, at least as far as the present dataset is concerned, prednisolone regulates different sets of genes in a dose-dependent manner.
Regarding the 72h time point:
i) Genes belonging to intersection S 4 ∩ S 5 (Figure 1f) comprised functions related to induction of apoptosis and cell cycle arrest (F a ), thus they do not confer any knowledge about the nature of the resistance mechanism.
ii) Intersections U4 ∩ U5 (Figure 1g) and D4 ∩ D5 (Figure 1h) did not yield common genes, whereas intersection U4 ∩ D5 (Figure 1i) captured one gene, namely the gene KIT. This gene is reported to play a role in a variety of human tumors, including acute myelogenous leukemia, in its mutated forms . From the present study it appears that it is also active in the regulation of GC action.
iii) Finally, one gene is derived from the intersection of D4 ∩ U5 (Figure 1j), namely MADD (also known as IG20), which represents a very interesting gene since it has an established role in the mediation of the death signal from TNF-alpha downwards to the apoptosis activators . In general, it is also known to be expressed at higher levels in neoplastic cells than in normal cells , which is also the case for our system. However, it is unclear if MADD over-expression in tumor cells incurs increased apoptosis or not. The case of this specific gene becomes more interesting as it is known that at least six different splice variants with equally different functions are expressed, each one performing different functions, both apoptotic and anti-apoptotic .
From the aforementioned it seems that dose-dependent prednisolone administration induces varying expression of different sets of genes. In this sense, a plausible presumption is that genes differentially expressed by prednisolone should to a large extent be implicated in the resistance mechanisms, responding to prednisolone treatment. Thus, apart from the aforementioned intersections, the sets U1, U3, D1 and D4 for the 4 h time point, and the sets U4, U5, D4 and D5 for the 72 h time point were also examined (Additional file 5). More specifically:
i) Gene set U 1, concerning response to low dose GC administration was related to apoptotic and cell cycle regulation mechanisms. It is worth noting that this set (upregulated by 10 nM prednisolone) includes gene BCL2L2, which has been reported to be an anti-apoptotic gene .
ii) Gene set U3, concerning genes overexpressing as a result of high GC presence, did not highlighted functions relevant to apoptosis induction or positive regulation of cell death (F a ). In this sense, it provides evidence for an inherent mechanism of resistance, since it seems that the high dose does not comply with the expected apoptotic actions. Interestingly, the same concentration (700 uM) seemed to stir metabolic as well developmental mechanisms.
iii) Both sets D1 and D3 did not incorporate functions related to apoptosis induction and positive cell death regulation (F a ). In other words, the genes down-regulated both in low or high doses of prednisolone administration are not related to apoptosis induction. Nevertheless in set D1, gene DAP, which encodes for a protein that is positive regulator of programmed cell death , is related in addition to the process of autophagy. Moreover, this gene is a member of the mTOR pathway, which is at the same time, a negative regulator of autophagy . Again, it appears that 10 nM prednisolone stimulates cell proliferation in addition to an anti-apoptotic effect. It cannot be precluded that prednisolone could potentially enhance alternative nutrition mechanisms as an alternative way to attain survival. It appears that GC presence is impacting the way the cell will metabolize.
Regarding the 72h time point:
i) It appears that prednisolone exerts a diversified intricate mechanism as it up-regulates (U 4) genes, such as BCL2, which is an anti-apoptotic gene. Moreover, functions were revealed that had to do with positive regulation of locomotion, indicating the activation of a possible mechanism of evasion from a hostile environment.
ii) Also, prednisolone up-regulated genes related to apoptosis evasion (F p ) at the 700 uM dose as compared to untreated cells (U5), among these, gene BIRC5. The BIRC family of genes belongs to the larger IAP family of genes (inhibitors of apoptosis genes). BIRC5 especially is considered to be an apoptosis evasion factor with significant presence in a variety of tumors . The fact that prednisolone up-regulates such a gene, supports the case of an acquired mechanism of action.
iii) In addition, prednisolone down-regulated genes (D4 and D5) are related to stimulation of cell cycle progression (F p ), such as BCL2L2 and KIT, respectively. Also, genes related to positive induction of cell death, such as IKBKE and PPP3CC  are down-regulated. However, it is worth mentioning that there is also a shift of molecular function towards NF-κB-related effects. Especially, gene IKBKE, which is a non-canonical Iκ-B kinase, a known controller of NF-κB , appears to be down regulated by prednisolone at 72 h. This is the same gene that is involved in induction of cell death, i.e. IKBKE, which is a non-canonical Iκ-B kinase, a known controller of NF-κB .
A very interesting case is presented in these clusters with the MCL1 gene. As mentioned before, this gene is instrumentally linked to cell survival and the fact that the low dose activates it, is a strong indication for inherent resistance in the present system. This indication becomes stronger by the finding that this gene is similarly expressed in untreated cells and cells treated with various concentrations of prednisolone. As a matter of fact it seems that even variations in concentrations such as the 22uM and 700uM do not affect its expression, at least at late time points, or its expression has been stabilized. In cluster 7, gene functions appeared to be related to cell cycle regulation. This group of clusters seem to outline the effects of the high prednisolone dose, as this dose activates genes related to cell cycle progression. For example, in these clusters appear genes such as BMP5, FGF7 and MCL1. Those genes, are promoters of anti-apoptosis and proliferation. Over-expression of such genes, at later time intervals, points out the existence of an inherent mechanism of reaction to the glucocorticoid. Yet, the fact that their expression at 4h remains stable, implies that for this gene set the decision for gene activation or not is taken later on during GC treatment. The fact however, is that cells react with anti-apoptotic signals already at 4h and our hypothesis is that the 72h activation of anti-apoptotic genes has to do with a re-enforcement of the resistant phenotype that the cells already possess.
Taken together the results of all the studied subsets, summarized in Table 1, showed that the cell system under study exerts an intricate response pattern of inherent and acquired mechanisms of GC resistance, which seems however to favor inherent resistance mechanisms.
We have seen that at early time points, there are no genes, which are regulated dose-dependently or in opposing manners. Therefore, we concluded that prednisolone regulates different sets of genes in a dose-dependent manner, at least as far as the present data set is concerned. This helped us to conclude that genes differentially expressed by prednisolone should play a role in the resistance mechanism to prednisolone.
Gene Ontology analysis based on clustering
K-means clustering is an effective way of classifying gene expression profiles, based on their similarities as well as on probable common regulatory mechanisms. In this way, it can be considered as an effective way to suggest candidate targets of putative, common regulatory mechanisms. Thus, as described in Methods, the top 100 genes with the highest SD (SD100) from the CGS was subjected to k-means clustering using squared Euclidean as a distance measure, in order to identify groups of genes presenting similar expression profiles during prednisolone treatment early and late response and possibly co-regulated. In Figure 3, the k-means clustering is presented, illustrating the seven clusters derived (Additional file 6). GO-based enrichment analysis, that followed, linked co-expressed genes in each cluster to several functional categories (Additional file 5). In particular, we noted that:
i) Cluster 1 (C 1) contains time-dependent genes, which expression decreases at 700 uM prednisolone and the 72 h time point (resembling the behaviour of the D 5 set), while remaining unchanged at all other conditions (including genes from the S 1 ∩ S 2 set). However, genes under this cluster did not yield any significant GO terms.
ii) Cluster 2 (C2) mainly comprises genes showing early down-regulation at 700 uM prednisolone (including genes from the D3 set), while remaining unchanged at all other conditions (probably outlining genes of the S4 ∩ S5 set). GO-based analysis revealed genes involved in developmental processes, implying a possible existence of stem cell related functions in the cell system under study.
iii) Cluster 3 (C3) depicts genes showing early down-regulation at 10 nM prednisolone (including genes from the D1 set), while remaining unaffected by the 700 uM prednisolone treatment. This cluster comprised genes related to positive regulation of cell death. The fact that these genes are suppressed under low prednisolone concentration at the early time point is in agreement with the anti-apoptotic and survival effect reported previously by , which implies an intrinsic mechanism of resistance.
iv) Cluster 4 (C4), similarly to (C1), presents genes which expression at 700 uM prednisolone decreases at the 72 h time point (resembling the behaviour of the D5 subset), while remaining unchanged at all other conditions (including genes from the S1 ∩ S3 subset). However, genes under this cluster did not present any interesting GO terms.
v) Cluster 5 (C 5) contains genes remaining unchanged at all conditions (including genes from the S 1 ∩ S 3 intersection) except for experimental setup '4', where they are down-regulated (resembling the behaviour of the D 4 set). GO-based analysis of cluster 5 revealed genes that are involved in the opposing processes of cell cycle progression, as well as cell cycle arrest, and cell death functions.
vi) Cluster 6 (C6) depicts genes that are also unchanged at all conditions (including genes from the S4 ∩ S5 intersection) except for experimental setup '2', where they seem to be down-regulated (resembling the behaviour of the D2 set). Functions represented by genes in cluster 6 include cell death functions so their down regulation at the specific setup supports the case of inherent resistance mechanisms.
vii) Finally, cluster 7 (C7) groups together time-dependent genes that remain unaffected at all experimental setups (including genes from the S1 ∩ S3 intersection), but at 700 uM after 72 h of treatment (resembling the behaviour of the U5 set), where they are positively regulated. It appears that those genes participate in cell cycle regulation. This cluster seems to outline the effects of the high prednisolone dose, as this dose activates genes related to cell cycle progression. Special attention was drawn to the MCL1 gene. This gene is a member of the BCL2 family and it produces an anti-apoptotic protein responsible for cell survival.
The cluster analysis results, summarized in Table 1, also confirm that, although the gene expression profiles of the cell system under study present a mixed response of inherent and acquired mechanisms of GC resistance, however there is a tendency towards inherent resistance mechanisms.
A next step on our analysis was the identification of common transcription factors that would regulate genes in a similar way. Cluster 1 (C1) manifested a common transcription factor namely OCT1 (p = 5 · 10-5). Cluster 2 (C2) similarly manifested a common transcription factor, MZF1 (p = 8 · 10-4). Cluster 5 (C5) were predicted to be commonly regulated by the CCAAT (NFYA) transcription factor (p = 5 · 10-7). Cluster 6 (C6) presented an interesting case, as the common most prevalent transcription factor was JUN. Finally, Cluster 7 (C7) also manifested different transcription factors. In particular, the most prevalent transcription factor was OCT1 (p = 1 · 10-10).
In order to find further patterns of expression in the system under study, we have performed a chromosome mapping analysis, where the mean expression of genes with respect to chromosomes has been studied. A general remark from this analysis is that expression seemed to be equivalently distributed; at least as far as positive regulation is concerned, while chromosomes 14 and 22 appeared to have genes with most deactivation as compared to untreated genes (Figure 4a).
Further on, we have separated gene expression based on the time points i.e. we have calculated the mean expressions for the 4 h and 72 h treatments (Figure 4b and 4c, respectively). We have tried to identify whether the mean expression values, either positive or negative, correlate to the number of genes represented on each chromosome. In Figure 4d, the number of genes per chromosome is presented. Table 2 presents the calculated correlation values of mean expression values for experimental time points, while Figure 5 graphically illustrates the Pearson's correlation values comparing gene numbers to mean gene expression. Interestingly, gene number is negatively correlated to negative mean expression, which means that the more genes are represented on chromosomes the more these genes are down-regulated as compared to untreated cells or, in order to be more concise, the more these genes are inactivated as compared to control.
In order to gain further insight into the possible mechanisms that probe for the preponderance of inherence or acquired resistance mechanisms, we have attempted to perform a pathway analysis with the defined genes from our dataset. We have mapped the common dataset on known pathways and in particular on the KEGG Pathways database. In particular, we have outlined two pathways: the MAPK pathway, as it incorporated the larger number of involved genes to be mapped on it (Figure 6), and the apoptosis pathway, since we were interested in examining the possible role of genes related to apoptotic mechanisms (Figure 7).
It is known that the MAPK signaling pathway is involved in cell cycle progression and apoptosis. We have outlined several cross-talking molecules in this pathway that were revealed also by our dataset, such as the NF-κB, JNK, p38 and ERK5. NF-κB remains stable across the experimental setups indicating a steady role in apoptosis. Yet, as it is regulated by the GR then its levels should be lowering as concentration increases. This hints towards a more specific, more pronounced involvement of the NF-κB factor in the observed resistance. At the same time, the JNK kinase, which is a mediator of apoptosis, appears to remain unaffected by the prednisolone concentrations indicating that the MAPK pathway involved in apoptosis does not function in the way it should have, hence supporting an inherent mechanism of resistance. Finally, an interesting molecule is the ERK5, which appears to be down-regulated at the low doses and unaffected thereafter. This molecule is involved in cell proliferation and its down-regulation is one of the very few implications that prednisolone behaves as expected, hence indicating an acquired mechanism of resistance.
Moving to the apoptosis pathway, several interesting genes that play a role in the observed apoptosis were identified. In particular, based on the signaling on Figure 7, we can discriminate between the TNF/FADD/CASP/BIRC system (top of the figure), where it appears that molecules responsible for degradation are down-regulated under prednisolone treatment. In particular, BIRC5 was found to be activated in the 72 h treatment, while remained unchanged in the previous time point. In addition, CASP1 was found to be down-regulated at 10 nM treatment and 4 h, indicating an inherent mechanism. This part of the pathway seems to function in such a way as cells already possess glucocorticoid resistance. The second part of the pathway consisting from NGBF/NTRK1/AKT1/NFκB/BIRC/BCL2 (family) shows the following. AKT1 appeared to remain unchanged at all concentrations and time points. NF-κB1 is down-regulated at the late time points, indicating a late response to the glucocorticoid, while it is unchanged at the early time points. The BCL family is a super-family of proteins that when expressed promotes survival. In the present dataset, the BCL2 gene was found to remain unchanged, while a member of the super-family, BCL2L2, was up-regulated. BCL2L2 is one of the genes that participate in survival in the pathway, confirming the fact that we have an early anti-apoptosis response. The role of late NFKB1 inactivation and early BCL2L2 activation probably favors an acquired mechanism of resistance. Finally, an apoptotic signaling avenue is via the regulation of Ca+2 where the caspases participating in this pathway are inactivated in the present system. The PPP3CC protein that regulates phosphorylation of down-stream targets has a relation to apoptosis over the activation of CASP proteins. Both are inactivated in the present system with PPP3CC being inactivated at 72 h and CASP1 at 4 h. This presents a mixed mechanism of apoptosis evasion.
In the present work, we have set up and propose a rational computational analysis framework, in order to aid the elucidation of the molecular mechanisms of glucocorticoid resistance and specifically whether these are to a larger extent inherent or acquired. To the best of our knowledge, there are no reports trying to analyze systematically the underlying, inherent or acquired molecular mechanisms of GC resistance. The issue whether leukemic cells possess the inherent genetically imprinted resistance mechanisms or it is rather a post effect of evolutionary adoption of originally sensitive cells upon GC treatment is still a controversial one, with potentially significant interest for design of novel therapeutic approaches in cancer treatment. A similar work  reported recently that in ex vivo samples, leukemic cells exhibit an at least in part, intrinsic mechanism of reaction. In another work , it has also been mentioned that glucocorticoids can induce intrinsic mechanisms of GC-resistance such as the BCL2 relais.
The initial working hypothesis was that the in vitro system of the present study presents an inherent mechanism of reaction to glucocorticoids, resulting in resistance to apoptosis. Our results showed that cells may attain a mixed mechanism of response to glucocorticoids, however, there is clear evidence predicating towards an intrinsic mechanism of resistance.
Specifically, several interesting genes, whose expression is regulated by the GR, were identified. One of these genes was MCL1. This gene belongs to the BCL super-family, which is responsible for cell survival. It has been reported that down-regulation of MCL1 sensitizes T-cell leukemia cells to treatment with glucocorticoids . In our system this gene appeared to be stable across all concentrations while, interestingly, it was up-regulated by the low prednisolone dose, confirming the anti-apoptotic effect observed previously by the low dose. At the same time another member of the BCL family, BCL2L2, also responsible for cell survival was up-regulated by the low dose of prednisolone. There are no reports connecting BCL2L2 with leukemia, yet some reports refer to its pro-survival role in leukemogenesis [80, 81]. An interesting gene also revealed by the present analysis was DAPK1, a tumor suppressor gene, simultaneously down-reglulated at both prednisolone concentrations. Methylations of this gene have been linked to hereditary character of chronic leukemias [82, 83] as well as its expression has been linked to childhood leukemia [84, 85]. Additionally, when moving to 72 h of treatment a gene down-regulated by prednisolone was KIT, a proto-oncogene homolog to c-KIT. Its inhibition has been reported to be involved in leukemia treatment and glucocorticoid activity [86–88]. Further on, a gene that is up-regulated by the high prednisolone dose was MADD. This gene is a propagating agent of the death signal induced by TNF. It mediates the signal from TNF to MAPK pathway thus inducing apoptosis. It is reported to be overexpressed in several neoplasms  and also it is reported that its over-expression is linked to tumor survival .
MADD is a propagating agent of TNF induced death signals, and more specifically, it mediates the signal from TNF to MAPK pathway, thus inducing apoptosis. MADD is phosphorylated by AKT which then inhibits TRAIL-induced apoptosis . Considering the fact that MADD's expression pattern is in line with the resistance observed in this study, a lingering question is if this over-expression is the result of an inherent or acquired mechanism.
In the search for shared transcription factor binding motifs an interesting finding was that of JUN. This factor has been reported previously to play a role in the apoptosis regulation in the same system studied here .
Chromosome mapping and chromosomal-relative expression did not correlate with the number of genes represented on each chromosome, thus suggesting a well-coordinated GR-induced regulation. GCs are known to lead to chromatin modification about 2 h following treatment. A detectable change in the expression of genes that are immediately regulated by the GC is expected to reach the stage of mRNA steady-state levels at approximately 3-4 h after treatment. Then, genes regulated by the GC, begin to influence subsequent gene expression. cDNA microarray analysis upon 4 h of treatment has been chosen, in order to reduce the complexities that arise later due to ensuing feedback mechanisms, at the same time focusing on the GR/NF-κB-linked, direct target genes . Moreover, the delay in the action of GCs has been reported depending on the cell type and lasting from 2 h to 24 h . Specifically, in our model a prednisolone action lag lasts up to 72 h , as reported in different studies . Chromosomal-related expression, is closely linked to chromatin remodeling and it could be the object of future investigations, as it appears that the GR affects almost all chromosomes at all time points as could be easily hypothesized from the abundance of GR response elements throughout the human genome . This confirms the idea of complete genome regulation by the glucocorticoid receptor in cellular systems.
The final remarks regarding resistance to glucocorticoids, comes from pathway analysis. Based on our observations, probably the apoptosis induced by either TNF receptors or NGFB over the AKT pathway and in concordance to the MAPK pathway is ruled out, since AKT remains unchanged. That means that in the present system, glucocorticoids do not significantly affect this pathway. It is also reported that protein-kinase networks control in a major way the observed resistance to glucocorticoids . On the other hand, based on the pathway model, two alternative ways remain; the mitochondria-induced apoptotic pathway and Ca+2 induced pathway. From those two the mitochondria-directed pathway appears to entail genes that remain unchanged from glucocorticoid treatment. If we also take into account the notion that GR translocation to the mitochondria is part of the resistance mechanisms to glucocorticoids  then this action could be related to the induction of resistance in our system, as through translocation to mitochondria GR ceases to exert its regulatory effects. Alternatively, a likely mechanism for induction of resistance to glucocorticoid-induced apoptosis could be the Ca+2 signaling pathway. Several key genes in it appear to be differentially expressed, such as PPP3CC and CASP1, at the early time points, thus making evasion of apoptosis probable.
Attention should be also drawn on two categories of genes regulated by prednisolone. These are metabolic genes and signal-transduction related genes. In both time points, high prednisolone concentration regulates such genes, thus grounding for cell proliferation machinery. In addition, the regulation of NF-κB-related genes implies an inherent mechanism of resistance through the established link of NF-κB inflammatory role and GC-induced resistance.
Overall, the results of the present study support our initial working hypothesis for a rather inherent GC resistance mechanism. However, further exploitation of the proposed computational analysis workflow through the conduct of a larger genome-wide transcriptomic study could promote the extent and level of detail regarding our knowledge about the molecular underpinnings which orchestrate the manifestation and dynamics of the mechanisms of GC resistance and desensitization.
The leukemic cells used in this study are known to be resistant to glucocorticoids and in particular to prednisolone. In order to gain more insight to the mechanisms of resistance we have developed a rational computational analysis framework along with experimental approaches in order to answer the question of whether cells exhibit this resistance due to an inherent or, an over time, acquired mechanism. We have used the biological information derived from our modeling approach to interpret the findings observed regarding our initial hypothesis. The analysis of the results supports a complex mechanism of action for the cells which tends to favor an intrinsic mechanism of resistance, although in one case prednisolone appeared to exert the expected effects i.e. positive regulation of apoptotic genes, thus supporting an acquired mechanism. It seems that a crucial determinant for the manifestation of either phenotype seems to be the dosage as other haphazard environmental factors, which partly regulate the cellular circuitry towards either direction. The ability to discriminate between acquired or intrinsic mechanisms of resistance is of major importance both in the mechanics of glucocorticoid signal transduction as well as to the clinical praxis since GCs are still front line medications for the treatment of malignant diseases, especially in the case of leukemia.
common gene set
nuclear factor kappa beta
top 100 highest standard deviation genes.
Lauten M, Stanulla M, Zimmermann M, Welte K, Riehm H, Schrappe M: Clinical outcome of patients with childhood acute lymphoblastic leukaemia and an initial leukaemic blood blast count of less than 1000 per microliter. Klin Padiatr. 2001, 213 (4): 169-174. 10.1055/s-2001-16848.
Den Boer ML, Harms DO, Pieters R, Kazemier KM, Gobel U, Korholz D, Graubner U, Haas RJ, Jorch N, Spaar HJ, Kaspers GJ, Kamps WA, Van der Does-Van den Berg A, Van Wering ER, Veerman AJ, Janka-Schaub GE: Patient stratification based on prednisolone-vincristine-asparaginase resistance profiles in children with acute lymphoblastic leukemia. J Clin Oncol. 2003, 21 (17): 3262-3268. 10.1200/JCO.2003.11.031.
Lambrou GI, Vlahopoulos S, Papathanasiou C, Papanikolaou M, Karpusas M, Zoumakis E, Tzortzatou-Stathopoulou F: Prednisolone exerts late mitogenic and biphasic effects on resistant acute lymphoblastic leukemia cells: Relation to early gene expression. Leuk Res. 2009, 33 (12): 1684-1695. 10.1016/j.leukres.2009.04.018.
Sionov RV, Spokoini R, Kfir-Erenfeld S, Cohen O, Yefenof E: Mechanisms regulating the susceptibility of hematopoietic malignancies to glucocorticoid-induced apoptosis. Advances in cancer research. 2008, 101: 127-248.
Schlossmacher G, Stevens A, White A: Glucocorticoid receptor-mediated apoptosis: mechanisms of resistance in cancer cells. J Endocrinol. 2011, 211 (1): 17-25. 10.1530/JOE-11-0135.
Tissing WJ, Meijerink JP, den Boer ML, Brinkhof B, van Rossum EF, van Wering ER, Koper JW, Sonneveld P, Pieters R: Genetic variations in the glucocorticoid receptor gene are not related to glucocorticoid resistance in childhood acute lymphoblastic leukemia. Clin Cancer Res. 2005, 11 (16): 6050-6. 10.1158/1078-0432.CCR-04-2097.
Russcher H, Smit P, van den Akker EL, van Rossum EF, Brinkmann AO, de Jong FH, Lamberts SW, Koper JW: Two polymorphisms in the glucocorticoid receptor gene directly affect glucocorticoid-regulated gene expression. The Journal of clinical endocrinology and metabolism. 2005, 90 (10): 5804-10. 10.1210/jc.2005-0646.
Kofler R, Schmidt S, Kofler A, Ausserlechner MJ: Resistance to glucocorticoid-induced apoptosis in lymphoblastic leukemia. J Endocrinol. 2003, 178 (1): 19-27. 10.1677/joe.0.1780019.
Takada Y, Kobayashi Y, Aggarwal BB: Evodiamine abolishes constitutive and inducible NF-kappaB activation by inhibiting IkappaBalpha kinase activation, thereby suppressing NF-kappaB-regulated antiapoptotic and metastatic gene expression, up-regulating apoptosis, and inhibiting invasion. J Biol Chem. 2005, 280 (17): 17203-17212. 10.1074/jbc.M500077200.
Rao NAS, McCalman MT, Moulos P, Francoijs K, Chatziioannou A, Kolisis FN, Alexis MN, Mitsiou DJ, Stunnenberg HG: Coactivation of GR and NFKB alters the repertoire of their binding sites and target genes. Genome Research. 2011, 21 (9): 1404-1416. 10.1101/gr.118042.110.
Medh RD, Webb MS, Miller AL, Johnson BH, Fofanov Y, Li T, Wood TG, Luxon BA, Thompson EB: Gene expression profile of human lymphoid CEM cells sensitive and resistant to glucocorticoid-evoked apoptosis. Genomics. 2003, 81 (6): 543-555. 10.1016/S0888-7543(03)00045-4.
Ausserlechner MJ, Obexer P, Wiegers GJ, Hartmann BL, Geley S, Kofler R: The cell cycle inhibitor p16(INK4A) sensitizes lymphoblastic leukemia cells to apoptosis by physiologic glucocorticoid levels. J Biol Chem. 2001, 276 (14): 10984-10989. 10.1074/jbc.M008188200.
Thompson EB, Johnson BH: Regulation of a distinctive set of genes in glucocorticoid-evoked apoptosis in CEM human lymphoid cells. Recent Prog Horm Res. 2003, 58: 175-197. 10.1210/rp.58.1.175.
Wissink S, van Heerde EC, Schmitz ML, Kalkhoven E, van der Burg B, Baeuerle PA, van der Saag PT: Distinct domains of the RelA NF-kappaB subunit are required for negative cross-talk and direct interaction with the glucocorticoid receptor. The Journal of biological chemistry. 1997, 272 (35): 22278-84. 10.1074/jbc.272.35.22278.
Palmer LA, Harmon JM: Biochemical evidence that glucocorticoid-sensitive cell lines derived from the human leukemic cell line CCRF-CEM express a normal and a mutant glucocorticoid receptor gene. Cancer research. 1991, 51 (19): 5224-31.
Powers JH, Hillmann AG, Tang DC, Harmon JM: Cloning and expression of mutant glucocorticoid receptors from glucocorticoid-sensitive and -resistant human leukemic cells. Cancer research. 1993, 53 (17): 4059-65.
Norman MR, Thompson EB: Characterization of a glucocorticoid-sensitive human lymphoid cell line. Cancer Res. 1977, 37 (10): 3785-3791.
Sifakis EG, Lambrou GI, Prentza A, Koutsouris D, Tzortzatou-Stathopoulou F: cDNA microarray analysis of a glucocorticoid treated acute lymphoblastic leukemia cell line. Proceedings of the 8th IEEE International Conference on BioInformatics and BioEngineering 2008 (IEEE-BIBE 2008): October 8-10; Athens, Greece. 2008, 1-8.
Laane E, Panaretakis T, Pokrovskaja K, Buentke E, Corcoran M, Soderhall S, Heyman M, Mazur J, Zhivotovsky B, Porwit A, Grander D: Dexamethasone-induced apoptosis in acute lymphoblastic leukemia involves differential regulation of Bcl-2 family members. Haematologica. 2007, 92 (11): 1460-1469. 10.3324/haematol.10543.
Tissing WJ, den Boer ML, Meijerink JP, Menezes RX, Swagemakers S, van der Spek PJ, Sallan SE, Armstrong SA, Pieters R: Genomewide identification of prednisolone-responsive genes in acute lymphoblastic leukemia cells. Blood. 2007, 109 (9): 3929-35. 10.1182/blood-2006-11-056366.
Sifakis EG, Prentza A, Koutsouris D, Chatziioannou AA: Evaluating the effect of various background correction methods regarding noise reduction, in two-channel microarray data. Comput Biol Med.
Zhang D, Zhang M, Wells MT: Multiplicative background correction for spotted microarrays to improve reproducibility. Genet Res. 2006, 87 (3): 195-206. 10.1017/S0016672306008196.
Quackenbush J: Microarray data normalization and transformation. Nat Genet. 2002, 32 (Suppl): 496-501.
Montgomery DC, Runger GC: Applied statistics and probability for engineers. 2003, New York: John Wiley & Sons, 3
Cleveland WS: Robust locally weighted regression and smoothing scatterplots. J Am Stat Assoc. 1979, 74 (368): 829-836. 10.2307/2286407.
Cleveland WS, Devlin SJ: Locally Weighted Regression: An Approach to Regression Analysis by Local Fitting. J Am Stat Assoc. 1988, 83 (403): 596-610. 10.2307/2289282.
Duboit S, Yang Y, Speed T, Callow M: Statistical methods for detecting differentially expressed genes in replicated cDNA microarray experiments. Statistica Sinica. 2002, 12: 111-140.
Altman NS, Hua J: Extending the loop design for two-channel microarray experiments. Genet Res. 2006, 88 (3): 153-163. 10.1017/S0016672307008476.
Kerr MK, Churchill GA: Experimental design for gene expression microarrays. Biostatistics (Oxford, England). 2001, 2 (2): 183-201.
Kerr MK, Churchill GA: Statistical design and the analysis of gene expression microarray data. Genet Res. 2001, 77 (2): 123-128.
Kerr MK, Churchill GA: Statistical design and the analysis of gene expression microarray data. Genet Res. 2007, 89 (5-6): 509-514. 10.1017/S0016672308009713.
Yang IV, Chen E, Hasseman JP, Liang W, Frank BC, Wang S, Sharov V, Saeed AI, White J, Li J, Lee NH, Yeatman TJ, Quackenbush J: Within the fold: assessing differential expression measures and reproducibility in microarray assays. Genome biology. 2002, 3 (11): research0062-
Shabalin AA, Tjelmeland H, Fan C, Perou CM, Nobel AB: Merging two gene-expression studies via cross-platform normalization. Bioinformatics (Oxford, England). 2008, 24 (9): 1154-60. 10.1093/bioinformatics/btn083.
Ramasamy A, Mondry A, Holmes CC, Altman DG: Key Issues in Conducting a Meta-Analysis of Gene Expression Microarray Datasets. PLoS Med. 2008, 5 (9): e184-10.1371/journal.pmed.0050184.
Russel S, Meadows L, Russel RR: Microarray Technology in Practice. 2009, London: Elsevier Inc.
Schuler GD: Sequence mapping by electronic PCR. Genome research. 1997, 7 (5): 541-50.
Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL: GenBank. Nucleic acids research. 2003, 31 (1): 23-7. 10.1093/nar/gkg057.
Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL: GenBank: update. Nucleic acids research. 2004, 32 (Database): D23-6.
Wheeler DL, Church DM, Federhen S, Lash AE, Madden TL, Pontius JU, Schuler GD, Schriml LM, Sequeira E, Tatusova TA, Wagner L: Database resources of the National Center for Biotechnology. Nucleic acids research. 2003, 31 (1): 28-33. 10.1093/nar/gkg033.
Wheeler DL, Church DM, Edgar R, Federhen S, Helmberg W, Madden TL, Pontius JU, Schuler GD, Schriml LM, Sequeira E, Suzek TO, Tatusova TA, Wagner L: Database resources of the National Center for Biotechnology Information: update. Nucleic acids research. 2004, 32 (Database): D35-40.
Diehn M, Sherlock G, Binkley G, Jin H, Matese JC, Hernandez-Boussard T, Rees CA, Cherry JM, Botstein D, Brown PO, Alizadeh AA: SOURCE: a unified genomic resource of functional annotations, ontologies, and gene expression data. Nucleic acids research. 2003, 31 (1): 219-23. 10.1093/nar/gkg014.
Perez-Iratxeta C, Wjst M, Bork P, Andrade MA: G2D: a tool for mining genes associated with disease. BMC genetics. 2005, 6: 45-10.1186/1471-2156-6-45.
Warnat P, Eils R, Brors B: Cross-platform analysis of cancer microarray data improves gene expression based classification of phenotypes. BMC bioinformatics. 2005, 6: 265-10.1186/1471-2105-6-265.
Toedling J, Spang R: Assessment of Five Microarray Experiments on Gene Expression Profiling of Breast Cancer. Proceedings of the 7th IEEE Annual International Conference on Computational Biology: April 10-13; Berlin, Germany. 2003
Tritchler D, Parkhomenko E, Beyene J: Filtering genes for cluster and network analysis. BMC bioinformatics. 2009, 10: 193-10.1186/1471-2105-10-193.
Freyhult E, Landfors M, Onskog J, Hvidsten T, Ryden P: Challenges in microarray class discovery: a comprehensive examination of normalization, gene selection and clustering. BMC Bioinformatics. 2010, 11 (1): 503-10.1186/1471-2105-11-503.
Forgy E: Cluster analysis of multivariate data: efficiency vs interpretability of classifications. Biometrics. 1965, 21: 768-769.
Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability. Edited by: Le Cam LM, Neyman J. 1967, University of California Press
Lloyd S: Least squares quantization in PCM. IEEE Transactions on Information Theory. 1982, 28 (2): 129-137. 10.1109/TIT.1982.1056489. 129
The MathWorks Incorporated: MATLAB - The Language of Technical Computing. 2009, 7.9 R2009b:
Gibbons FD, Roth FP: Judging the quality of gene expression-based clustering methods using gene annotation. Genome research. 2002, 12 (10): 1574-81. 10.1101/gr.397002.
Rousseeuw PJ, Leroy AM: Robust Regression and Outlier Detection. 1987, New York: Wiley-Interscience
Kaufman L, Rousseeuw PJ: Finding groups in data: An introduction to cluster analysis. 1990, New York: Wiley
Halkidi M, Batistakis Y, Vazirgiannis M: On Clustering Validation Techniques. J Intell Inform Syst. 2001, 17 (2-3): 107-145.
Khatri P, Drăghici S: Ontological analysis of gene expression data: current tools, limitations, and open problems. Bioinformatics. 2005, 21 (18): 3587-3595. 10.1093/bioinformatics/bti565.
Grossmann S, Bauer S, Robinson PN, Vingron M: Improved detection of overrepresentation of Gene-Ontology annotations with parent-child analysis. Bioinformatics. 2007, 23 (22): 3024-3031. 10.1093/bioinformatics/btm440.
Bauer S, Grossmann S, Vingron M, Robinson PN: Ontologizer 2.0--a multifunctional tool for GO term enrichment analysis and data exploration. Bioinformatics. 2008, 24 (14): 1650-1651. 10.1093/bioinformatics/btn250.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nature genetics. 2000, 25 (1): 25-9. 10.1038/75556.
Cole SW, Yan W, Galic Z, Arevalo J, Zack JA: Expression-based monitoring of transcription factor activity: the TELiS database. Bioinformatics (Oxford, England). 2005, 21 (6): 803-10. 10.1093/bioinformatics/bti038.
Wingender E, Dietze P, Karas H, Knuppel R: TRANSFAC: a database on transcription factors and their DNA binding sites. Nucleic acids research. 1996, 24 (1): 238-41. 10.1093/nar/24.1.238.
Cohen BA, Mitra RD, Hughes JD, Church GM: A computational analysis of whole-genome expression data reveals chromosomal domains of gene expression. Nature genetics. 2000, 26 (2): 183-6. 10.1038/79896.
Zhang B, Schmoyer D, Kirov S, Snoddy J: GOTree Machine (GOTM): a web-based platform for interpreting sets of interesting genes using Gene Ontology hierarchies. BMC bioinformatics. 2004, 5: 16-10.1186/1471-2105-5-16.
Mlecnik B, Scheideler M, Hackl H, Hartler J, Sanchez-Cabo F, Trajanoski Z: PathwayExplorer: web service for visualizing high-throughput expression data on biological pathways. Nucleic acids research. 2005, 33 (Web Server): W633-7. 10.1093/nar/gki391.
Ogata H, Goto S, Sato K, Fujibuchi W, Bono H, Kanehisa M: KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic acids research. 1999, 27 (1): 29-34. 10.1093/nar/27.1.29.
Kanehisa M, Goto S: KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic acids research. 2000, 28 (1): 27-30. 10.1093/nar/28.1.27.
Kanehisa M: The KEGG database. Novartis Foundation symposium. 2002, 247: 91-101. discussion 101-3, 119-28, 244-52
Feinstein E, Druck T, Kastury K, Berissi H, Goodart SA, Overhauser J, Kimchi A, Huebner K: Assignment of DAP1 and DAPK--genes that positively mediate programmed cell death triggered by IFN-gamma--to chromosome regions 5p12.2 and 9q34.1, respectively. Genomics. 1995, 29 (1): 305-7. 10.1006/geno.1995.1255.
Wang Y, Zhao L, Wu C, Liu P, Shi L, Liang Y, Xiong S, Mi J, Chen Z, Ren R, Chen S: C-KIT mutation cooperates with full-length AML1-ETO to induce acute myeloid leukemia in mice. Proceedings of the National Academy of Sciences. 2011, 108 (6): 2450-2455. 10.1073/pnas.1019625108.
Li P, Jayarama S, Ganesh L, Mordi D, Carr R, Kanteti P, Hay N, Prabhakar BS: Akt-phosphorylated mitogen-activated kinase-activating death domain protein (MADD) inhibits TRAIL-induced apoptosis by blocking Fas-associated death domain (FADD) association with death receptor 4. J Biol Chem. 2010, 285 (29): 22713-22722. 10.1074/jbc.M110.105692.
Kurada BR, Li LC, Mulherkar N, Subramanian M, Prasad KV, Prabhakar BS: MADD, a splice variant of IG20, is indispensable for MAPK activation and protection against apoptosis upon tumor necrosis factor-alpha treatment. The Journal of biological chemistry. 2009, 284 (20): 13533-41. 10.1074/jbc.M808554200.
Kelly JL, Novak AJ, Fredericksen ZS, Liebow M, Ansell SM, Dogan A, Wang AH, Witzig TE, Call TG, Kay NE, Habermann TM, Slager SL, Cerhan JR: Germline variation in apoptosis pathway genes and risk of non-Hodgkin's lymphoma. Cancer Epidemiol Biomarkers Prev. 2010, 19 (11): 2847-2858. 10.1158/1055-9965.EPI-10-0581.
Levy-Strumpf N, Kimchi A: Death associated proteins (DAPs): from gene identification to the analysis of their apoptotic and tumor suppressive functions. Oncogene. 1998, 17 (25): 3331-40.
Koren I, Reem E, Kimchi A: DAP1, a novel substrate of mTOR, negatively regulates autophagy. Curr Biol. 2010, 20 (12): 1093-1098. 10.1016/j.cub.2010.04.041.
Liu X, Gao R, Dong Y, Gao L, Zhao Y, Zhao L, Zhao X, Zhang H: Survivin gene silencing sensitizes prostate cancer cells to selenium growth inhibition. BMC Cancer. 2010, 10: 418-10.1186/1471-2407-10-418.
Guo JP, Shu SK, Esposito NN, Coppola D, Koomen JM, Cheng JQ: IKKepsilon phosphorylation of estrogen receptor alpha Ser-167 and contribution to tamoxifen resistance in breast cancer. J Biol Chem. 2010, 285 (6): 3676-3684. 10.1074/jbc.M109.078212.
Lambrou GI, Vlahopoulos S, Papathanasiou C, Papanikolaou M, Karpusas M, Zoumakis E, Tzortzatou-Stathopoulou F: Prednisolone exerts late mitogenic and biphasic effects on resistant acute lymphoblastic leukemia cells: Relation to early gene expression. Leukemia research. 2009, 33 (12): 1684-1695. 10.1016/j.leukres.2009.04.018.
Cario G, Fetz A, Bretscher C, Moricke A, Schrauder A, Stanulla M, Schrappe M: Initial leukemic gene expression profiles of patients with poor in vivo prednisone response are similar to those of blasts persisting under prednisone treatment in childhood acute lymphoblastic leukemia. Annals of hematology. 2008, 87 (9): 709-16. 10.1007/s00277-008-0504-x.
Schmidt S, Rainer J, Riml S, Ploner C, Jesacher S, Achmuller C, Presul E, Skvortsov S, Crazzolara R, Fiegl M, Raivio T, Janne OA, Geley S, Meister B, Kofler R: Identification of glucocorticoid-response genes in children with acute lymphoblastic leukemia. Blood. 2006, 107 (5): 2061-9. 10.1182/blood-2005-07-2853.
Obexer P, Hagenbuchner J, Rupp M, Salvador C, Holzner M, Deutsch M, Porto V, Kofler R, Unterkircher T, Ausserlechner MJ: p16INK4A sensitizes human leukemia cells to FAS-and glucocorticoid-induced apoptosis via induction of BBC3/Puma and repression of MCL1 and BCL2. The Journal of biological chemistry. 2009, 284 (45): 30933-40. 10.1074/jbc.M109.051441.
Beverly LJ, Varmus HE: MYC-induced myeloid leukemogenesis is accelerated by all six members of the antiapoptotic BCL family. Oncogene. 2009, 28 (9): 1274-9. 10.1038/onc.2008.466.
Craig RW: MCL1 provides a window on the role of the BCL2 family in cell proliferation, differentiation and tumorigenesis. Leukemia. 2002, 16 (4): 444-54. 10.1038/sj.leu.2402416.
Raval A, Tanner SM, Byrd JC, Angerman EB, Perko JD, Chen SS, Hackanson B, Grever MR, Lucas DM, Matkovic JJ, Lin TS, Kipps TJ, Murray F, Weisenburger D, Sanger W, Lynch J, Watson P, Jansen M, Yoshinaga Y, Rosenquist R, de Jong PJ, Coggill P, Beck S, Lynch H, de la Chapelle A, Plass C: Downregulation of death-associated protein kinase 1 (DAPK1) in chronic lymphocytic leukemia. Cell. 2007, 129 (5): 879-90. 10.1016/j.cell.2007.03.043.
Qian J, Wang YL, Lin J, Yao DM, Xu WR, Wu CY: Aberrant methylation of the death-associated protein kinase 1 (DAPK1) CpG island in chronic myeloid leukemia. European journal of haematology. 2009, 82 (2): 119-23. 10.1111/j.1600-0609.2008.01178.x.
Holleman A, den Boer ML, de Menezes RX, Cheok MH, Cheng C, Kazemier KM, Janka-Schaub GE, Gobel U, Graubner UB, Evans WE, Pieters R: The expression of 70 apoptosis genes in relation to lineage, genetic subtype, cellular drug resistance, and outcome in childhood acute lymphoblastic leukemia. Blood. 2006, 107 (2): 769-76. 10.1182/blood-2005-07-2930.
Larramendy ML, Niini T, Elonen E, Nagy B, Ollila J, Vihinen M, Knuutila S: Overexpression of translocation-associated fusion genes of FGFRI, MYC, NPMI, and DEK, but absence of the translocations in acute myeloid leukemia. A microarray analysis. Haematologica. 2002, 87 (6): 569-77.
Gnoni A, Marech I, Silvestris N, Vacca A, Lorusso V: Dasatinib: an anti-tumour agent via Src inhibition. Curr Drug Targets. 2011, 12 (4): 563-578. 10.2174/138945011794751591.
Aplenc R, Blaney SM, Strauss LC, Balis FM, Shusterman S, Ingle AM, Agrawal S, Sun J, Wright JJ, Adamson PC: Pediatric Phase I Trial and Pharmacokinetic Study of Dasatinib: A Report From the Children's Oncology Group Phase I Consortium. Journal of Clinical Oncology. 2011
Andrade MV, Hiragun T, Beaven MA: Dexamethasone suppresses antigen-induced activation of phosphatidylinositol 3-kinase and downstream responses in mast cells. J Immunol. 2004, 172 (12): 7254-62.
Bandyopadhyay S, Chiang C, Srivastava J, Gersten M, White S, Bell R, Kurschner C, Martin CH, Smoot M, Sahasrabudhe S, Barber DL, Chanda SK, Ideker T: A human MAP kinase interactome. Nat Meth. 2010, 7 (10): 801-805. 10.1038/nmeth.1506.
Medh RD, Wang A, Zhou F, Thompson EB: Constitutive expression of ectopic c-Myc delays glucocorticoid-evoked apoptosis of human leukemic CEM-C7 cells. Oncogene. 2001, 20 (34): 4629-39. 10.1038/sj.onc.1204680.
Thompson EB, Johnson BH: Regulation of a distinctive set of genes in glucocorticoid-evoked apoptosis in CEM human lymphoid cells. Recent progress in hormone research. 2003, 58: 175-97. 10.1210/rp.58.1.175.
Laane E, Panaretakis T, Pokrovskaja K, Buentke E, Corcoran M, Soderhall S, Heyman M, Mazur J, Zhivotovsky B, Porwit A, Grander D: Dexamethasone-induced apoptosis in acute lymphoblastic leukemia involves differential regulation of Bcl-2 family members. Haematologica. 2007, 92 (11): 1460-9. 10.3324/haematol.10543.
Kfir-Erenfeld S, Sionov RV, Spokoini R, Cohen O, Yefenof E: Protein kinase networks regulating glucocorticoid-induced apoptosis of hematopoietic cancer cells: fundamental aspects and practical considerations. Leuk Lymphoma. 2010, 51 (11): 1968-2005. 10.3109/10428194.2010.506570.
This study was supported in part by Norwegian, Iceland, Lichtenstein & Hellenic ministry of Finance, Grant EL0049 and by the Information Society Technology program of the European Commission "e-Laboratory for Interdisciplinary Collaborative Research in Data Mining and Data-Intensive Sciences (e-LICO)" (IST-2007.4.4-231519).
The authors declare that they have no competing interests.
EGS designed the structure of the preprocessing and analysis pipeline, implemented the corresponding Matlab® scripts, performed the computational analyses and drafted the manuscript. GIL conceived the idea of the hypothetic model of the study, performed the biological experiments, contributed to the biological interpretation of the results and drafted the manuscript. AP supervised the design and implementation of the computational analyses and participated in manuscript revision. DK, SV and FTS contributed in the supervision of various aspects of the work, provided technical consultation and participated in manuscript revision. AAC participated to the conception and design of the study, supervision of the computational analyses, results interpretation and manuscript revision, evaluated the computational analyses results and proofread the manuscript. All authors have read and approved the final manuscript.
Emmanouil G Sifakis, George I Lambrou contributed equally to this work.
Electronic supplementary material
Additional file 1: Cross-platform normalization. One microarray slide per platform (1.2 k and 4.8 k) was selected and a quantile-quantile plot (QQ-plot) was produced (a) before and (b) after the application of cross-platform normalization. In each QQ-plot, the quantiles of all gene expression values of the first slide were plotted against the quantiles of all gene expression values of the second slide. In the case where the gene expression values of the two slides come from the same distribution, the points in the plot should fall near the straight line. (PDF 15 KB)
Additional file 2: Optimal cluster number determination. K-means clustering was executed for a number of clusters, varying between 2 to 30. For each cluster number, the best (maximum) value of all the average silhouette widths obtained at 1,000 executions was plotted against the cluster number. Since the maximum values of the average silhouette width did not exhibit any specific trend, the optimal cluster number was determined as the one corresponding to the maximum value of the plot, indicated by the arrow. For the computation of the silhouettes the squared Euclidean distance was also used. (PDF 9 KB)
Additional file 3: Unified dataset after data integration. The list of the 490 genes, common in all experimental platforms and replicates (CGS) and their corresponding fold change ratios per experiment. Data integration was performed after (i) matching the reporters on the two microarray platforms through UniGene Cluster IDs, and (ii) applying a cross-platform normalization approach. (XLSX 63 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.