- Open Access
Microenvironmental genomic alterations reveal signaling networks for head and neck squamous cell carcinoma
© Bebek et al; licensee BioMed Central Ltd. 2011
- Received: 18 March 2011
- Accepted: 2 August 2011
- Published: 2 August 2011
Advanced stage head and neck squamous cell carcinoma (HNSCC) is an aggressive cancer with low survival rates. Loss-of-heterozygosity/allelic imbalance (LOH/AI) analysis has been widely used to identify genomic alterations in solid tumors and the tumor microenvironment (stroma). We hypothesize that these identified alterations can point to signaling networks functioning in HNSCC epithelial-tumor and surrounding stroma (tumor microenvironment).
Under the assumption that genes in proximity to identified LOH/AI regions are correlated with the tumorigenic phenotype, we mined publicly available biological information to identify pathway segments (signaling proteins connected to each other in a network) and identify the role of tumor microenvironment in HNSCC. Across both neoplastic epithelial cells and the surrounding stromal cells, genetic alterations in HNSCC were successfully identified, and 75 markers were observed to have significantly different LOH/AI frequencies in these compartments (p < 0.026). We applied a network identification approach to the genes in proximity to these 75 markers in cancer epithelium and stroma in order to identify biological networks that can describe functional associations amongst these marker-associated genes.
We verified the involvement of T-cell receptor signaling pathways in HNSCC as well as associated oncogenes such as LCK and PLCB1, and tumor suppressors such as STAT5A, PTPN6, PARK2. We identified expression levels of genes within significant LOH/AI regions specific to stroma networks that correlate with better outcome in radiation therapy. By integrating various levels of high-throughput data, we were able to precisely focus on specific proteins and genes that are germane to HNSCC.
- Human Papilloma Virus
- Cold Spot
- Neoplastic Epithelial Cell
- NCI60 Cell Line
- Stroma Sample
HNSCC is the sixth most common cancer and remains a major cause of cancer morbidity and mortality worldwide . More than 85% of head and neck squamous cell carcinomas (HNSCC) are related to tobacco use, while others may have a relationship to viral etiologies such as human papillomavirus (HPV) infection/colonization. Nevertheless, advanced stage HNSCC remains an aggressive cancer with low survival rates. Molecular studies suggest that HNSCC results from cumulative epigenetic and genetic alterations [2–4]. Various genomic regions and/or genes have been correlated with survival in HNSCC or classified as early detection/aggressiveness markers . Albeit incomplete, such baseline knowledge of HNSCC genetics builds a foundation for exploration of functional associations between these structural alterations and tumorigenesis. Identifying such networks through a more systematic examination of HNSCC is a challenge and the focus of this study.
Recent genome-scanning technologies uncovered an unexpectedly large amount of structural variation (SV) in the human genome [2, 5–9]. Structural variations comprise a large set of alterations including deletions, duplications, large-scale copy-number variants, inversions and translocations in the genome . On the extreme, cancer genomes are known to attain frequent alterations in their gross chromosomal structure by amplification, deletion, translocation and/or inversion of chromosomal segments . These structural variations can inactivate genes, produce multiple copies of genes thereby increasing gene activity or, in rare situations, result in the fusion of two genes. Alterations in tandem may be critical to cancer onset and progression.
Loss-of-heterozygosity/allelic imbalance (LOH/AI) scanning has been widely used to identify genetic alterations in tumor samples. The absence or an imbalanced signal of a DNA marker in the tumor sample would suggest LOH/AI in these cancerous cells . Numerous studies reporting localized and/or genome-wide LOH/AI analyses have discovered specific loci with consistently high frequencies of LOH/AI in HNSCC. These observations have provided key clues for identification of tumor suppressor genes in this malignancy [2, 13]. Moreover, it is now common practice to utilize laser capture micro-dissection (LCM) and LOH/AI analysis of tumor compartments, namely, neoplastic epithelial cells and the surrounding cancer-associated (previously presumed to be non-cancerous) stromal cells (part of the tumor microenvironment) [14–21]. For example, LOH/AI analysis of DNA from the neoplastic epithelial cells of invasive breast carcinomas and surrounding stroma revealed that stromal somatic mutations of TP53 in stromal cells, but not epithelial neoplasia, correlated with regional nodal metastases . In the absence of stromal TP53 mutation, LOH/AI at 5 specific loci in the stromal cells also correlated with regional nodal metastases . Subsequently, only with extensive empiric molecular and cell biology studies did a mechanism for this genetic observation emerge . In general, however, extended functional associations of genes within these regions with their cellular signaling mechanisms have yet to be made. It is hoped that the approach described here will minimize the time and effort put forward for pinpointing functional mechanisms from tumor-associated bicompartmental somatic genomic observations without prolonged repeated empiric work on multiple candidate pathways.
In this study, therefore, we have applied an integrated network discovery framework [24–26] to identify distinct signaling pathway networks (SPN) of the two compartments of HNSCC. Genes of interest are surveyed and signaling networks identifying genes affected by these variations are visualized. We also investigated bicompartmental genomic alterations and their associated SPN's in the context of radiation therapy and human papilloma virus (HPV) status, both germane factors in HNSCC treatment response. Ultimately, our systems biology approach of pathway identification should provide invaluable knowledge in understanding the inter-compartmental and inter-network-based events in HNSCC tumorigenesis and importantly, guide empiric molecular and cellular biology experiments in a targeted manner.
LOH/AI gene identification
Patient Characteristics (N = 122)*
Frequency, No. (%)
Age, mean (SD), y
T3 or T4
Regional nodal metastases
Low G1 or 2
Hot spot-, cold spot- and clinicopathological feature-associated microsatellite markers identified in HNSCC tissue compartments
Epithelium and stroma
For this set of 75 markers, we extended marker locations 250 kb in both directions of a marker to identify genes within proximity. The parameter (250 kb) was chosen for computational flexibility. This extension returned 273 genes that lay within proximity of these marker locations (See Additional File 1, Table S3). The number of genes included in the region increases linearly as the flanking regions are extended (See Additional File 2, Supp. Text). A larger set of genes diminishes the effectiveness of the methodology since the number of unrelated genes increases. For instance, if genes within the same loci of identified markers are used (> 250 kb, varying based on loci size), the mapping would return ~2200 genes for these 75 markers. Thus, we decided against this all-encompassing approach so that we could establish an effective methodology (See Additional File 2, supplementary text and Additional File 3, Figure S4).
Genes are filtered by establishing networks
Signaling networks highlight functional associations of tumor related genes
Signaling events in the cell play a critical role in the execution of key biological functions. To further investigate the role of the filtered genes within LOH/AI regions of interest, we searched for signaling pathway networks, which were generated using mRNA expression levels, known key signaling pathways, protein-protein interactions, and characteristics of these proteins. The signaling pathway search greatly decreased the number of genes associated with each marker (down ~50 from 273). In this way, an extended list of genes was reduced to a short list of genes that are functionally correlated with one another in the HNSCC context.
We then compared our short list of genes with the oncogenes and tumor suppressor genes that have been previously identified to be associated with HNSCC and other cancers in earlier studies (listed in Table 2). Earlier studies may have identified genes of interest by observing their structural loss or reduction in function. In the present work, our network search workflow (Figure 1) has been successfully verified by identifying genes that were previously associated with HNSCC. For further verification, we found that our methodology accurately identified genes that were previously identified as tumor suppressors, proto-oncogenes and metastasis-related genes in earlier studies (see Additional File 1, Table S6, Table S7 and Table S8). In addition, the generated networks included known head and neck cancer biomarkers. Common fragile site genes (DAG1 and PARK2) and various genes that have elevated expression patterns in cancers are also connected in these networks.
Structural variations are involved in initiation and progression of HNSCC
In summary, networks signifying medium- to large-scale structural variations are predicted through integration of genome-wide LOH/AI analysis, tumor-derived mRNA expression levels, and high throughput proteomic, pathway and annotation data.
Verifying networks identified via cluster analysis of mRNA expression data
To strengthen our claim that the generated networks are highly significant in describing the disease, in this case HNSCC, we also analyzed randomly picked genes from the protein-protein interaction database, the primary database of the computational framework. We acquired microarray expression levels from the same mRNA dataset for these genes and generated an unsupervised clustering for these genes for comparison. These clusters show increased disarray (See Additional File 3, Figure S2) when compared to the expression patterns of the genes placed in the network through the computational framework.
Radiation response of identified genes and networks
Proteins in the signaling networks associated with structural hot- and cold-spots generated for stroma and epithelium of HNSCC
Tumor Suppressor Gene
ACVR1B, DOK2, PARK2, PTPN6, STAT5A
DOK2, PTPN6, STAT5A
CRK, EGFR , LCK, SRC
EGFR, LCK, PARK2, SRC
Oncogenic when overexpressed
Increased expression in Cancers
CDC2, CREM, HOXB4, NTSR2, PVR, PVRL1
ADRBK1, CCR4 , HOXB4
Inactive in Cancers
Serine/threonine protein kinase
Role in Metastasis
HNSCC association not identified yet
ADAM15, BSN, CBLC, CNTN4, EMX1, GRIN2B, HOXB1, KRT82, SKAP1, TLE6
DSCAM, EMX1, GRIN2B, HOXB1, SKAP1, TLE6
In this study, we sought to identify signaling pathway networks in HNSCC-derived carcinoma and their associated stroma by genotyping DNA from each compartment with microsatellite markers and integrating independent publicly available HNSCC-relevant somatic microarray-based mRNA expression and other datasets , resulting in a first description of integrative -omics-derived genes to signaling pathway networks in the neoplastic epithelial carcinoma cells and the surrounding tumor-associated stromal fibroblasts. We followed a computational workflow that integrates extensive amounts of high-throughput data, namely protein-protein interactions, gene ontology annotations, protein colocalization data, and known signaling pathways to form signaling networks that can lead to a better understanding of HNSCC (Figure 1). In addition, it is hoped that this type of approach would also result in specific pathways that can be targeted for empiric study linking genomic variation and pathogenesis without taking a candidate approach in functional analysis.
Genetic alterations such as copy number aberrations or LOH/AI have been shown to be associated with HNSCC initiation and progression . In LOH/AI testing, the microsatellite markers are informative in a location-specific manner, however, these markers are an average of 9 cM apart. Hence, we extended the coverage to 250 kb flanking each side of the 75 significant (71 hot + cold spot markers + 4 markers associated with tumor size/nodal metastases) marker locations to generate a list of genes in close proximity. By choosing a shorter segment of the genomic region(s) near a significant marker (500 kb/marker instead of the marker's whole locus, in this case, 9 × 2 cM), we were able to narrow our search space and increase efficiency and minimizing false positives. We believe that our approach has identified significant genomic regions with viable functional associations. In this study, we have utilized microarray data that were generated from tumor samples that were at least 80% tumor cellularity . An 80% tumor-cellularity does not mean 20% are stroma. Because we are looking at tumor-associated stroma, the 80% tumor-cellularity should also contain its tumor-associated stromal cells, but the precise make-up is unknown. The lack of publicly available subcompartment-specific gene expression profiles certainly poses its own challenges. However, since the pathway analysis is seeded from the genomic alterations of the subcompartments, the microarray data should still carry general patterns of expression profiles from head and neck tissue. Hence, the resulting pathways so identified should represent reasonably accurate stroma- and epithelium-specific signaling pathway networks. The generated networks contain a significant number of stroma- and epithelium-specific genes identified through the genotyping experiments. The networks reflect this classification via utilization of an integrative -omics approach. This reduces any false signals that might be introduced via any platform that is utilized.
Signaling pathways of HNSCC
In this study, identifying large numbers of frequent LOH/AI in stroma suggests that genetic alterations in this compartment of the tumor precede the genetic alteration in the surrounding cells, which might be consistent with the well-known field-effect theory of cancerization [33, 34]. We report networks based on these significant markers, which also highlight hot spot marker genes that mostly cluster in the stromal network. In recent studies, the interaction of epithelium and stroma of breast carcinoma was investigated [14, 35, 36]. Similar to our observation in HNSCC, the location of the LOH/AI regions in the epithelial cells of breast cancer are concentrated in a smaller region, specifically, a smaller number of markers with much higher LOH/AI frequencies; whereas in the stromal cells, they are more spatially complex, distributed over a larger number of loci. Using these observations, a model of carcinogenesis was developed, at least for breast cancers: transformation initiates in the epithelial cells (higher LOH/AI frequencies) while stromal genomic alterations may dictate biology and in term affect the epithelial component . This genomic observation has been mechanistically validated .
We have observed a larger number of genetic alterations in the tumor stroma than in the epithelium. Additionally, not all changes in epithelium are observed in stroma. However, these added changes could play a different and parallel role in HNSCC carcinogenesis. This observation parallels the findings of independent studies showing somatic mutations and/or LOH/AI in the stroma of breast cancer [14, 37] colon cancer , bladder cancer , and ovarian cancer . It is plausible that the large number of markers involved in the stroma of HNSCC and bladder cancer could represent a field effect after exposure to shared carcinogens. Just as in other cancers such as those of the breast, the diversity of markers in the stroma may explain biological diversity . The changes in the stroma resulting from alterations in the signaling pathways may then cross-talk with the cancer epithelium and induce genomic instability, as has been shown by Lisanti's group for breast cancers .
Similarities and differences between stroma and epithelium networks
The identified signaling pathway networks point out differences in signaling in stroma and in epithelium (Figure 2). In investigating the expression correlations among proteins to identify the pattern of up/down regulation in tumors versus their corresponding normal tissues, PARK2 (Parkinson disease [autosomal recessive, juvenile] 2, parkin), a gene lying in a hotspot, was found to be common in both compartments' networks. PARK2 is within the fragile site FRA6E on chromosome 6, a region shown to be unstable and prone to breakage and rearrangements. DAG1 from this region was observed as inactivated in multiple cancers . We observe that both of these genes in HNSCC are affected (Figure 2) with likely loss of expression.
EGFR-PTK2B signaling modulates ubiquitin (Ub)/proteasome pathway-mediated intracellular trafficking. PYK2B activation is also critical for the activation of SRC downstream of EGFR, which we do not observe in HNSCC. In this study, we observe that EGFR transactivation prevented the phosphorylation of the nonreceptor tyrosine kinases PYK2B and SRC, locating these kinases downstream of the transactivated EGFR as noted. Although as highlighted in the signaling networks, SKAP1 (SRC kinase associated phosphoprotein 1) is identified as a cold spot, the lack of signaling starting through EGFR prevents SRC activation. SRC is expressed at low levels in most cell types and, in the absence of appropriate extracellular stimuli, maintained in an inactive conformation.
We associated genes that are activated or have gain-of-function in other cancers with those found in HNSCC, e.g., HOXB4, PVR, RHOA, ADRBK1 and CCR4. Pathways that are common to multiple cancers can be identified by incorporating these types of oncogenes from multiple studies (Additional File 1, Table S6 and Table S7). Moreover, genes like CCR4 and GSK3A highlighted in this study are already candidate targets for therapy in other cancers . Human breast, ovarian, renal, lung and colon tumor specimens have been analyzed for somatic RHOA mutations previously. No intragenic mutations in RHOA were found, nor a correlation between RHOA mRNA expression and the presence or absence of 3p21 deletions. This suggests likely duplication of RHOA in HNSCC as well (also verified by the increased mRNA expression levels shown in Figure 2) .
Regulatory T cells are important in modulating antitumor immune response. In both compartments, we see T-cell related signaling proteins, such as proto-oncogene LCK (T cell-specific protein-tyrosine kinase), tumor suppressor PTPN6 (Tyrosine-protein phosphatase), and SKAP1 (Src kinase-associated phosphoprotein 1). In cells, SKAP1 has a critical role in inside-out signaling (regulatory signaling that originate within the cell cytoplasm and are then transmitted to the external ligand-binding domain of a receptor) by coupling T-cell antigen receptor stimulation to the activation of integrins. In both compartments, SKAP1 interacts with LCK, which is most commonly found in T cells (Figure 2). In an earlier study , STAT5B was shown to contribute to LCK-induced cell proliferation and resistance to apoptosis. Similarly STAT5A, a STAT5B isoform, might be carrying out a similar activity in HNSCC. Hence, increased constitutive activation of STAT5 was detected in transformed compared with normal squamous cells. It is known that blockade of TGF-alpha or EGFR, ended STAT5 activation . However, observing down regulation of EGFR in this cancer (Figure 2 and Figure 3), we conclude that the control on proliferation is lost.
HNSCC biology is consistent in both HPV+ and HPV- patients
In this study, we did not observe differences in biological networks of HNSCC with and without human papillomavirus (HPV) in the context of the stroma. HPV infection is a strong risk factor for HNSCC  regardless of other factors such as tobacco or alcohol use. However, it should be noted that the HPV "effect" is germane only in certain oropharyngeal sites of HNSCC. Furthermore, depending on the manner and quantity of subtyping HPV, in fact, the jury is still out regarding the role of HPV in HNSCC. Unsupervised hierarchal clustering of mRNA expression levels of the network genes in stroma and epithelium showed that the expression patterns of HPV+ and HPV- patients are similar (Figure 3) in these networks. In essence, we hypothesize that these networks represent the biology in both tissue types, since they are built upon the genomic alterations and integrated with tissue specific message signals. Therefore, these heat maps, as opposed to subclustering into two patient groups (+ and -), reveal that only patient and normal sample differences are observed. Interestingly, when methylated genes are clustered via the same microarray data used in our network identification, the HPV+ and HPV - patients clearly separate from each other (Figure 4). To rule out that this might be due to tumor site differences, since HPV+ cases are observed with highest prevalence in oropharynx and base of tongue , we have repeated this clustering with a subset of patients that are site matched (11 pairs matched by site and stage), and observed a similar result (Figure S3).
The difference in methylated genes between HPV+ and HPV- patients is mostly due to changes in cellular machinery caused by either HPV's integration into the genome or the latter's inflammation-related effects on methylation. For instance p16 (CDKN2A) is known to be overexpressed in HPV+ HNSCC patients . Although these differences were explained by HPV modifying the cellular expression machinery to create different expression profiles , our networks show that the essential genes in tumorigenesis are similar in both patient groups. In other words, HPV integration affects genes with similar downstream consequences as other mutational events observed in HPV- tumors. Hence, HNSCC-relevant pathways derived from LOH/AI regions of HNSCC as shown in our results (Figure 2, and Figure 3) has not been differentiated with HPV initiated HNSCC. Indeed, our current data suggest that whether it is HPV-associated or not, what is important are the final common pathways.
HNSCC is a result of cumulative genetic and epigenetic alterations . In this study, we have only considered genetic markers of HNSCC and mRNA levels in the tissue to measure these changes. The computational data mining approach can be easily adapted to include any other high throughput experimentation, such as methylation profiles, single nucleotide polymorphisms etc. In future studies, using this successful workflow as a basis, we will extend our knowledgebase to fine map HNSCC signaling pathway networks.
Stroma may mediate better response to radiation therapy
Microarray profiles of radiation response in the NCI60 cell lines
sd of Q
Epithelium Network (EN)
Stroma Network (SN)
Intersection of EN & SN
Hot/Cold spots in SN
Hot/Cold spots in EN
Genes within 250 K of Epithelium Markers
Genes within 250 K of Stroma Markers
The proposed framework establishes valuable foundations towards building tumor-specific signaling pathway networks, which in return will provide a more thorough understanding of the pathobiology of HNSCC. The framework not only reduces the search space but also enables us to focus on specific proteins and genes that are active in HNSCC, including novel proteins related to molecular mechanisms involved in HNSCC. Pathways and networks are built up efficiently, utilizing widely available high-throughput data and providing powerful discovery tools for research. Our present work also demonstrates that this approach and framework can be applied to the tumor microenvironment whose role in tumorigenesis, invasion, progression and response in therapy will only gain in prominence [22, 23, 48].
Hot and cold spots of LOH/AI in HNSCC
In our study, we analyzed HNSCC samples that were previously collected and genotyped in an earlier study . The two compartments of the neoplastic tissue (epithelium and stroma) in 122 samples (Table 1) were isolated using LCM as previously described in . LOH/AI markers used in this study have coverage of 7 to 29 markers per chromosome, i.e. about 9-cM intermarker distance. In total, 366 microsatellite markers were analyzed in both epithelium and stroma samples from the 122 patients (Overall 244 samples, 122 epithelium and 122 stroma samples of the 122 patients). In this earlier study, all significant regions were named hot spots, attributing to their importance, regardless of their LOH/AI frequency.
In this study, hot and cold spots of regional LOH/AI are defined and identified. The LOH/AI regions that have significantly higher frequency of LOH/AI (p-value < 0.05) at a marker or markers compared with other markers along the same chromosome are named hot spots. In contrast if LOH/AI regions have significantly lower frequency of LOH/AI (p-value < 0.05) at a marker or markers compared with other markers along the same chromosome, we named these regions cold spots (See Table 2 for a summary, Additional File 1, Table S1 for epithelium hot/cold spots, and Additional File 1, Table S2 for stroma hot/cold spots). Since hot spot regions have significantly higher frequencies of LOH/AI, we expected to observe variation in these regions, whereas cold spots are regions that were less likely to carry variation. Among the 75 genomic locations of interest, 34 of them are cold spots, and 37 are hot spots. We also included four more markers that correlate to tumor size and nodal status that is not characterized as hot or cold spot in HNSCC (three of the four observed in stroma, and one in epithelium) .
Tissue specific mRNA microarray data acquisition
Based on the assumption that genes affected by carcinogenesis should reflect their altered state on their respective mRNA expression levels, we acquired HNSCC somatic expression array data. Genes within close proximity of significant markers of LOH/AI can be associated with the available expression array data to further reveal relationships that can lead to clues about the role of these genes in biological pathways. Hence, publicly available microarray expression data for HNSCC is acquired from the Gene Expression Omnibus . The array source is screened by the platform used, tissue (for controls) and tumors analyzed. The probe sets in this study are processed using the robust multiarray averaging (RMA). In this publicly available data set human gene expression levels were measured using Affymetrix U133 Plus 2.0 arrays. This array is a comprehensive genome-wide expression analysis chip that analyzes the expression level of over 47,000 transcripts and variants. We have merged all of the expression profiles and calculated the Pearson's Correlation Coefficient of all the genes over all the samples.
Identifying signaling pathway networks of HNSCC
Network analysis frameworks are commonly used for computational analysis of high-throughput molecular interaction data and are useful in determining the conservation and divergence of functional organization in biological systems. Current widely used approaches are generally limited to specific target patterns, such as conserved sub-networks and motifs utilizing shortest path algorithms, nearest neighbor queries or topological properties based on limited abstractions from well annotated pathway databases such as KEGG. On the other hand, the computational framework utilized in this study (Figure 1) facilitates identification of components and features of the cellular network that characterize similarities and differences between cancerous and normal cells from a functional perspective. Details of this framework are given in the supplementary text.
Identifying genes within vicinity of LOH/AI markers
First, possible LOH/AI regions of HNSCC are identified using a similar approach presented in . Although carcinogenesis pathway of HNSCC does not directly represent functional relationships of cancer related genes, this information can be utilized to identify signaling pathways via a more systematic and integrated manner.
The LOH/AI regions identified from the signaling pathway will be associated with possible genes. Although each marker can correspond to more than one gene, by associating the genes with available high-throughput data that is related to cancer, genes can be eliminated. Here our assumption is that, if these LOH/AI regions are correlated with each other in terms of carcinogenesis, the effect should be observed in expression levels as well as interaction patterns of these genes after translation. Hence, if the genes within close proximity of these markers are associated with available high-throughput data, we should observe these relationships and eventually form hypotheses over functional relationships of these genes in terms of pathways. Previous studies [49–51] have used a similar approach, where they measured gene expression through mRNA levels to identify tumor suppressor genes in HNSCC. Identifying these hot spot genes will allow us to form a hypothesis of the functional relationships among these genes.
Identifying gene expression correlation with outcome via Global Test
Global Test is a statistical test, giving a score for association of the expression profile of one or more groups of genes to a given outcome . The test is based on the Cox proportional hazards model and is calculated using martingale residuals. This procedure allows us to test hypotheses about the influence of these groups of genes on survival directly; in our case response to genotoxic stress. A study reporting large-scale gene expression changes in response to genotoxic stress is utilized. The data is downloaded from GEO (accession GSE7505) . In this dataset, radiation response was measured in NCI60 cell lines using NHGRI Homo sapiens 6 K array. Out of the 63 array samples, 15 cell lines were labeled as sensitive/resistant to genotoxic stress. Using these 15 samples and the seven set of genes (hot and cold spot genes (Additional File 1, Table S4 and Table S5), stroma and epithelium networks genes (Figure 2, Additional File 1, Table S6 and Table S7) and their intersection, and only the hot/cold spot genes that are identified in these two networks) the global test is run and p-values are acquired (Table 4). P-value calculation method was done using permutations over the whole microarray experiment downloaded. The number of permutations was limited to number of genes on each array used.
Unspervised hierarchial clustering of gene expression profiles
The gene expression profile of each sample is first normalized by transforming values so that the mean is 0 and the standard deviation is 1. The clustering is performed by calculating Pearson's correlation coefficients between mRNA expression profiles over all samples and based on these distances building dendograms with hierarachical clustering method as it is implemented in Matlab (Matworks, Natick, MA). The heatmaps are generated based on the final clustering.
GB is supported in part from National Institutes of Health grants R25T-CA094186, P30-CA043703 and UL1-RR024989. CE is the Sondra J. and Stephen R. Hardis Endowed Chair of Cancer Genomic Medicine at the Cleveland Clinic, was a recipient of the Doris Duke Distinguished Clinical Scientist Award and is a recipient of the American Cancer Society Clinical Research Professorship, generously supported, in part, by the F.M. Kirby Foundation.
- American Cancer Society: Cancer Facts and Figures. 2009Google Scholar
- Chen Y, Chen C: DNA copy number variation and loss of heterozygosity in relation to recurrence of and survival from head and neck squamous cell carcinoma: A review. Head & Neck. 2008, 30: 1361-1383. 10.1002/hed.20861.View ArticleGoogle Scholar
- Bennett KL, Romigh T, Eng C: Disruption of transforming growth factor-beta signaling by five frequently methylated genes leads to head and neck squamous cell carcinoma pathogenesis. Cancer Res. 2009, 69: 9301-9305. 10.1158/0008-5472.CAN-09-3073.View ArticlePubMedGoogle Scholar
- Bennett KL, Lee W, Lamarre E, Zhang X, Seth R, Scharpf J, Hunt J, Eng C: HPV status-independent association of alcohol and tobacco exposure or prior radiation therapy with promoter methylation of FUSSEL18, EBF3, IRX1, and SEPT9, but not SLC5A8, in head and neck squamous cell carcinomas. Genes Chromosomes Cancer. 2010, 49: 319-326.PubMedGoogle Scholar
- Cooper GMM, Zerr T, Kidd JMM, Eichler EEE, Nickerson DAA: Systematic assessment of copy number variant detection via genome-wide SNP genotyping. Nature genetics. 2008Google Scholar
- Goidts V, Cooper DN, Armengol L, Schempp W, Conroy J, Estivill X, Nowak N, Hameister H, Kehrer-Sawatzki H: Complex patterns of copy number variation at sites of segmental duplications: an important category of structural variation in the human genome. Hum Genet. 2006, 120: 270-284. 10.1007/s00439-006-0217-y.View ArticlePubMedGoogle Scholar
- Itsara A, Cooper GM, Baker C, Girirajan S, Li J, Absher D, Krauss RM, Myers RM, Ridker PM, Chasman DI, et al: Population analysis of large copy number variants and hotspots of human genetic disease. Am J Hum Genet. 2009, 84: 148-161. 10.1016/j.ajhg.2008.12.014.PubMed CentralView ArticlePubMedGoogle Scholar
- Jakobsson M, Scholz SW, Scheet P, Gibbs RJ, Vanliere JM, Fung H-C, Szpiech ZA, Degnan JH, Wang K, Guerreiro R, et al: Genotype, haplotype and copy-number variation in worldwide human populations. Nature. 2008, 451:Google Scholar
- Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH, Andrews TD, Fiegler H, Shapero MH, Carson AR, Chen W, et al: Global variation in copy number in the human genome. Nature. 2006, 444: 444-454. 10.1038/nature05329.PubMed CentralView ArticlePubMedGoogle Scholar
- Feuk L, Carson AR, Scherer SW: Structural variation in the human genome. Nat Rev Genet. 2006, 7: 85-97.View ArticlePubMedGoogle Scholar
- Mardis ER, Wilson RK: Cancer genome sequencing: a review. Hum Mol Genet. 2009, 18: R163-168. 10.1093/hmg/ddp396.PubMed CentralView ArticlePubMedGoogle Scholar
- Alberts B, Johnson A, Lewis J, Raff M, Roberts K, Walter P: Molecular Biology of the Cell [Book and CD-ROM]. Garland Science. 2002Google Scholar
- Chin D, Boyle GM, Theile DR, Parsons PG, Coman WB: Molecular introduction to head and neck cancer (HNSCC) carcinogenesis. Br J Plast Surg. 2004, 57: 595-602. 10.1016/j.bjps.2004.06.010.View ArticlePubMedGoogle Scholar
- Fukino K, Shen L, Matsumoto S, Morrison CD, Mutter GL, Eng C, Fukino K, Shen L, Matsumoto S, Morrison CD, et al: Combined total genome loss of heterozygosity scan of breast cancer stroma and epithelium reveals multiplicity of stromal targets. Cancer Res. 2004, 64:Google Scholar
- Kurose K, Hoshaw-Woodard S, Adeyinka A, Lemeshow S, Watson PH, Eng C: Genetic model of multi-step breast carcinogenesis involving the epithelium and stroma: clues to tumour-microenvironment interactions. Hum Mol Genet. 2001, 10:Google Scholar
- Weber F, Xu Y, Zhang L, Patocs A, Shen L, Platzer P, Eng C, Weber F, Xu Y, Zhang L, et al: Microenvironmental genomic alterations and clinicopathological behavior in head and neck squamous cell carcinoma. JAMA. 2007, 297:Google Scholar
- Yan PS, Venkataramu C, Ibrahim A, Liu JC, Shen RZ, Diaz NM, Centeno B, Weber F, Leu YW, Shapiro CL, et al: Mapping geographic zones of cancer risk with epigenetic biomarkers in normal breast tissue. Clin Cancer Res. 2006, 12:Google Scholar
- Curino A, Patel V, Nielsen BS, Iskander AJ, Ensley JF, Yoo GH, Holsinger FC, Myers JN, El-Nagaar A, Kellman RM, et al: Detection of plasminogen activators in oral cancer by laser capture microdissection combined with zymography. Oral Oncology. 2004, 40: 1026-1032. 10.1016/j.oraloncology.2004.05.011.View ArticlePubMedGoogle Scholar
- Bian Y, Knobloch TJ, Sadim M, Kaklamani V, Raji A, Yang GY, Weghorst CM, Pasche B: Somatic acquisition of TGFBR1*6A by epithelial and stromal cells during head and neck and colon cancer development. Hum Mol Genet. 2007, 16: 3128-3135. 10.1093/hmg/ddm274.PubMed CentralView ArticlePubMedGoogle Scholar
- Patel V, Hood BL, Molinolo AA, Lee NH, Conrads TP, Braisted JC, Krizman DB, Veenstra TD, Gutkind JS: Proteomic Analysis of Laser-Captured Paraffin-Embedded Tissues: A Molecular Portrait of Head and Neck Cancer Progression. Clinical Cancer Research. 2008, 14: 1002-1014. 10.1158/1078-0432.CCR-07-1497.View ArticlePubMedGoogle Scholar
- Leethanakul C, Patel V, Gillespie J, Pallente M, Ensley JF, Koontongkaew S, Liotta LA, Emmert-Buck M, Gutkind JS: Distinct pattern of expression of differentiation and growth-related genes in squamous cell carcinomas of the head and neck revealed by the use of laser capture microdissection and cDNA arrays. Oncogene. 2000, 19: 3220-3224. 10.1038/sj.onc.1203703.View ArticlePubMedGoogle Scholar
- Patocs A, Zhang L, Xu Y, Weber F, Caldes T, Mutter GL, Platzer P, Eng C: Breast-cancer stromal cells with TP53 mutations and nodal metastases. N Engl J Med. 2007, 357: 2543-2551. 10.1056/NEJMoa071825.View ArticlePubMedGoogle Scholar
- Lisanti MP, Martinez-Outschoorn UE, Pavlides S, Whitaker-Menezes D, Pestell RG, Howell A, Sotgia F: Accelerated aging in the tumor microenvironment: Connecting aging, inflammation and cancer metabolism with personalized medicine. Cell Cycle. 2011, 10:Google Scholar
- Bebek G, Yang J: PathFinder: mining signal transduction pathway segments from protein-protein interaction networks. BMC Bioinformatics. 2007, 8:Google Scholar
- Bebek G, Patel V, Chance MR: PETALS: Proteomic Evaluation and Topological Analysis of a mutated Locus' Signaling. BMC Bioinformatics. 2010, 11: 596-10.1186/1471-2105-11-596.PubMed CentralView ArticlePubMedGoogle Scholar
- Patel VN, Bebek G, Mariadason JM, Wang D, Augenlicht LH, Chance MR: Prediction and testing of biological networks underlying intestinal cancer. PLoS One. 2010, 5:Google Scholar
- Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000, 25:Google Scholar
- Ongenaert M, Van Neste L, De Meyer T, Menschaert G, Bekaert S, Van Criekinge W: PubMeth: a cancer methylation database combining text-mining and expert annotation. Nucleic Acids Res. 2008, 36: D842-846.PubMed CentralView ArticlePubMedGoogle Scholar
- Goeman JJ, van de Geer SA, de Kort F, van Houwelingen HC: A global test for groups of genes: testing association with a clinical outcome. Bioinformatics. 2004, 20: 93-99. 10.1093/bioinformatics/btg382.View ArticlePubMedGoogle Scholar
- Khodarev NN, Beckett M, Labay E, Darga T, Roizman B, Weichselbaum RR: STAT1 is overexpressed in tumors selected for radioresistance and confers protection from radiation in transduced sensitive cells. Proc Natl Acad Sci USA. 2004, 101: 1714-1719. 10.1073/pnas.0308102100.PubMed CentralView ArticlePubMedGoogle Scholar
- Amundson SA, Do KT, Vinikoor LC, Lee RA, Koch-Paiz CA, Ahn J, Reimers M, Chen Y, Scudiero DA, Weinstein JN, et al: Integrating global gene expression and radiation survival parameters across the 60 cell lines of the National Cancer Institute Anticancer Drug Screen. Cancer Res. 2008, 68: 415-424. 10.1158/0008-5472.CAN-07-2120.View ArticlePubMedGoogle Scholar
- Pyeon D, Newton MA, Lambert PF, den Boon JA, Sengupta S, Marsit CJ, Woodworth CD, Connor JP, Haugen TH, Smith EM, et al: Fundamental differences in cell cycle deregulation in human papillomavirus-positive and human papillomavirus-negative head/neck and cervical cancers. Cancer Res. 2007, 67: 4605-4619. 10.1158/0008-5472.CAN-06-3619.PubMed CentralView ArticlePubMedGoogle Scholar
- Braakhuis BJ, Tabor MP, Kummer JA, Leemans CR, Brakenhoff RH: A genetic explanation of Slaughter's concept of field cancerization: evidence and clinical implications. Cancer Res. 2003, 63: 1727-1730.PubMedGoogle Scholar
- Ge L, Meng W, Zhou H, Bhowmick N: Could stroma contribute to field cancerization?. Medical hypotheses. 2010, 75: 26-31. 10.1016/j.mehy.2010.01.019.View ArticlePubMedGoogle Scholar
- de Neergaard M, Kim J, Villadsen R, Fridriksdottir AJ, Rank F, Timmermans-Wielenga V, Langerød A, Børresen-Dale A-L, Petersen O-W, Rønnov-Jessen L: Epithelial-Stromal Interaction 1 (EPSTI1) substi- tutes for peritumoral fibroblasts in the tumor microenvironment. Am J Pathol. 2010Google Scholar
- Fukino K, Shen L, Patocs A, Mutter GL, Eng C, Fukino K, Shen L, Patocs A, Mutter GL, Eng C: Genomic instability within tumor stroma and clinicopathological characteristics of sporadic primary invasive breast carcinoma. JAMA. 2007, 297:Google Scholar
- Wernert N, Locherbach C, Wellmann A, Behrens P, Hugel A: Presence of genetic alterations in microdissected stroma of human colon and breast cancers. Anticancer Res. 2001, 21: 2259-2264.PubMedGoogle Scholar
- Paterson RF, Ulbright TM, MacLennan GT, Zhang S, Pan CX, Sweeney CJ, Moore CR, Foster RS, Koch MO, Eble JN, Cheng L: Molecular genetic alterations in the laser-capture-microdissected stroma adjacent to bladder carcinoma. Cancer. 2003, 98: 1830-1836. 10.1002/cncr.11747.View ArticlePubMedGoogle Scholar
- Tuhkanen H, Anttila M, Kosma VM, Yla-Herttuala S, Heinonen S, Kuronen A, Juhola M, Tammi R, Tammi M, Mannermaa A: Genetic alterations in the peritumoral stromal cells of malignant and borderline epithelial ovarian tumors as indicated by allelic imbalance on chromosome 3p. Int J Cancer. 2004, 109: 247-252. 10.1002/ijc.11733.View ArticlePubMedGoogle Scholar
- McAvoy S, Zhu Y, Perez DS, James CD, Smith DI: Disabled-1 is a large common fragile site gene, inactivated in multiple cancers. Genes, Chromosomes and Cancer. 2008, 47: 165-174. 10.1002/gcc.20519.View ArticlePubMedGoogle Scholar
- Ishida T, Ueda R: CCR4 as a novel molecular target for immunotherapy of cancer. Cancer Science. 2006, 97: 1139-1146. 10.1111/j.1349-7006.2006.00307.x.View ArticlePubMedGoogle Scholar
- Faried A, Nakajima M, Sohda M, Miyazaki T, Kato H, Kuwano H: Correlation between RhoA overexpression and tumour progression in esophageal squamous cell carcinoma. Eur J Surg Oncol. 2005, 31: 410-414. 10.1016/j.ejso.2004.12.014.View ArticlePubMedGoogle Scholar
- Shi M, Cooper JC, Yu CL: A constitutively active Lck kinase promotes cell proliferation and resistance to apoptosis through signal transducer and activator of transcription 5b activation. Mol Cancer Res. 2006, 4: 39-45. 10.1158/1541-7786.MCR-05-0202.View ArticlePubMedGoogle Scholar
- Leong PL, Xi S, Drenning SD, Dyer KF, Wentzel AL, Lerner EC, Smithgall TE, Grandis JR: Differential function of STAT5 isoforms in head and neck cancer growth control. Oncogene. 2002, 21: 2846-2853. 10.1038/sj.onc.1205385.View ArticlePubMedGoogle Scholar
- D'Souza G, Kreimer AR, Viscidi R, Pawlita M, Fakhry C, Koch WM, Westra WH, Gillison ML: Case-control study of human papillomavirus and oropharyngeal cancer. N Engl J Med. 2007, 356: 1944-1956. 10.1056/NEJMoa065497.View ArticlePubMedGoogle Scholar
- Munck-Wikland E, Attner P, Nasman A, Hammerstedt L, Ramqvist T, Dalianis T: "Epidemic" increase of tonsillar and base of tongue cancer. Explanation: parallel increase of HPV infections. Lakartidningen. 2010, 107: 1702-1704.PubMedGoogle Scholar
- Slebos RJ, Yi Y, Ely K, Carter J, Evjen A, Zhang X, Shyr Y, Murphy BM, Cmelak AJ, Burkey BB, et al: Gene expression differences associated with human papillomavirus status in head and neck squamous cell carcinoma. Clin Cancer Res. 2006, 12:Google Scholar
- Eng C: Microenvironmental protection in diffuse large-B-cell lymphoma. N Engl J Med. 2008, 359: 2379-2381. 10.1056/NEJMe0808409.View ArticlePubMedGoogle Scholar
- Beder LB, Gunduz M, Ouchida M, Fukushima K, Gunduz E, Ito S, Sakai A, Nagai N, Nishizaki K, Shimizu K, et al: Genome-wide analyses on loss of heterozygosity in head and neck squamous cell carcinomas. Lab Invest. 2003, 83:Google Scholar
- Beder LB, Gunduz M, Ouchida M, Gunduz E, Sakai A, Fukushima K, Nagatsuka H, Ito S, Honjo N, Nishizaki K, et al: Identification of a candidate tumor suppressor gene RHOBTB1 located at a novel allelic loss region 10q21 in head and neck cancer. J Cancer Res Clin Oncol. 2006, 132:Google Scholar
- Gunduz M, Nagatsuka H, Demircan K, Gunduz E, Cengiz B, Ouchida M, Tsujigiwa H, Yamachika E, Fukushima K, Beder L, et al: Frequent deletion and down-regulation of ING4, a candidate tumor suppressor gene at 12p13, in head and neck squamous cell carcinomas. Gene. 2005, 356:Google Scholar
- Chung CH, Parker JS, Karaca G, Wu J, Funkhouser WK, Moore D, Butterfoss D, Xiang D, Zanation A, Yin X, et al: Molecular classification of head and neck squamous cell carcinomas using patterns of gene expression. Cancer Cell. 2004, 5:Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.