A systematic analysis of a mi-RNA inter-pathway regulatory motif
© Di Carlo et al.; licensee BioMed Central Ltd. 2013
Received: 31 May 2013
Accepted: 16 October 2013
Published: 24 October 2013
The continuing discovery of new types and functions of small non-coding RNAs is suggesting the presence of regulatory mechanisms far more complex than the ones currently used to study and design Gene Regulatory Networks. Just focusing on the roles of micro RNAs (miRNAs), they have been found to be part of several intra-pathway regulatory motifs. However, inter-pathway regulatory mechanisms have been often neglected and require further investigation.
In this paper we present the result of a systems biology study aimed at analyzing a high-level inter-pathway regulatory motif called Pathway Protection Loop, not previously described, in which miRNAs seem to play a crucial role in the successful behavior and activation of a pathway. Through the automatic analysis of a large set of public available databases, we found statistical evidence that this inter-pathway regulatory motif is very common in several classes of KEGG Homo Sapiens pathways and concurs in creating a complex regulatory network involving several pathways connected by this specific motif. The role of this motif seems also confirmed by a deeper review of other research activities on selected representative pathways.
Although previous studies suggested transcriptional regulation mechanism at the pathway level such as the Pathway Protection Loop, a high-level analysis like the one proposed in this paper is still missing. The understanding of higher-level regulatory motifs could, as instance, lead to new approaches in the identification of therapeutic targets because it could unveil new and “indirect” paths to activate or silence a target pathway. However, a lot of work still needs to be done to better uncover this high-level inter-pathway regulation including enlarging the analysis to other small non-coding RNA molecules.
Systems biology is increasingly highlighting that a discrete biological function can only rarely be attributed to a single molecule. Instead, most biological characteristics arise from complex interactions among the cell’s numerous constituents, such as proteins, DNA, RNA and small molecules [1–3]. Understanding the structure and the dynamics of complex intercellular networks that contribute to the structure and function of a living cell is therefore mandatory.
The fast development of technologies to collect high-throughput biological data allows us to determine how different molecules interact with each other, leading to a proliferation of biological networks (e.g., protein-protein interaction, metabolic, signaling and transcription-regulatory networks). Several public and commercial network repositories including the WikiPathway database [4, 5], the Ingenuity database , and the Kyoto Encyclopedia of Genes and Genomes (KEGG) [7, 8], collect large amount of curated biological networks that can be explored and analyzed for high-level systemic analysis. None of these networks is independent, instead they form a complex network of networks that is responsible for the behavior of the cell. In this paper we concentrate on the key role that small non-coding RNAs, and in particular micro RNAs (miRNAs), have in this intricate biological network of networks.
Several results have been achieved in the past few years from the research of recurrent motifs in complex Gene Regulation Networks [9–18], highlighting the central role of miRNAs in governing specific regulation mechanisms at the network level [19–22]. In order to move toward the identification of higher-level mechanisms of transcriptional regulation, in this paper we performed a systemic analysis on a large set of well known biological networks to underline the presence of an inter-network regulatory motif in which miRNAs seem involved in a high-level regulatory activity among different networks. Rather then searching for pure topological motifs, available networks have been enriched with biological information from several public repositories to attempt to link obtained results to selected biological mechanisms .
When the pathway is activated, one or more pathway genes co-express one or more intragenic miRNAs.
The intragenic miRNAs expressed by the pathway target one or more transcription factors of some of the PAGs. We call these transcription factors Pathway Antagonist Transcription Factors (PATFs). In some situations, the pathway intragenic miRNA may also target the pathway itself, but this mechanism belongs to well-studied intra-pathway regulations that are not the focus of this work.
The down-regulation of the PATFs has a repressive effect on the expression of the corresponding PAGs.
Down-regulated PAGs have lower ability to express the Antagonist miRNAs. It is worth here to remember that miRNAs have a post-transcriptional regulation role. Intragenic miRNAs that directly target the PAGs would not actually prevent the production of the related Antagonist miRNAs since miRNAs are expressed during transcription. The only way PPLs can form is therefore by mediating the PAGs down-regulation through their corresponding PATFs .
The reduced presence of Antagonist miRNAs contributes to the successful expression of the pathway genes, thus closing the protection loop.
An interesting characteristic of PPL is its hierarchical structure: a very small number of intragenic miRNAs (usually one or two) is able to “defend” the expression of either a large number or even the most important pathway genes.
This paper proposes an extensive systems biology study to analyze the existence and the characteristics of this new motif on a large set of public available networks. Results will show statistical evidence that this inter-network regulatory motif is very common in several classes of considered networks, and it can be used to identify an intricate set of links among networks thus building a high-level pathway to pathway interaction network. Finally, to further support the proposed research activity, literature mining allowed us to find clues of possible dysregulated PPLs in several papers targeting the study of tumors [25, 26].
Results and discussion
To assess the presence of PPLs we analyzed a set of networks for the Homo Sapiens species available in the KEGG database .
The KEGG database contains a set of 203 networks related to the Homo Sapiens species and represents one of the most curated and reliable source of pathway information. KEGG is unique for its focus and coverage of yeast, mouse, and human metabolic and signaling pathways . All 203 pathways have been carefully analyzed in order to keep only representative and reliable networks. Human disease pathways have been excluded from the analysis since they represent deviations from correct behaviors that may change the mechanisms responsible for the formation of PPLs. Moreover, a set of few additional pathways not actually containing a regulatory network have been excluded, obtaining a final set of 158 pathways available for the analysis. All these 158 pathways have been manually checked and are all regulatory sub-networks. Each is, of course, part of THE regulatory network including the whole genome. Nevertheless, the separation in single “functional” pathways is necessary to make the problem manageable with the current tools.
The final set of considered KEGG pathways is reported in the two files Additional file 1 and Additional file 2 provided as additional files to this paper. The KEGG pathway repository contains several classes of networks describing a very large set of biological processes. The type of biological process, and consequently the involved actors (e.g., genes, proteins, metabolites, etc.) may bias the presence or the absence of PPLs. It has therefore been taken into account in our analysis. KEGG already categorizes all pathways according to a hierarchical ontology called the KEGG BRITE hierarchy. We exploited the first hierarchical level of this ontology to cluster all considered pathways into two main categories related to the ability of the corresponding nodes to be involved in miRNA mediated regulatory processes as will be discussed in the Statistical Analysis section. The first category contains 107 metabolic pathways while the second category contains 51 non-metabolic pathways (9 from KEGG cellular processes classification, 16 from KEGG environmental information processes classification, 6 from KEGG genetic information processes classification and 20 from KEGG organismal systems classification).
All pathways have been analyzed to search for the presence of PPLs resorting to a bioinformatics pipeline featuring the aggregation of information from several public on-line biological databases (see the Materials and methods section). To statistically analyze the existence of PPLs in the selected pathways we compared the obtained results with the ones gathered analyzing a population of randomly generated pathways. We generated a random population of 100 randomized networks. The size of each random network has been selected by first computing the mean (μsize) and the standard deviation (σsize) of the size of all networks in the KEGG dataset and by sampling a Normal distribution N(μsize, σsize) to obtain random network sizes comparable with the real ones. Genes composing each network have been then randomly selected from the Sanger Genecode database release 9 (Sanger) .
We analyzed data obtained from our analysis using the R language and its environment for statistical computing . The full set of analyzed data is available in the files Additional file 1 and Additional file 2 provided as additional files to this paper.
Statistical analysis of PPL occurrence in pathways (mirSVR < -0.3): PPL occurrence contingency matrix
Group of pathways
Presence of loops
10 (9.3), [15.8]
97 (90.7), [49.7]
28 (54.9), [44.4]
23 (45.1), [11.8]
25 (25), [39.8]
75 (75), [38.5]
Pairwise Pearson’s Chi-square tests among all possible pairs of pathway groups (i.e., non-metabolic vs. random, metabolic vs. non-metabolic and metabolic vs. random) for PPLs identified with mirSVR < -0.3. p-values have been adjusted applying Holms adjustment
χ2 = 36.7867
p = 3.953649 × 10-09
χ2 = 7.9363
χ2 = 11.9768
p = 4.845243 × 10-03
p = 1.077333 × 10-03
Pearson’s Chi-squared test among the three groups points out that there is significant statistical dependence between rows and columns of the contingency matrix reported in Table 1 (χ2 = 38.8678, d.f. 2, p = 3.631× 10-09), thus confirming our hypothesis that PPLs manifest with different frequencies based on the considered groups. To better understand where differences among groups lie, post-hoc analysis has been performed. We performed a chi-squared test considering all possible pairs of groups (i.e., non-metabolic vs. random, metabolic vs. non-metabolic and metabolic vs. random). Analyzing the obtained results reported in Table 2, we noticed that pathways including PPLs appear with a significant higher frequency in non-metabolic pathways (55%) than in metabolic pathways (9%) (χ2 = 36.7867, p = 3.953649 × 10-09). This insight is in accordance with the study done in , that suggests the presence of a “universe” of miRNAs deeply involved in the regulation of signaling pathways, which represent a large portion of the non-metabolic pathways group. Overall, considering KEGG signaling pathways only, about 71% of them contain PPLs. Instead, as expected, metabolic pathways exhibit a reduced percentage of PPLs due to the high presence of metabolites in their nodes, which are unable to express pathway intragenic miRNAs that are responsible for the creation of PPLs. Pathways including PPLs also appear with significant higher frequency in non-metabolic pathways (55%) than in random pathways (25%) (χ2 = 11.9768, p = 1.077333 × 10-03) confirming our hypothesis that the establishment of this motif is not due to chance. Moreover, the frequency of pathways with PPLs is higher in the random group compared to the metabolic group (χ2 = 7.9363, p = 4.845243 × 10-03). Again, this is non surprising at all. As already stated, metabolic pathways are in large part formed by metabolites unable to express the intragenic miRNAs required to create a PPL. Differently, random pathways include nodes which are randomly sampled from the full set of genes available in the Sanger Genecode database and have a higher probability to include genes expressing miRNAs potentially able to establish a PPL.
To further analyze the characteristics of the proposed motif, we also investigated if we can observe statistical difference in the number of PPLs per pathway among the different groups (this information is available in the Additional file 1). We analyzed the distribution of this variable for Normality with the Kolmogorov-Smirnov test using the R lillie.test procedure. The result confirmed the lack of normality (D = 0.4335, p < 2.2 × 10-16). Eventually, the Kruskal-Wallis test, a non-parametric analysis of variance , has been performed resorting to the R kruskal.test procedure.
Mann–Whitney U test on the PPL numerosity among all possible pairs of pathway groups (i.e., non-metabolic vs. random, metabolic vs. non-metabolic and metabolic vs. random) for PPLs identified with mirSVR < -0.3. p-values have been adjusted applying Holms adjustment
p = 1.0 × 10-09
p = 0.0028
p = 0.0028
Statistical analysis of PPL occurrence in pathways (mirSVR < -0.6): PPL occurrence contingency matrix
Group of pathways
Presence of loops
7 (6.5), [14.0]
100 (93.5), [48.08]
24 (47.05), [48.0]
27 (52.95), [12.98]
19 (19), [38.00]
81 (81), [38.94]
Pairwise Pearson’s Chi-square tests among all possible pairs of pathway groups (i.e., non-metabolic vs. random, metabolic vs. non-metabolic and metabolic vs. random) for PPLs identified with mirSVR < -0.6. p-values have been adjusted applying Holms adjustment
χ2 = 33.4281
p = 2.2218396 × 10-08
χ2 = 6.2143
χ2 = 11.7142
p = 1.26723 × 10-02
p = 1.240482 × 10-03
Mann–Whitney U test on the PPL numerosity among all possible pairs of pathway groups (i.e., non-metabolic vs. random, metabolic vs. non-metabolic and metabolic vs. random) for PPLs identified with mirSVR < -0.6. p-values have been adjusted applying Holms adjustment
p = 9.5 × 10-09
p = 0.0049
p = 0.0029
Pearson’s Chi-squared test on the contingency matrix reported in Table 4, that indicates the frequency in which PPLs manifest in the three considered groups of pathways with this new mirSVR threshold, still confirms that there is significant statistical dependence between rows and columns (χ2 = 36.3039, d.f. 2, p = 1.308 × 10-08), thus confirming that even reducing the set of miRNA targets to the ones with higher score we still observe that PPLs manifest with different frequencies based on the considered groups. Table 5 further confirms this result when post-hoc analysis is performed to analyze differences among pairs of groups. Finally, Kruskal-Wallis rank sum test on the number of loops among the three groups of pathways confirms statistical differences also in this case (H = 34.1145, d.f. 2, p = 3.91 × 10-08), and this difference is confirmed also in Table 6 when Mann–Whitney U tests are used for post-hoc analysis among the different pairs of groups.
This outcome is particularly interesting since it highlights that the identified PPLs mainly involve high-score miRNA gene predictions, thus adding reliability to our findings.
Interaction among networks
Each node of the network represents a KEGG pathway. Two types of nodes are available: (1) hexagonal nodes represent pathways in which PPLs have been detected, (2) rhomboidal nodes are pathways in which no PPLs have been detected but containing at least one PAG. A directed weighted edge connects two pathways if a PPL generated from the first pathway targets a PAG contained in the second pathway. The weight of the edges represents the number of PPLs connecting the two pathways. Furthermore, each node is labeled with an additional parameter reporting the number of PAGs of the pathway that have not been detected in any of the KEGG pathways.
The network reported in Figure 2 clearly shows how the PPL motif creates a very intricate regulatory mechanism among different pathways. 79 pathways are involved in this mechanism and 552 edges identify interactions between pathways involving at least a PPL.
By analyzing the nodes generating PPLs using the Cytoscape network analyzer plugin, it is also possible to highlight that on average, each pathway generating PPLs is connected to 25.111 pathways thus confirming the complexity of the identified motif, which involves the cooperation of several pathways.
This enforces the idea that miRNAs cover different, and often even conflicting, roles in gene regulatory networks. From this perspective, we can identify three important and complementary roles for miRNAs in gene regulation: the first is the well-known intra-pathway regulatory role targeting genes belonging to the pathway itself; the second is an inter-pathway down-regulatory effect, where miRNAs expressed by a pathway directly silence mRNAs from genes belonging to pathways that may be biochemically or functionally incompatible with the pathway that is being expressed; the third is an indirect up-regulatory function where miRNAs, thanks to the PPL motif, indirectly contribute to the pathway up-regulation by down-regulating the Transcription Factors of its PAGs.
In the remaining of this paper two pathways manifesting the PPL motif will be analyzed in detail. The full list of pathways where PPLs have been identified has been provided as additional material to this submission in the form of dot graph files.
PPLs in mTOR signaling pathway
The mTOR signaling pathway has been identified as a hub. It integrates the output of several upstream pathways, including insulin, growth factors and amino acids , as well as cellular nutrition, energy levels and redox status . The mTOR pathway is actually under the analysis of several research units. Its pharmacological targeting looks like an effective method for acting against multiple types of cancer (e.g., leukemia, glioblastoma, myelodysplasia breast, hepatic and pancreatic [35, 36]), in which the mTOR pathway appears dysregulated . The mTOR pathway contains two main complexes: mTORC1 and mTORC2. mTORC1 has been largely analyzed, whereas mTORC2 (regulated by insulin, growth factors, serum, and nutrient levels ) has been less clearly investigated. In order to better understand the role of the mTORC2 complex several knockouts experiments have been performed on its genes and direct interactors. In particular, the RICTOR gene has been highlighted as responsible for metastasis and inhibition of growth factors . Its down-regulation is directly linked to the reduced phosphorylation of AKT and PKC, which leads to an impaired differentiation of Th2 cells, producing IL-4, IL-5, IL-10, and IL-13, responsible for strong antibody production, eosinophil activation, and inhibition of several macrophage functions, providing phagocyte-independent protective responses . Dysregulated type 1/type 2 cytokine production and their skewed development have been implicated in the progression of multiple immune disorders including asthma [41, 42], leukemia , and other cancers . This also leads to a renewed interest in using type 1 and type 2 cytokines as markers of human immune function.
Interestingly, the RICTOR gene appears as an actor in the PPL identified within the mTOR pathway. As shown in Figure 3, the PPL is composed of a pathway host-gene (RSK) expressing a protective intragenic miRNA (miR-1976) which acts against the expression of the MLL transcription factor. MLL is responsible for HOXA9 (one of the PAGs) transcription, leading to the expression of miR-196b (the Antagonist miRNA), which would target RICTOR, if expressed, dysregulating the mTORC2 complex. This result seems to strongly confirm the central role of MLL, the HOXA cluster (HOXA9), and both miR-196b and miR-1976 in Acute Lymphoblastic Leukemia (ALL), as presented by Schotte et al. [25, 45]. The PPL overall suggests that aberrant miR-196b expression may in fact contribute to leukemogenesis, as dysregulation of HOX genes were shown to directly induce leukemia in mice . The results presented by Schotte et al. [25, 45] may suggest that many of the observed dysregulations are compatible with disruption of the observed PPL. The common assumption is that miRNAs discovered in a pathological context have a dysregulatory role; in this case the PPL suggests instead that miR-1976  may have a protective role, and its slight over expression may indicate the mTOR pathway attempt to protect itself.
The most common pathological rearrangements of MLL (t(4;11), t(11;19), t(9;11) and t(1;11)) may mislead the proper miR-1976 regulatory function because the MLL translocation may imply changes in its miRNAs binding sites. Popovic et al.  showed that leukemogenic MLL fusion proteins cause over- expression of miR-196b, while treatment of MLL-AF9 transformed bone marrow cells with miR-196 specific antagomir abrogates their replating potential in methylcellulose. This may suggest that miR-196b function is necessary for MLL fusion-mediated immortalization and it may justify the fact that the mTOR pathway protects itself by not allowing its expression through the PPL. Similarly, the same work shows that the level of miR-196b is decreased up to 14-fold in the absence of MLL, thus confirming the down-regulatory role of miR-1976 on MLL.
To further validate these observations, we analyzed the expected level of expression of interactors in ALL disease retrieved from Gene Expression Atlas (http://www.ebi.ac.uk/gxa/). The expression of both RSK and RICTOR appears compatible with the identified PPL: (1) RSK (E-MTAB-62 experiment, filtered by ALL) shows an upper regulation in ALL (p = 0.002), and it may confirm the attempt of the pathway to protect its correct behavior maximizing the production of protective miR-1976 by over-expressing its host gene; (2) RICTOR (E-MTAB-37 experiment, filtered by ALL) appears globally down (p = 9.07e - 4), accordingly with the observed miR-196b up-regulation.
The reliability of our findings depends on the reliability of the miRNA target predictions, since the more targets are considered, the more loops may appear. In order to consider only reliable predictions we filtered miRNA targets for mirSVR score lower than -0.3 (see Materials and methods section). Relaxing this threshold may identify additional PPLs with weaker target affinity. In this case, we identified three additional low-score miR-1976 targets in the mTOR pathway, which are PAG Transcription Factors involved in PPLs. However, their role and possible involvement in the disease dynamics are still under investigation.
PPLs in the antigen processing and presentation (APP) pathway
APP is composed of two inner pathways responsible for synthesis of major histocompatibility complexes I and II (MHCI and MHCII), which are responsible for cell destruction (when MHCI expression is low) and specific immunization (MCHII).
The NFY complex is well known for peptide presentation in antigen presenting cells  and for being highly enriched in specific phases of the cell-cycle , thus playing a central role in cell control and maturation. The identified PPLs look compatible with the NFY behavior observed in mammals, in which the complex acts as an on/off switch by post-transcriptional mechanisms, and other more subtle post-translational regulations .
miR-30e, located in the intronic region of NFY-C and co-transcribed with its host gene, has been highlighted as responsible for maintaining differentiated cell phenotypes. For instance, the knock out of miR-30 miRNA family induces epithelial-mesenchymal transition of pancreatic islet cells . Moreover, miR-30e is under-expressed in breast, head, neck, and lung tumors, with experimental evidences confirming that its ectopic expression suppresses uncontrolled cell growth . This regulatory role seems compatible with the PPL behavior in which miR-30e is the only miRNA deputed to protect the pathway. Thus, the miR-30e dysregulation may lead to a wrong antigen exposition, which does not allow proper T cells to target the dysregulated cells, avoiding apoptosis driven either by CD8+ or NK T cells .
miR-30e directly targets the STAT1 transcription factor that belongs to the signal transducers and activators of transcription family. STAT1 is involved in up-regulation of genes (interferon stimulated genes) in response to different interferon based stimulation. In particular, after IFN-γ stimulation, STAT1 forms homodimers or heterodimers with STAT3 for binding with GAS (interferon-gamma activated sequence) promoter elements and their further regulation . This feedback loop targeting IFN-γ, may suggest a fine tuning loop between IFN-γ and STAT1.
STAT1 promotes two PAGs: RUNX1 and UGT8. RUNX1, also known as AML1 or CBFA2, is a transcription factor that regulates the fate of hematopoietic stem cell populations and is generally regulated by 2 enhancers, which are tissue specific and drive the binding of lymphoid or erythroid regulatory proteins . RUNX1 takes part in cell fate process mediating the transition of an endothelial cell into a haematopoietic cell. Evidences in RUNX1 knock out mice showed that primitive erythrocytes displayed a defective morphology, and the size of blast cell population was substantially reduced . At least 39 forms of RUNX1 mutations are implicated in various myeloid malignancies. Chromosomal translocations involving RUNX1 are associated with several types of leukemia including AML . As for MLL in the mTOR pathway previously discussed (see sub-section PPLs in mTOR signaling pathway), single nucleotide polymorphism (SNP), chimerism and translocation may invalidate the standard PPL regulation machinery, causing unexpected misbehaviors.
UGT8 encodes for an enzyme involved in glycosphingolipids synthesis, in particular galactosylceramides (GalCer lipids), which are involved in a variety of cellular processes including differentiation, cell-cell interaction, and transmembrane signaling [58, 59]. It is also noticeable that UGT8 is mainly localized in the endoplasmic reticulum, but not in the Golgi complex, nor in the plasma membrane . The same characteristic applies to the final PPL target, the HLA-DM complex targeted via miR-577, which is also only localized in the endoplasmic reticulum.
Furthermore, previous studies highlighted that an induced dose-dependent inhibition of GalCer expression on the cell surface, after treatment with recombinant gamma-interferon (rIFN-γ), caused reduced viral (HIV- 1) infection by decreasing GalCer synthesis and expression . This may be explained by a certain level of competition between IFN-γ and UGT8, in accordance with the identified PPL. As for RUNX1, also UGT8 is known to have multiple non-synonymous SNPs which could affect structures and/or biological functions of the respective gene products .
miR-802, co-expressed with RUNX, targets multiple pathway genes: IFN-γ, NFY-C, CANX, and the HLA-DM complex. It is worth noticing that among its targets we find NFY-C, which is responsible for the PPLs initiation. CANX is a chaperone protein responsible for protein folding and quality control. It retains unfolded or mis-folded proteins in the endoplasmic reticulum, in order to have only well assembled proteins in the cytoplasm. CANX also controls the folding of the MHC class I alpha chain. This central role in the MHCI synthesis makes it a possible critical target in PPL dysregulation. The HLA-DM protein, another chaperone, finally, is targeted by both miR-802 and miR-577. HLA-DM regulates the peptides that bind to MHCII, and controls/presents the antigen in antigen presenting cells. It plays a central role in the MHCII complex stability by favoring more stable peptide-MHC complexes. Dysregulation of HLA-DM is associated with negative prognosis in breast cancer, since patients with tumors that co-express HLA-DR, Ii and HLA-DM have improved recurrence-free survival as compared with patients with tumors that express HLA-DR and Ii in the absence of HLA-DM , and, accordingly to the discussion of miR-30e, HLA-DM negative patients show a general paucity of infiltrating CD3+, CD4+ and CD8+ T cells . Under expression of HLA-DM is also proven in autoimmune processes in Rheumatoid Arthritis  and Hodgkin Lymphoma .
The discovery of the Pathway Protection Loops is suggesting a level of transcriptional regulation at the pathway level not fully investigated before. Studies conducted on specific miRNAs such as the one published by Barik  confirm the presence of this type of regulatory motif, but a high-level analysis such as the one proposed in this paper is still missing.
The understanding of this and other higher-level regulatory motifs could, for example, lead to new approaches in the identification of therapeutic targets because it could unveil new and “indirect” paths to activate or silence a target pathway.
A lot of work still needs to be done to better uncover this high-level inter-pathway regulation. miRNA are not the only small RNAs that are involved in regulatory mechanisms. For example, ceRNA have been recently identified as miRNA down-regulators . Unfortunately data available on these new mechanisms is still very limited and therefore it is not yet possible to include genome-wide investigations like the one presented in this paper.
Materials and methods
To study the characteristics and properties of PPLs, we designed a software pipeline that, combining pathway data available via PathwayAPI  with the Micronome data extracted from public databanks (e.g., Microrna.org , miRBase , etc.), is able to search for miRNA mediated interactions at the pathway level, thus searching for the existence of PPLs.
The full software pipeline that is available at (http://www.testgroup.polito.it/index.php/bio-menu-tools/item/185-pathway-rotection-loops-finder) has been implemented as a collection of PHP classes, given the need of interfacing our software with several web based sources of information. All collected data have been saved into a unified relational database used for mining information about PPLs.
Pathway data sources
The search for the existence of the PPL motif starts from the analysis of a collection of pathways. Several public and commercial pathway resources currently exist on the web. However, these biological databases are very diverse, making it extremely laborious to carry out even simple queries across databases . To overcome with this limitation, pathway related information have been retrieved through Pathway API . Pathway API is an aggregated database combining and unifying databases from three major sources of information: (1) the WikiPathway database , the (2) Ingenuity database  and the (3) KEGG . One of the main advantages of Pathway API is the normalization of the network nodes that are all consistently translated and named using the corresponding NCBI Gene ID , thus enabling an easy data integration with the other data sources considered in this work.
Micronome and gene interaction data sources integration
In several cases, these databases use different convention for identifying specific entities (e.g., genes). Whenever possible, information from the cited databases have been dumped into a local unified relational database and all entries have been then preprocessed to unify the different identifiers. Working with a local dump of the information also allowed us to speed-up the information retrieval process which requires a massive access to these information sources.
Intragenic miRNA identification
To identify miRNAs co-expressed with the pathway genes we restricted our search to the set of intragenic miRNA. Intragenic miRNA represent around 50% of the mammalian miRNAs [75–79]. Most of these intragenic miRNA are located within introns of protein coding genes (miRNA host genes) and are referred to as intronic miRNA, whereas the remaining miRNAs are overlapping with exons of their host genes and are thus called exonic miRNA. Moreover the majority of intragenic miRNAs are sense strand located while only a very small portion is anti-sense strand located. Our analysis considers intronic and exonic miRNAs both sense and anti-sense strand located.
We assume that intragenic miRNA are in general co-expressed with their related host-genes as supported by previous studies [75, 80–83]. Recently Chunjiang et al.  also suggested that evolutionary conserved intragenic miRNA tend to be co-expressed with their host genes more likely than poorly conserved ones. This consideration could further refine the outcome of our analysis, however at the current stage it has not yet been implemented in our pipeline.
Intragenic miRNAs are retrieved through the miRBase database. miRBase is a searchable database of published miRNA sequences and annotations. About 94.5% of the available mature miRNA sequences considered in this paper have experimental evidence, thus representing a reliable source of information. Each miRNA entry in miRBase is correlated with the related information on the genetic location that is exploited to identify the host genes.
To identify intragenic miRNA of a given host gene we first search for the coordinate of the gene using the e-Utils. Once obtained the gene coordinates we search for all miRNAs with coordinates embedded in the ones of the gene resorting to miRBase.
Intragenic miRNA targets identification
We searched for potential targets of each identified intragenic miRNA resorting to microRNA.org. microRNA.org searches for miRNA targets applying the miRanda algorithm . The miRanda algorithm identifies potential binding sites by looking for high-complementarity regions on the 3′UTRs. The scoring matrix used by the algorithm is built so that complementary bases at the 5′ end of the miRNA are rewarded more than those at the 3′ end. The resulting binding sites are then evaluated thermodynamically, using the Vienna RNA folding package  and each prediction is finally associated with a down-regulation score named mirSVR score . Newer miRanda versions  implement a strict model for the binding sites that requires almost-perfect complementarity in the seed region with only a single wobble pairing, thus increasing the prediction accuracy. Other miRNA target databases such as TargetScan  use different prediction algorithms that aim at filtering many false positives from the beginning of the prediction process. However, the availability of the mirSVR score in microRNA.org provided us an additional degree of freedom to investigate the robustness of our prediction when changing the way microRNA targets are filtered. The second advantage offered by microRNA.org compared to other repositories such as TargetScan is the possibility of downloading the full database in a relational form. Given the amount of queries required by the proposed analysis this was a mandatory requirement to keep the computation time into a reasonable range.
To work with reliable predictions and limit the amount of returned miRNA-gene interactions, during the analysis we restricted our search to the microRNA.org “Good mirSVR score, Conserved miRNA” and “Good mirSVR score, Non-conserved miRNA” with negative mirSVR score lower than -0.3/-0.6. Given the selected intragenic miRNA name, searching for the targets simply requires an SQL query into the microRNA.org database.
Antagonist miRNA identification
Antagonist miRNAs are miRNAs that target one of the genes of the pathway and similarly to the Intragenic miRNA targets can be retrieved through microRNA.org. Given the NCBI GeneID we query the microRNA.org database to identify the set of miRNAs targeting the gene. Query to microRNA.org at this step follows the same filtering rules on the mirSVR score applied for the identification of the intragenic miRNA targets.
Antagonist miRNA host gene identification
The identification of an antagonist miRNA host gene follows an inverted flow compared to the one employed to identify the pathway intragenic miRNAs. For each antagonist miRNA we identify the related coordinates using miRBase, and, given the coordinates, we search into Sanger for a gene whose coordinates embrace the one of the considered miRNA. Genes identified at this step represent potential PAGs.
Antagonist miRNA host gene TF identification
As already mentioned in the introduction of this paper, miRNAs have a post-transcriptional regulation role . Intragenic miRNAs that directly target the PAGs would not actually prevent the production of the related Antagonist miRNAs since miRNAs are expressed during transcription whereas the down-regulatory action is post-transcriptional. However, the expression of miRNAs can be activated or repressed by transcription factors of the related host genes, which therefore can serve as upstream regulators of miRNA . For each antagonist miRNA host gene we therefore search for the related transcription factors. Searching for the transcription factors of the antagonist miRNA host genes is a critical step due to the limited availability of information from public databases that may strongly reduce our ability of identifying PPLs. For this reason we tried to integrate more than one data source in our search using two databases: (1) TargetMine and (2) TFe.
Both TargetMine and TFe provide web services to access the related database. To speed up the analysis all information contained in these two repositories have been downloaded and merged into a single database table containing relations between TFs and related target genes.
To download the information contained in TargetMine, we retrieved from Sanger the full list of NCBI GeneIDs considered in our analysis. For each geneID we then searched for TF targeting the selected gene through the REST service http://targetmine.nibio.go.jp:8080/targetmine/service/template/results?name=Gene_TFSource&constraint1=Gene&op1=LOOKUP&format=xml&&extra1=H.+sapiens&value1=⟨targetgeneid⟩. The resulting xml formatted information has then been processed and integrated in the database.
A similar approach has been applied to download the information provided by TFe. The list of all TFs available in TFe has been downloaded through the REST service http://www.cisreg.ca/cgi-bin/tfe/api.pl?code=all-tfids. For each TF in the list, the list of targets has been computed calling the REST service http://www.cisreg.ca/cgi-bin/tfe/api.pl?code=entrez-gene-id&tfid=⟨TFID⟩. The resulted information has ben finally added to the local database and joined with the ones provided by TargetMine.
With the availability of a local database, searching for TFs targeting a given host gene simply requires to query the related database tables.
This work has been partially supported by Grant No. CUP B15G13000010006 awarded by the Regione Valle d’Aosta for the project: “Open Health Care Network Analysis” and by the Italian Ministry of Education, University & Research (MIUR) (Project PRIN 2010, MIND).
- Alon U: Biological networks: the tinkerer as an engineer. Science. 2003, 301 (5641): 1866-1867. 10.1126/science.1089072.PubMedGoogle Scholar
- AL B ́a, Oltvai ZN: Network biology: understanding the cell’s functional organization. Nat Rev Genet. 2004, 5 (2): 101-113. 10.1038/nrg1272.Google Scholar
- Emmert-Streib F, Glazko GV: Pathway analysis of expression data: deciphering functional building blocks of complex diseases. PLoS Comput Biol. 2011, 7 (5): e1002053-10.1371/journal.pcbi.1002053. doi:10.1371/journal.pcbi.1002053PubMed CentralPubMedGoogle Scholar
- Kelder T, Van Iersel MP, Hanspers K, Kutmon M, Conklin BR, Evelo CT, Pico AR: WikiPathways: building research communities on biological pathways. Nucleic Acids Res. 2012, 40 (Database issue): D1301-D1307.PubMed CentralPubMedGoogle Scholar
- WikiPathways Database. [Online] http://www.wikipathways.org/ 2012
- Ingenuity Systems: Ingenuity Database. [Online] http://www.ingenuity.com 2012
- Kanehisa M, Goto S: KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000, 28: 27-30. 10.1093/nar/28.1.27.PubMed CentralPubMedGoogle Scholar
- Kanehisa Laboratories: KEGG Database. 2012, [Online] http://www.genome.jp/kegg/pathway.htmlGoogle Scholar
- Cheng C, Yan KK, Hwang W, Qian J, Bhardwaj N, Rozowsky J, Lu ZJ, Niu W, Alves P, Kato M, Snyder M, Gerstein M: Construction and analysis of an integrated regulatory network derived from high- throughput sequencing data. PLoS Comput Biol. 2011, 7: e1002190-10.1371/journal.pcbi.1002190.PubMed CentralPubMedGoogle Scholar
- Kaplan S, Bren A, Dekel E, Alon U: The incoherent feed-forward loop can generate non-monotonic input functions for genes. Mol Syst Biol. 2008, 4: 203-PubMed CentralPubMedGoogle Scholar
- Knabe JF, Nehaniv CL, Schilstra MJ: Do motifs reflect evolved function?–No convergent evolution of genetic regulatory network subgraph topologies. BioSystems. 2008, 94: 68-74. 10.1016/j.biosystems.2008.05.012.PubMedGoogle Scholar
- Konagurthu AS, Lesk AM: Single and multiple input modules in regulatory networks. Proteins. 2008, 73: 320-324. 10.1002/prot.22053.PubMedGoogle Scholar
- Maeda YT, Sano M: Regulatory dynamics of synthetic gene networks with positive feedback. J Mol Biol. 2006, 359: 1107-1124. 10.1016/j.jmb.2006.03.064.PubMedGoogle Scholar
- Ma HW, Kumar B, Ditges U, Gunzer F, Buer J, Zeng AP: An extended transcriptional regulatory network of Escherichia coli and analysis of its hierarchical structure and network motifs. Nucleic Acids Res. 2004, 32: 6643-6649. 10.1093/nar/gkh1009.PubMed CentralPubMedGoogle Scholar
- Zaslaver A, Mayo AE, Rosenberg R, Bashkin P, Sberro H, Tsalyuk M, Surette MG, Alon U: Just-in-time transcription program in metabolic pathways. Nat Genet. 2004, 36: 486-491. 10.1038/ng1348.PubMedGoogle Scholar
- Mangan S, Zaslaver A, Alon U: The coherent feedforward loop serves as a sign-sensitive delay element in transcription networks. J Mol Biol. 2003, 334: 197-204. 10.1016/j.jmb.2003.09.049.PubMedGoogle Scholar
- Shen-Orr SS, Milo R, Mangan S, Alon U: Network motifs in the transcriptional regulation network of Escherichia coli. Nat Genet. 2002, 31: 64-68. 10.1038/ng881.PubMedGoogle Scholar
- Kalir S, McClure J, Pabbaraju K, Southward C, Ronen M, Leibler S, Surette MG, Alon U: Ordering genes in a flagella pathway by analysis of expression kinetics from living bacteria. Science. 2001, 292 (5524): 2080-2083. 10.1126/science.1058758.PubMedGoogle Scholar
- Beezhold K, Castranova V, Chen F: Microprocessor of microRNAs: regulation and potential for ther- apeutic intervention. Mol Cancer. 2010, 9: 134-PubMed CentralPubMedGoogle Scholar
- Tu K, Yu H, Hua Y, Li Y, Liu L, Xie L, Li Y: Combinatorial network of primary and secondary microRNA-driven regulatory mechanisms. Nucleic Acids Res. 2009, 37: 5969-5980. 10.1093/nar/gkp638.PubMed CentralPubMedGoogle Scholar
- Yuan X, Liu C, Yang P, He S, Liao Q, Kang S, Zhao Y: Clustered microRNAs’ coordination in regulating protein-protein interaction network. BMC Syst Biol. 2009, 3: 65-10.1186/1752-0509-3-65.PubMed CentralPubMedGoogle Scholar
- Benso A, Di Carlo S, Politano G, Savino A: A new miRNA motif protects pathways’ expression in gene regulatory networks. Proceedings IWBBIO 2013: International Work-Conference on Bioinformatics and Biomedical Engineering. 2013, Granada 18014, Spain: Copicentro Granada S L, AV Andalucia, 38, Granada, 377-384.Google Scholar
- Kim W, Li M, Wang J, Pan Y: Biological network motif detection and evaluation. BMC Syst Biol. 2011, 5 (Suppl 3): S5-10.1186/1752-0509-5-S3-S5.PubMed CentralPubMedGoogle Scholar
- Wang J, Lu M, Qiu C, Cui Q: TransmiR: a transcription factor–microRNA regulation database. Nucleic Acids Res. 2010, 38 (suppl 1): D119-D122.PubMed CentralPubMedGoogle Scholar
- Schotte D, Chau J, Sylvester G, Liu G, Chen C, van der Velden V, Broekhuis M, Peters T, Pieters R, Den Boer M: Identification of new microRNA genes and aberrant microRNA profiles in childhood acute lymphoblastic leukemia. Leukemia. 2008, 23 (2): 313-322.PubMedGoogle Scholar
- Oldford S, Robb J, Codner D, Gadag V, Watson P, Drover S: Tumor cell expression of HLA-DM associates with a Th1 profile and predicts improved survival in breast carcinoma patients. Int Immunol. 2006, 18 (11): 1591-1602. 10.1093/intimm/dxl092.PubMedGoogle Scholar
- Nagasaki M, Saito A, Doi A, Matsuno H, Miyano S: Using cell illustrator and pathway databases. Foundations of Systems Biology. 2009, : Volume 13 of Computational Biology, Springer, 5-18. ISBN 978-1-84882-022-7Google Scholar
- Sanger Institute: Sanger GeneCode database. 2012, [Online] ftp://ftp.sanger.ac.uk/pub/gencode/release_9/genecode.v9.annotation.gtf.gzGoogle Scholar
- Development Core Team: R: A Language and Environment for Statistical Computing. 2010, Vienna, Austria: R Foundation for Statistical Computing, http://www.R-project.org. [ISBN 3-900051-07-0]Google Scholar
- Holm S: A simple sequentially rejective multiple test procedure. Scandinavian Journal of Statistics. 1979, 6 (2): 65-70. doi:10.2307/4615733Google Scholar
- Shirdel EA, Xie W, Mak TW, Jurisica I: NAViGaTing the micronome–using multiple microRNA prediction databases to identify signalling pathway-associated microRNAs. PLoS ONE. 2011, 6 (2): e17429-10.1371/journal.pone.0017429.PubMed CentralPubMedGoogle Scholar
- Kruskal WH, Wallis WA: Use of Ranks in One-Criterion Variance Analysis. J Am Stat Assoc. 1952, 47 (260): 583-621. 10.1080/01621459.1952.10483441. [http://dx.doi.org/10.2307/2280779]Google Scholar
- Hay N, Sonenberg N: Upstream and downstream of mTOR. Genes Dev. 2004, 18 (16): 1926-1945. 10.1101/gad.1212704.PubMedGoogle Scholar
- Tokunaga C, Yoshino K, Yonezawa K: mTOR integrates amino acid and energy-sensing pathways. Biochem Biophys Res Commun. 2004, 313 (2): 443-446. 10.1016/j.bbrc.2003.07.019.PubMedGoogle Scholar
- Easton JB, Houghton PJ: mTOR and cancer therapy. Oncogene. 2006, 25 (48): 6436-6446. 10.1038/sj.onc.1209886.PubMedGoogle Scholar
- Faivre S, Kroemer G, Raymond E: Current development of mTOR inhibitors as anticancer agents. Nat Rev Drug Discov. 2006, 5 (8): 671-688. 10.1038/nrd2062.PubMedGoogle Scholar
- Beevers CS, Li F, Liu L, Huang S: Curcumin inhibits the mammalian target of rapamycin-mediated signaling pathways in cancer cells. Int J Cancer. 2006, 119 (4): 757-764. 10.1002/ijc.21932.PubMedGoogle Scholar
- Frias MA, Thoreen CC, Jaffe JD, Schroder W, Sculley T, Carr SA, Sabatini DM: mSin1 is necessary for Akt/PKB phosphorylation, and its isoforms define three distinct mTORC2s. Curr Biol. 2006, 16 (18): 1865-1870. 10.1016/j.cub.2006.08.001.PubMedGoogle Scholar
- Zhang F, Zhang X, Li M, Chen P, Zhang B, Guo H, Cao W, Wei X, Cao X, Hao X, Zhang N: mTOR complex component Rictor interacts with PKCzeta and regulates cancer cell metastasis. Cancer Res. 2010, 70 (22): 9360-9470. 10.1158/0008-5472.CAN-10-0207.PubMedGoogle Scholar
- Romagnani S: Th1/Th2 cells. Inflamm Bowel Dis. 1999, 5 (4): 285-94. 10.1097/00054725-199911000-00009.PubMedGoogle Scholar
- Mazzarella G, Bianco A, Catena E, De Palma R, Abbate GF: Th1/Th2 lymphocyte polarization in asthma. Allergy. 2000, 55 (61): 6-9.PubMedGoogle Scholar
- Steinke JW, Borish L: Th2 cytokines and asthma. Interleukin-4: its role in the pathogenesis of asthma, and targeting it for asthma treatment with interleukin-4 receptor antagonists. Respir Res. 2001, 2 (2): 66-70. 10.1186/rr40.PubMed CentralPubMedGoogle Scholar
- Zhang XL, Komada Y, Chipeta J, Li QS, Inaba H, Azuma E, Yamamoto H, Sakurai M: Intracellular cytokine profile of T cells from children with acute lymphoblastic leukemia. Cancer Immunol Immunother. 2000, 49 (3): 165-172. 10.1007/s002620050616.PubMedGoogle Scholar
- Skinnider BF, Mak TW: The role of cytokines in classical Hodgkin lymphoma. Blood. 2002, 99 (12): 4283-4297. 10.1182/blood-2002-01-0099.PubMedGoogle Scholar
- Schotte D, Lange-Turenhout E, Stumpel D, Stam R, Buijs-Gladdines J, Meijerink J, Pieters R, Den Boer M: Expression of miR-196b is not exclusively MLL-driven but is especially linked to activation of HOXA genes in pediatric acute lymphoblastic leukemia. Haematologica. 2010, 95 (10): 1675-1682. 10.3324/haematol.2010.023481.PubMed CentralPubMedGoogle Scholar
- Buske C, Humphries RK: Homeobox genes in leukemogenesis. Int J Hematol. 2000, 71 (4): 301-308.PubMedGoogle Scholar
- Popovic R, Riesbeck LE, Velu CS, Chaubey A, Zhang J, Achille NJ, Erfurth FE, Eaton K, Lu J, Grimes HL, Chen J, Rowley JD, Zeleznik-Le NJ: Regulation of mir-196b by MLL and its overexpression by MLL fusions contributes to immortalization. Blood. 2009, 113 (14): 3314-3322. 10.1182/blood-2008-04-154310.PubMed CentralPubMedGoogle Scholar
- Mach B, Steimle V, Martinez-Soria E, Reith W: Regulation of MHC class II genes: lessons from a disease. Annu Rev Immunol. 1996, 14: 301-331. 10.1146/annurev.immunol.14.1.301.PubMedGoogle Scholar
- Bolognese F, Wasner M, Dohna CL, Gurtner A, Ronchi A, Muller H, Manni I, Mossner J, Piaggio G, Mantovani R, Engeland K: The cyclin B2 promoter depends on NF-Y, a trimer whose CCAAT-binding activity is cell-cycle regulated. Oncogene. 1999, 18 (10): 1845-1853. 10.1038/sj.onc.1202494.PubMedGoogle Scholar
- Mantovani R: The molecular biology of the CCAAT-binding factor NF-Y. Gene. 1999, 239: 15-27. 10.1016/S0378-1119(99)00368-6.PubMedGoogle Scholar
- Joglekar M, Patil D, Joglekar V, Rao G, Reddy D, Mitnala S, Shouche Y, Hardikar A: The miR-30 family microRNAs confer epithelial phenotype to human pancreatic cells. Islets. 2009, 1 (2): 137-147. 10.4161/isl.1.2.9578.PubMedGoogle Scholar
- Wu F, Zhu S, Ding Y, Beck W, Mo Y: MicroRNA-mediated regulation of Ubc9 expression in cancer cells. Clin Cancer Res. 2009, 15 (5): 1550-1557. 10.1158/1078-0432.CCR-08-0820.PubMed CentralPubMedGoogle Scholar
- Bosshart H, Jarrett R: Deficient major histocompatibility complex class II antigen presentation in a subset of Hodgkin’s disease tumor cells. Blood. 1998, 92 (7): 2252-2259.PubMedGoogle Scholar
- Katze M, He Y, Gale M: Viruses and interferon: a fight for supremacy. Nat Rev Immunol. 2002, 2 (9): 675-687. 10.1038/nri888.PubMedGoogle Scholar
- Okuda T, Nishimura M, Nakao M, Fujitaa Y: RUNX1/AML1: a central player in hematopoiesis. Int J Hematol. 2001, 74 (3): 252-257. 10.1007/BF02982057.PubMedGoogle Scholar
- Wang Q, Stacy T, Binder M, Marin-Padilla M, Sharpe A, Speck N: Disruption of the Cbfa2 gene causes necrosis and hemorrhaging in the central nervous system and blocks definitive hematopoiesis. Proc Natl Acad Sci. 1996, 93 (8): 3444-3449. 10.1073/pnas.93.8.3444.PubMed CentralPubMedGoogle Scholar
- Asou N: The role of a Runt domain transcription factor AML1/RUNX1 in leukemogenesis and its clinical implications. Crit Rev Oncol Hematol. 2003, 45 (2): 129-150. 10.1016/S1040-8428(02)00003-3.PubMedGoogle Scholar
- Varki A: Biological roles of oligosaccharides: all of the theories are correct. Glycobiology. 1993, 3 (2): 97-130. 10.1093/glycob/3.2.97.PubMedGoogle Scholar
- Zeller C, Marchase R: Gangliosides as modulators of cell function. American Journal of Physiology-Cell Physiology. 1992, 262 (6): C1341-C1355.Google Scholar
- Sprong H, Kruithof B, Leijendekker R, Slot J, Van Meer G, van der Sluijs P: UDP-galactose: ceramide galactosyltransferase is a class I integral membrane protein of the endoplasmic reticulum. J Biol Chem. 1998, 273 (40): 25880-25888. 10.1074/jbc.273.40.25880.PubMedGoogle Scholar
- Yahi N, Spitalnik S, Stefano K, De Micco P, Gonzalez-Scarano F, Fantini J: Interferon-[gamma] decreases cell surface expression of galactosyl ceramide, the receptor for HIV-1 GP120 on human colonic epithelial cells. Virology. 1994, 204 (2): 550-557. 10.1006/viro.1994.1568.PubMedGoogle Scholar
- Iida A, Saito S, Sekine A, Mishima C, Kitamura Y, Kondo K, Harigae S, Osawa S, Nakamura Y: Catalog of 86 single-nucleotide polymorphisms (SNPs) in three uridine diphosphate glycosyltransferase genes: UGT2A1, UGT2B15, and UGT8. J Hum Genet. 2002, 47 (10): 505-510. 10.1007/s100380200075.PubMedGoogle Scholar
- Xu C: Tumor cell survival strategies in Hodgkin lymphoma. 2010, The Netherland’s: PhD thesis, Rijksuniversiteit GroningenGoogle Scholar
- Barik S: An intronic microRNA silences genes that are functionally antagonistic to its host gene. Nucleic Acids Res. 2008, 36 (16): 5232-5241. 10.1093/nar/gkn513.PubMed CentralPubMedGoogle Scholar
- Salmena L, Poliseno L, Tay Y, Kats L, Pandolfi P: A ceRNA hypothesis: the Rosetta Stone of a hidden RNA language?. Cell. 2011, 146 (3): 353-358. 10.1016/j.cell.2011.07.014.PubMed CentralPubMedGoogle Scholar
- Soh D, Dong D, Yike G, Wong L: PathwayAPI. 2012, [Online] http://www.pathwayapi.com/Google Scholar
- Memorial Sloan-Kettering Cancer Center: microRNA.org - Targets and Expression Database. [Online] http://www.microrna.org/ 2012Google Scholar
- Kozomara A, Griffiths-Jones S: miRBase: integrating microRNA annotation and deep-sequencing data. Nucleic Acids Res. 2011, 39 (suppl 1): D152-D157. [http://nar.oxfordjournals.org/content/39/suppl_1/D152.abstract]PubMed CentralPubMedGoogle Scholar
- Soh D, Dong D, Guo Y, Wong L: Consistency, comprehensiveness, and compatibility of pathway databases. BMC Bioinforma. 2010, 11: 449-10.1186/1471-2105-11-449. http://www.biomedcentral.com/1471-2105/11/449,Google Scholar
- NCBI Gene. [Online] http://www.ncbi.nlm.nih.gov/ gene 2012
- Sayers E: E-utilities Quick Start. 2010, Bethesda (MD): National Center for Biotechnology Information (US)Google Scholar
- Betel D, Wilson M, Gabow A, Marks DS, Sander C: The microRNA.org resource: targets and expression. Nucleic Acids Res. 2008, 36 (Database issue): D149-D153.PubMed CentralPubMedGoogle Scholar
- Chen YA, Tripathi LP, Mizuguchi K: TargetMine, an integrated data warehouse for candidate gene prioritisation and target discovery. PLoS ONE. 2011, 6 (3): e17844-10.1371/journal.pone.0017844. [http://dx.doi.org/10.1371%2Fjournal. pone.0017844]PubMed CentralPubMedGoogle Scholar
- Wasserman Lab: Transcription Factor Encyclopedia (TFe). 2012, http://www.cisreg.ca/tfe,Google Scholar
- Rodriguez A, Griffiths-Jones S, Ashurst J, Bradley A: Identification of mammalian microRNA host genes and transcription units. Genome Res. 2004, 14 (10a): 1902-1910. 10.1101/gr.2722704.PubMed CentralPubMedGoogle Scholar
- Ozsolak F, Poling L, Wang Z, Liu H, Liu X, Roeder R, Zhang X, Song J, Fisher D: Chromatin structure analyses identify miRNA promoters. Genes Dev. 2008, 22 (22): 3172-3183. 10.1101/gad.1706508.PubMed CentralPubMedGoogle Scholar
- Monteys A, Spengler R, Wan J, Tecedor L, Lennox K, Xing Y, Davidson B: Structure and activity of putative intronic miRNA promoters. Rna. 2010, 16 (3): 495-505. 10.1261/rna.1731910.PubMed CentralPubMedGoogle Scholar
- Saini H, Enright A, Griffiths-Jones S: Annotation of mammalian primary microRNAs. BMC Genomics. 2008, 9: 564-10.1186/1471-2164-9-564.PubMed CentralPubMedGoogle Scholar
- Lin SL, Miller JD, Ying SY: Intronic microRNA (miRNA). J Biomed Biotechnol. 2006, 2006 (4): 26818-PubMed CentralPubMedGoogle Scholar
- Baskerville S, Bartel D: Microarray profiling of microRNAs reveals frequent coexpression with neighboring miRNAs and host genes. Rna. 2005, 11 (3): 241-247. 10.1261/rna.7240905.PubMed CentralPubMedGoogle Scholar
- Shomron N, Levy C: MicroRNA-biogenesis and pre-mRNA splicing crosstalk. J Biomed Biotechnol. 2009, 2009 (594678): http://dx.doi.org/10.1155/2009/594678,Google Scholar
- Smalheiser N, et al: EST analyses predict the existence of a population of chimeric microRNA precursor-mRNA transcripts expressed in normal human and mouse tissues. Genome Biol. 2003, 4 (7): 403-10.1186/gb-2003-4-7-403.PubMed CentralPubMedGoogle Scholar
- Kim Y, Kim V: Processing of intronic microRNAs. EMBO J. 2007, 26 (3): 775-783. 10.1038/sj.emboj.7601512.PubMed CentralPubMedGoogle Scholar
- He C, Li Z, Chen P, Huang H, Hurst L, Chen J: Young intragenic miRNAs are less coexpressed with host genes than old ones: implications of miRNA–host gene coevolution. Nucleic Acids Res. 2012, 40 (9): 4002-12. 10.1093/nar/gkr1312.PubMed CentralPubMedGoogle Scholar
- Enright A, John B, Gaul U, Tuschl T, Sander C, Marks D: MicroRNA targets in Drosophila. Genome Biol. 2003, 5 (1): R1-10.1186/gb-2003-5-1-r1.PubMed CentralPubMedGoogle Scholar
- Wuchty S, Fontana W, Hofacker I, Schuster P, et al: Complete suboptimal folding of RNA and the stability of secondary structures. Biopolymers. 1999, 49 (2): 145-165. 10.1002/(SICI)1097-0282(199902)49:2<145::AID-BIP4>3.0.CO;2-G.PubMedGoogle Scholar
- Betel D, Koppal A, Agius P, Sander C, Leslie C: Comprehensive modeling of microRNA targets predicts functional non-conserved and non-canonical sites. Genome Biol. 2010, 11 (8): R90-10.1186/gb-2010-11-8-r90.PubMed CentralPubMedGoogle Scholar
- John B, Enright A, Aravin A, Tuschl T, Sander C, Marks D: Human microRNA targets. PLoS Biol. 2004, 2 (11): e363-10.1371/journal.pbio.0020363.PubMed CentralPubMedGoogle Scholar
- Lewis B, Shih I, Jones-Rhoades M, Bartel D, Burge C, et al: Prediction of mammalian microRNA targets. Cell. 2003, 115 (7): 787-798. 10.1016/S0092-8674(03)01018-3.PubMedGoogle Scholar
- Filipowicz W, Bhattacharyya S, Sonenberg N: Mechanisms of post-transcriptional regulation by microRNAs: are the answers in sight?. Nat Rev Genet. 2008, 9 (2): 102-114.PubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.