® Open Access Full Text Article

ORIGINAL RESEARCH

Network analysis reveals potential markers for pediatric adrenocortical carcinoma

This article was published in the following Dove Press journal: Onco Targets and Therapy 26 July 2016 Number of times this article has been viewed

Anurag Kulshrestha1 Shikha Suman2 Rakesh Ranjan’

‘Bioinformatics Division, National Bureau of Animal Genetic Resources, Karnal, 2Division of Applied Sciences, Indian Institute of Information Technology, Allahabad, India

Abstract: Pediatric adrenocortical carcinoma (ACC) is a rare malignancy with a poor outcome. Molecular mechanisms of pediatric ACC oncogenesis and advancement are not well understood. Accurate and timely diagnosis of the disease requires identification of new markers for pediatric ACC. Differentially expressed genes (DEGs) were identified from the gene expression profile of pediatric ACC and obtained from Gene Expression Omnibus. Gene Ontology functional and pathway enrichment analysis was implemented to recognize the functions of DEGs. A protein- protein interaction (PPI) and gene-gene functional interaction (GGI) network of DEGs was constructed. Hub gene detection and enrichment analysis of functional modules were performed. Furthermore, a gene regulatory network incorporating DEGs-microRNAs-transcription factors was constructed and analyzed. A total of 431 DEGs including 228 upregulated and 203 downregu- lated DEGs were screened. These genes were largely involved in cell cycle, steroid biosynthesis, and p53 signaling pathways. Upregulated genes, CDK1, CCNB1, CDC20, and BUB1B, were identified as the common hubs of PPI and GGI networks. All the four common hub genes were also part of modules of the PPI network. Moreover, all the four genes were also present in the largest module of GGI network. A gene regulatory network consisting of 82 microRNAs and 100 transcription factors was also constructed. CDK1, CCNB1, CDC20, and BUBIB may serve as potential biomarker of pediatric ACC and as potential targets for therapeutic approach, although experimental studies are required to authenticate our findings.

Keywords: gene expression profiling, protein-protein interaction network, network module, gene-gene functional interaction network

Introduction

Adrenocortical tumor (ACT) is an aberrant and highly aggressive malignancy originating from the adrenal cortex. It is accountable for~0.2% of all childhood cancers. The majority of pediatric ACTs are functional as compared to adult ACTs, which are mostly nonfunctional.1 Girls are more frequently affected with pediatric adrenocortical carcinoma (ACC) than boys.2 Increased production of androgens, aldosterone, corti- costeroids, and estrogen with ~80% showing virilization are the symptoms associated with pediatric ACC.3 Like most pediatric embryonal tumors, a good result demands early detection and complete surgical resection.4 Traditional chemotherapeutic agents have shown little value in treating children with ACC. Long-term problems associ- ated with mitotane plus EDP (etoposide, doxorubicin, and cisplatin) treatment are a troublesome issue for children with ACC. Moreover, leukemogenesis may develop on treatment with topoisomerase inhibitors such as etoposide and doxorubicin.3 Surgery and exhaustive chemotherapy have shown poor outcomes in children with locally advanced or metastasis disease.4

Correspondence: Anurag Kulshrestha Bioinformatics Division, National Bureau of Animal Genetic Resources, Karnal, Haryana 132001, India Tel +91 896 029 8856 Email anurag.kulshrestha23@gmail.com

f

D

CC

BY NC

Pediatric ACC is commonly reported in families with Li-Fraumeni syndrome, which are generally related with TP53 tumor-suppressor germ line mutations5 or inherent genetic and/or epigenetic changes affecting chromosome 11p15 (Beckwith-Wiedemann syndrome).6 Although the elements advancing to occasional pediatric ACCs are not established, their similarity to cases with an inherent sus- ceptibility indicates a common method of tumorigenesis. Size of the tumor and verification of the tumor after surgery form the basis of staging of pediatric ACC.7 Early-stage tumor tissue can be accessed through the distinguishable clinical symptoms of ACC such as Cushing syndrome and virilization. Embryonal tumors share the epidemiological and molecular characteristics with ACC.4

Molecular studies differentiating pediatric ACC from age-matched normal adrenals have been established from gene expression profiling. Rarity of this tumor has been a problem in identifying the potential markers. Increased levels of insulin-like growth factor 2 (IGF-2) are found in 90% of pediatric ACC due to genetic or epigenetic changes in chromosome 11p15.8 KCNQ1 (potassium channel, volt- age gated KQT-like subfamily Q, member 1) and CDKN1C (cyclin-dependent kinase inhibitor 1C) are among the most strongly downregulated genes in pediatric ACC. Genes associated with mitogen-activated kinase and growth factor receptor pathways are impaired in pediatric ACC, indicat- ing their plausible utility as therapeutic targets. HSD3B2, a steroidogenic enzyme, and its transcriptional regulators NR4A1 (nuclear receptor subfamily 4, group A, member 1) and NR4A2 (nuclear receptor subfamily 4, group A, member 2) are downregulated in pediatric ACC.2 Another highly downregulated gene in pediatric ACC is NOV (nephroblas- toma overexpressed), which encodes a multimodular protein that has proapoptotic function on adrenocortical cancer cells.9 MicroRNA (miRNA) expression profiling of pediatric ACC revealed downregulation of miR-99a and miR-100, which in turn downregulates the expression of insulin-like growth factor 1 receptor (IGF-1R), mechanistic target of rapamycin (mTOR), and regulatory associated protein of MTOR, com- plex 1 (RPTOR) in adrenocortical cell lines.10

A listing of differentially expressed genes (DEGs) is provided through gene expression analysis. Protein-protein interactions (PPIs) utilize known relationships among the protein molecules. Analyzing the PPI network recounts the importance of these interactions in relation to signal trans- duction and biochemistry. In a particular biological context, all proteins interact with other proteins to perform particular functions.11 Substantial amount of attention has been given to coherent analysis of microarray gene expression data

with PPI networks in recent years.12,13 Potential biomarker identification, elucidation of protein function and protein interaction, functional module identification, and drug target identification are some of the applications of analyzing PPI networks.14,15

This study focuses on analyzing the gene expression profile of children with ACC, based on the understanding of interaction networks utilizing a system biology approach. To obtain more knowledge from gene expression profiles, analysis should transcend identification of the affected genes leading to the proteins underlying the inflated biological phenotypes.

Materials and methods

A bioinformatics approach with myriad of computational tools, software, and databases was utilized for shedding light on the underlying mechanisms of pediatric ACC. The entire workflow representing the procedure employed for the study is shown in Figure 1.

Raw biological data

The raw DNA microarray data were obtained from Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo) for healthy children and children suffering from ACC. The chip dataset GSE7541516 included samples from healthy children, children with adenomas, and children with ACC. Eighteen ACC and seven normal adrenal samples were extracted from the dataset. Studying the gene expression enables to identify potential biomarkers for early detection of ACC. Gene expression profiling was performed using Affymetrix Human Genome U133A chips. (Affymetrix, Inc., Santa Clara, CA, USA)

Data preprocessing and normalization

DNA microarray analysis begins with preprocessing and normalization of raw biological data. This process removes noise from the biological data and ensures its integrity. Back- ground correction of probe data, normalization, and summa- rization were executed by robust multi-array average analysis algorithm17 in affy package of R.18 Raw intensity and normal- ized intensity box plot were also generated for the analysis.

Elucidation of DEGs

Normalization of data was followed by analysis of differ- ential expression by Linear Models for MicroArray data package of R.19 Delineating parameters such as adjusted P-value, false discovery rate (FDR), and fold change were utilized for filtering of DEGs between healthy and diseased states. To reduce error from multiple hypothesis testing,20

Figure 1 Workflow utilized for identification of potential markers of pediatric ACC. Abbreviations: ACC, adrenocortical carcinoma; DEGs, differentially expressed genes; GGI, gene-gene functional interaction; miRNA, microRNA; PPI, protein-protein interaction; TFs, transcription factors.

Gene expression data

DEGS

Function and pathway enrichment

PPI network

GGI network

TFs and miRNAs

Hub identification

Module extraction

Regulatory network

Identification of putative markers of pediatric ACC

Benjamini-Hochberg method21 was employed to obtain adjusted P-values. Genes were further screened on account of adjusted P-value <0.05 and fold change >2.

Functional enrichment of DEGs

Discerning the implication of DEGs in ACC, biological attributes of DEGs such as biological processes, molecular functions, and cellular components were extracted from Gene Ontology (GO)22 enrichment analysis. Database for Annota- tion, Visualization, and Integrated Discovery,23 WEB-based Gene SeT Analysis Toolkit (WebGestalt),24 and Funrich tools25 were surveyed for GO and pathway enrichment of DEGs. Hypergeometric test was utilized to assess the func- tional enrichment against predefined functional categories through large genomic, proteomic, and genetic datasets with P-value <0.05 and gene count >2.

PPI network construction

The recent version of STRING v1026 database was employed for construction of PPI network of proteins encoded by DEGs. STRING is a database of known and predicted PPIs. The interactions include direct (physical) and indirect (functional) associations. The interactions are procured from genomic context, high-throughput experiments, coexpres- sion, and previous knowledge. Three PPI networks were constructed by mapping upregulated DEGs, downregulated DEGs, and total DEGs, respectively, to STRING database

with confidence score >0.4. PPI networks were visualized and analyzed in Cytoscape27 software, which furnishes diverse plugins for multiple analyses. Cytoscape represents PPI networks as graphs with nodes illustrating proteins and edges depicting associated interactions. Hub protein nodes of the PPI network with connectivity degree >10 were identified.

Network topology analysis

Network Analyzer28 was employed to analyze the topological parameters of networks. Architecture of complex networks was examined with topological parameters such as clustering coefficient, centralization, density, and network diameter. Undirected edges were considered for all the networks. The number of directly connected neighbors of a node in a network was termed as degree of a node. Node degree distri- bution P(k) is termed as the number of nodes with a degree k for k=0, 1, 2, … Power law of distribution of node degrees, one of the most crucial network topological characteristics, was analyzed. A line can be fitted on the node degree dis- tribution data to visualize the pattern of their dependencies. Network Analyzer uses the least squares method and only the points with positive coordinate values are considered for fit. R-squared value (R2), also known as the coefficient of determination, gives the proportion of variability in the dataset. When R2 value is close to 1, the fit is considered to be good. Also, other network parameters were analyzed.

Module identification and enrichment

analysis

Module identification is imperative as two interacting proteins have a higher probability of sharing the same function than two proteins not interacting. Molecular Complex Detection (MCODE)29 was used to find local dense subnetworks or modular clusters through vertex weighing by local neighborhood density and outward traversal from a locally dense protein node to isolate dense regions in PPI network. Module identification criteria included degree cutoff of 2, node score cutoff of 0.2, k-core of 2, and maximum depth of 100. Significant mod- ules were identified with MCODE score ≥4 and nodes ≥6. Biological significance of these predicted modules was inferred by BiNGO30 plugin of Cytoscape. GO enrichment was per- formed with P-value <0.05 based upon hypergeometric test and corrected by Benjamini and Hochberg FDR. GO Slim is utilized by BINGO for functional annotation.

Construction of gene-gene functional interaction network

Gene-gene interaction network incorporating up- and downregulated DEGs was constructed for identifying the functional interactions between DEGs. DEGs’ list was fur- nished to GeneMANIA31 that incorporates large functional association data such as coexpression data, colocalization data, physical interactions, shared protein domains, path- way, and genetic interactions. Twenty additional genes were added to the interaction network based on GO term (biological process)-based weighting and Homo sapiens as the species to identify genes that may have been missed

A

Expression values

10

12

14

Normal 1

Normal 2

Normal 3

Normal 4

Normal 5

Normal 6

Normal 7

Cancer 1

Cancer 2

Cancer 3

Cancer 4

Cancer 5

Cancer 6

Cancer 7

Cancer 8

Cancer 9

Cancer 10

Cancer 11

Cancer 12

Cancer 13

Cancer 14

Cancer 15

Cancer 16

Cancer 17

Cancer 18

(B) after normalization.

during the screening process. Hub genes were identified with connectivity degree >10. MCODE was employed for iden- tification of modules in the gene-gene functional interaction (GGI) network. BINGO and Gene Set Enrichment Analysis32 were utilized to identify the GO terms and pathways associ- ated with modules with FDR q-value below 0.05.

Construction of transcription factor-miRNA regulatory network Genes must interact with and respond to an organism’s environment, as they independently cannot control the organ- ism by themselves. Transcription factors (TFs) and miRNA regulate the gene expression at transcriptional and posttran- scriptional levels. Information pertinent to TFs, miRNAs, and their respective target genes may help to better understand the intrinsic processes of pediatric ACC. Molecular Signatures Database v5.1 (MSigDB)32 was explored for the identifica- tion of TFs and miRNAs associated with DEGs with FDR q-value below 0.05. MSigDB incorporates annotated gene sets derived from a large variety of resources. A gene regu- latory network incorporating DEGs, TFs, and miRNAs was constructed in Cytoscape.

Results

The comprehensive study focused on identifying and ana- lyzing the genes, proteins, and probable patterns that are expressed in children with ACC, as compared to normal children. The ACC dataset was exposed to preprocessing and normalization in order to remove noise by robust multiaver- age analysis algorithm (Figure 2).

B

10

12

Normal 1

Normal 2

Normal 3

Normal 4

Normal 5

Normal 6

Normal 7

Cancer 1

Cancer 2

Cancer 3

Cancer 4 1

Cancer 5

Cancer 6

Cancer 7

Cancer 8

Cancer 9

Cancer 10

Cancer 11

Cancer 12

Cancer 13

Cancer 14

Cancer 15

Cancer 16

Cancer 17

Cancer 18

Notes: (A) Raw intensity box plot. (B) Normalized intensity box plot. Bars represent interquartile range. Red in (A) represents dataset before normalization and blue

Figure 2 Preprocessing and normalization of expression data.

0

CO

1

-

T

F

1

P

M

F

-

1

4

L

T

-

T

F

4

1

F

4

-

-

M

1

-4

F

H

F

4

1

-

1

4

-

T

-

4

1

4

1

1

1

-

1

P

-

4

1

4

1

T

Expression values

N

4

6

Co


-

0:00

1

1

DO

0

1

0

1

0

1

00

F

0

F

00

1

1

1

O

-

P

1

0

1

1

1

D

7

1

000

1

1

1

F

C

I

Identification of DEGs

The normalized data were subjected to analysis to reveal genes with altered expression profiles between healthy and diseased datasets. A total of 431 DEGs were obtained through a threshold of adjusted P-value <0.05 and a fold change >2. Among the total DEGs, 228 were upregulated and 203 were downregulated genes.

Functional enrichment of DEGs

Biological significance of DEGs was established by enriching the GO functions such as biological processes, cellular com- ponents, and molecular functions. Among the total DEGs, both upregulated and downregulated DEGs were largely involved in metabolic process and protein binding. Most of the upregulated DEGs were present on the nucleus, while the downregulated DEGs existed on the membrane (Figure 3).

Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis revealed that the upregulated DEGs were mostly enriched in cell cycle, steroid biosynthesis, and p53 signaling pathway, while the downregulated DEGs were mainly involved in complement and coagulation cas- cades (Table 1). Funrich enrichment analysis of the biological pathways associated with DEGs is shown in Figure 4.

PPI network construction

STRING furnishes original reliable protein data for conse- quent analysis. Upregulated, downregulated, and total DEGs were mapped to generate three PPI networks. A PPI network was formed with upregulated DEGs containing 194 nodes and 1,122 edges, as these DEGs had literature related to interacting proteins. Moreover, a PPI network was formed with 130 nodes and 262 edges with the available literature. The total DEG PPI network was formed with 366 nodes and 1,858 edges (Figure 5A).

Hub genes of the total DEGs PPI network were identified as glyceraldehyde-3-phosphate dehydrogenase (GAPDH), cyclin-dependent kinase 1 (CDK1), topoisomerase (DNA) II alpha (TOP2A), cyclin B1 (CCNB1), and ATP citrate lyase (ACLY). Top 20 hub genes of the overall PPI network are shown in Table 2.

Network topological analysis

PPI networks or biological networks are notably different from random networks on the basis of differentiable topo- logical characteristics. The most important indicator is the power law of node degree distribution. The power law of degree distribution was followed with an R2=0.749, 0.859,

Figure 3 Gene ontology functional analyses of DEGs. Notes: Gene Ontology category (A) biological process, (B) molecular function, and (C) cellular component. Blue denotes upregulated DEGs and red depicts downregulated DEGs. Abbreviation: DEGs, differentially expressed genes.

A

Number of genes

Biological process

B

180

Number of genes

Molecular function

160

160

140

140

120

120

100

100

80

80

60

60

40

40

20

20

0

Metabolic process

Biological regulation

Response to stimulus

Cellular component organization

Cell communication

Multicellular organismal process

Developmental process

Localization

Reproduction

Cell profileration

Death

Growth

Multiorganism process

Unclassified

0

Protein binding

Lon binding

Nucleotide binding

Transferase activity

Hydrolase activity

Nucleic acid binding

Enzyme regulator activity

Transporter activity

Structural molecule activity

Molecular transducer activity

Electron carrier activity

Carbohydrate binding

Lipid binding

Molecular adaptor activity

Chromatin binding

Antioxidant activity

Unclassified

C

Number of genes

Cellular component

120

100

80

60

40

20

0

Nucleus

Membrane

Membrane-enclosed lumen

Macromolecular complex

Cytosol

Endomembrane system

Cytoskeleton

Endoplasmic reticulum

Mitochondrion

Chromosome

Envelope

Extracellular space

Vesicle

Extracellular matrix

Cell projection

Golgi apparatus

Lipid particle

Vacuole

Endosome

Microbody

Ribosome

Unclassified

Table 1 KEGG pathway enrichment analysis of DEGs (top five in each)
CategoryTermCategoryCountAdjusted P-value
Upregulated
KEGGhsa04110Cell cycle202.70E-09
KEGGhsa00100Steroid biosynthesis60.0016
KEGGhsa04115p53 signaling pathway90.0051
KEGGhsa04114Oocyte meiosis110.0049
KEGGhsa00900Terpenoid backbone biosynthesis40.0961
Downregulated
KEGGhsa04610Complement and coagulation cascades90.0012
KEGGhsa02010ABC transporters40.8510

Abbreviations: DEGs, differentially expressed genes; KEGG, Kyoto Encyclopedia of Genes and Genomes.

and 0.836 for upregulated, downregulated, and total DEGs, respectively. This implies that all the PPI subnetworks were scale-free, a major attribute of complex biological networks.33 Various parameters of the PPI networks such as clustering coefficient, network centralization, and network density are shown in Table 3.

Module identification and enrichment analysis

The overall PPI network of the DEGs was surveyed for identification of functional modules in the network. Five modules were identified in the PPI network with MCODE score ≥4 and nodes ≥6: module P-A with MCODE score of 8.625 (nodes =17), module P-B with MCODE score of 6.857 (nodes =22), module P-C with MCODE score of 5.727 (nodes =23) (Figure 5B), module P-D with MCODE

score of 5.2 (nodes =16) (Figure 5C), and module P-E with MCODE score of 5 (nodes =19). Hub genes, namely, CDK1, CCNB1, and cell division cycle 20 (CDC20), were present in module P-D, and BUB1 mitotic checkpoint serine/threonine kinase B (BUB1B) was present in module P-C. Modules P-C and P-D were scrutinized further for function and pathway enrichment analysis.

GO functional enrichment of the modules revealed that module P-C was enriched in sterol and steroid metabolic process and that module P-D was enriched in cell cycle pro- cess and cell cycle phase (Table 4). Also, KEGG pathway enrichment analysis revealed that genes in module P-C were enriched in cell cycle and p53 signaling pathway and that genes in module P-D were significantly enriched in aminoacyl-tRNA biosynthesis and cell cycle pathways (Table 5).

Figure 4 Pathway enrichment analyses of (A) upregulated and (B) downregulated DEGs. Abbreviations: DEGs, differentially expressed genes; IGF, insulin-like growth factor; IGFBPs, insulin-like growth factor binding proteins.

A

-Log10 (P-value)

B

-Log10 (P-value)

0 2 4 6 8 10 12 14 16

0

1

2

3

4

5

Cholesterol biosynthesis I

5.5%

P<0.001

B1 integrin cell surface interactions

37.9%

P<0.001

Biological pathway

Cholesterol biosynthesis II (via 24,25-dihydrolanosterol)

5.5%

Biological pathway

B2 integrin cell surface interactions

4.9%

P<0.001

P<0.001

Mitotic G1-G1/S phases

13.8%

3.9%

P<0.001

Initial triggering of complement

P<0.001

Cholesterol biosynthesis

6.9%

Regulation of IGF Activity by IGFBPs

3.9%

P<0.001

P<0.001

Superpathway of cholesterol biosynthesis

8.3%

Complement cascade

4.9%

P<0.001

P<0.001

Cell cycle, mitotic

24.1%

41.7%

P<0.001

Integrin family cell surface interactions

P<0.001

0

5

10

15

20

25

0

10

20

30

40

50

Percentage of genes

Percentage of genes

Percentage of genes

P=0.05 reference

P-value

Dovepress

Figure 5 PPI network of pediatric ACC along with the modules. Notes: (A) Overall PPI network of the DEGs. Upregulated genes are depicted in blue (circle) and downregulated genes in red (triangle). (B) Module with MCODE score 5.2 and (C) module with MCODE score 5.727. Abbreviations: ACC, pediatric adrenocortical carcinoma; DEGs, differentially expressed genes; MCODE, molecular complex detection; PPI, protein-protein interaction.

A

B

DNAJB1

BARD 1

TUBB3

HSPA4

SKP2

CCK1

SIRT1

CCNB1

DICER1

DNMT1

CDC20

CCNB2

GADD45A

Z

VIN

TACC3

MI

IP

C

NCAPG

CONE1 SMARCA4

CEP55

BUB1

IARS

DDX39A

BUB1B

TOP2A

MELK

TARS

MARS

TCP1

TXN

EPRS

ENO1

HSD3B2

C14 orf

CAT

HSPA5

LOS

SR

F1

DUR

Dovepress

Table 2 Top 20 hubs of overall PPI network
IDDegreeIDDegreeIDDegreeIDDegree
GAPDH88CDC2051HSPA538RRM233
CDK176AURKA51HDAC137HSPA433
TOP2A74CCNB249EGR136SMARCA432
CCNB160FOS49BUB1B35MMP232
ACLY52BUB140CAT34MCM531

Abbreviations: DEGs, differentially expressed genes; PPI, protein-protein interaction.

Table 3 Topological parameters of PPI networks
PPI networkNumber of nodesNumber of edgesR2CorrelationClustering coefficientNetwork centralizationNetwork densityNetwork diameter
Overall DEGs3661,8580.8360.8800.3040.2140.0289
Upregulated1941,1220.7490.8490.4040.2800.0608
Downregulated1302620.8590.9960.2130.2040.0319

Abbreviations: DEGs, differentially expressed genes; PPI, protein-protein interaction.

Table 4 GO enrichment analysis of modules of PPI network (top five in each)
CategoryClassDescriptionGene countAdjusted P-value
Module P-C
BP16,125Sterol metabolic process56.3681E-05
BP8,202Steroid metabolic process66.3681E-05
BP43,038Amino acid activation46.3681E-05
BP43,039tRNA aminoacylation46.3681E-05
BP6,418tRNA aminoacylation for protein translation46.3681E-05
Module P-D
BP22,402Cell cycle process91.6615E-06
BP22,403Cell cycle phase82.3258E-06
BP7,047Cell cycle98.3642E-06
BP278Mitotic cell cycle71.3412E-05
BP6,996Organelle organization103.8684E-05

Abbreviations: BP, biological process; GO, Gene Ontology; PPI, protein-protein interaction.

Table 5 Pathway enrichment analysis of modules of PPI network (top five in each)
CategoryClassDescriptionGene countAdjusted P-value
Module P-C
KEGGhsa04110Cell cycle66.13E-06
KEGGhsa04115p53 signaling pathway48.55E-04
KEGGhsa04114Oocyte meiosis40.0023
KEGGhsa04914Progesterone-mediated oocyte maturation30.0256
Module P-D
KEGGhsa00970Aminoacvl-tRNA biosynthesis40.0030
KEGGhsa04110Cell cycle30.3684

Abbreviations: PPI, protein-protein interaction; KEGG, Kyoto Encyclopedia of Genes and Genomes.

GGI network

A GGI network of the overall DEGs was constructed to infer the biological meaning of the recognized DEGs at the gene level. The gene interaction network was composed of 449 nodes and 14,848 edges (Figure 6A). Approximately 64.66% genes show physical interactions, and 17.38% of genes show coexpression in the GGI network.

The hub genes of the GGI network were identified as ubiquitin C (UBC), interleukin enhancer binding factor 2 (ILF2), karyopherin subunit alpha 2, CCNB1, and CDC28 protein kinase regulatory subunit 2 (CKS2). The top 20 hubs of GGI network are shown in Table 6.

Ten modules were identified in the GGI network with MCODE score ≥4 and nodes ≥6. They were as follows: module G-A (MCODE score - 70.541) with 75 nodes (Figure 6B), module G-B (MCODE score - 18.698) with 64 nodes, module G-C (MCODE score - 12.963) with 28 nodes, module G-D (MCODE score - 7.333) with 28 nodes, module G-E (MCODE score - 5.647) with 35 nodes, module G-F (MCODE score - 5.556) with ten nodes, mod- ule G-G (MCODE score - 5.286) with 29 nodes, module G-H (MCODE score - 5.2) with eleven nodes, module G-I (MCODE score - 4.839) with 32 nodes, and module G-J (MCODE score-4.833) with 13 nodes. Hub genes, namely, CDK1, CCNB1, CDC20, and BUB1B, were also present in the largest module G-A of the GGI network. Genes in module G-A were further scrutinized for enrichment analysis.

Genes in module G-A were found to be significantly enriched in cell cycle and cell cycle phase. Moreover, KEGG pathway enrichment revealed that genes in module G-A were enriched in cell cycle and oocyte meiosis pathways (Table 7).

Gene regulatory network

Identification of DEGs was preceded by recognizing TFs and miRNA associated with DEGs to better understand the process of gene regulation.

Eighty-two miRNAs such as TGGTGCT, MIR-29A, MIR-29B, MIR-29C, GTACTGT, MIR-101, GTGCCTT, and MIR-506 were found to be associated with DEGs. One hundred TFs such as GGGCGGR_VNFAT_Q4_01 were mapped to DEGs with FDR q-value below 0.05. Top ten TFs and miRNAs associ- ated with DEGs are shown in Tables 8 and 9, respectively.

The DEGs-miRNAs-TFs regulatory network consisted of 520 nodes and 2,782 edges (Figure 7). Sp1 TF (SP1) was identified as the hub of the network with a node degree of 88.

Discussion

PPIs and protein expression are responsible for the patho- logical changes induced by the development of carcinoma. Multiple resources such as alterations in gene expression, PPI network, gene functional interaction network, hubs, and mod- ule identification were employed to identify potential diag- nostic markers that can distinguish children with ACC.

Figure 6 GGI network for pediatric ACC along with module. Notes: (A) A GGI network of overall DEGs. Upregulated genes are shown in blue, downregulated DEGs in red, and additional genes in light blue. (B) Functional module with MCODE score 70.541.

A

B

FOXM1

RACGAP1

UBC

MCM2

MLFOGH

TOP2A

MCM6

RFC4

TYMS

CKS1B

FANCI

GINS1

TPX2

KIAA0101

MCM7

MCM3

PBK

DDX39A

HMMR

H2AFZ

CCNA2

CKS2

DNMT1

NCAPG

GMNN

EM19LA

“SE

CCNB2

CDKN3

AURKA

CDTUBA1B

VRK1

NCAPH

CEP55

TTK

DLGAP5

SMC4

NDC80

CENPF

BUB1B

RRM2

PTTG1

BUB1

UBE25

DTL

MELK

CDC7

TACC3

MAD2L1

ASF1B

CDK2

HMGB2

UBE2C

ZWINT

KPNA2

TMPO

PRC1

MICM5

PLK1

NCAPD2

ASPM

CCNB1

MCM4

EMNB

520A

SMC2

CDK1

BARD1

CDC20

KIF23

RRM1

Abbreviations: ACC, pediatric adrenocortical carcinoma; DEGs, differentially expressed genes; GGI, gene-gene functional interaction; MCODE, molecular complex detection.

Table 6 The top 20 hub genes of GGI network
IDDegreeIDDegreeIDDegreeIDDegree
UBC322CDK2135H2AFZ127PLK1123
ILF2141CDK1133CKS1B127PSMD14122
KPNA2138PTTG1131UBE2S125DDX39A122
CCNB1137BUB1B130MCM6124ZWTNT121
CKS2137CDC20127DNMT1124MCM3121

Abbreviation: GGI, gene-gene functional interaction.

A total of 431 DEGs including 228 upregulated and 203 downregulated DEGs were identified by microarray data analysis. Pathway enrichment analysis demonstrated that cell cycle, terpenoid backbone biosynthesis, and oocyte meiosis were overrepresented amid upregulated genes. IGF-2 was the most highly expressed gene in pediatric ACC, as compared to normal adrenal. Among the upregulated DEGs, CDK1, CCNB1, CDC20, and BUB1B were the common hubs among PPI and GGI networks. More centralized genes in the network are suggested to be key drivers of proper cellular function, in comparison to peripheral genes.34 Moreover, all these four genes were incorporated in the functional modules of the interaction networks.

CDK1, also known as CDC2, is a representative of serine/ threonine protein kinase family. CDK1 is a catalytic subunit of M-phase promoting factor, a well-conserved protein kinase complex crucial for G1/S and G2/M phase transitions of cell cycle in eukaryotes. CDK1 has previously been reported to be upregulated in ACC samples by Glover et al. Moreover, in vivo inhibition of CDK1 by targeted miR-7 delivery has been

Table 7 GO and KEGG pathway enrichment analyses of the most significant module of gene-gene interaction network with MCODE score 70.541 (top five in each)
CategoryClassDescriptionGene countAdjusted P-value/FDR q-value
GO
BP7,049Cell cycle473.4537E-39
BP22,403Cell cycle phase375.8834E-35
BP278Mitotic cell cycle354.0954E-34
BP51,301Cell division335.9482E-34
BP279M phase345.9482E-34
Pathway
KEGGhsa04110Cell cycle211.35E-34
KEGGhsa04114Oocyte meiosis115.32E-15
KEGGhsa03030DNA replication71.15E-11
KEGGhsa04914Progesterone-mediated oocyte maturation87.63E-11
KEGGhsa04115P53 signaling pathway66.33E-08

Abbreviations: FDR, false discovery rate; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes; MCODE, molecular complex detection; BP, biological process.

proposed as a therapeutic approach for ACC.35 CDC2 was found to be dysregulated in cell cycle pathway through meta- analysis of gene expression and comprehensive genomic hybridization profiling data of ACC.36

CCNB1 encodes for a regulatory protein that is involved in mitosis. Pinto et al reported that pediatric ACC based on TP53 and somatic ATRX mutations had shown high expres- sion of genes associated with chromosome instability and deregulation of cell cycle control, such as CCNB1.4 Soon et al reported that CCNB1 expression was found to be appreciably higher in ACC as compared to adrenocortical adenoma. Also, the combination of IGF-2 and either MAD2L1, CCNB1, or Ki-67 is highly sensitive and specific for ACC.37

CDC20 acts as regulatory protein involved in cell cycle progression, apoptosis, and ciliary disassembly. CDC20 has been proposed to exhibit oncogenic function, dem- onstrating its utility as a potential therapeutic target for combating human cancers.38 CDC20 has previously been reported to be upregulated in transcriptome analysis of adrenocortical cancer.39

BUB1B is an essential spindle assembly checkpoint pro- tein that forms mitotic check point complex, which on activa- tion controls premature chromosome segregation.40 Mitotic inhibitor drugs such as taxanes, which disrupt the process of cell division, have proved to be potent anticancer drugs.

Table 8 Top ten TFs associated with DEGs
TFNumber of target genesFDR q-value
GGGCGGR_V$SP1_Q6884.96E-20
TGGAAA_VSNFAT_Q4_01671.40E-18
GATTGGY_V$NFY_Q6_01518.78E-18
GGGAGGRR_V$MAZ_Q6701.43E-16
CTTTGT_V$LEF1_Q2643.33E-16
CAGGTG_V$E12_Q6709.63E-15
AACTTT_UNKNOWN561.81E-12
RYTTCCTG_V$ETS2_B401.39E-11
RTAAACA_V$FREAC2_01351.74E-10
V$NFY_01192.60E-10

Abbreviations: DEGs, differentially expressed genes; FDR, false discovery rate; TFs, transcription factors.

Table 9 Top ten miRNAs associated with DEGs
miRNANumber of target genesFDR q-value
TGGTGCT, MIR-29A, MIR-29B, MIR-29C201.72E-05
GTACTGT, MIR-101141.72E-05
GTGCCTT, MIR-506213.41E-04
ATACCTC, MIR-202103.41E-04
CAGTGTT, MIR-141, MIR-200A133.41E-04
AATGTGA, MIR-23A, MIR-23B153.86E-04
GTGACTT, MIR-22494.89E-04
TGCCTTA, 1IR-124A174.89E-04
ACCAAAG, MIR-9164.89E-04
ATGTTTC, MIR-49494.89E-04

Abbreviations: DEGs, differentially expressed genes; FDR, false discovery rate; miRNA, microRNA.

Combined expression of BUB1B-PINK1 has been proposed to be slightly related with disease-free survival in the pedi- atric group.41

A gene regulatory network incorporating DEGs- miRNAs-TFs was also constructed to better understand the process of gene regulation. Upon analysis, SP1 was found to be hub of the gene regulatory network. SP1 is a versatile sequence-specific DNA-binding protein involved in the expression of different genes.42 It is overexpressed in various human cancers and is involved in angiogenesis, cell growth, and resistance to apoptosis.43-45

The study has some limitations as the data utilized in the study consisted of 18 ACC and seven control samples, which

Figure 7 A gene regulatory network incorporating DEGs-TFs-miRNAs. Notes: DEGs are shown in red (circle), TFs in green (triangle), and miRNAs in blue (diamond). Abbreviations: DEGs, differentially expressed genes; miRNA, microRNA; TFs, transcription factors.

were restricted in quantity and downloaded from the Gene Expression Omnibus database, not generated by us.

Conclusion

Four novel genes, CDK1, CCNB1, CDC20, and BUB1B, associated with pediatric ACC were identified by bioinfor- matics approaches. These DEGs were present in the hubs and modules of PPI and GGI networks, suggesting their potential utility as potential biomarker for pediatric ACC. These genes may also provide prospective targets for pediatric ACC therapy, although further experimental studies are essential to confirm the role of these genes and their potential to be developed as molecular targets for pediatric ACC.

Acknowledgment

Computing facilities at the National Bureau of Animal Genetic Resources, Karnal, India are gratefully acknowledged.

Disclosure

The authors report no conflicts of interest in this work.

References

1. Gulack BC, Rialon KL, Englum BR, et al. Factors associated with sur- vival in pediatric adrenocortical carcinoma: an analysis of the National Cancer Data Base (NCDB). J Pediatr Surg. 2016;51(1):172-177.

2. Lalli E, Figueiredo BC. Pediatric adrenocortical tumors: what they can tell us on adrenal development and comparison with adult adrenal tumors. Front Endocrinol. 2015;6:23.

3. Pinto EM, Morton C, Rodriguez-Galindo C, et al. Establishment and characterization of the first pediatric adrenocortical carcinoma xenograft model identifies as a potential chemotherapeutic agent. Clin Cancer Res. 2013;19(7):1740-1747.

4. Pinto EM, Chen X, Easton J, et al. Genomic landscape of paediatric adrenocortical tumors. Nat Commun. 2015;6:6302.

5. Wasserman JD, Zambetti GP, Malkin D. Towards an understanding of the role of p53 in adrenocortical carcinogenesis. Mol Cell Endocrinol. 2012;351(1):101-110.

6. Weksberg R, Shuman C, Beckwith JB. Beckwith-Wiedemann syndrome. Eur J Hum Genet. 2010;18(1):8-14.

7. Ribeiro RC, Pinto EM, Zambetti GP, Rodriguez-Galindo C. The Interna- tional Pediatric Adrenocortical Tumor Registry initiative: contributions to clinical, biological and treatment advances in pediatric adrenocortical tumors. Mol Cell Endocrinol. 2012;351(1):37-43.

8. Ribeiro RC, Pinto EM, Zambetti GP. Familial predisposition to adreno- cortical tumors: clinical and biological features and management strate- gies. Best Pract Res Clin Endocrinol Metabol. 2010;24(3):477-490.

9. Doghman M, Arhatte M, Thibout H, et al. Nephroblastoma overexpressed/cysteine-rich protein 61/connective tissue growth factor/ nephroblastoma overexpressed gene-3 (NOV/CCN3), a selective adre- nocortical cell proapoptotic factor, is down-regulated in childhood adre- nocortical tumors. J Clin Endocrinol Metab. 2007;92(8):3253-3260.

10. Doghman M, El Wakil A, Cardinaud B, et al. Regulation of insulin-like growth factor -mammalian target of rapamycin signaling by microRNA in childhood adrenocortical tumors. Cancer Res. 2010;70(11):4666-4675.

11. WuJ, Vallenius T, Ovaska K, Westermarck J, Mäkelä TP, Hautaniemi S. Integrated network analysis platform for protein-protein interactions. Nat Methods. 2009;6(1):75-77.

12. Li M, Wu X, Wang J, Pan Y. Towards the identification of protein complexes and functional modules by integrating PPI network and gene expression data. BMC Bioinformatics. 2012;13:109.

13. Bapat SA, Krishnan A, Ghanate AD, Kusumbe AP, Kalra RS. Gene expression: protein interaction systems network modelling identifies transformation-associated molecules and pathways in ovarian cancer. Cancer Res. 2010;70(12):4809-4819.

14. Sharan R, Ulitsky I, Shamir R. Network-based prediction of protein function. Mol Syst Biol. 2007;3:88.

15. Li Y, Li J. Disease gene identification by random walk on multigraphs merging heterogeneous genomic and phenotype data. BMC Genomics. 2012;13 (Suppl 7):S27.

16. West AN, Neale GA, Pounds S, et al. Gene expression profiling of childhood adrenocortical tumors. Cancer Res. 2007;67(2):600-608.

17. Irizarry RA, Hobbs B, Collin F, et al. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003;4(2):249-264.

18. Gautier L, Cope L, Bolstad BM, Irizarry RA. Affy - analysis of Affyme- trix GeneChip data at the probe level. Bioinformatics. 2004;20(3): 307-315.

19. Smyth GK. Limma: linear models for microarray data. In: Gentleman V, Carey S, Dudoit R, Irizary W, Huber, W, editors. Bioinformatics and Computational Biology Solutions Using R and Bioconductor. New York: Springer; 2005:397-420.

20. Bender R, Lange S. Adjusting for multiple testing - when and how? J Clin Epidemiol. 2001;54(4):343-349.

21. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc B (Methodological). 1995;57(1):289-300.

22. Ashburner M, Ball CA, Blake JA, et al. Gene Ontology: tool for the unification of biology. Nat Genet. 2000;25(1):25-29.

23. Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID Bioinformatics Resources. Nat Protoc. 2009;4(1):44-57.

24. Wang J, Duncan D, Shi Z, Zhang B. Web-based gene set analysis toolkit (WebGestalt): Update 2013. Nucleic Acids Res. 2013;41(Web Server issue): W77-W83.

25. Pathan M, Keerthikumar S, Ang CS, et al. Funrich: An open access standalone functional enrichment and interaction network analysis tool. Proteomics. 2015;15(15):2597-2601.

26. Szklarczyk D, Franceschini A, Wyder S, et al. STRING v10: protein- protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 2015;43(Database issue): D447-D452.

27. Shannon P, Markiel A, Ozier O, et al. Cytoscape: a software environ- ment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498-2504.

28. Assenov Y, Ramírez F, Schelhorn SE, Lengauer T, Albrecht M. Com- puting topological parameters of biological networks. Bioinformatics. 2008;24(2):282-284.

29. Bader GD, Hogue CW. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinformatics. 2003;4:2.

30. Maere S, Heymans K, Kuiper M. BINGO: a Cytoscape plugin to assess overrepresentation of Gene Ontology categories in biological networks. Bioinformatics. 2005;21(16):3448-3449.

31. Warde-Farley D, Donaldson SL, Comes O, et al. The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function. Nucleic Acids Res. 2010;38(Web Server issue): W214-W220.

32. Subramanian A, Tamayo P, Mootha VK, et al. Gene set enrichment anal- ysis: a knowledge-based approach for interpreting genome-wide expres- sion profiles. Proc Natl Acad Sci U S A. 2005;102(43):15545-15550.

33. Wu B, Li C, Du Z, et al. Network based analyses of gene expression profile of LCN2 overexpression in esophageal squamous cell carcinoma. Sci Rep. 2014;4:5403.

34. Horvath S, Zhang B, Carlson M, et al. Analysis of oncogenic signaling networks in glioblastoma identifies ASPM as a molecular target. Proc Natl Acad Sci U S A. 2006;103(46):17402-17407.

35. Glover AR, Zhao JT, Gill AJ, et al. microRNA-7 as a tumor suppres- sor and novel therapeutic for adrenocortical carcinoma. Oncotarget. 2015;6(34):36675-36688.

36. Szabó PM, Tamási V, Molnár V, et al. Meta-analysis of adrenocortical tumour genomics data: novel pathogenic pathways revealed. Oncogene. 2010;29(21):3163-3172.

37. Soon PS, Gill AJ, Benn DE, et al. Microarray gene expression and immunohistochemistry analyses of adrenocortical tumors identify IGF2 and Ki-67 as useful in differentiating carcinomas from adenomas. Endocr Relat Cancer. 2009;16(2):573-583.

38. Wang L, Zhang J, Wan L, Zhou X, Wang Z, Wei W. Targeting Cdc20 as a novel cancer therapeutic strategy. Pharmacol Ther. 2015;151:141-151.

39. Ragazzon B, Assié G, Bertherat J. Transcriptome analysis of adreno- cortical cancers: from molecular classification to the identification of new treatments. Endocr Relat Cancer. 2011;18(2):R15-R27.

40. Wan X, Yeung C, Kim SY, et al. Identification of FoxM1/Bub1b signaling pathway as a required component for growth and survival of rhabdomyosarcoma. Cancer Res. 2012;72(22):5889-5899.

41. Fragoso MC, Almeida MQ, Mazzuco TL, et al. Combined expression of BUB1B, DLGAP5, and PINK1 as predictors of poor outcome in adrenocortical tumors: validation in a Brazilian cohort of adult and pediatric patients. Eur J Endocrinol. 2012;166(1):61-67.

42. Suske G. The Sp-family of transcription factors. Gene. 1999;238(2): 291-300.

43. Lou Z, O’Reilly S, Liang H, Maher VM, Sleight SD, McCormick JJ. Down-regulation of overexpressed sp1 protein in human fibrosar- coma cell lines inhibits tumor formation. Cancer Res. 2005;65(3): 1007-1017.

44. Wang L, Guan X, Gong W, et al. Altered expression of transcription factor Sp1 critically impacts the angiogenic phenotype of human gastric cancer. Clin Exp Metastasis. 2005;22(3):205-213.

45. Kanai M, Wei D, Li Q, et al. Loss of Krüppel-like factor 4 expression contributes to Sp1 overexpression and human gastric cancer develop- ment and progression. Clin Cancer Res. 2006;12(21):6395-6402.

OncoTargets and Therapy

Dovepress

Publish your work in this journal

OncoTargets and Therapy is an international, peer-reviewed, open access journal focusing on the pathological basis of all cancers, potential targets for therapy and treatment protocols employed to improve the management of cancer patients. The journal also focuses on the impact of management programs and new therapeutic agents and protocols on

Submit your manuscript here: http://www.dovepress.com/oncotargets-and-therapy-journal

patient perspectives such as quality of life, adherence and satisfaction. The manuscript management system is completely online and includes a very quick and fair peer-review system, which is all easy to use. Visit http://www.dovepress.com/testimonials.php to read real quotes from published authors.