frontiers in Oncology

Check for updates

OPEN ACCESS

Edited by: Gelina Kopeina, Lomonosov Moscow State University, Russia

Reviewed by:

Mantang Qiu, Peking University People’s Hospital, China Yongsong Chen, Shantou University, China

*Correspondence: Xiangqing Kong xiangqing_kong@sina.com

Jing Shi shijing5499@jsph.org.cn +These authors have contributed equally to this work

Specialty section:

This article was submitted to Molecular and Cellular Oncology, a section of the journal Frontiers in Oncology Received: 17 November 2020 Accepted: 07 April 2021 Published: 19 May 2021

Citation:

Yang C, Wu T, Zhang J, Liu J, Zhao K, Sun W, Zhou X, Kong X and Shi J (2021) Prognostic and Immunological Role of mRNA ac4C Regulator NAT10 in Pan-Cancer: New Territory for Cancer Research? Front. Oncol. 11:630417.

doi: 10.3389/fonc.2021.630417

Prognostic and Immunological Role of mRNA ac4C Regulator NAT10 in Pan-Cancer: New Territory for Cancer Research?

Chuanxi Yang1,21, Tingting Wu11, Jing Zhang11, Jinhui Liu3, Kun Zhao1, Wei Sun1, Xin Zhou4, Xiangqing Kong1* and Jing Shi1*

1 Department of Cardiology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China, 2 Department of Cardiology, Yangpu Hospital, Tongji University School of Medicine, Shanghai, China, 3 Department of Gynecology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China, 4 Department of Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China

Background: NAT10 (also known as human N-acetyltransferase-like protein) is a critical gene that regulates N4-acetylcytidine formation in RNA, similar to the multiple regulators of N6-methyladenosine. However, the underlying functions and mechanisms of NAT10 in tumor progression and immunology are unclear.

Methods: In this study, we systematically analyzed the pan-cancer expression and correlations of NAT10, using databases including Oncomine, PrognoScan, GEPIA2, and Kaplan-Meier Plotter. The potential correlations of NAT10 with immune infiltration stages and gene marker sets were analyzed using the Tumor Immune Estimation Resource and GEPIA2.

Results: Compared with normal tissues, NAT10 showed higher expression in most cancers based on combined data from TCGA and GTEx. In different datasets, high NAT10 expression was significantly correlated with poor prognosis in adrenocortical carcinoma, head and neck squamous cell carcinoma, liver hepatocellular carcinoma, kidney renal papillary cell carcinoma, and pheochromocytoma and paraganglioma. Moreover, there were significant positive correlations between NAT10 expression and immune infiltrates, including B cells, CD8+ T cells, CD4+ T cells, neutrophils, macrophages, dendritic cells, endothelial cells, and fibroblasts in LIHC. NAT10 expression showed strong correlations with diverse immune marker gene sets in LIHC.

Conclusion: NAT10 expression affects the prognosis of pan-cancer patients and is significantly correlated with tumor immune infiltration. Furthermore, it represents a potential target for cancer therapy.

Keywords: NAT10, prognosis, pan-cancer, tumor infiltration, N4-acetylcytidine

BACKGROUND

RNA modification was first discovered in 1956 by Cohn et al. (1) and Davis et al. (2). In recent years, extensive research in RNA biology has revealed diverse modifications of RNA at the post- transcription stage. More than 100 RNA modifications have been shown to have important roles in regulating RNA stability (3), localization (4), transport, shearing (5), and translation (6). N4- acetylcytidine (ac4C) is considered to be a conservative chemically modified nucleoside on tRNA and rRNA (7). Recently, several studies proved that the presence of ac4C on tRNA, rRNA and mRNA is important for increasing and maintaining the fidelity of protein translation (8-11). Furthermore, studies by Thomale et al. (12) and Liebich et al. (13) found significant increases in modified nucleosides (including ac4C) in the urine of tumor mice and cancer patients. Besides, increased levels of ac4C in urine were observed in colorectal cancer (14), urogenital cancer (15), ovarian epithelial cancer (16), and breast cancer (17). These findings suggest that ac4C is a potential biomarker for cancer.

NAT10 (also known as hALP, human N-acetyltransferase- like protein), which was first reported in 2003, is a protein with histone acetylation activity that can enhance telomerase activity by stimulating transcription of hTERT (18). NAT10 or a homologous enzyme in other species increased the formation of ac4C on tRNA, rRNA, and mRNA, thereby maintaining the accuracy of protein translation and stabilizing the mRNA (11). Tuan et al. first showed that NAT10 was associated with cancer by demonstrating that it could significantly promote cell growth in epithelial ovarian cancer and breast cancer (19, 20). NAT10 also has a potential role in increasing melanogenesis and melanoma growth (21). In addition, high NAT10 expression was found to be related to poor survival in human hepatocellular carcinoma (22) (23), acute myeloid leukemia (24) and to promote colorectal cancer progression by increasing micronuclei (25). These findings suggest that NAT10 has multifaceted functional roles in cancers. Several studies have also shown that levels of ac4C are associated with inflammatory responses (26, 27). However, the underlying functions and

Abbreviations: ACC, adrenalcortical carcinoma; BLCA, bladder urothelial carcinoma; BRCA, breast invasive carcinoma; CCLE, Cancer Cell Line Encyclopedia; CHOL, cholangiocarcinoma; COAD, colon adenocarcinoma; DFI, disease-free interval; DLBC, lymphoid neoplasm diffuse large B-cell lymphoma; DSS, disease-specific survival; ESCA, esophageal carcinoma; GBM, glioblastoma multiforme; GTEx, The Genotype- Tissue Expression; International Cancer Genome Consortium; HNSC, Head and Neck squamous cell carcinoma; KICH, kidney chromophobe; KIRC, kidney renal clear cell carcinoma; KIRP, kidney renal papillary cell carcinoma; LAML, acute myeloid leukemia; LGG, brain lower grade glioma; LIHC, liver hepatocellular carcinoma; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; MESO, mesothelioma; MSI, microsatellite instability; OS, overall survival; OV, ovarian serous cystadenocarcinoma; PAAD, pancreatic adenocarcinoma; PCPG, pheochromocytoma and paraganglioma; PD-1, programmed cell death protein 1s; PRAD, prostate adenocarcinoma; READ, rectum adenocarcinoma; ROC, Receive Operating Characteristic; SARC, sarcoma; SKCM, skin cutaneous melanoma; STAD, stomach adenocarcinoma; TCGA, The Cancer Genome Atlas; TGCT, testicular germ cell tumor; THYM, thymoma; TIMER2, Tumor IMmune Estimation Resource; TMB, tumor mutation burden; UCEC, uterine corpus endometrial carcinoma; UVM, uveal melanoma.

mechanisms of NAT10 in tumor progression and tumor immunology remain unclear.

In the current study, we systematically analyzed the pan- cancer expression of NAT10 and its correlations, using databases including Oncomine, PrognoScan, GEPIA2, and Kaplan-Meier Plotter. We then investigated the potential correlations of NAT10 with immune infiltration stages using the Tumor Immune Estimation Resource2 (TIMER2) and GEPIA2. The findings from our study indicate that NAT10 expression affects the prognosis of pan-cancer patients as well as being significantly correlated with tumor-immune infiltration. Furthermore, it may serve as a potential target for cancer therapies.

MATERIALS AND METHODS

Ethics Approval

Ten paired human para-tumor and tumor tissue samples were obtained from newly diagnosed Liver hepatocellular carcinoma in the First Affiliated Hospital of Nanjing Medical University (Nanjing, China). The tissues were stored in the fridge at -80℃. The study was conducted with the approval of the Institutional Review Board and the Ethics Committee of the First Affiliated Hospital of Nanjing Medical University (ID: 2017-SRFA-104).

Data Mining for NAT10 in Public Databases

First, to investigate the pan-cancer differential expression of NAT10 mRNA, several databases were mined, including: Oncomine (http://www.oncomine.org/resource/login.html) with thresholds of P-value 0.05 and fold change 1.5; The Cancer Genome Atlas (TCGA); the Broad Institute Cancer Cell Line Encyclopedia (CCLE); and GEPIA2 (http://GEPIA2.cancer- pku.cn/). Simply, we search the different expression of NAT10 between tumor and adjacent normal tissues for the TCGA project at “Gene_DE” module of TIMER2 (tumor immune estimation resource, version 2) web (http://TIMER2.cistrome. org). GBM (Glioblastoma multiforme), LAML (Acutemyeloid leukemia), etc., which without normal or with highly limited normal tissues, are using the “Expression analysis -Box Plots” module of the GEPIA2 (Gene Expression Profiling Interactive Analysis, version 2) to get box plots about NAT10 expression between these tumor tissues and the corresponding normal tissues of the GTEx (Genotype -Tissue Expression) database (setting with P-value cutoff = 0.01, log2FC (fold change) cutoff =1, and “Match TCGA normal and GTEx data).

Then, HPA (Human protein atlas) database (http://www. proteinatlas.org/humanproteome/pathology) were used to get the expression of NAT10 in different cells and tissues under physiological conditions. The detailed information about low specificity of NAT10 was stated by “NX (Normalized expression) ≥ 1 in at least one tissue/region/cell type but not elevated in any tissue/region/cell type” which can be found at the link http:// proteinatlas.org//search/NAT10.

In addition, the NAT10 expression transformed to log2 [TPM (Transcripts per million) +1] in different pathological stage

(stage I, stage II, stage III and stage IV) of tumors was showed in violin plots at the “Pathological Stage Plot” module of GEPIA2. Furthermore, the UALCAN portal (http://ualcan.path.uab.edu/ analysis-prot.html) was used to process protein expression analysis of the CPTAC (clinical proteomic tumor analysis consortium) dataset. The expression level of total protein of NAT10 between primary tumor and normal tissues (breast cancer, ovarian cancer, colon cancer, clear cell renal cell carcinoma, uterine corpus endometrial carcinoma and lung adenocarcinoma) was explored by entering “NAT10”.

Survival Analysis in GEPIA2, PrognoScan, and Kaplan-Meier Plotter

Cox regression analysis was performed to test the correlations between NAT10 expression and patients’ overall survival (OS), disease-free survival (DFS), disease-specific survival (DSS), and progression-free survival (PFS) in each cancer type using TCGA in the R environment. PrognoScan (http://dna00.bio.kyutech.ac. jp/PrognoScan-cgi/PrognoScan.cgi) microarray datasets were used to examine the relationships of NAT10 expression levels with prognosis. The threshold was adjusted to Cox P-value < 0.05. GEPIA2, an interactive online platform with information from TCGA and GTEx, was used to assess the effects of NAT10 expression on OS and DFS in each available cancer type (total number = 34). Kaplan-Meier Plotter is a relatively comprehensive online tool that can be used to analyze the effects of 54,675 genes on survival in 21 cancer types. We analyzed the relationships of NAT10 with OS and relapse-free survival (RFS) in liver hepatocellular carcinoma (LIHC), Head and neck squamous cell carcinoma (HNSC), adrenocortical carcinoma (ACC), kidney renal papillary cell carcinoma (KIRP), and pheochromocytoma and paraganglioma (PCPG). Hazard ratios (HRs) with 95% confidence intervals (CIs) and log-rank P-values were calculated.

Correlation Between NAT10 Expression and Immune Status in TIMER2 and GEPIA2

TIMER2, a powerful online platform for the systematic analysis of immune infiltration in abundant cancer types, contains 10,897 samples spanning 32 cancer types from the TCGA database, which can be used to evaluate the diversity of immune infiltration. Therefore, we analyzed NAT10 expression with all six types of immune infiltrates: B cells, CD4+ T cells, CD8+ T cells, neutrophils, macrophages, and dendritic cells (DCs). Correlations between expression levels of NAT10 and tumor purity were also analyzed by using the “Immune- Gene” module.

Furthermore, the correlations between immune cell markers and NAT10 expression were identified using correlation modules in GEPIA2. The gene markers included markers of B cells, CD8+ T cells, follicular helper T cells (Tfh), T-helper 1 (Th1) cells, T-helper 2 (Th2) cells, T-helper 9 (Th9) cells, T-helper 17 (Th17) cells, T-helper 22 (Th22) cells, regulatory T cells (Tregs), exhausted T cells, M1 macrophages, M2 macrophages, tumor- associated macrophages (TAMs), monocytes, natural killer (NK) cells, neutrophils, and DCs. Immune gene markers from R&D systems (https://www.rndsystems.com/cn/resources/cell-

markers/immune-cells) was selected for analyzing. The gene expression level was adjusted with log2 RSEM with x-axis representing NAT10 and y-axis representing immune gene markers. Correlation scores were calculated for LIHC, HNSC, ACC, KIRP, and PCPG with the Spearman method by using scatterplots.

To explore the proteins potentially interacting with NAT10, the STRING database (http://string-db.org) which was searched by a single protein name (“NAT10”) and organism (“Homo sapiens”). Furthermore, the following main parameters: meaning of network edges (“evidence”), minimum required interaction score [“Low confluence (0.150)”], max number of interactors to show (“no more than 50 interactors”) and active interaction sources (“experiments”) was set to obtain the available experimentally determined NAT10-binding proteins. Then, by using GEPIA2 with the module of “Similar Gene Detection”, we get the top 100 NAT10-correlated genes which was conducted by all TCGA tumor and normal tissues. Particularly, “correlation analysis” in GEPIA2 was used to perform the Pearson correlation between NAT10 with the top 5 selected genes, and “Gene_Corr” module in TIMER2 was used to supply the heatmap data of the top 5 selected genes. Then, the 100 NAT10-correlated genes and the 50 interactors were subjected to network analyses (https://portal.genego.com) as previous described (28). Pearson correlation analysis was used to determine the associations of proteins with NAT10.

RNA Analysis and Real-Time Quantitative PCR (qRT-PCR)

Total RNA isolation from tissue samples were performed using the RNazol B method and a Qiagen RNeasy kit, according to the manufacturer’s instructions. RNA was reverse transcribed (Applied Biosystems) using random hexamer priming. Real- time qRT-PCR was performed using SYBR Green reagent (Applied Biosystems) and rat-specific primers on the ABI Prism 7500 Sequence Detection system. GAPDH was used as an internal control. The relative gene expression levels were calculated using the 2-A/Ct method (n=10).

Primers for real-time PCR:

NAT10 forward: 5’-ATAGCAGCCACAAACATTCGC-3’,

NAT10 reverse: 5’-ACACACATGCCGAAGGTATTG-3’; GAPDH forward: 5’-GAACGGGAAGCTCACTGG-3’,

GAPDH reverse: 5’-GCCTGCTTCACCACCTTCT-3’.

Immunohistochemistry Staining

Para-tumor and tumor tissue samples were embedded in formalin. Each tissue was cut to 4-um thick and mounted on a glass slide. Dewaxing sections were performed as previously described (29). Endogenous peroxidase activity was inhibited and blocked with 5% bovine serum albumin for 30 min at 37℃. The slices were incubated in anti-NAT10 (1:1000 dilution, 13365-1-AP, Proteintech) overnight at 4℃, washed three times with PBS for 5 min, and then incubated with secondary anti-horseradish peroxide at 37℃ for 30 min. After three more washes with PBS, the slices were visualized in diaminobenzidine

chromogenic solution. Microscopic images were obtained by light microscopy (Carl Zeiss, Oberkochen, Germany).

Statistical Analysis

Low and high NAT10 expression groups were established using normalized NAT10 mRNA expression values from the various datasets, based on P-values determined by t-tests. The Spearman correlation test was used to assess the correlations between NAT10 expression and targets of interest, including neoantigens, tumor mutational burden (TMB), and microsatellite instability (MSI). We used log-rank tests to calculate HRs and log-rank P-values in Kaplan-Meier Plotter, PrognoScan, and GEPIA2. P-values less than 0.05 were considered significant. All graphs were produced using the R software (version 4.0.2, www.r-project.org) with the ggplot2 and forestplot packages.

RESULTS

NAT10 Expression Analysis Data

In our study, by using integrated datasets [HPA (Human protein atlas), GTEx, FANTOM5 (Function annotation of the mammalian genome 5), Monaco and Schmiedel], we first assessed the expression of NAT10 in different cells and normal tissues. As shown in Figure 1A, expression of NAT10 showed low RNA blood cell type specificity in different blood cells. NAT10 showed highest expression in the Tonsil, Parathyroid gland and Testis (Figure 1B). However, NAT10 can be expressed in all tissues (all consensus normalized expression value >1) showing low RNA tissue specificity.

Then, the mRNA expression levels of NAT10 were analyzed in Oncomine over a cancer-wide range. NAT10 expression was higher in cancer groups compared with the respective normal groups, including bladder, breast, colorectal, esophageal, gastric, liver, lung, kidney, and prostate cancers, as well as leukemia and myeloma. Interestingly, lower expression of NAT10 was found in one leukemia dataset (Figure 1C). The NAT10 expression data for multiple cancers from Oncomine are summarized in Supplementary Table 1.

Furthermore, the pan-cancer expression of NAT10 was examined based on RNA sequencing data from TCGA using TIMER2. As shown in Figure 1D, the expression of NAT10 in tumor tissues of BLCA, BRCA, CHOL, COAD, ESCA, HNSC, KICH, KIRC, LIHC, LUAD, LUSC, READ, SKCM, STAD and THCA is higher than the corresponding normal tissues. After using the GTEx dataset as controls, similarly increasing level of NAT10 expression was found in DLBC and THYM (Supplementary Figure 1). However, we did not obtain a significant difference for other tumors. We also used the “Pathological Stage Plot” module in GEPIA2 to get the correlation between NAT10 expression and the pathological stages of cancers, including KIRP, LIHC, LUAD and PAAD (Figure 1E, all p <0.05) but not others (Supplementary Figure 1).

Moreover, using the CPTAC dataset, the total protein of NAT10 is higher expressed in the primary tumor of clear cell

RCC, breast cancer, colon cancer, LUAD, ovarian cancer and UCEC than in normal tissues (Figure 1F). Also, in different stages of these six types of cancer, the total protein of NAT10 showed higher expression in the primary tumor except ovarian cancer (Supplementary Figure 1). The immunohistochemical findings from HPA database showed positive in prostate, lung, liver, breast and colorectal cancer than normal tissues (Supplementary Figure 2). Especially, the immuno histochemical and mRNA results in LIHC from 10 patients showed higher expression of NAT10 compared with Paracancerous tissues.

We assessed the correlation between the respective expression levels of NAT10 and OS, PFS, DFS, and DSS in different cancer types using a single-variate Cox regression analysis based on TCGA. The results are summarized in Figures 2A-D. Nine of the 33 cancer types showed significant relationships between NAT10 expression levels and OS, seven showed significant relationships with PFS, five with DFS, and seven with DSS. Overall, the HRs for NAT10 were significant for LIHC, HNSC, ACC, KIRP, and PCPG with respect to OS, PFS, DFS, and DSS. In addition, survival curves comparing high and low expression of NAT10 in different types of cancer in the TCGA database were shown in Supplementary Figure 3.

Using Kaplan-Meier Plotter and GEPIA2, high expression of NAT10 in HNSC, KIRP, LIHC and PCPG had worse outcomes from Kaplan-Meier Plotter in OS and RFS (Figures 3A-H). For ACC, HNSC, KIRP and LIHC, NAT10 significantly decreased the OS in GEPIA2 (Figures 3I-M). In addition, compared with low expression levels, high expression levels of NAT10 were correlated with poorer DFS in ACC, HNSC and LIHC, but not in KIRP and PCPG in GEPIA2 (Figures 3N-R). Using PrognoScan, we analyzed the role of NAT10 in each cancer type (number of cancer types = 12) and the relationships between NAT10 expression and prognosis in different cancers. The results are shown in Supplementary Table 2. Therefore, these results suggest that NAT10 expression is an independent risk factor for poor prognosis in these cancers.

High NAT10 Expression Affects the Prognosis of LIHC With Different Clinicopathological Features

In order to determine the relevance and underlying mechanisms of NAT10 expression in LIHC, we first analyzed NAT10 expression at different stages of LIHC, ACC, KIRP, and HNSC using TIMER2. The expression of NAT10 at stage III showed a significant increase compared with stage I (Figures 4A-D). The relationships between NAT10 expression and clinico pathological features were investigated by combining clinical and pathological data in Kaplan-Meier Plotter. With respect to OS and PFS, almost all characteristics showed a detrimental role of NAT10 in patients with LIHC, except for grade 2 (N =174, HR = 1.92, 95% CI = 0.97 to 3.97, P = 0.0564),

FIGURE 1 | Expression levels of NAT10 in different tumors and pathological stages. (A, B) We analyzed the expression of NAT10 gene in different blood cells using the consensus datasets of HPA, Monaco and Schmiedel (A) or in different tissues using the consensus datasets of HPA, GTEx and FANTOM5. (C) Increased or decreased expression of NAT10 in datasets for different cancer tissues, compared with normal tissues from the Oncomine database. The number in each cell is the size of the dataset. (D) The expression status of NAT10 gene in different cancers or specific cancer subtypes was determined by TIMER2. (E) Based on the TCGA data, the expression of NAT10 was analyzed by the main pathological stages (stage I, stage II, stage III and stage IV) of KIRP, LIHC, LUAD and PAAD. Log2 (TPM+1) was applied for log-scale. (F) The expression of NAT10 total protein between normal tissue and primary tissue of breast cancer, ovarian cancer, colon cancer, clear cell RCC and UCEC was analyzed based on the CPTAC dataset. * P < 0.05, *** P < 0.001.

A

T-cells

C

Dendritic cells

NAT 10

Normalized expression (NX)

Cancer VS. Normal

14

-

3

Monocytes

B-cells

Analysis Type by Cancer

-

0

Granulocytes

Bladder Cancer

2

Brain and CNS Cancer

1

Breast Cancer

7

Basophil

Eosinophil

Neutrophil

Classical monocyte

Non-classical monocyte

Intermediate monocyte

T-reg

GdT-cell

MAIT T-cell

Memory CD4 T-cell

Naive CD4 T-cell

Memory CD8 T-cell

Naive CD8 T-cell

Memory B-cell ’

Naive B-cell

Plasmacytoid DC

Myeloid DC

NK-cell

Total PBMC

Cervical Cancer

Colorectal Cancer

17

Esophageal Cancer

2

Gastric Cancer

3

Head and Neck Cancer

B

Kidney Cancer

3

Consensus normalized expression (NX)

Leukemia

4

1

40

I

Liver Cancer

2

2

w

Lung Cancer

3

0

Lymphoma

4

Melanoma

20

Myeloma

2

Other Cancer

1

1

Ovarian Cancer

10

Pancreatic Cancer

Prostate Cancer

1

0

Cerebral cortex

Cerebellum

Olfactory region

Hippocampal formation

Amygdala Basal ganglia

halamus

Hypothalamus

Midbrain

Pons and medulla

Corpus callosum

Spinal cord

Retina

Thyroid gland

Parathyroid gland

Adrenal gland

Pituitary gland

Lung

Salivary gland

Esophagus

Tongue

Stomach

Duodenum

Small intestine

Colon

Rectum

Liver

Gallbladder

Pancreas

Kidney

Urinary bladder

Testis

Epididymis

Seminal vesicle

Prostata

Ductus deferens

Vagina

Ovary

Fallopian tube

Endometrium

Cervix, uterine

lacenta

Breast

Heart muscle

antooth muscle

Skeletal muscle

Adipose tissue

Skin

Thymus

Appendix

Spleen

Lymph node

Tonsil

Bone marrow

Granulocytes

Monocytes

-Colle

B-cells NK-cells

Dendritic cells

Total PBMC

Sarcoma

1

Significant Unique Analyses

52

3

Total Unique Analyses

266

1

5

10

5

1

%

D

Gene rank percentile(%)

8

NAT 10 Expression Level (log2 TPM)







A

*






*



3

+

1

A

..

.

:

2

·

ACC.Tumor

BLCA.Tumor

BLCA.Normal

BRCA.Tumor

BRCA.Normal

BRCA-Basal.Tumor

BRCA-Her2.Tumor

BRCA-LumA.Tumor

BRCA-LumB.Tumor

CESC.Tumor

CESC.Normal

CHOL.Tumor

CHOL.Normal

COAD.Tumor

COAD.Normal

DLBC.Tumor

ESCA.Tumor

ESCA.Normal

GBM.Tumor

GBM.Normal

HNSC.Tumor

HNSC.Normal

HNSC-HPV+.Tumor

HNSC-HPV -. Tumor

KICH.Tumor

KICH.Normal

KIRC.Tumor

KIRC.Normal

KIRP.Tumor

KIRP.Normal

LAML.Tumor

LGG.Tumor

LIHC.Tumor

LIHC.Normal

LUAD.Tumor

LUAD.Normal

LUSC.Tumor

LUSC.Normal

MESO.Tumor

OV.Tumor

PAAD. Tumor

PAAD.Normal

PCPG.Tumor

PCPG.Normal

PRAD.Tumor

PRAD.Normal

READ.Tumor

READ.Normal

SARC.Tumor

SKCM.Tumor

SKCM.Metastasis

STAD.Tumor

STAD.Normal

TGCT.Tumor

THCA.Tumor

THCA.Normal

THYM.Tumor

UCEC.Tumor

UCEC.Normal

UCS.Tumor

UVM.Tumor

E

5.5

KIRP

F value = 3.32 Pr(>F) = 0.0204

LIHC

F value = 4.29 Pr(>F) = 0.00542

F value = 3.76

F value = 3.33 Pr(>F) = 0.0211

0

LUAD

Pr(>F) = 0.0109

0

PAAD

NAT10 expression Log2 (TPM+1)

5.0

OD

10

4.5

«

5

D

-

I

4

3.5

3

5

3.0

es

Stage I

Stage II

Stage III

Stage IV

Stage I

Stage II

Stage III

Stage IV

Stage I

Stage II

Stage III

Stage IV

Stage I

Stage II

Stage III

Stage IV

F

Protein expression of NAT 10 (z-value)

Clear cell RCC p=2.53E-50

1-

Breast cancer

Colon cancer

3-

LUAD p=1.54E-36

a

Ovarian cancer p=0.002158988

34

UCEC

p=4.11E-58

5

2-

p=6.95E-09

2

2

p=1.38E-12

2-

1

1

1

1

0-

0-

1

4

#

0-

8

4

2-

4-

9

2-

-4

4-

4-

-2

-2

5

Normal (N=84)

Primary Tumor (N=110)

4

Normal (N=18)

Primary Tumor (N=125)

$

Normal (N=100)

Primary Tumor (N=97)

4

Normal (N=111)

Primary Tumor (N=111)

-

Normal (N=111)

Primary Tumor (N=111)

4

Normal (N=31)

Primary Tumor (N=100)

A

OS

B

PFS

ACC BLCA

pvalue

Hazard ratio

0.021

2.864(1.172-6.995)

ACC

pvalue

Hazard ratio

3.255(1.603-6.610)

BRCA

0.418

0.896(0.688-1.168)

BLCA

0.001

CESC

0.818

0.169

0.967(0.725-1.290)

1.478(0.847-2.580)

BRCA

0.802

0.967(0.747-1.254)

CESC

0.330

0.052

0.866(0.649-1.156)

CHOL

COAD

0.370

1.690(0.996-2.870)

DLBC

0.024 0.983

0.609(0.206-1.802)

1.613(1.064-2.445)

CHOL

0.274

0.727 0.411-1.287

ESCA

0.243 0.098

0.983(0.213-4.545)

COAD

DLBC

0.413

0.136

1.138(0.835-1.550)

0.110

2.529(0.746-8.580)

GBM

1.370(0.807-2.325)

ESCA

1.419(0.924-2.181)

HNSC

KICH

0.001 0.006

1.422(0.937-2.159)

1.510(1.180-1.933)

GBM

HNSC

0.892

32.336(2.705-386.524)

KICH

0.004

0.971(0.635-1.485)

0.043

0.164

1.414(1.114-1.794)

KIRC KIRP

2.666(0.671-10.593)

LAML LGG LIHC

0.003

1.751(1.019-3.010)

4.783(1.682-13.599)

KIRC

1.255(0.783-2.010)

0.299

1.305(0.790-2.158)

KIRP

0.346

4.321(1.776-10.514)

0.592

0.868(0.517-1.456)

LGG

0.001

<0.001

2.360(1.638-3.399

LIHC

0.483

<0.001

0.858(0.560-1.316)

LUAD

1.725(1.311-2.269)

LUAD LUSC

1.240(0.908-1.695)

LUSC

0.381

0.947

1.125(0.864-1.464)

0.992(0.774-1.271)

MESO OV

0.176 0.213

0.876 0.797

0.857(0.672-1.093

1.063(0.492-2.297

MESO

OV

0.263

0.066

1.693(0.673-4.257

0.828(0.677-1.013)

PAAD

0.078

1.029(0.827-1.280)

PCPG

1.666(0.944-2.939)

PAAD PCPG

0.008

0.009

2.020(1.205-3.386)

PRAD

0.011

0.458

10.461(1.695-64.555)

2.113(0.293-15.221)

PRAD

READ

0.033

3.999(1.418-11.281)

2.110(1.063-4.188)

READ SARC

0.019

SARC SKCM

0.966

0.538

0.987 0.530-1.837)

0.903(0.653-1.249)

SKCM

0.671

0.379(0.168-0.852)

1.081(0.755-1.546)

0.706

1.053(0.806-1.374)

STAD

0.542

1.105(0.801-1.525)

0.897 0.652-1.233)

STAD

0.369

0.868(0.636-1.183)

TGCT

0.502

THCA

0.573

0.813

1.883(0.208-17.039)

TGCT

0.688

1.162(0.558-2.421 1.936(0.554-6.773)

THYM

UCEC

0.974

0.756(0.074-7.683)

THCA

0.957(0.067-13.663)

THYM

0.301

1.038(0.644-1.672)

UCEC

0.911

0.878

UVM

0.123

2.292(0.800-6.569)

UCS

0.110

0.911(0.174-4.753)

1.386(0.929-2.068)

UCS

0.140

1.805(0.824-3.956)

UVM

0.202

0.132

1.614(0.774-3.367

2.020(0.810-5.037)

0.062 1e+00 2e+01 3e+02 Hazard ratio

0.25

1.0 2.0 4.0 8.0

Hazard ratio

C

DFS

D

DSS

ACC

pvalue

Hazard ratio

10.020(1.949-51.516)

ACC

pvalue

Hazard ratio

0.006

2.609(1.021-6.668)

0.866(0.417-1.798)

BLCA

0.045

BLCA

0.700

BRCA

0.495

0.894(0.647-1.234)

BRCA

0.720(0.493-1.052)

CESC

0.654

0.089

CHOL

0.536

1.092(0.744-1.602)

1.343(0.528-3.417)

CESC

0.711

CHOL

0.152

1.165(0.518-2.622)

COAD

0.472

1.577(0.845-2.944)

0.605

0.799(0.434-1.472)

COAD

0.009

2.463(1.253-4.843)

DLBC

0.825

1.120(0.729-1.721)

DLBC

ESCA

0.454

0.194(0.003-14.113)

ESCA

0.229

1.251(0.172-9.121)

GBM

1.455(0.790-2.681)

HNSC

0.634

KICH

0.110

1.230(0.524-2.886)

0.064

1.597 0.973-2.620)

0.315

1.833(0.872-3.852)

HNSC

KICH

0.021

1.404(1.052-1.874)

KIRC

0.247(0.016-3.788)

0.362

0.537(0.141-2.043)

KIRC KIRP

0.052

0.065

6.376(0.985-41.286)

11.302(2.982-42.836)

0.002

1.773(0.964-3.260)

KIRP

<0.001

6.170(1.934-19.677

LGG

LIHC

0.159

2.583(0.690-9.676)

LGG LIHC

0.826

LUAD

0.005

0.007

0.936(0.518-1.691)

1.546(1.138-2.100)

LUAD

1.758(1.165-2.653)

0.125

1.382(0.914-2.091)

LUSC

0.376

0.416

1.170(0.826-1.656)

0.874(0.632-1.208)

LUSC

MESO

0.945

0.646

1.014(0.688-1.493)

MESO

0.338(0.003-34.480)

OV

0.355

0.881

1.605(0.590-4.367

PAAD

0.046

1.018(0.807-1.284)

OV

PAAD

0.003

0.666(0.508-0.875)

0.223

PCPG

0.006

1.915(1.010-3.629)

PCPG

1.884(0.681-5.214)

23.115(2.488-214.791)

PRAD

0.093

9.736(0.683-138.695)

PRAD

0.021

32.544(1.708-620.059)

0.171

READ

0.175

0.557(0.239-1.298)

READ SARC

2.426(0.682-8.620)

0.681

0.711(0.139-3.623)

SARC

SKCM

0.996

0.587

0.999(0.666-1.499)

STAD

1.070(0.700-1.635)

STAD

0.940

1.101(0.778-1.558)

0.754

0.986(0.685-1.421)

0.120

0.657(0.387-1.116)

TGCT

THCA

0.390

0.097

3.393(0.209-54.983)

TGCT

THCA

0.523

1.325(0.558-3.146)

THYM

0.808

15.338(0.613-384.026)

0.718

1.398(0.227-8.626)

1.639(0.031-87.416)

UCEC

UCS

0.829

1.070(0.576-1.988)

UCEC

UCS

0.555

0.473

2.010(0.298-13.557)

UVM

0.138

1.191(0.667-2.126)

0.107

1.817(0.825-4.001)

2.475(0.821-7.458)

0.004 0.125 4e+00 1e+02 Hazard ratio

0.031 0.5008e+00 1e+02 Hazard ratio

FIGURE 2 | Association between NAT10 expression levels and patient prognosis based on multiple tumors from TCGA database. (A) Relationship of NAT10 expression with OS. (B) Relationship of NAT10 expression with progression-free interval (PFI). (C) Relationship of NAT10 expression with disease-free interval (DFI). (D) Relationship of NAT10 expression with DSS. Cox regression analysis; P < 0.05 was considered significant.

AJCC_T 1 (N= 180, HR = 1.6, 95% CI = 0.89 to 2.89, P=0.1146), and micro-vascular invasion (N = 90, HR =2.02, 95% CI =0.9 to 4.57, P =0.0833) for OS; and stage 2 (N = 84, HR = 1.87, 95% CI = 0.99 to 3.54, P = 0.0501), grade 2 (N = 175, HR =1.51, 95% CI = 0.98 to 2.35, P = 0.0619), non-vascular invasion (N = 204, HR = 1.54, 95% CI = 0.96 to 2.49, P = 0.0721), and micro- vascular invasion (N = 91, HR = 1.76, 95% CI = 0.97 to 3.19, P = 0.0583) for PFS (Supplementary Table 3). Therefore, the expression of NAT10 seems to be an independent risk factor in prognosis of LIHC.

NAT10 Expression Is Correlated With Pan-Cancer Immune Infiltration Levels

Previous studies have proved that tumor-infiltrating lymphocytes can affect patient survival (30), and the above results demonstrate a powerful pan-cancer effect of NAT10 on prognosis. Thus, we explored the relationships between inflammatory infiltration and NAT10 expression. Using TIMER2 datasets, we calculated the coefficients of NAT10 expression and immune infiltration levels in 40 cancer types. The results show that NAT10 expression has significant positive

FIGURE 3 | Kaplan-Meier survival curves comparing NAT10 expression in ACC, HNSC, KIRP, LIHC, and PCPG in GEPIA2 and Kaplan-Meier Plotter. (A-D) Differences in OS among groups in HNSC (A), KIRP (B), LIHC (C), and PCPG (D) in Kaplan-Meier Plotter. (E-H) Differences in RFS among groups in HNSC (E), KIRP (F), LIHC (G), and PCPG (H) in Kaplan-Meier Plotter. (I-M) Differences in OS among groups in ACC (I), HNSC (J), KIRP (K), LIHC (L), and PCPG (M) in GEPIA2. (N-R) Differences in DFS among groups in ACC (N), HNSC (O), KIRP (P), LIHC (Q), and PCPG (R) in GEPIA2.

A

HNSC, OS

NAT10

B

KIRP, OS

NAT10

C

LIHC, OS

NAT10

D

PGPC, OS

NAT10

1.0

HR = 1.7 (1.29 - 2.23)

o ~

HR = 2.76 (1.52 - 5.02)

0;

-

HR = 2.57 (1.8-3.67)

9

-

************** *** HR =551601643.93 (0 - Inf)

logrank P = 0.00013

logrank P = 0.00051

logrank P = 8.8e-08

logrank P = 0.023

0.8

0.8

0.8

8

0

Probability

0.6

Probability

0.6

Probability

0.6

Probability

0.6

0.4

0.4

0.4

0.4

2

0

2

Expression

0.

Expression

0

Expression

0.2

Expression

low

low

low

low

0.0

high

0.0

high

0

high

0.0

high

O

0

50

100

150

200

0

50

100

150

200

0

20

40

60

80

100

120

0

50

100

150

200

250

300

Number at risk 328

Time (months)

Number at risk 187

Time (months)

Time (months)

Time (months)

Number at risk

low

68

13 3

Y

Number at risk

0

low

48

9

100

25

3

1

0

low

266

153

71

13

37

15

3

4 2

1 0

86

20 18

Y

high

171

18

0

high

high

104

29

5

high

92

0

1

0

1

0

1

0 4

E

HNSC, RFS

NAT10

F

KIRP, RFS

NAT10

G

LIHC, RFS

NAT10

H

PGPC, RFS

NAT10

0.

O

-

HR = 2.54 (1.12 - 5.74)

0,

o,

~

++

HR = 3.15 (1.46 - 6.8)

-

HR = 1.89 (1.35 - 2.64)

-

HA #872742923.7910 - Int)

logrank P = 0.021

logrank P = 0.0021

logrank P = 0.00017

logrank P = 0.019

0.8

00

+

HH H

0

00 0

8

0

Probability

0.6

Probability

0.6

Probability

00

Probability

0.6

0.4

0.4

0.4

0.4

0.2

Expression

0.2

Expression

Expression

0.2

Expression

low

low

low

low

0.0

high

0.0

high

0

0

high

0.0

high

0

50

100

150

0

20

40

60

80

100

120

0

20

40

60

80

100

120

0

20

40

60

80

100

120

Number at risk 92

Time (months)

Time (months)

Time (months)

Time (months)

Number at risk

Number at risk

low

18 0

5

0

2

0

136

83

43

13

23 9

13

2

7

1

1 0

low

201

115

81 24

35 12

17

3

6 1

2

Number at risk 85

high

32

high

47

27

high

1

1

0

high

74

51 43

28

16

16 9

10 4

7

1

I

ACC Overall Survival

J

HNSC Overall Survival

K

KIRP

Overall Survival

L

LIHC

Overall Survival

M

PGPC Overall Survival

1.0

Low NAT10 TPM

High NAT10 TPM

+

Low NAT10 TPM

1.0

High NAT10 TPM

Low NAT10 TPM

0

2 ¢

High NAT10 TPM

Low NAT10 TPM

High NAT10 TPM

Low NAT10 TPM

High NAT10 TPM

Logrank p=0.025

Logrank p=0.00013

Logrank p=0.026

Logrank p=0.00076

Logrank p=0.58

0.8

HR(high)=2.4

p(HR)=0.03

0.8

HR(high)=1.7

0.8

HR(high)=2

0.8

HR(high)=1.8

p(HR)=0.00015

p[HR)=0.029

p(HR)=0.00092

0.8

HR(high)=1.6

p(HR)=0.58

Percent survival

n(high)=38

Percent survival

n(high)=259

n(high)=141

n(low)=259

Percent survival

n(low)=141

Percent survival

n[high)=182 n(low)=182

Percent survival

n[high)=91

0.6

n(low)=38

0.6

0.6

0.6

0.6

n(low)=91

0.4

0.4

0.4

0.4

0.4

0.2

0.2

0.2

0.2

0.2

0.0

0.0

0.0

0.0

0.0

0

50

100

150

0

50

100

150

200

0

50

100

150

200

0

20

40

60

80

100

120

0

50

100

150

200

250

300

Months

Months

Months

Months

Months

N

Disease Free Survival

O

HNSC Disease Free Survival

P

KIRP Disease Free Survival

Q

LIHC Disease Free Survival

R

PGPC Disease Free Survival

1.0

Low NAT10 TPM

1.0

High NAT10 TPM

Low NAT10 TPM

1.0

Low NAT10 TPM

1.0

High NAT10 TPM

High NAT10 TPM

Low NAT10 TPM

Q

High NAT10 TPM

-

Low NAT10 TPM

Logrank p=0.0072

High NAT10 TPM

Logrank p=0.017

Logrank p=0.67

Logrank p=0.00092

Logrank p=0.22

0.8

HR(high)=2.5

HR(high)=1.5

HR(high)=1.1

HR(high)=1.7

HR(high)=1.8

p[HR)=0.0096

0.8

p(HR)=0.018

0.8

p(HR)=0.67

0.8

p(HR)=0.001

0.8

p(HR)=0.23

Percent survival

n(high)=38

Percent survival

n(high)=259

Percent survival

n(high)=258

Percent survival

n[high)=182

Percent survival

n(high)=91

0.6

n|low)=38

0.6

n(low)=259

0.6

n(low)=258

0.6

n(low)=182

0.6

n|low)=9

0.4

0.4

0.4

H

0.4

0.4

0.2

0.2

0.2

0.2

0.2

0.0

0.0

0.0

0.0

0.0

0

50

100

150

0

50

100

150

200

0

20

40

60

80

100

120

140

0

20

40

60

80

100

120

0

50

100

150

200

Months

Months

Months

Months

Months

Cancer: ACC

Cancer: HNSC

FIGURE 4 | Correlation of NAT10 expression levels with stage in ACC, HNSC, KIRP, and LIHC. Relationships of NAT10 expression with stage in ACC (A), HNSC (B), KIRP (C), and LIHC (D).

A

Stage

Stage I

Stage II

Stage III

Stage IV

B

Stage

Stage I

Stage II

Stage III

Stage IV

0.74

10-

0.97

6

0.9

0.95

0.68

1

0.015

8

0.42

NAT10 expression

5

0.027

NAT10 expression

0.47

0.081

0.43

6

4

4

3

2

Stage I

Stage II

Stage III

Stage IV

Stage I

Stage II

Stage III

Stage IV

Stage

Stage

Cancer: KIRP

Cancer: LIHC

C

Stage

Stage I

Stage II

Stage III

Stage IV

D

Stage

Stage I

Stage II

Stage III

Stage IV

0.6

7

0.15

0.13

0.29

0.11

0.34

5

0.051

6

0.58

NAT10 expression

0.025

0.51

NAT10 expression

0.012

5

0.17

4

4

3

3

2

..

Stage I

Stage II

Stage III

Stage IV

Stage I

Stage II

Stage III

Stage IV

Stage

Stage

correlations with tumor purity in 15 types of cancer. In addition, NAT10 expression had significant correlations with infiltrating levels of B cells in 15 types of cancer, CD8+ T cells in 17 types of cancer, CD4+ T cells in 20 types of cancer, macrophages in 13 types of cancer, neutrophils in 23 types of cancer, and DCs in 19 types of cancer (Supplementary Table 4).

Overall, in five types of cancer (ACC, HNSC, LIHC, KIRP and PCPG), NAT10 expression showed most relevant to immune infiltration levels. Furthermore, we first assessed the relationships between NAT10 expression and tumor purity in the above five types of cancer. Two types (ACC and HNSC) of the five showed significant positive correlations with tumor purity in TIMER2. In addition, consistent positive correlations with different types of infiltrating immune cells were seen in LIHC: neutrophils (R = 0.162, P = 0.009) and DCs (R = 0.129, P = 0.039) in KIRP; B cells (R = 0.243, P = 0.002) and macrophages (R = 0.221, P = 0.004) in PCPG; B cells, CD4+ T cells, neutrophils, and DCs in ACC; and CD8+ T cells, neutrophils and DCs in HNSC showed positive correlations with NAT10 expression (Figures 5A-E). These findings strongly

suggest that NAT10 affects patient survival via interactions with immune cell infiltration in cancers including LIHC.

Relationships Between NAT10 Expression and Immune Markers

To further investigate the correlations between NAT10 and different types of infiltrating immune cells, we analyzed the relationships between NAT10 and immune cell markers using TIMER2 and GEPIA2. In TIMER2, after adjustments for tumor purity, NAT10 expression was significantly associated with 42 of 45 immune cell markers in LIHC; however, it was significantly correlated with only 22 gene markers in KIRP, eight gene markers in ACC, 28 gene markers in HNSC and 21 gene markers in PCPG (Table 1).

As shown in Figure 5, B cells, CD4+ T cells, and macrophages were the three immune cell types most strongly correlated with NAT10 expression in LIHC. However, these correlations were not found in KIRP. The relationships between NAT10 expression and B cells, CD4+ T cells, and macrophage markers also showed differences between LIHC and KIRP. First, as for B

A

NAT10 Expression Level (log2 TPM)

Purity

B Cell

CD8+ T Cell

CD4+ T Cell

Macrophage

Neutrophil

Dendritic Cell

8

cor = 0.243

p = 3.69e-02

partial.cor = 0.422

p = 2.04e-04

partial.cor = 0.245

p = 3.64e-02

partial.cor = - 0.118

p = 3.21e-01

partial.cor = 0.094

p = 4.28e-01

partial.cor = 0.326

p = 4.83e-03

partial.cor = 0.336

p = 3.71e-03

6

ACC

4-

0.2

0.4

0.6

0.8

1.0

0.11

0.12

0.13

0.20

0.25

0.30

0.35 0.07

0.09

0.11

0.13

0.15

0.08

0.12

0.16

0.12

0.14

0.16

0.18

0.49

0.50

0.51

0.52

0.53

B

Infiltration Level

NAT10 Expression Level (log2 TPM)

8

Purity

B Cell

CD8+ T Cell

CD4+ T Cell

Macrophage

Neutrophil

Dendritic Cell

cor = 0.151

7

p = 7.66e-04

partial.cor = - 0.083

p = 6.91e-02

partial.cor = - 0.066

p = 1.53e-01

partial.cor = 0.227

p = 4.978-07

partial.cor = 0.079

p = 8.39e-02

partial.cor = 0.127

p = 5.29e-03

partial.cor = 0.107

p = 1.90e-02

6

HNSC

4

3

0.25

0.50

0.75

1.00 0.0

0.1

0.2

0.3

0.4

0.5

0.0

0.2

0.4

0.6

0.0

0.1

0.2

0.3

0.4

0.5 0.0

0.1

0.2

0.3

0.4

0.0

0.1

0.2

0.3

0.25

0.50

0.75

1.00

1.25

C

Infiltration Level

NAT10 Expression Level (log2 TPM)

Purity

B Cell

CD8+ T Cell

CD4+ T Cell

Macrophage

Neutrophil

Dendritic Cell

5.5.

cor 0.018

p = 7.68e-01

partial.cor = 0.085

p = 1.77e-01

partial.cor = - 0.007

p = 9.16e-01

partial.cor = 0.058

p = 3.51e-01

partial.cor = - 0.068

p = 2.87e-01

partial.cor = 0.162

p = 8.96e-03

partial.cor = 0.129

p = 3.90e-02

5.0

?

4.5

KIRP

4.0

3.5

3.0

0.25

0.50

0.75

1.00 0.0

0.1

0.2

0.3

0.4

0.5

0.1

0.2

0.3

0.4

0.1

0.2

0.3

0.4

0.0

0.1

0.2

0.3

0.4

0.1

0.2

0.2

0.4

0.6

0.8

1.0

D

Infiltration Level

NAT10 Expression Level (log2 TPM)

Purity

B Cell

CD8+ T Cell

CD4+ T Cell

Macrophage

Neutrophil

Dendritic Cell

6

cor = 0.082

p = 1.26e-01

partial.cos = 0.364

p = 3.18e-12

partial.cor = 0.226

p = 2.51€-05

.partial.cor = 0.417

p = 6.65€-16

partial.cor = 0.453

p = 1.20c-18

partial.cor = 0.445

p = 3.36e-18

partial.cor = 0.355

p = 1.61e-11

5

A

LIHC

3-

2.

0.25

0.50

0.75

1.00

0.1

0.2

0.3

0.4

0.2

0.4

0.6

0.0

0.1

0.2

0.3

0.4

0.0

0.1

0.2

0.3

0.05

0.10

0.15

0.20

0.25

0.50

0.75

1.00

E

Infiltration Level

NAT10 Expression Level (log2 TPM)

Purity

B Cell

CD8+ T Cell

CD4+ T Cell

Macrophage

Neutrophil

Dendritic Cell

cor= 0.073

.partial.cor = 0.243

p = 1.55e-03

partial.cor = - 0.008

p = 9.21e-01

partial.cor = 0.03

partial.cor = 0.221

partial.cor = 0.142

p = 3.46e-01

p = 7.04e-01

· p = 4.20e-03

p = 6.71e-02

partial.cor = 0.026

p = 7.37e-01

4

PCPG

3.

0.25

0.50

0.75

1.000.00

0.05

0.10

0.15

0.20

0.15

0.20

0.25

0.30

0.10

0.15

0.20

0.25

0.30

0.00

0.05

0.10

0.15

0.20

0.10

0.15

0.20

0.25

0.3

0.4

0.5

0.6

0.7

Infiltration Level

FIGURE 5 | Correlation of NAT10 expression with immune infiltration levels in ACC, HNSC, KIRP, LIHC, and PCPG. (A) Correlations of NAT10 expression with immune infiltration levels in ACC (A), HNSC (B), KIRP (C), LIHC (D), and PCPG (E).

cells and macrophage markers, we analyzed the correlations of NAT10 expression in tumor and normal tissues for LIHC and KIRP based on the GEPIA2 database. Notably, the correlations between NAT10 and TAMs were similar to those found using TIMER2, suggesting that NAT10 is correlated with TAM infiltration in LIHC. Second, NAT10 expression in LIHC and KIRP showed partial difference in its relationships with CD8+ T cells, Tfh cells, Th2 cells, Th9 cells, Th17 cells, Th22 cells, neutrophils, and NK cells. In addition, NAT10 in LIHC had significant correlations with T cell exhaustion markers including PD-1 and CTLA4, and monocyte markers including CD14 and CD16, whereas NAT10 in KIRP showed no such relationships. We also used MCPcounter datasets to analyze the correlations between NAT10 expression and other immune cells; the results,

shown in Supplementary Figure 4, revealed strong positive correlations of endothelial cells and fibroblasts with NAT10 expression in KIRP and LIHC. Therefore, these results further confirm the findings that NAT10 is specifically correlated with immune infiltrating cells in LIHC, demonstrating that NAT10 has a vital role in immune escape in LIHC.

Pan-Cancer Correlation of NAT10 Expression With Expression of Immune Checkpoint Genes

Tumor immunotherapy is a novel treatment that involves restarting and maintaining the tumor-immune cycle to restore the body’s normal anti-tumor immune response. Immune checkpoint genes are the main direction for monoclonal antibody inhibitors, cancer

vaccines, cell therapies, and small-molecule inhibitors (31). Thus, we analyzed the relationships between NAT10 expression and 47 immune checkpoint genes in the above five types of cancer. Figure 6 shows the most significant positive correlations in KIRP (15 of 47) and LIHC (31 of 47); no such strong relationships were found in HNSC (three of 47), ACC (three of 47), or PCPG (seven of 47), but there were positive correlations. Therefore, these results further suggest that NAT10 expression has a vital role related to immune checkpoint genes in KIRP and LIHC (Figure 6A).

Relationships Between NAT10 Expression and Immune Neoantigens, TMB and MSI

Neoantigens are new unnatural proteins encoded by mutated genes in tumor cells, which can be used to synthesize new antigen

vaccines to activate immunity and achieve a therapeutic effect (32). Hence, we counted the number of new antigens in the above five types of cancer and analyzed the relationships between NAT10 expression and these antigens. The results are shown in Figure 6B. Surprisingly, there was no relationship between NAT10 expression and antigens.

Tumor mutation load (or TMB) (33), a quantifiable biomarker used to reflect the number of mutations contained in tumor cells, and MSI (34), the emergence of a new microsatellite allele in the tumor, are valid prognostic biomarkers and indicators of immune therapy response in many tumor types. Therefore, we analyzed the correlations of NAT10 expression with TMB and MSI in the above five types of cancer, using Person correlation. As shown in Figure 6C, NAT10

A

ADORA2A

BTLA

BTNL2

CD160

CD200

CD200R1

CD244

CD27

CD274

CD276

CD28

CD40

(

CD40LG

CD44

-log10(p value)

2

2

CD48

CD70

5

CD80

.75

CD86

CTLA4

HAVCR2

0

HHLA2

ICOS

ICOSLG

5

IDO1

0

IDO2

correlation

0

KIR3DL1

5

LAG3

LAIR1

9

LGALS9

i

NRP1

PDCD1

PDCD1LG2 TIGIT

TMIGD2

TNFRSF14

TNFRSF18

TNFRSF25

TNFRSF4

TNFRSF8

TNFRSF9

TNFSF14

TNFSF15

TNFSF18

TNFSF4

TNFSF9

VSIR

VTCN1

KIRP

LIHC

HNSC

ACC

PCPG

B

KIRP,P=0.56

C

KIRP,P=0.0061

D

KIRP,P=0.73

0.05

0.16

0.14

0

0-

0

PCPG,P=0.65

LIHC,P=0.4

PCPG,P=0.61

LIHC,P=0.0092

-0.05

-0.16

-0.14

LIHC,P=0.8

HNSC.P=0.48

ACC,P=0.31

HNSC,P=0.39

ACC,P=0.23

HNSC,P=0.31

Immune neoantigen

TMB

MSI

FIGURE 6 | Relationship between NAT10 expression and immune checkpoint gene expression, immune neoantigens, TMB, and MSI in ACC, HNSC, KIRP, LIHC, and PCPG. (A) Correlations of NAT10 expression with immune checkpoint gene expression. The lower triangle in each tile indicates coefficients calculated by Pearson’s correlation test, and the upper triangle indicates the log10-transformed P-value. (B) Correlations between immune neoantigens and NAT10 expression. (C) Correlations between TMB and NAT10 expression. (D) Correlations between MSI and NAT10 expression. * P < 0.05, ** P < 0.01, *** P < 0.001.

TABLE 1 | Correlation analysis between NAT10 and related genes and markers of immune cells in TIMER2.
Cell typeGene markerLIHC (n=371)KIRP (n=290)ACC (n= 79)HNSC (n= 522)PCPG (n= 181)
NonePurityNonePurityNonePurityNonePurityNonePurity
CorPCorPCorPCorPCorPCorPCorPCorPCorPCorP
B cellCD191.43E-**1.77E-***-9.62E-*5.20E-4.05E-0.057794080.612912470.058827360.62104169-0.00699510.873324390.048890720.279102580.076201660.307932860.064110720.41044115
0101020201
CD205.60E-2.80E-1.17E-*1.20E-8.40E-7.00E-9.11E--0.14399170.205499990.004780080.96798446-0.02352380.591791360.037220910.410061560.1162310.119190490.120913780.11958194
02010102010301
CD381.62E-**2.29E-****1.54E-***1.68E-***0.06358480.577725730.26625501*0.20620227****0.22776931****0.20534102**0.20784791**
01010101
CD8+ T CellCD8A1.22E-*1.95E-***-2.00E-9.71E-1.50E-8.12E--0.12100780.288096340.016976310.88664367-0.03315630.449695040.028734040.524866420.34676621****0.34579678****
010103010201
CD8B5.20E-3.21E-1.20E-*-7.60E-2.00E--5.90E-3.43E--0.13488480.235939080.007560780.94938014-0.07274280.09687526-0.02262370.616649170.16943357*0.20526366**
02010102010201
TfhCXCR52.56E-****1.81E-***7.10E-2.27E-4.00E-5.21E-0.052423890.646347280.125032710.29189005-0.0515060.240101520.016299810.718359850.129765060.081668780.126864790.10231163
010102010201
ICOS1.95E-***2.81E-****6.10E-2.98E-1.38E-**-0.06434950.57314980.124170610.295256340.041102050.34864720.09967326*0.20163799**0.22623221**
0101020101
BCL-64.09E-****4.00E-****3.79E-****2.95E-****0.47370983****0.47178104****0.27113488****0.24528851****0.068087350.362436250.079635870.30629919
01010101
Th1IL12RB22.50E-****2.71E-****9.60E-1.03E-1.07E-8.60E-0.151527690.182514610.221889140.059201670.079540560.069399570.10426452*0.16002304*0.18438766*
010102010102
WSX-13.96E-****4.53E-****3.38E-****1.90E-****0.176290170.120154580.223341050.057517340.20896131****0.22182227****0.145872540.05006460.1782193*
01010101
T-BET4.50E-3.91E-1.10E-*-3.00E-9.60E--3.00E-9.64E--0.06751070.554414190.123618060.297427220.004819070.912536610.070514420.118275770.121439950.103414410.16061212*
02010103010301
Th2CCR33.12E-****3.72E-****8.10E-1.97E-8.00E-1.77E--0.2634941*-0.2202270.061178840.002051140.962711920.039221470.385342070.070840560.34330740.064625170.40668787
010102010201
STAT63.93E-****3.83E-****4.38E-****4.40E-****0.175584230.121669740.141454060.232580660.3274151****0.31710451****0.116007930.119905680.076275380.32722448
01010101
GATA-31.78E-***2.79E-****1.29E-*1.43E-*0.25350536*0.221206580.06000723-0.02666360.543293840.002215730.960901580.39382754****0.36768489****
01010101
Th9TGFBR24.17E-****4.48E-****5.74E-****5.81E-****0.093524830.412320910.20270460.085446790.23470935****0.29722585****0.23068423**0.33824744****
01010101
IRF41.83E-***2.67E-****8.70E-1.40E-9.50E-1.30E-0.050468570.658701840.26458911*0.04612580.292852620.11859115**0.17356966*0.20374218**
010102010201
PU.12.86E-****4.02E-****-5.90E-3.21E--8.90E-1.54E--0.240482*-0.13447320.256681390.027940670.524148720.09947565*0.053959480.470634560.057547150.46008926
010102010201
Th17IL-21R2.34E-****3.29E-****1.59E-**1.51E-*0.102251980.369877570.156592280.18583167-0.01501740.732122610.048692830.28105543-0.05079840.4970578-0.05047970.51707903
01010101
IL-23R2.79E-****2.88E-****1.93E-***1.77E-**0.28712054*0.23731782*0.12680544**0.13453352**0.077395650.300392360.089067490.25235957
01010101
STAT33.68E-****4.05E-****6.26E-****6.33E-****0.43062317****0.44841214****0.27642249****0.28223771****0.2708093***0.2661371***
01010101
Th22CCR103.56E-****3.86E-****7.50E-2.01E-4.50E-4.72E-0.148490750.191541820.16564660.161347470.11972348**0.11561832*0.116333760.118862140.112319320.14841281
010102010201
AHR3.72E-****3.90E-****3.62E-****4.03E-****0.194109060.086505920.25659853*0.334679****0.33426283****0.13033210.080332880.21327457**
01010101
TregFOXP31.53E-**1.89E-***2.31E-****2.22E-***0.134128530.238600390.1343370.257167860.10742879*0.16560909***0.45138729****0.4374526****
01010101
CCR83.64E-****4.44E-****1.58E-**1.75E-**0.055513540.627020010.139214720.240134090.18501258****0.23649006****0.20990708**0.24182654**
01010101
CD252.48E-****3.42E-****2.08E-***1.97E-**0.146820240.19664250.31443993**0.14642496***0.22127487****0.134434260.071184790.16780586*
01010101
T cellPD-12.30E-****3.08E-****-1.70E-7.02E--4.50E-3.31E--0.07702040.499889030.000867990.994185010.12554469**0.13635659**0.042472630.570233090.054343990.48547745
exhaustion010102010201
CTLA41.82E-***2.59E-****-5.80E-3.28E--6.20E-3.19E--0.08215670.47164610.058093560.625410060.010377410.813018410.072780580.106878440.051821940.488417290.042168080.58845274
010102010201
MacrophageCD682.23E-****2.84E-****2.00E-7.29E-3.60E-5.64E--0.05564750.626187440.103186440.38499440.10119248*0.14927373***0.09885050.185526710.095721050.21850046
010102010201
CD11b2.71E-****3.29E-****1.23E-*9.70E-1.19E--0.09956180.38266760.048689250.682492820.09386498*0.12810879**0.20477516**0.22566575**
0101010201
M1NOS21.43E-**1.54E-**2.10E-***2.31E-***0.111614210.327431530.172316310.144898640.16362206***0.15766382***0.30284556****0.27522138***
01010101
ROS******0.113158990.320738430.24617754*0.14626064***0.17242222***0.088242250.237499870.083874630.28118635
(Continued)
TABLE 1 | Continued
Cell typeGene markerLIHC (n=371)KIRP (n=290)ACC (n= 79)HNSC (n= 522)PCPG (n= 181)
NonePurityNonePurityNonePurityNonePurityNonePurity
CorPCorPCorPCorPCorPCorPCorPCorPCorPCorP
1.33E-1.29E-1.66E-1.63E-
01010101
M2ARG1-2.62E-****-2.72E-****1.17E-*1.17E-6.00E-0.141420760.213794490.157151210.18424652-0.06568450.13394232-0.05125490.256482630.077519270.299618720.081164980.29707519
0101010102
MRC1-3.30E-5.31E-5.00E-9.33E-1.34E-*1.28E-*-0.01175750.918089020.131492970.267469610.1655064***0.2373637****0.19055309*0.25344635***
020103010101
TAMHLA-G1.07E-*1.18E-*-9.20E-1.17E--5.20E-4.06E-0.159639730.15992520.200243430.089394230.0905184*0.11910112**0.056408230.450706180.03954550.61187369
010102010201
CD803.14E-****4.04E-****1.91E-**1.78E-**-0.03507870.758911110.013865190.907315640.09485578*0.14797968***0.018263560.807208240.026654360.73240816
01010101
CD862.51E-****3.59E-****-2.94E-6.18E--4.41E-4.80E--0.2231256*-0.11667320.325601020.08185140.061659180.14779467**0.079319210.288503210.090382930.2453934
010102010201
MonocyteCD14-3.26E-****-3.17E-****-7.30E-9.01E--3.83E-5.40E--0.13283840.24318839-0.01075570.92803864-0.0039420.928406880.058903250.192115420.106994110.15168520.098447650.20560418
010103010201
CD162.10E-****2.76E-****1.35E-*1.01E-1.05E--0.08310610.466521310.032301620.786166050.10422137*0.16355297***0.036518730.625501490.033543340.66694776
0101010101
NKXCL12.32E-****1.91E-***7.72E-9.90E-9.11E-8.77E-0.022841590.84788697-0.10675070.3490793-0.02603680.56451126-0.03339340.446457760.128689470.097430030.14941162*
010104010301
KIR3DL15.61E-2.80E-8.78E-1.03E-6.03E-3.07E-5.48E-3.80E--0.10801330.34337567-0.02313960.84592659-0.02853470.515362930.004543820.919922790.001248880.986687430.004534820.95361888
0201020102010201
CD71.31E-*1.89E-***-4.34E-4.61E--4.16E-5.05E--0.05961540.601745590.145365630.21978575-0.0643380.14211828-0.00502050.911554450.04252120.569791870.048236810.53589303
010102010201
NeutrophilCD154.23E-****4.56E-****3.19E-****3.30E-****0.46251217****0.46438564****0.37948184****0.38474495****0.42227551****0.43137235****
01010101
MPO7.95E-1.26E-1.04E-5.50E--1.01E-8.60E--1.23E-*-0.10385270.36239205-0.0419020.724848740.1186357**0.16703874***0.019731470.79205248-0.03023990.69806127
02010102010201
DCCD1C1.91E-***1.91E-***1.52E-**1.52E-**0.022110340.846636170.022110340.84663617-0.1013037*-0.1013037*0.205807**0.205807**
01010101
CD1411.19E-*1.67E-**3.96E-****4.06E-****-0.09381690.410856040.02380020.84158423-0.05922020.17670668-0.04811720.2867881-0.10683830.15228572-0.05094260.51323863
01010101

Tfh, follicular helper T cell; Th, T helper cell; Treg, regulatory T cell; TAM, tumor-associated-macrophage; NK, natural killer cell; DC, dendritic cell; None, correlation without adjustment; Purity, correlation adjusted for tumor purity; Cor, R value of Spearman’s correlation. * P< 0.01; ** P < 0.001; *** P < 0.0001, **** P < 0.0001.

expression was positively correlated with low TMB in KIRP (P = 0.0061). In addition, the coefficient values for MSI indicated that NAT10 expression is positively correlated with high MSI in LIHC (P = 0.0092, Figure 6D). Overall, these results show that the relationships of NAT10 expression with TMB and MSI are diverse among these five types of cancer.

Interactions and Correlations of Predicted Proteins With NAT10

NAT10, the only confirmed regulator of mRNA acetyltransferase, shows remarkable correlation with most cancers in immune infiltration. However, as in the case of m6A RNA methylation regulators which change the levels of m6A in immune infiltration of cancers (35, 36), the details of the molecular mechanism of NAT10 involved in the acetylation of mRNA are not clear. We tried to screen out the targeting NAT10-binging proteins and the NAT10-correlated genes for in-depth pathway enrichment analyses. By using STRING tool and GEPIA2, we get total 50 NAT10-binding proteins (Figure 7A) and top 100 genes that correlated with NAT10 expression. Also, we found two common genes, namely, BMS1 and NOL10, between the above two groups (Figure 7B). As shown in Figure 7C, the NAT10 expression level was positively correlated with that of CAPRIN1 (Cell Cycle Associated Protein 1) (R=0.68), EIF3M (Eukaryotic translation initiation factor 3, subunit M) (R=0.56), NCL (Nucleolin) (R=0.55), PDCD11 (Programmed Cell Death 11) (R=0.54), ANAPC1 (Anaphase Promoting Complex Subunit 1) (R=0.54) genes (all p <0.001). Then, the corresponding heatmap showed a strong positive relationship between NAT10 and the above five genes in the most types of cancers (Figure 7D).

In addition, by using Metacore system, pathway maps analysis results showed that the two datasets were significantly enriched in metabolism of RNA (including GTP-XTP metabolism, CTP/UTP metabolism and ATP/ITP metabolism). GO process results showed these genes were significantly enriched in ribonucleoprotein complex biogenesis, rRNA processing and rRNA metabolic process. And, translation initiation and mRNA processing were enriched in process networks (Figure 7E).

DISCUSSION

NAT10 was the first acetylation regulator to be proved to maintain effective translation and stabilize mRNA by forming ac4C on mRNA (11). Although studies of NAT10 have been limited, increased levels of ac4C in urine are known to be correlated with four types of cancer (14-17). In addition, several studies have shown that overexpression of NAT10 could promote tumor progression in cancers including colorectal cancer, epithelial ovarian cancer, and melanoma (20, 21, 26). Here, we report that a higher level of NAT10 mRNA and protein was comprehensively found in multiple cancers according to several different databases. Meanwhile, using OS, PFS, DFS, and DSS data from TCGA, we discovered that three of 33 cancers (ACC, KIRP, LIHC) showed consistent correlations

between unfavorable prognosis wtih NAT10 expression; NAT10 expression in HNSC and PCPG showed significant correlations with OS, PFS, and DSS but not DFS. in particular, there were significant correlations with NAT10 expression in LIHC and KIRP. Prognosis can vary according to characteristics such as gender, race, tumor grade, and tumor stage. First, we found significant increased NAT10 expression at stage III compared with stage I in LICH, KIRP, and ACC. Second, high levels of NAT10 expression were shown to be almost consistently correlated with poor prognosis in liver cancer across gender, race, alcohol consumption, hepatitis virus, tumor stage, tumor grade and AJCC_T, with the highest HRs for poor OS and PFS. In KIRP and HNSC, these correlations were found in our study. Together, these findings strongly suggest that NAT10 represents an independent prognostic biomarker for liver cancer.

The tumor microenvironment (TME) contains various cells including a large proportion of infiltrating immune cells (37). Conventionally, the infiltration of immune cells in the TME is a component of an antitumor strategy to avoid tumor cells being killed (31, 38). Furthermore, in two (ACC and HNSC) of the above five types of cancer, NAT10 expression showed a significant positive correlation with tumor purity in TIMER2 and a significant correlation with prognosis in GEPIA2, whereas KIRP, LIHC, and PCPG had no correlation of NAT10 with tumor purity in TIMER2. In addition, the types of infiltrating immune cells in five types of cancer were as follows: B cells, CD8+ T cells, CD4+ T cells, macrophages, neutrophils, and DCs in LIHC; B cells, CD4+ T cells, neutrophils, and DCs in ACC; CD8+ T cells, neutrophils, and DCs in HNSC; neutrophils and DCs in KIRP; B cells and macrophages in PCPG. Moreover, the relationships between NAT10 expression and immune cell markers reveal the role of NAT10 in regulating tumor immunology in the above five cancers. In particular, NAT10 expression was significantly associated with 42 of 45 immune cell markers in LIHC, that is, all markers except MRC1 for M2 macrophages, KIR3DL1 for NK cells, and MPO for neutrophils. However, only 20 gene markers in PCPG, 22 gene markers in KIRP, 28 gene markers in HNSC, and eight gene markers in ACC showed significant correlations with high NAT10 expression. Interestingly, most gene markers in these five types of cancer were involved in B cell, T cell exhaustion, and TAM. Otherwise, endothelial cells and fibroblasts were strongly positively correlated with NAT10 expression in KIRP and LIHC, whereas moderate or weak correlations with NAT10 expression were found in ACC, HNSC, and PCPG using MCPcounter datasets. These results reveal that NAT10 plays an important part in recruitment and regulation of immune infiltrating cells in LIHC.

Systematic analysis of the correlations between NAT10 expression and immune checkpoint genes (31), immune neoantigens (32), TMB (33), and MSI (34) is conducive to a more comprehensive understanding of TME, which could be used to synthesize new antigen vaccines for antitumor therapies. First, in our study of immune checkpoint genes, we found the most significant positive correlations with NAT10

A

BMS1

NOL6

B

RPS5

RPS14

UTP18

NOP56

RPS8

RPS13

NAT10

RPS24

NOL 10

Correlated

Interacted

RPS4X

RPS6

DCAF13

SNU13

IMP4

WDR3

UTP4

FBL

HEATR1

KRR1

WDR75

IMP3

RCL1

RPS11

RPS16

NOP58

98

2

48

UTP20

NAT10

DNTTIP2

RPS28

MPHOSPH10

RPS15A

RRP9

UTP11

UTP3

PNO1

TBL3

UTP6

WDR36

NOCAL

UTP15

EMG1

BMS1

NOL10

WDR46

FCF1

C

RSL10

RPS9

NOP14

RPS23

p-value = 0

.

M

0

R = 0.68

p-value = 0

R=0,55

*

:

p-value = 0

R = 0.55

p-value = 0

R=0.54

p-value = 0

=

4

R = 0.54

log2(CAPRINI TPM)

·

log2(EIF3M TPM)

-

log2(NCL TPM)

log2(PDCD11 TPM)

>

log2(ANAPCI TPM)

0

.

·

4

*

-

·

+

.

0

%

-

-

0

.

0

-

0

.

0

-

0

*

0

2

4

6

0

2

4

6

0

2

4

8

0

2

4

6

0

2

4

6

log2(NAT10 TPM)

log2(NAT10 TPM)

log2(NAT10 TPM)

log2(NAT10 TPM)

log2(NAT10 TPM)

Spearman_Cor

1

D

0

8 p>0.05

-1

p … 0.05

UVM (n=80)

UCS (n=57)

UCEC (n=545)

THYM (n=120)

THCA (n=509)

TGCT (n=150)

STAD (n=415)

SKCM-Primary (n=103)

SKCM-Metastasis (n=368)

SKCM (n=471)

SARC (n=260)

READ (n=166)

PRAD (n=498)

PCPG (n=181)

PAAD (n=179)

OV (n=303)

MESO (n=87)

LUSC (n=501)

LUAD (n=515)

LIHC (n=371)

LGG (n=516)

KIRP (n=290)

KIRC (n=533)

KICH (n=66)

HNSC-HPV+ (n=98)

HNSC-HPV-(n=422)

HNSC (n=522)

GBM (n=153)

ESCA (n=185)

DLBC (n=48)

COAD (n=458)

CHOL (n=36)

CESC (n=306)

BRCA-LumB (n=219)

BRCA-LumA (n=568)

BRCA-Her2 (n=82)

BRCA-Basal (n=191)

BRCA (n=1100)

BLCA (n=408)

ACC (n=79)

ANAPC1

CAPRIN1

EIF3M

X

NCL

PDCD11

E

Pathway Maps

GO Processes

Process Networks

1

2

3

-log(pValue)

15

30

45

60

75

3

6

9

12

15

-log(pValue)

1

1. GTP-XTP metabolism

-log(pValue)

2

2. CTP/UTP metabolism

1. ribonucleoprotein complex biogenesis

1

1. Translation_Translation

3

3.ATP/ITP metabolism

1

initiation

2

4

4.Cell cycle_Nucleocytoplasmic

2. Transcription_mRNA processing

transport of CDK/Cyclins

2

2.rRNA processing

3

3. Translation_Elongation-

5

5. DNA damage_p53 activation by DNA damage

3

3.rRNA metabolic process

Termination

4

4

4. ribosome biogenesis

4.Cell cycle_Mitosis

6

6. Transport_RAN regulation

pathway

5

7. Cell cycle_Chromosome

5

5.RNA processing

5.Cell cycle_G2-M

7

6

6. Protein folding_Protein folding nucleus

condensation in prometaphase

6

6.RNA metabolic process

8

8. Translation_Role of Retinoic

acid signaling in the initiation of translation

7

7.ncRNA processing

7

7.Cell cycle_G1-S

8

9.CFTR folding and maturation (normal and CF)

8

8. nucleic acid metabolic process

8.Cytoskeleton_Spindle microtubules

9

10.Apoptosis and survival_Role

9

9. gene expression

9

9.DNA damage_DBS repair

10

of nuclear PI3K in NGF/ TrkA signaling

10

10.ncRNA metabolic process

10

10.Protein folding_Folding in normal condition

-

Maps

Processes

Networks

FIGURE 7 | NAT10-related gene enrichment analysis. (A) We obtained the available experimentally determined top 50 NAT10-binding proteins using the STRING database. (B) An intersection analysis of the NAT10-binding and correlated genes was applied. (C) Top five (including ANAPC1, CAPRIN1, EIF3M, NCL and PDCD11) most NAT10-correlated genes in TCGA projects was analyzed by using the GEPIA2. (D) The corresponding heatmap data between NAT10 with these five genes in the detailed cancer types was displayed. (E) Pathway maps analysis (left), GO process analysis (middle) and Process networks analysis (right) of NAT10- binding and correlated genes.

expression in KIRP (15 of 47) and LIHC (31 of 47), whereas HNSC (three of 47), ACC (three of 47), and PCPG (seven of 47) did not show these strong relationships, although there were still positive correlations. Second, no relationship was found between NAT10 expression and antigens. Third, NAT10 expression was positively correlated with low TMB in KIRP (P = 0.0061), and with high MSI in LIHC (P =0.0092). Overall, these results show that the relationships of NAT10 expression with TMB and MSI are diverse in these five types of cancer.

As in the case of m6A RNA methylation regulators, it has been reported that m6A regulators play independent role of immune infiltration in several types of cancers (35, 36, 39). In this study, we first presented the evidence of the potential correlation between NAT10, RNA acetylation regulator, with immune infiltration. Furthermore, we assessed the 50 NAT10- binding proteins and top 100 NAT10-correlated genes across all tumors to get a series of pathway maps, GO processes and process networks and identified the potential enrichment of “metabolism of RNA”, “rRNA metabolic process” and “mRNA

processing” in the etiology or pathogenesis of cancers. Because of the extensive role of NAT10 in post-transcriptional modification (11, 40), we therefore have reason to believe NAT10 may play an important role about immune infiltration in tumors. While, this hypothesis still needs further verification.

However, there were limitations to our study. First, systematic bias may have been generated because of the large proportion of microarray and sequencing data used in our study; higher- resolution methods such as single-cell RNA sequencing could be used to overcome this issue in future studies. Second, although NAT10 plays an important part in ac4C formation in mRNA, there was no information in the databases about the detailed changes in ac4C in these cancers. Third, this study only conducted a bioinformatics analysis of NAT10 expression and patient survival across several databases; further experiments in vivo and in vitro should be performed in future studies.

CONCLUSION

In summary, increased NAT10 expression was correlated with poor prognosis in 12 types of cancer, especially ACC, KIRP, LIHC, HNSC, and PCPG, and with increased immune infiltration levels of CD8+ T cells, CD4+ T cells, macrophages, neutrophils, and DCs in various cancers. In addition, NAT10 expression may contribute to regulation of TAMs, B cells, exhausted T cells, and other immune cells in LIHC. Therefore, NAT10 is likely to have an independent role in immune cell infiltration and could represent a unique prognostic biomarker in patients with liver cancers.

DATA AVAILABILITY STATEMENT

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/ Supplementary Material.

ETHICS STATEMENT

All the data included in the analysis are from public databases without the need of permissions from local ethical committees. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

AUTHOR CONTRIBUTIONS

CY and TW designed this study. JZ, JL, KZ, and WS extracted the information from the databases. CY and JZ analyzed the data. XK and JS supervised the entire study. CY and TW wrote the

manuscript. XZ provided the tissues. All authors contributed to the article and approved the submitted version.

FUNDING

This work is Supported by the Youth Program of National Natural Science Foundation of China (Grant No. 81800313) and the Youth Program of Natural Science Foundation of Jiangsu Province (Grant No. BK20181084).

SUPPLEMENTARY MATERIAL

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2021. 630417/full#supplementary-material

Supplementary Figure 1 | Expression of the NAT10 gene in different tumors and pathological stages. (A) The expression statuses of the NAT10 gene in ACC, CESC, DLBC, GBM, LAML, LGG, OV, PAAD, SKCM, TGCT, THYM and UCS in TCGA project were compared with the corresponding normal tissues of the GTEx databases. (B-E) Expression of the NAT10 gene by different pathological stages of ACC, BLCA, BRCA, CESC (B), CHOL, COAD, DLBC, HNSC, KICH (C), KIRC, LUSC, OV, READ, SKCM (D), STAD, TGCT, THCA, UCEC and UCS (E). (E-G) Expression of the NAT10 total protein by different pathological stages of breast cancer, clear cell RCC, colon cancer (F), LUAD, ovarian cancer and UCEC (G).

Supplementary Figure 2 | The immunohistochemical staining from HPA database. (A-E) The immunohistochemical results from HPA database between prostate (A), lung (B), liver (C), breast (D) and colorectal cancer (E) with normal tissues.

Supplementary Figure 3 | NAT10 expression in LIHC and Paracancerous tissues from 10 patients. (A, B) The immunohistochemical (A) and mean % NAT10- positive cells results (B) in LIHC and Paracancerous tissues from 10 patients. (C) The NAT10 mRNA relevant expression in LIHC and Paracancerous tissues from 10 patients.

Supplementary Figure 4 | Kaplan-Meier survival curves comparing high and low expression of NAT10 in different types of cancer in the TCGA database. (A-F) Survival curves for OS in six different cancers. (G-J) Survival curves for progression-free interval (PFI) in four different cancers. (K-O) Survival curves for disease-free interval (DFI) in five different cancers. (P-S) Survival curves for DSS in four different cancers. Red curve represents patients with high expression of NAT10.

Supplementary Figure 5 | Correlation between NAT10 expression with immune infiltration level in MCPcounter. The lower triangle in each tile indicates coefficients calculated by Pearson’s correlation test, and the upper triangle indicates log10- transformed P-values.

Supplementary Table 1 | NAT10 expression in cancers versus normal tissue in Oncomine database.

Supplementary Table 2 | Correlation between NAT10 expression and patient prognosis for different cancers in PrognoScan database.

Supplementary Table 3 | Short bars are due to limited sample size for parameters; HR values could not be calculated. Bold values indicate P < 0.05.

Supplementary Table 4 | The overview about significant correlation between inflammatory cell types with pan-cancer.

REFERENCES

1. Cohn WE. Pseudouridine, a Carbon-Carbon Linked Ribonucleoside in Ribonucleic Acids: Isolation, Structure, and Chemical Characteristics. J Biol Chem (1960) 235:1488-98. doi: 10.1016/S0021-9258(18)69432-3

2. Davis FF, Allen FW. Ribonucleic Acids From Yeast Which Contain a Fifth Nucleotide. J Biol Chem (1957) 227(2):907-15. doi: 10.1016/S0021-9258(18) 70770-9

3. Wang X, Lu Z, Gomez A, Hon GC, Yue Y, Han D, et al. N6-Methyladenosine- Dependent Regulation of Messenger Rna Stability. Nature (2014) 505 (7481):117-20. doi: 10.1038/nature12730

4. Fustin JM, Doi M, Yamaguchi Y, Hida H, Nishimura S, Yoshida M, et al. Rna-Methylation-Dependent RNA Processing Controls the Speed of the Circadian Clock. Cell (2013) 155(4):793-806. doi: 10.1016/ j.cell.2013.10.026

5. Molinie B, Wang J, Lim KS, Hillebrand R, Lu ZX, Van Wittenberghe N, et al. M(6)a-LAIC-Seq Reveals the Census and Complexity of the M(6)a Epitranscriptome. Nat Methods (2016) 13(8):692-8. doi: 10.1038/nmeth.3898

6. Meyer KD, Patil DP, Zhou J, Zinoviev A, Skabkin MA, Elemento O, et al. 5’ UTR M(6)a Promotes Cap-Independent Translation. Cell (2015) 163(4):999- 1010. doi: 10.1016/j.cell.2015.10.012

7. Zachau HG, Dutting D, Feldmann H. The Structures of Two Serine Transfer Ribonucleic Acids. Hoppe Seylers Z Physiol Chem (1966) 347(4):212-35. doi: 10.1515/bchm2.1966.347.1.212

8. Kawai G, Hashizume T, Miyazawa T, McCloskey JA, Yokoyama S. Conformational Characteristics of 4-Acetylcytidine Found in Trna. Nucleic Acids Symp Ser (1989) 21):61-2.

9. Sharma S, Langhendries JL, Watzinger P, Kotter P, Entian KD, Lafontaine DL. Yeast Kre33 and Human NAT10 are Conserved 18s Rrna Cytosine Acetyltransferases That Modify Trnas Assisted by the Adaptor Tan1/ Thumpd1. Nucleic Acids Res (2015) 43(4):2242-58. doi: 10.1093/nar/gkv075

10. Arango D, Sturgill D, Alhusaini N, Dillman AA, Sweet TJ, Hanson G, et al. Acetylation of Cytidine in Mrna Promotes Translation Efficiency. Cell (2018) 175(7):1872-86.e24. doi: 10.1016/j.cell.2018.10.030

11. Dominissini D, Rechavi G. N(4)-Acetylation of Cytidine in mRNA by NAT10 Regulates Stability and Translation. Cell (2018) 175(7):1725-7. doi: 10.1016/ j.cell.2018.11.037

12. Thomale J, Nass G. Elevated Urinary Excretion of RNA Catabolites as an Early Signal of Tumor Development in Mice. Cancer Lett (1982) 15(2):149-59. doi: 10.1016/0304-3835(82)90045-3

13. Liebich HM, Lehmann R, Xu G, Wahl HG, Haring HU. Application of Capillary Electrophoresis in Clinical Chemistry: The Clinical Value of Urinary Modified Nucleosides. J Chromatogr B BioMed Sci Appl (2000) 745(1):189-96. doi: 10.1016/S0378-4347(00)00263-2

14. Feng B, Zheng MH, Zheng YF, Lu AG, Li JW, Wang ML, et al. [Application of Urinary Nucleosides in the Diagnosis and Surgical Monitoring of Colorectal Cancer]. Zhonghua Wai Ke Za Zhi (2005) 43(9):564-8.

15. Szymanska E, Markuszewski MJ, Markuszewski M, Kaliszan R. Altered Levels of Nucleoside Metabolite Profiles in Urogenital Tract Cancer Measured by Capillary Electrophoresis. J Pharm BioMed Anal (2010) 53(5):1305-12. doi: 10.1016/j.jpba.2010.07.031

16. Zhang T, Wu X, Ke C, Yin M, Li Z, Fan L, et al. Identification of Potential Biomarkers for Ovarian Cancer by Urinary Metabolomic Profiling. J Proteome Res (2013) 12(1):505-12. doi: 10.1021/pr3009572

17. Li H, Qin Q, Shi X, He J, Xu G. Modified Metabolites Mapping by Liquid Chromatography-High Resolution Mass Spectrometry Using Full Scan/All Ion Fragmentation/Neutral Loss Acquisition. J Chromatogr A (2019) 1583:80-7. doi: 10.1016/j.chroma.2018.11.014

18. Lv J, Liu H, Wang Q, Tang Z, Hou L, Zhang B. Molecular Cloning of a Novel Human Gene Encoding Histone Acetyltransferase-Like Protein Involved in Transcriptional Activation of Htert. Biochem Biophys Res Commun (2003) 311(2):506-13. doi: 10.1016/j.bbrc.2003.09.235

19. Liu HY, Liu YY, Yang F, Zhang L, Zhang FL, Hu X, et al. Acetylation of MORC2 by NAT10 Regulates Cell-Cycle Checkpoint Control and Resistance to DNA-damaging Chemotherapy and Radiotherapy in Breast Cancer. Nucleic Acids Res (2020) 48(7):3638-56. doi: 10.1093/nar/gkaa130

20. Tan TZ, Miow QH, Huang RY, Wong MK, Ye J, Lau JA, et al. Functional Genomics Identifies Five Distinct Molecular Subtypes With Clinical

Relevance and Pathways for Growth Control in Epithelial Ovarian Cancer. EMBO Mol Med (2013) 5(7):1051-66. doi: 10.1002/emmm.201201823

21. Oh TI, Lee YM, Lim BO, Lim JH. Inhibition of NAT10 Suppresses Melanogenesis and Melanoma Growth by Attenuating Microphthalmia- Associated Transcription Factor (Mitf) Expression. Int J Mol Sci (2017) 18 (9):1924. doi: 10.3390/ijms18091924

22. Tschida BR, Temiz NA, Kuka TP, Lee LA, Riordan JD, Tierrablanca CA, et al. Sleeping Beauty Insertional Mutagenesis in Mice Identifies Drivers of Steatosis-Associated Hepatic Tumors. Cancer Res (2017) 77(23):6576-88. doi: 10.1158/0008-5472.CAN-17-2281

23. Li Q, Liu X, Jin K, Lu M, Zhang C, Du X, et al. NAT10 is Upregulated in Hepatocellular Carcinoma and Enhances Mutant p53 Activity. BMC Cancer (2017) 17(1):605. doi: 10.1186/s12885-017-3570-4

24. Liang P, Hu R, Liu Z, Miao M, Jiang H, Li C. Nat10 Upregulation Indicates a Poor Prognosis in Acute Myeloid Leukemia. Curr Probl Cancer (2020) 44 (2):100491. doi: 10.1016/j.currproblcancer.2019.06.006

25. Cao Y, Yao M, Wu Y, Ma N, Liu H, Zhang B. N-Acetyltransferase 10 Promotes Micronuclei Formation to Activate the Senescence-Associated Secretory Phenotype Machinery in Colorectal Cancer Cells. Transl Oncol (2020) 13(8):100783. doi: 10.1016/j.tranon.2020.100783

26. Duan J, Zhang Q, Hu X, Lu D, Yu W, Bai H. N(4)-Acetylcytidine is Required for Sustained Nlrp3 Inflammasome Activation Via Hmgb1 Pathway in Microglia. Cell Signal (2019) 58:44-52. doi: 10.1016/j.cellsig.2019.03.007

27. Doskocil J, Holy A. Inhibition of Nucleoside-Binding Sites by Nucleoside Analogues in Escherichia Coli. Nucleic Acids Res (1974) 1(3):491-502. doi: 10.1093/nar/1.3.491

28. Schuierer S, Tranchevent LC, Dengler U, Moreau Y. Large-Scale Benchmark of Endeavour Using MetaCore Maps. Bioinformatics (2010) 26(15):1922-3. doi: 10.1093/bioinformatics/btq307

29. Kawaguchi Y, Cooper B, Gannon M, Ray M, MacDonald RJ, Wright CV. The Role of the Transcriptional Regulator Ptf1a in Converting Intestinal to Pancreatic Progenitors. Nat Genet (2002) 32(1):128-34. doi: 10.1038/ng959

30. June CH. Adoptive T Cell Therapy for Cancer in the Clinic. J Clin Invest (2007) 117(6):1466-76. doi: 10.1172/JCI32446

31. Topalian SL, Drake CG, Pardoll DM. Immune Checkpoint Blockade: A Common Denominator Approach to Cancer Therapy. Cancer Cell (2015) 27(4):450-61. doi: 10.1016/j.ccell.2015.03.001

32. Yamamoto TN, Kishton RJ, Restifo NP. Developing Neoantigen-Targeted T Cell-Based Treatments for Solid Tumors. Nat Med (2019) 25(10):1488-99. doi: 10.1038/s41591-019-0596-y

33. Nikanjam M, Cohen PR, Kato S, Sicklick JK, Kurzrock R. Advanced Basal Cell Cancer: Concise Review of Molecular Characteristics and Novel Targeted and Immune Therapeutics. Ann Oncol (2018) 29(11):2192-9. doi: 10.1093/ annonc/mdy412

34. Srinivas PR, Kramer BS, Srivastava S. Trends in Biomarker Research for Cancer Detection. Lancet Oncol (2001) 2(11):698-704. doi: 10.1016/S1470- 2045(01)00560-5

35. Zhang B, Wu Q, Li B, Wang D, Wang L, Zhou YL. M(6)a Regulator-Mediated Methylation Modification Patterns and Tumor Microenvironment Infiltration Characterization in Gastric Cancer. Mol Cancer (2020) 19(1):53. doi: 10.1186/ s12943-020-01170-0

36. Du J, Ji H, Ma S, Jin J, Mi S, Hou K, et al. M6a Regulator-Mediated Methylation Modification Patterns and Characteristics of Immunity and Stemness in Low-Grade Glioma. Brief Bioinform (2021) bbab013. doi: 10.1093/bib/bbab013

37. Bindea G, Mlecnik B, Tosolini M, Kirilovsky A, Waldner M, Obenauf AC, et al. Spatiotemporal Dynamics of Intratumoral Immune Cells Reveal the Immune Landscape in Human Cancer. Immunity (2013) 39(4):782-95. doi: 10.1016/j.immuni.2013.10.003

38. Gajewski TF, Schreiber H, Fu YX. Innate and Adaptive Immune Cells in the Tumor Microenvironment. Nat Immunol (2013) 14(10):1014-22. doi: 10.1038/ni.2703

39. Chong W, Shang L, Liu J, Fang Z, Du F, Wu H, et al. M(6)a Regulator-Based Methylation Modification Patterns Characterized by Distinct Tumor Microenvironment Immune Profiles in Colon Cancer. Theranostics (2021) 11(5):2201-17. doi: 10.7150/thno.52717

40. Tsai K, Jaguva Vasudevan AA, Martinez Campos C, Emery A, Swanstrom R, Cullen BR. Acetylation of Cytidine Residues Boosts Hiv-1 Gene Expression by

Increasing Viral RNA Stability. Cell Host Microbe (2020) 28(2):306-12.e6. doi: 10.1016/j.chom.2020.05.011

Conflict of Interest: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Copyright @ 2021 Yang, Wu, Zhang, Liu, Zhao, Sun, Zhou, Kong and Shi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.