Identification of hub genes and their SNP analysis in West ...
Transcript of Identification of hub genes and their SNP analysis in West ...
RESEARCH ARTICLE
Identification of hub genes and their SNP analysis in West Nilevirus infection for designing therapeutic methodologies usingRNA-Seq data
Iftikhar Aslam Tayubi1 • Ahmad Firoz2,3 • Omar M. Barukab1 • Adeel Malik4
Received: 9 March 2015 / Accepted: 19 May 2015 / Published online: 24 June 2015
� The Genetics Society of Korea and Springer-Science and Media 2015
Abstract The West Nile virus (WNV) infections are
generally asymptomatic and are considered as immediate
concerns of biodefense due to the lack of any therapeutic
remedies. In this work, we created an interaction network
of 1159 differentially expressed genes to detect potential
hub genes from WNV infected primary human macro-
phages. We go on to explore the genetic variations that can
alter the expression and function of identified hub genes
(HCLS1, SLC15A3, HCK, and LY96) using the PROVEAN
Protein Batch tool and PolyPhen-2. Community analysis of
the network revealed that these clusters were enriched in
GO terms such as inflammatory response and regulation of
proliferation. Analysis of hub genes can aid in determining
their degree of conservation and may help us in under-
standing their functional roles in biological systems. The
nsSNPs proposed in this work may be further targeted
through experimental methods for improving treatment
towards the infection of WNV.
Keywords West Nile virus � Infection � Interactionnetwork � Hub genes � Community analysis � nsSNPs
Introduction
West Nile virus (WNV) is a mosquito-borne neurotropic
viruses and belongs to a member of Japanese encephalitis
virus (JEV) serogroup in the family Flaviviridae (Heinz
et al. 2000), which comprises viruses for yellow fever and
dengue (Brinton 2002). Originally first detected in Uganda,
it has been endemically spreading in various regions across
the world, including the Middle East, some regions of
Africa, Europe and United States at different periods
(Suthar et al. 2013). However, serologically it was first
tested in Assam, India in 2006 (Chowdhury et al. 2014).
Belonging to the Flaviviridae family, it is encoded by a
*11 kb positive-sense, single-stranded RNA (ssRNA)
genome (Suthar et al. 2013). It follows a replication life
cycle, which is involved in binding to cell surface receptor,
leading to fusion with the membrane and final delivery of
infectious RNA genome into the cytoplasm (Suthar et al.
2013). The genome injected is further translated as a single
polyprotein, which undergoes subsequent cleavage by viral
and host proteases to generate structural and non-structural
proteins. The structural proteins form the virion encapsi-
dating the viral RNA, whereas, the non-structural proteins
form the replication complex, which synthesize the nega-
tive- and positive-sense viral RNA. Hence, targeting the
replication pathway and primarily the infectious genome
can aid in curbing the expression of WNV (Suthar et al.
Electronic supplementary material The online version of thisarticle (doi:10.1007/s13258-015-0297-y) contains supplementarymaterial, which is available to authorized users.
& Iftikhar Aslam Tayubi
1 Faculty of Computing and Information Technology, King
Abdulaziz University, Rabigh 21911,
Kingdom of Saudi Arabia
2 School of Chemistry and Biochemistry, Thapar University,
Patiala 147004, Punjab, India
3 Biomedical Informatics Center of ICMR, Post Graduate
Institute of Medical Education and Research (PGIMER),
Chandigarh 160012, India
4 Perdana University Centre for Bioinformatics, MARDI
Complex, Jalan MAEPS Perdana, 43400 Serdang, Selangor,
Malaysia
123
Genes Genom (2015) 37:679–691
DOI 10.1007/s13258-015-0297-y
2013). Additionally, targeting at gene level will also
enhance the effectiveness of therapy involved. Identifica-
tion of the prime regions of interest and further the can-
didate genes for gene therapy can be determined via RNA
sequencing (RNA-Seq) methods (Mercer et al. 2014).
The RNA-Seq method comprises of the direct
sequencing of cDNA for monitoring quantitative gene
expression (Nagalakshmi et al. 2010). It involves direct
sequencing of cDNAs using high-throughput DNA
sequencing technologies followed by mapping of reads to
the reference genome (Nagalakshmi et al. 2010). The
methodology allows identifications of introns and exons
and further helps in recognizing intronic and exonic
boundaries and boundary ends of the gene of interest
(Nagalakshmi et al. 2010). RNA-Seq is presently suggested
as cost-effective new tool, which can be applied for deci-
phering the genetic basis of diseases, and traits that could
not be detected based on previous conventional gene-dis-
covery methods (Bamshad et al. 2011). RNA-Seq has been
employed to investigate biological phenomena in diverse
areas such as viral infections (Jones et al. 2014), cancer
(Young et al. 2014) and cardiomyopathy (Christodoulou
et al. 2014). Recently, we have also used RNA-Seq anal-
ysis to explore the role of glycogenes in skeletal muscle
development in MYOGkd cells (Lee et al. 2014) and also
used precomputed expression values from mouse RNA-Seq
data to understand the role glycogenes in various biological
processes that are involved in the development of brain,
muscle, and liver tissues (Firoz et al. 2014).
RNA-Seq has also been applied to identify 1514 dif-
ferentially expressed (DE) transcripts from primary human
macrophages, in response to infection with WNV (Qian
et al. 2013). In vivo studies of WNV infection using high-
throughput digital gene expression analysis have reported
common genes that are differentially expressed in multiple
tissues upon WNV infections as well as genes that are
differentially expressed in specific tissues following WNV
infection (Pena et al. 2014). These studies have helped in
developing a framework that can aid in the discovery of
genes and their associated biological functions and in turn,
decipher their impact on causing such infectious diseases.
Therefore, in this work, we have performed a network
analysis to identify the key hubs from the list of candidates
differentially expressed transcripts that were recently
detected from RNA-Seq analysis of primary human mac-
rophages infected with WNV (Qian et al. 2013). We also
explored the Single Nucleotide Polymorphisms (SNPs)
within these potential hubs that can be suggested as the
prime regions for susceptibility towards WNV infection.
Additionally, we also explored functionally important
modules in the network of WNV infected DE genes by
using community analysis. The regions suggested in this
study can be further targeted via experimental therapeutic
technologies like gene knockout technique, gene targeting,
etc., for developing treatment towards suppressing the
pathogenic action of WNV.
Materials and methods
Datasets
In this work, 1514 differentially expressed (DE) transcripts
were used for our computational analysis and were down-
loaded from a recent study on WNV infected macrophages
(Qian et al. 2013). These DE transcripts represent the RNA-
Seq analysis originally carried out byQian et al. (2013) using
Illumina Genome Analyzer 2 and expression levels for each
transcript estimated by using maximum likelihood based
method employed in the Cufflinks program (Trapnell et al.
2010). The dataset was filtered by removing any duplicate
transcripts and represent the final set of differentially
expressed genes (DEGs) used in this study.
Network analysis
GeneMANIA [www.genemania.org/] cytoscape plugin
(Montojo et al. 2010) was utilized to determine the func-
tional interactions between DEGs based on the GO term
‘‘biological process’’ and Homo sapiens as reference spe-
cies. The predicted relationships between the genes in the
network comprises of co-expression, physical and genetic
interactions, pathways, co-localization, protein domain
similarity, and predicted interactions.
Identification of hub genes
Biological networks exhibit the scale-free property (Albert
2005) with hubs representing nodes with many connections
in the network. NetworkAnalyzer plugin of Cytoscape
(Smoot et al. 2011) was utilized to determine the hubs by
calculating the values of the node degree distribution.
Three genes with highest node degree distribution were
identified as hubs in the current network.
Community analysis
The modules with functional property were determined by
applying greedy community-structure detection algorithm
via GLay [http://brainarray.mbni.med.umich.edu/sugang/
glay] (Su et al. 2010) plugin inCytoscape. For identifying the
over-represented biological functions within each cluster,
the clusters were subjected to a functional enrichment
analysis by focusing only on communities with at least 10
nodes. The functional enrichment analysis was carried out
using the DAVID functional analysis tool.
680 Genes Genom (2015) 37:679–691
123
Computational analysis of non-synonymous SNPs
(nsSNPs) of predicted hub genes
Data mining the SNP information for hub genes identified
in humans in this study were retrieved from National
Centre for Biotechnology Information (NCBI) database
dbSNP [http://www.ncbi.nlm.nih.gov/projects/SNP]. Three
different programs were used to predict the damaged or
deleterious coding nsSNPs.
Prediction of deleterious or damaging coding
nsSNPs using the PROVEAN protein batch tool
The PROVEAN Protein Batch tool [http://provean.jcvi.org/
index.php] was used to provide PROVEAN (Protein
Variation Effect Analyzer) and SIFT (Sorting Intolerant
From Tolerant) predictions for a list of protein variants.
Both PROVEAN and SIFT are software tools that predict
whether an amino acid substitution has an impact on the
biological function of a protein if the score, lies below a
certain threshold value. In PROVEAN, the clustering of
BLAST hits is carried out, and the best 30 clusters of
closely linked sequences of the supporting sequence set
that is further employed to make the predictions. For every
supporting sequence, a delta alignment score is computed
and then averaged within and across clusters generating a
final PROVEAN score (Choi et al. 2012). A default score
threshold of -2.5 or above is considered deleterious
whereas anything less than this cut-off score has a neutral
effect. On the other hand, SIFT is a multi-step algorithm
and uses sequence homology based method to classify
amino acid substitutions (Kumar et al. 2009). The PRO-
VEAN Protein Batch tool accepts a list of protein sequence
variants as input for the predictions.
Prediction of functional modification of coding
nsSNPs by polymorphism phenotyping v2
(PolyPhen-2)
PolyPhen-2 [http://genetics.bwh.harvard.edu/pph2/] tool
was used to study the possible consequence of nsSNPs on
protein structure and function. The web server predicts the
potential effect of amino acid substitution on the stability
and function of human proteins by employing structural as
well as comparative evolutionary considerations. For each
specified amino acid residue substitution in a protein,
PolyPhen-2 mines a variety of sequence and structural
characteristics of the replacement position and supplies
them to a probabilistic classifier. For this work, we have
used the provided batch query option and selected the
default advance option for generating the predictions
(Adzhubei et al. 2010).
Results
We downloaded a list of 1514 transcripts that were deter-
mined to be differentially expressed between control
unaffected and WNV-infected samples (Qian et al. 2013).
The data were further filtered by removing any duplicate
transcripts so that each transcript represents a unique DEG.
As a result, a non-redundant set of 1159 DEGs was
Fig. 1 Flowchart showing the
overall methodology
implemented in this study
Genes Genom (2015) 37:679–691 681
123
obtained that consisted of 253 down-regulated and 906 up-
regulated genes (Supplementary Table S1). The complete
workflow of computational methodology adopted in this
study is diagrammatically represented in (Fig. 1).
Network construction and identification of hub
genes
How DEGs interact with each other and additional related
genes in the network was investigated by GeneMANIA
Cytoscape plugin (Montojo et al. 2010). A GeneMANIA
network analysis of DEGs points to enrichment of signal-
ing pathways such as cytokine-mediated and type I inter-
feron-mediated signaling pathways. The other over-
represented GO terms includes inflammatory response,
regulation of leukocyte and lymphocyte activation, and
regulation of cell activation (Table 1).
The interaction network deduced via GeneMANIA was
further analyzed in Cytoscape 2.8.2 (Smoot et al. 2011).
The initial network comprising of 952 nodes and 48,263
edges was filtered to 952 nodes and 44,785 edges by
removing duplicate edges. All the genes in the network are
represented by circles and the interactions between them
are represented as edges. The up-regulated genes are shown
in green whereas; red nodes represent down-regulated
genes. Moreover, the additional related genes predicted by
GeneMANIA are shown in cyan (Fig. 2).
The genes determined as hubs with high node degree
distribution encode, HCLS1 (Hematopoietic lineage cell-
specific protein or Hematopoietic cell-specific LYN sub-
strate 1), (Supplementary Fig. 1a) a top hub with the node
degree of 342 whereas, SLC15A3 (Solute carrier family 15
member 3) (Supplementary Fig. 1b) was the second hub
having a node degree of 323. The genes with third highest
node degree are represented by HCK (Tyrosine-protein
Table 1 Top 30 significantly
enriched gene ontology (GO)
terms detected by GeneMANIA
for differentially expressed
genes and additional related
genes
GO ID Description Q-value
GO:0019221 Cytokine-mediated signaling pathway 1.55E-39
GO:0060337 Type I interferon-mediated signaling pathway 3.75E-30
GO:0071357 Cellular response to type I interferon 3.75E-30
GO:0034340 Response to type I interferon 4.83E-30
GO:0034341 Response to interferon-gamma 4.83E-30
GO:0071346 Cellular response to interferon-gamma 6.73E-27
GO:0060333 Interferon-gamma-mediated signaling pathway 1.85E-23
GO:0051707 Response to other organism 3.10E-21
GO:0009607 Response to biotic stimulus 1.00E-20
GO:0001817 Regulation of cytokine production 1.00E-20
GO:0031347 Regulation of defense response 2.62E-20
GO:0001816 Cytokine production 7.21E-20
GO:0006954 Inflammatory response 7.52E-17
GO:0002694 Regulation of leukocyte activation 7.96E-17
GO:0050865 Regulation of cell activation 7.96E-17
GO:0051249 Regulation of lymphocyte activation 1.85E-16
GO:0046649 Lymphocyte activation 1.18E-15
GO:0045088 Regulation of innate immune response 3.63E-15
GO:0002252 Immune effector process 4.69E-15
GO:0050867 Positive regulation of cell activation 4.69E-15
GO:0002696 Positive regulation of leukocyte activation 8.86E-15
GO:0009615 Response to virus 2.20E-14
GO:0051251 Positive regulation of lymphocyte activation 2.68E-14
GO:0042110 T cell activation 3.73E-14
GO:0050778 Positive regulation of immune response 3.37E-13
GO:0031349 Positive regulation of defense response 5.70E-13
GO:0050863 Regulation of T cell activation 1.76E-12
GO:0001819 Positive regulation of cytokine production 2.05E-12
GO:0002237 Response to molecule of bacterial origin 2.25E-12
GO:0007249 I-kappaB kinase/NF-kappaB cascade 3.05E-12
682 Genes Genom (2015) 37:679–691
123
kinase) (Supplementary Fig. 1c) and LY96 (Lymphocyte
antigen 96) (Supplementary Fig. 1d) because both have a
node degree of 318.
Community analysis and functional annotation
of detected modules
Five biologically-related clusters were identified using Fast
Greedy community-structure identification algorithm
(Fig. 3). Among all the detected clusters, Cluster 1 is the
largest with 425 genes and also consists of hubs HCK and
LY96 (Fig. 3a). Cluster 2 and cluster 3 consists of 262
(Fig. 3b) and 254 (Fig. 3c) genes, respectively. Cluster 3
also consists of two hubs, SLC15A3 and HCLS1 (Fig. 3c).
Only 9 genes were observed in cluster 4 whereas cluster 5
has 2 genes (Fig. 3d). Therefore, only those communities
were selected for enrichment analyses that have at least 10
nodes. Based on this criterion, only 3 communities (Cluster
1, 2 and 3) were finally analyzed for over-representation of
GO terms.
To biologically categorize these clusters, DAVID
functional analysis tool was used to classify the genes in
each module, and observed the enrichment of GO term
‘‘Biological Process’’ in three selected modules. The top 20
statistically significant enriched GO terms for DEGs in top
3 clusters for community analysis are summarized in
(Table 2). The three most statistically significant GO terms
that were enriched in cluster 1 are response to wounding,
defense response and inflammatory response. Other
significant GO terms in this cluster include regulation of
apoptosis or programmed cell death as well as positive
regulation of response to stimulus. Cluster 2 shows higher
enrichment for GO terms related to phosphorous metabolic
processes as well as regulation of proliferation. The GO
terms of cluster 3 were mostly related to antigen processing
and presentation of peptide antigen, regulation of I-kappaB
kinase/NF-kappaB cascade and positive regulation of T
cell activation.
SNP analysis
Non-synonymous SNPs (nsSNPs) are positioned predomi-
nantly in coding regions and their consequences are
observed in phenotypic characteristics of translated protein
products (Ramensky et al. 2002). On the other hand,
influence of synonymous SNPs is not determined easily on
translated proteins probably owing to the uncertainty of
genetic code (Hunt et al. 2009). In the current study we
have focussed on deciphering the role of variants on pro-
teins and their related pathways in successive steps,
therefore we confined and selected nsSNPs for our com-
putational analysis. Figure 4 shows the distribution of
nsSNPs and total SNPs in four hub genes. From the figure
it can be observed that in spite of a large number of SNPs
identified in each hub gene, only a small percent represent
the nsSNPs. The top hub HCLS1 has 689 SNPs and 68
nsSNPs, whereas, SLC15A3 has 443 SNPs and 60 nsSNPs.
HCK has 1138 SNPs, which is highest among all four
genes, but has only 48 nsSNPs. Finally, LY96 has 796 SNPs
and only 13 nsSNPs. The estimation of ratio of synony-
mous and nonsynonymous substitution rates helps in
determining the evolutionary effect of substitutions on
protein coding genes (Yang and Nielsen 1998).
Deleterious or damaged nsSNPs by PROVEAN
protein batch tool
The PROVEAN Human protein batch tool was used for the
prediction of damaged or deleterious nsSNPs. The program
provides both PROVEAN as well as SIFT prediction
scores. The input to the PROVEAN protein batch tool
consists of detected coding nsSNPs for four hubs, that is,
68 nsSNPs for HCLS1 (UniProt ID: P14317), 60 for
SLC15A3 (UniProt ID: Q8IY34), 48 for HCK (UniProt ID:
A8K4G3), and 13 for LY96 (UniProt ID: Q9Y6Y9). The
input for each hub gene was submitted independently to
PROVEAN Human protein batch tool. Among 68 nsSNPs
for HCLS1, 29 were identified to be deleterious by PRO-
VEAN whereas 36 were identified to be damaging by SIFT
(Supplementary Table S2). For SLC15A3, 28 and 36 out of
60 nsSNPs were predicted as deleterious or damaging by
PROVEAN and SIFT, respectively (Supplementary
GPR109A
OLIG1CLEC6A
MRGPRF LRRC50F13A1CSF3C7orf68
GPR133
EREGTHBDCCL7
NR4A3OSM
SNAI1
ARNTL2
NCAPH2
AXLSOCS3ELOVL3CHST2TREML1
FPR2
DAB2
GPR132
RAB11FIP1
FN1NTNG2GPBAR1SLC11A1
IGFBP6
FSCN1RUNX2
WNT5A
IGF1SPHK1
C3orf59BMP2
LGI2
MOSC1REPS2
GNG7 SLC19A1PTGFRN
TMPRSS13SLC22A4PPARG
CYP19A1NR4A2GRB10TULP2
FAM151AGSTM1
NRGNCKB
PLEKHN1
SCIN
FSTL1
ITGA2SRC
VWF
TNFRSF4 IRAK2
MXD1
PROCR
CD80
GAS6ARHGEF11COLEC12CPXM1LAMB1
FOXC1
PTRFGJA5
MN1HS3ST3B1
CDKN1C
IL23ANUPR1
PLVAP
TNF
FXYD6PMAIP1
MSX1ITGA10
SSTR2PDGFC
RCN3CNRIP1
GFRA2
BTBD11MMP14EDN1
GPR146
RBP7PLCB2CETP
PRAM1FAM46C
LRRFIP2
PLS3SLAMF1
NKD1ACTA2
RASGRP4
RETN
ASB13ITIH4PI4K2B
PINK1
RBKS
SYNE1
ZNF618
TSPAN5
EHBP1
HIST2H2BESTAG3
CIDEB
IQSEC1
NKX3−1GCNT2
TMEM110
CLIP2NPDC1
HESX1
APOBEC3F
MAML2RRM2
PHACTR2
XBP1PIWIL4CREB3
BMPR2
GMPR
PPPDE2IL15RAKAT2B
UNC84B
ZC3H12D
IFIT2
MIA3TRIB2
TSC22D1SRGAP2ASPHD2PVRL2SEMA4ARHEBL1
CSF2RBDPEP2
ELOVL7
SPATA13
FGD2NEXN
CD38PDCD1GLS
MKLN1ATG16L2
DOK2
IGFBP4
LGALS3BPATP2A3TBC1D10CIFI27TLR3IFIT3
CORO1ASTAT4
DENND5A
MAPK12 ZFXATP2C1
MYOFTESC
SAMD4A
ENDOD1
BAALC
PLA2G16TMEM123PLA2G4CCD7
CASP7
CDKN2CGRAMD1BPHF15
CBARA1TOR1BRICTOR
TRIM25
RNF125
C1GALT1
C15orf48
ZNFX1DEK RSAD2TRIM26HSH2DFAM111ANDC80
ZNF588
POLS
KIAA0226SLC25A28SNX5RNF138USP6NL
REC8 TFGTMEM194A
NUP205C22orf28
GTPBP2 MOV10PPM1K
PNPT1SGPP2
NCOA2TJP2
VAV3
JUPACOX2
C21orf91
CBX7MDKTJP1
BCL2L11
JAK3ARID5AC1S
CDC73BCL2L13
PATL1 NCAPH
BRIP1
TMEM229BNT5C2
SFI1
RUFY3MKL2DDX43
TMEM117
FAM59A
SNX26
C14orf73
MRO
PFTK1ZNRF2GALM
CNP
IMPDH2HIST2H2AA3
STMN1
PELI3
MASTLHNRPLL
CCDC152BARD1DEF6
PHACTR4PARP10
USP9X
C3orf38
RNF6
RFKKIAA1958 PARP15
HIST1H2BC
GPR180HIST2H3A
GPBP1HIRAHIST1H2BDRPS6KC1
KIAA1109
BLZF1
PLCXD1LY6E
C2orf84ARAP2KDM6A
MSL3FAM122CTIA1
AGRN
IL27TMEM26
HIP1RABCB9MMEL1
CAMK2BCYHR1
EPHB6
PRRT2
BBC3RAB39
SBK1DYNC2H1
NOXA1
IGF2BP3
DEPDC7
AK1
NEURL3
C1orf127
GPAT2SH3PXD2B
AATK
ORAI2EGFKIAA1024
GUCY1A3RAPGEF2ENTPD5
DUSP8
CCNA1
PLEKHA4
RND1
P2RX1
SV2A
ARHGEF10LGALNT3
CAMK1MEFV
DPP4
TNK2
NTN1
KRT79DCLK2
PSCA
AFAP1EPB41L5
CPT1B
FAM179A
SNAI3CBLN3
ADCABATARHGAP23
SHANK1
SERPINF2
HSPG2SPTBN4CRYBB1
RASL11BATXN7L1AGFG2
SRGAP1
GCNT4ZDHHC19
ANXA9RAC3
KCNF1KIAA1671
ASAH2B
PTCRA
DPYSHES4EXOC3LRASGEF1C
ZBTB7CCACNA1A
FN3K
C5orf20
SLC29A2
AKAP6
GSDMA
ADCY1
RAPGEF3C20orf195
AKAP7
NIPAL4PLB1
C14orf68OSBPL6
SPTBN2
RP11−274B18.1
PNPLA7ZDHHC9TMEM151AACOT11
FCGBP
MAP2 FLYWCH2TBC1D17
EXD3GAMT
KCTD14AUTS2MYEOV
KHK
DEPDC6
ZNF589
ZNF704NANOG
CASZ1
NFIX
CHST13
SMTNL1OTOFNFIA
ATP8B2
CENPVSPTB
IQGAP3PNPLA3
OLIG2
C2orf7IQCD
SLC25A29
IQSEC3RUFY4
SFXN2SLC1A7
C1orf224
PRKCE
PIH1D2
RASAL1
EIF4BHSPA1A
C5
PIK3C2B
PLEKHM3RASL11A
MCOLN2
CACNG8FER1L6
TSKS DAB2IP
WDR86TTC39A
LINGO3
SHROOM1
C2orf71
FAM72D
C14orf182C21orf58
AKT1S1
USP41
HIST2H2AA4
CHTF18NUDCD1
HIST2H4B
HIST2H3C
HIST2H2AC
HIST2H4A
GTPBP1
CDKL1SLC41A2
RUSC1
SH2D4A
ATP10A
BDH1
CABLES1
WIPF3
C15orf52
HIST2H2AB
TMEM37
HIST2H2BF
CENPN
SLC46A1
ANUBL1
NLN
TTC3
ANKS6
GTF2I
PIGV
USP30
RASGEF1B
PTCHD1COL23A1
COL1A1
IGF2
KCNA2
ASCL2
FNDC5GPR161
FXYD7
SORCS2
C1orf183
ANKRD1ADORA1DBF4B
FZD4KLF4COL18A1
DUSP7FXYD1
MXD4PLD4
RGL3
GJB2SLC16A10MYPNICOSLG
OSBPL5DSPSLC30A4 HSPA2
MYO1A
MCOLN1
FAM125B
CHRNB2NMUR1KLK10TSPAN15
RET
ABTB2
MUSTN1ZMYND10ADRA2B
PLEKHG3KCNN1
TIAM2
SEMA3AAPLN MOSPD3
MDGA1
KCNMB4
PRLRNACADCALHM1
MAP1LC3A
FGD5IRF6GREB1
IRF4
DOCK4
MAFTNFRSF9P2RX7NEDD9IL10RAAGPAT9ID3
SLC15A3
TNFSF15LTA
DNAJB4EXT1
SEMA3GPLEC1
TNFRSF8CUX2E2F2
KLHDC7B
MAP2K6EPHB2TNFSF18
PLCL1
BCAMHDX
LSSMID2CATSPER1
RBMS2
INHBAPPP1R15A
C20orf175
NR4A1CD36
SDS
TNFAIP6FOSL2
GLT1D1
CD300E
GPR84C17orf87
OLR1
CLEC4D
FFAR2IL6
STAB1G0S2PTX3
EMR1CD163L1
PTGS2HGFPFKFB3CSRNP1OTUD1
LRRC25
CEBPB
CBWD7
TNIP3
JUNB
PTGIRADORA3S100B
SLC39A8CTSBHLX
CXCL12
DSECYP27B1
CEACAM3ENTPD1CXCL1
ADAM8
PTGER4RNF144BC10orf10
IL7R
LILRB4EBI3
FAM129A
CXCL3
HCK
STX11
CTSL1
ZFP36AMICA1CSF3R
GPR34GNG2
CLEC4E
CCL2PTPLAD2
CXCL2
GRAMD1A
S100A9
TBXAS1
TAGAP
NFKB1
KCNAB2
KIAA0040
FUT4
RCAN1
TRAF3IP2DUSP
ACSL1
BTG1
ADPRH
SOD2
RIN2
PTGER2
ADAMDEC1
CDA
NFKB2PARVG
PSTPIP2PANX1
ATF3ATF5
AMPD3
CD14
S100A4
CDC42EP2
GNSP2RY13CD5L
PDE4BNCF2
SLC2A6
MCL1GCAC6orf150NFKBIAFAR2FCER1G
RIPK2ITGB7PNRC1
MYO1G
IFI30
TYMP
APOBEC3ALGALS2
CCR7GCH1
CXCL9
HLA−DRA
SLFN12CCL5
RELBSLC38A5HLA−DMB
DAPP1GZMACASP8LMNB1
APOL4
GBP1
IL2RGARHGAP25
JAK2FAM65BHLA−B
IDO1GZMB
GIMAP4
FGL2
DOCK8
CTSC
WTAP
NFE2L3
HLA−DPB1HAPLN3
CASP4TFEC
VAMP5
CD274
TMC8PMLGRIN3AHLA−G
CASP10BTN3A2
HELBHLA−DPA1 IFI16
HLA−A
GBP4B2M
LAP3TRIM22
APOL3
BATF2
SCO2
AGTRAP
IL7GBP5
PLSCR1LYSMD2
SP140IRF9
WARSTAPBP
SAMHD1UBE2L6
TNFSF13B
SPTLC2HLA−F
BAZ1AIL15
GBP2
TANKHLA−E
CD74
FAM26F
HLA−DMA
HLA−DRB1
ELF1
BRCA2
IFITM2
SOCS1
IPCEF1
IFNGPELI1
ZNF200SECTM1ANKIB1
MYD88
RHOH
PPP1R16B
CXCL10MTHFD2
C9orf91SPATS2L
MFN1CBR1
CASP1
HCLS1
SLC46A3TICAM2
ST3GAL5
BST2LAG3
RNF19BCD180
NADK
LY96
IL2RBSIGLEC1TNFSF10SLC7A7
TDRD7
UBE2Z
OPTNUBA7 IFI35FTSJD2OAS1
RBM43RIPK1
STAT2
DDX60 EIF2AK2C19orf66
PCID2
RABGAP1LADAR
RNF213
NT5C3
TRIM5
RNF114
DDX60L
PARP12TRAFD1RFX5BTN3A1
USP18
TAP2DHX58
LCP2IFITM3
MAP3K8
LY86
HPSE
DNAJA1SERPING1
SNX10
KRR1HIVEP1LAMP3
SEMA4DSNX20
NLRP1
LGALS9HIVEP2
BTG3
IFNA13OAS3NUB1
TRIM69IFNA1
TAP1IL32OASL
BLNKIFI6
ISG15IFIH1
TCTEX1D2
CD47
STAT1
IRF7 SAMD9NMI HERC6
EPSTI1
SP140LZC3HAV1
OAS2IFI44L
SAMD9L
CXCL11
MX2DYNLT1
BTN3A3
BIRC3
TRIM21
APOBEC3G
BET1
APOL1
SLFN12L
SLFN5
PSME2FBXO6
AIM2CARD11
PHF11RTP4IFI44
GBP3
DDX58
IFIT5PSME1XAF1LBA1
XRN1PARP9DTX3L
MX1PARP14ISG20IFIT1
HERC5CD48
OTUD4RARRES3IRF1NCF1
PSMB10IFITM1TRIM38
NLRC5PSMB8
APOL6
SP110
PSMB9SP100
PBEF1
NFKBIZ
TNFRSF1BCREM
TLR2BCL2A1C1RL
ICAM1PILRA
KIAA1274
CCL13CCR1
CCL3
TNIP1
TRAF1
TYROBPSCPEP1
CCL4
CCR5SERPINB9
ITGB2
ADAM19
LILRB1
IL4I1
STOM
ADM
ZC3H12A
NBN
MS4A6A
NOD2
CD40
DRAM1
SLAMF7PDCD1LG2
IL16
LYN
MNDA
PTPRCC1QBPLEK
CFLAR
BATF3
SAMSN1CCL8
SELL
BCL3
IL1RNTNFAIP3RASSF4
C3AR1
FAS
CCL4L1
IL8S100A8
FPR3
NAMPTL
CCL4L2
IL1B
CCL3L1
CBWD2
Fig. 2 Interaction network between DEGs and related additional
genes. In this network, green nodes represent up-regulated genes
whereas red nodes represent down-regulated genes and GeneMANIA
predicted genes are shown in cyan. Hub genes are shown as purple
diamonds and represent the genes with highest node degree
Genes Genom (2015) 37:679–691 683
123
CCL3L1CCL4L1
CCL4L2IL4I1
C6orf150
S100A8ADAM8GPR84
PSTPIP2
MYO1G
TBXAS1
CEACAM3RETNCCL7
GLT1D1 CCR1
FFAR2
SOCS3CLEC4D
RASGRP4
SLC11A1
CCL3
CCL4
IL1RN
APOBEC3A
SDSTNFOSMSTX11IL8
NFKB2
BCL3
PFKFB3
GCH1
TNIP3ZC3H12AFOSL2
RAB11FIP1
IL1B
FAM46C
LRRC50
SOD2NADKCD274CLEC4E
AMPD3
TNIP1
CXCL2HS3ST3B1
EMR1IL7R
CCL8
IFI30
TAGAP
HPSE
ACSL1
GCACXCL1MXD1GBP1
CXCL3
NR4A1
OLR1
IL6NFKBIARNF19BPPP1R15A
PROCRDPP4IGF1CTSB
DEPDC6
RUNX2C10orf10
FOXC1
TLR2DRAM1BATF3
PTGIRCASP4CCL2INHBATNFAIP6
OLIG1
SPHK1
HLX
IL10RA
CTSL1
HGF
SLC22A4
NOD2SELLBCL2A1
PLEKMNDA
SERPINB9
DSE
TNFRSF1B
PTGER2FAM129A
S100A4
GNS
F13A1
DOCK4
C1S
ACOX2
HCK
NCF2CSF3R
S100A9
IFITM2CSF3
PLSCR1
SLC39A8
ADM
TNFAIP3PTGS2
ZFP36
ICAM1
KIAA0040
PANX1SNAI1
CREMCSRNP1
NBN
DUSP
MCL1NFKBIZ
PBEF1
NAMPTL
ANKRD1
NR4A3ATF3
OTUD1
FAS
HIST2H2BE
NFKB1RCAN1NFE2L3
CDKN2C
BAZ1A
XBP1
PDE4BRAPGEF2
LMNB1MFN1LRRFIP2
BTG3
MTHFD2HIVEP1IL23A
PMAIP1RND1
ELF1
OTUD4 BET1
ANKIB1
NR4A2TFG TANK
GPBP1
SLC38A5
BCL2L11DNAJA1 ZNF200WTAP
RIPK2C7orf68
PELI1PTGER4EREG
RELB
RNF144B
RICTOR
ADAMDEC1MAP3K8
JUNBSAMSN1
ACTA2
APLNPDGFC
MAML2FCGBP
GJB2
EIF4B
CABLES1
ZNRF2
MKL2
FAM59A
KIAA1958
NUP205
NUDCD1
KIAA1109
C3orf38
A
B
HNRPLL
GPR180
STMN1
TMEM117USP6NL
KAT2BMIA3NCOA2
RFK
TIA1
NFIADEPDC7
TSC22D1
SNX5
AUTS2
GUCY1A3
ID3
SLC41A2
WNT5A
TSPAN15
CNRIP1
FZD4FSTL1
EHBP1
LAMB1
MXD4
NPDC1FAM151A
COL1A1
CPXM1
FXYD1SEMA3G
GRB10
SRC
DPYS
ASAH2B
PTCHD1
FXYD6
GSTM1
NUPR1
FN1ABAT
VWF
IGFBP4AXLSRGAP2
NRGN
MYOF
PTRF
FUT4C1RL
SLC46A3
CXCL12 GPR34RCN3
GAS6
DAB2
GPR146
HSPG2CD163L1SORCS2
CDKN1CCOL18A1
SH3PXD2B
MUSTN1TMPRSS13IGFBP6
GJA5
EXT1PHACTR2GREB1
EPHB2
CD5LPPARG
RIN2CD36
ADPRHSTOM
ARNTL2
SPTLC2
NEDD9
HSPA2SNX10
PNRC1
THBDIFI16CHST2CEBPB
CTSC DENND5AFAM65B
CSF2RBG0S2
LYNPTPRC
PTX3
CFLAR
C20orf175MSX1
REPS2
C3orf59
SCPEP1MOSC1
FCER1G
CDA
CYP27B1
ITGB2LILRB4GRAMD1A
NTNG2SLC19A1
CDC42EP2
LRRC25
CLEC6AP2RX7
FPR2
AMICA1
PILRAC3AR1
P2RY13LCP2
RASSF4SLC16A10
DPEP2LILRB1NKD1
NCF1SLC7A7
FPR3C1QB
SLC2A6
MS4A6ACCL13
CD14
PTGFRN
ARHGEF10L
BAALCPLVAP
GPR109A
MRGPRF
SRGAP1
P2RX1
DOCK8
PRAM1
ATG16L2
COLEC12STAB1IGF2GNG2
SERPING1
CETP
FAR2
GPR133
ENTPD1
TYROBPFSCN1
MMP14MAF
S100B
COL23A1GPR161
GPBAR1
PTPLAD2
MCOLN1
ADORA3RBP7
CENPN
HSPA1A
BLZF1
RNF114
NDC80PCID2
RNF6
DEK
TICAM2
TMEM194A
RNF138
BARD1DNAJB4
USP9XCDC73
PI4K2B LY96
HIVEP2
KIAA1024
PIWIL4
TTC3
TRAF3IP2
C21orf91
CCNA1
IGF2BP3
PLCL1
TJP2
AGPAT9
MKLN1
DYNLT1
C1GALT1
NCAPH
CASP7
KDM6A
GLS
RRM2
DUSP7
PLS3
DSP
KLF4
ARID5A
CD38
ITGA2
SYNE1
BMPR2
BTG1EDN1
SAMD4AMN1
TOR1BBMP2ELOVL7
GALNT3
TJP1
FAM111ANT5C2
KRR1BRIP1
TCTEX1D2POLS
TMEM123HIRA
ATP2C1ZFX
BRCA2
ADC
FLYWCH2
C14orf73
FNDC5
CRYBB1AKAP6
LTA
NMUR1
PLCB2
RASAL1
WDR86
PTCRA
ATXN7L1
PARVG
PIK3C2B
ASPHD2
GCNT4
TMEM151A
PDCD1KCTD14
SMTNL1
RP11−274B18.1
TREML1
KLHDC7B
CACNG8
EXOC3L
TNFRSF4
FER1L6KCNA2
OLIG2ADCY1C20orf195
HES4
C17orf87
NOXA1
APOBEC3FCALHM1
SPTBN4
TNK2
GTPBP1
SHROOM1
C5orf20
DAB2IPRET
ARHGAP23
CASZ1CDKL1
JAK3
CLIP2
PLEC1JUP
SBK1
TRIB2 BBC3
TSKSFAM125B
HIP1R
PLD4RGL3
PLB1HDXITIH4
NANOG
NCAPH2
OSBPL6
ASCL2
KCNMB4
MAP1LC3A
AKAP7
ORAI2
IRF6
IQSEC1
NFIX
RASL11A
MYO1A
PLEKHA4
FXYD7
RAC3
ADRA2B
EGF
GFRA2BCAM
DYNC2H1CUX2
WIPF3RUFY3
RBKS
AFAP1
C15orf48
ANKS6
EPB41L5
FGD5
PLEKHN1CBLN3
HESX1
RHEBL1CYP19A1
PLEKHG3
SH2D4A
CAMK2B
SV2A
C5
DBF4B
ARHGEF11
ATP10A
GCNT2ACOT11
RNF125
RBMS2
FN3K
KCNN1
EXD3
FAM122C
CHRNB2
C1orf183
KLK10
EBI3MID2
ADAM19
TNFRSF8
PRKCESPTBN2ABTB2
AATKSTAG3
ABCB9
MDGA1
NTN1
SPTB
PNPLA3
AGFG2EPHB6
MAP2PVRL2
NACAD
SERPINF2C14orf68MCOLN2
TTC39A
SLC29A2TESC
SLC1A7ZDHHC9
PIH1D2RASL11B
TIAM2
DDX43
ANXA9
TULP2
CCDC152
NKX3−1 CD80NIPAL4KCNF1E2F2
GNG7
ZNF589ZDHHC19
ZMYND10
IQSEC3
GSDMA IL27
C1orf224
OTOFELOVL3
CHST13
SHANK1
SSTR2
CATSPER1
SLC30A4
TNFSF18
TSPAN5
LGI2MROPNPLA7
CD300E
KRT79ICOSLG
GPR132
ADORA1
RASGEF1C
RAPGEF3CPT1B
TMEM26
ANUBL1
CACNA1A
MEFV
SEMA3AITGA10MMEL1
C1orf127LINGO3C21orf58
FAM72D
RAB39
IQCD
C2orf71
PSCA
MYEOV
IQGAP3
C14orf182RASGEF1B
SFXN2
C2orf7
IMPDH2 BDH1
RPS6KC1
SPATA13
NLN
ASB13
ENTPD5
PLEKHM3
PIGV
CENPV
SLC25A29
GPAT2
PRLROSBPL5KHKSCINBCL2L13RUFY4 ATP8B2
BTBD11
LSS
TBC1D17
AK1MOSPD3
PFTK1
ZNF704
MYPN
ZNF588
DCLK2
ATF5
DUSP8
MAP2K6
IRAK2
SNX26
CAMK1
MAPK12
C15orf52
FAM179A
NEURL3
CHTF18
PRRT2
CKB
UNC84BGTF2I
ZNF618
SNAI3MASTL
KIAA1671
AKT1S1
SNX20
ZBTB7C
PINK1GAMT
CYHR1
ARAP2
PELI3
KIAA1274VAV3
Fig. 3 Communities detected
by fast greedy (GLay) clustering
algorithm are shown. In each
community a Cluster 1
b Cluster 2 c Cluster 3 d Cluster
4 e Cluster 5, up- and down-
regulated genes are shown as
green and red nodes
respectively, whereas
GeneMANIA predicted genes
are represented as cyan. Nodes
in purple diamond shape
represent hub genes
684 Genes Genom (2015) 37:679–691
123
Table S3). PROVEAN identified 21 nsSNPs as deleterious
while SIFT predicted 24 as damaging out of total 48
nsSNPs for HCK (Supplementary Table S4). Similarly, 6
and 5 nsSNPs were detected by PROVEAN and SIFT for
13 nsSNPs of LY96 gene (Supplementary Table S5).
Damaged nsSNPs by PolyPhen-2 web server
The protein sequences of each gene and their SNP substi-
tution were submitted independently to the Polyphen-2
web server. Among 68 nsSNPs for HCLS1, 47 were
identified to be damaging (Supplementary Table S2)
whereas, 37 nsSNPs were predicted to have a damaging
effect for SLC15A3 (Supplementary Table S3). Similarly,
the predicted number of damaging nsSNPs observed for
HCK was 21 out of 48 (Supplementary Table S4) and 8
nsSNPs were identified to be damaging for LY96 out of 13
nsSNPs (Supplementary Table S5). Overall, the number of
nsSNPs that are commonly predicted to be deleterious or
damaging by PROVEAN, SIFT and Polyphen-2, which
may affect protein function, is 26 for HCLS1 and
SLC15A3, 16 for HCK, and 5 for LY96 (Table 3),
respectively.
Discussion
Discovery of molecular targets and targeted therapeutics
have turned out to be a vital remedial treatment for dis-
eases, especially with the advancement of bioinformatics in
the last few decades (Dinh et al. 2007). In order to explore
the role of DEGs that were recently identified by RNA-Seq
analysis (Qian et al. 2013), and additionally related genes
involved in WNV infection, an interaction network was
constructed and node degree for each gene in the network
was calculated. HCLS1, SLC15A3, HCK and LY96 were the
genes with the highest degree and considered as hubs in the
network created from DEGs of primary human macro-
phages and additional related genes predicted by Gene-
MANIA. The most significant enriched GO terms within
the interaction network identified by GeneMANIA include
cytokine-mediated signaling and type I interferon-mediated
signaling pathways, processes related to regulation of
defense and immune response. Additional over-represented
GO terms in the network represent processes related to
immune cell activation (Table 1). This analysis is in con-
sistency with the recent study on WNV infection in which
DAVID functional annotation cluster analysis highlighted
the enrichment of processes involved in immune or defense
responses (Qian et al. 2013).
Among HCLS1, SLC15A3, HCK and LY96, only HCK
was detected as one of the hubs from a list of DEGs
whereas GeneMANIA predicted HCLS1, SLC15A3, and
LY96 genes to be a part of interaction network. These genes
may possibly play a key role in regulating the biological
process of WNV resistance and control viral infection.
HCLS1 gene codes hematopoietic cell-specific Lyn sub-
strate 1 protein (HCLS1 or HS1) that consists of an SH3
(Src homology 3) adapter domain and can trigger activa-
tion of receptor-coupled tyrosine kinases (Kitamura et al.
1989; Van Rossum et al. 2005; Huang and Burkhardt
2007). Elevated levels of HCLS1 are related with chronic
lymphoblastic leukemia (Huang et al. 2008; Scielzo et al.
2005), whereas lymphocyte precursors deficient in HCLS1
tends towards defective proliferation and differentiation of
B-lymphocytes following B cell receptor activation (Sko-
kowa et al. 2012). The members of the SLC15 protein
family have been verified to be significant drug targets at
the level of drug transport (Sasawatari et al. 2011).
SLC15A3 and SLC15A4 are the two types of histidine
transporters that have been identified in the lysosomes of
immune cells (Sakata et al. 2001). Previous studies have
established the fact that amino acids are essential in the
PHACTR4
TMEM110CBARA1
GRAMD1B
RABGAP1L
CIDEB
SIGLEC1REC8C22orf28
FTSJD2LBA1
PPM1K
CREB3
SP140L
RHOH
HELBIPCEF1MSL3
RUSC1VAMP5
PHF15RIPK1
MYD88LAMP3
UBA7
LGALS9
ISG20STAT2SP100
IFI44L
SP110
TAP2
C19orf66
SLC25A28
BATF2
APOL6
LAP3
TLR3
PLA2G16
RTP4
DDX60
IL15RA
OPTN
DTX3L
CASP1
GBP4
PSME1
EPSTI1PSME2
SLFN5
PSMB8
CXCL11GBP2
HLA−G
CXCL10PSMB10
TNFSF13B
PSMB9
STAT1
IRF9UBE2L6
TAP1
TNFSF10
BTN3A3
IFITM1
NMI
PMLOAS2
IFIH1
IFITM3
IFIT5
SLC15A3
USP18OAS3
HIST1H2BDSAMD9SPATS2L
MDK
IFI27
OASL
ISG15
IFI6PARP14
TRIM21
MX2
BST2
PHF11
PARP12
IFI44TDRD7ADAR
IRF7XAF1
MX1IFIT2
PPPDE2IFIT1
RNF213
PDCD1LG2
HLA−DPA1
FAM26F
HLA−DRB1GIMAP4
RFX5CXCL9
SOCS1
TNFSF15CASP8IL7
TRIM69SLFN12HLA−BIDO1
WARSTRAFD1
TMEM37
IL15 HLA−DMABTN3A2
TNFRSF9
CD40
SLAMF1
SLAMF7
KCNAB2TBC1D10C
TMC8
STAT4
IL16C2orf84
DAPP1
CD180SEMA4D
NLRP1
LGALS2
HLA−DMB
LYSMD2HLA−DPB1
C9orf91CD47
HLA−DRA
GZMA
GZMBIFNG
GBP5B2M
TFEC
CASP10TAPBPAPOL4
JAK2
CD74
IL2RB
ARHGAP25
ST3GAL5HAPLN3
CCR5
APOBEC3G
TRAF1
USP30
GTPBP2PARP10
AGRN
GMPR
DHX58
ZC3HAV1
ENDOD1
AGTRAP
SGPP2
FGL2
CNP
OAS1
LY6E
HERC6RSAD2
LGALS3BP
HLA−A
NEXN
PARP9
CBR1SAMD9L
TRIM5
FBXO6RBM43
XRN1
IFIT3 IFI35
DDX58HERC5
USP41C
D
ZNFX1
EIF2AK2TRIM25
TMEM229B
MOV10
PATL1NT5C3
DDX60L
PNPT1
HIST1H2BC
IFNA1
IFNA13
SLFN12L NUB1
GALM
GRIN3A GBP3
BLNKLY86
CORO1AIL2RG CCR7
CD48
DOK2
SLC46A1
IRF4KIAA0226
SFI1CARD11
PLCXD1
ZC3H12D
PARP15
HSH2D
ITGB7
UBE2Z
PLA2G4CAIM2
CCL5
LAG3TRIM38
SECTM1
IL32
SAMHD1
HCLS1
TRIM26
NLRC5
BTN3A1
BIRC3HLA−E
IRF1
HLA−F APOL3
DEF6
ATP2A3CBX7
SP140
PPP1R16B
CD7SEMA4A
FGD2
TYMP
SCO2
RARRES3
APOL1TRIM22
CBWD7
HIST2H4A
HIST2H2AA4
HIST2H2AB
HIST2H4B
HIST2H3C
HIST2H3A
HIST2H2AC
CBWD2HIST2H2BF
HIST2H2AA3
Fig. 3 continued
Genes Genom (2015) 37:679–691 685
123
Table 2 Enrichment of GO terms for DEGs in each cluster
Cluster number Term P value
1 GO:0009611, response to wounding 3.3E-28
GO:0006952, defense response 3.63E-25
GO:0006954, inflammatory response 5.64E-25
GO:0010033, response to organic substance 1.81E-15
GO:0002237, response to molecule of bacterial origin 5.35E-10
GO:0042330, taxis 5.6E-09
GO:0006935, chemotaxis 5.6E-09
GO:0042981, regulation of apoptosis 6.16E-09
GO:0043067, regulation of programmed cell death 8.52E-09
GO:0032496, response to lipopolysaccharide 8.93E-09
GO:0001817, regulation of cytokine production 8.97E-09
GO:0010941, regulation of cell death 9.62E-09
GO:0048584, positive regulation of response to stimulus 1.07E-08
GO:0048584, positive regulation of response to stimulus 1.07E-08
GO:0009617, response to bacterium 2.81E-08
GO:0001775, cell activation 2.89E-08
GO:0002684, positive regulation of immune system process 5.7E-08
GO:0002252, immune effector process 3.64E-07
GO:0002252, immune effector process 3.64E-07
GO:0002252, immune effector process 3.64E-07
2 GO:0006468, protein amino acid phosphorylation 6.19E-05
GO:0016310, phosphorylation 0.000353
GO:0046578, regulation of Ras protein signal transduction 0.000769
GO:0006793, phosphorus metabolic process 0.002414
GO:0006796, phosphate metabolic process 0.002414
GO:0042325, regulation of phosphorylation 0.002418
GO:0043242, negative regulation of protein complex disassembly 0.002544
GO:0051693, actin filament capping 0.00345
GO:0051174, regulation of phosphorus metabolic process 0.003524
GO:0019220, regulation of phosphate metabolic process 0.003524
GO:0051129, negative regulation of cellular component organization 0.003904
GO:0030835, negative regulation of actin filament depolymerization 0.004442
GO:0043244, regulation of protein complex disassembly 0.005631
GO:0050670, regulation of lymphocyte proliferation 0.006219
GO:0030834, regulation of actin filament depolymerization 0.006224
GO:0032944, regulation of mononuclear cell proliferation 0.006539
GO:0070663, regulation of leukocyte proliferation 0.006539
GO:0030837, negative regulation of actin filament polymerization 0.0069
GO:0051494, negative regulation of cytoskeleton organization 0.00736
GO:0050671, positive regulation of lymphocyte proliferation 0.00736
3 GO:0019882, antigen processing and presentation 1.35E-16
GO:0048002, antigen processing and presentation of peptide antigen 3.75E-12
GO:0002474, antigen processing and presentation of peptide antigen via MHC class I 2.22E-09
GO:0043122, regulation of I-kappaB kinase/NF-kappaB cascade 5.18E-09
GO:0002684, positive regulation of immune system process 1.27E-08
GO:0002684, positive regulation of immune system process 1.27E-08
GO:0050863, regulation of T cell activation 1.56E-08
GO:0043123, positive regulation of I-kappaB kinase/NF-kappaB cascade 1.66E-08
686 Genes Genom (2015) 37:679–691
123
regulation of immune responses (Sasawatari et al. 2011).
HCK gene is mainly expressed in cells having monocyte or
macrophage lineage (Ziegler et al. 1987) that are important
HIV-1 target cells and viral reservoirs (Martın and Bandres
1999; Orenstein 2001; Crowe et al. 2003). HCK encodes a
505-residue polypeptide, which is closely related to
pp56lck, a lymphocyte-specific protein-tyrosine kinase
(Ziegler et al. 1987). HCK also one of the members of Src
kinase family, is known to interact with NEF (HIV-1 vir-
ulence factor) and promotes viral pathogenesis of HIV/
AIDS (Trible et al. 2006). LY96 gene encodes a glyco-
protein lymphocyte antigen 96 (LY96; also denoted as
ESOP-1 or MD-2) that is involved in endotoxin recognition
and offers a foundation for antiseptic drug development
(Ohto et al. 2007). MD-2 interacts with the extracellular
domain of Toll-like receptor 4 (TLR4) and is necessary for
the activation of TLR4 by lipopolysaccharide (LPS) pre-
sent on the outer membranes of Gram-negative bacterial
cell walls (Mullen et al. 2003; Dziarski and Gupta 2000).
Recognizing the structure and function of biological
networks is indispensable for the investigation of biologi-
cal processes. Many recent studies have used network
based studies to investigate various biological problems
(Malik et al. 2014; Lee and Lee 2013). In this work, we
identified 5 functional modules or communities in the
interaction network using fast greedy algorithm imple-
mented as GLAY (Su et al. 2010) plugin for Cytoscape.
Furthermore, the functional enrichment of only 3 modules
(based on the criteria described in the methods section) was
explored by using functional annotation tool DAVID. Our
analysis shows that the modules are enriched in functions
related to defense response, inflammatory response, phos-
phorous metabolic processes and regulation of I-kappaB
kinase/NF-kappaB cascade. According to previous studies,
subjects with severe infection exhibit high expression
levels of cytokine and chemokine genes (IL-8, TNF),
antiviral signaling genes such as TMEM158 (involved in
antiviral immune signaling) (Ishikawa and Barber 2008),
Table 2 continued
Cluster number Term P value
GO:0010740, positive regulation of protein kinase cascade 2.22E-08
GO:0019884, antigen processing and presentation of exogenous antigen 2.48E-08
GO:0019884, antigen processing and presentation of exogenous antigen 2.48E-08
GO:0051249, regulation of lymphocyte activation 3.45E-08
GO:0050865, regulation of cell activation 4.18E-08
GO:0050867, positive regulation of cell activation 7.7E-08
GO:0010627, regulation of protein kinase cascade 1.47E-07
GO:0002694, regulation of leukocyte activation 1.47E-07
GO:0050870, positive regulation of T cell activation 1.48E-07
GO:0050870, positive regulation of T cell activation 1.48E-07
GO:0051251, positive regulation of lymphocyte activation 1.66E-07
GO:0046649, lymphocyte activation 2.28E-07
Fig. 4 Distribution of total
SNPs and nsSNPs in four hub
genes
Genes Genom (2015) 37:679–691 687
123
Table 3 List of nsSNPs that are commonly predicted to be damaging/deleterious/affect protein function by PROVEAN, SIFT as well as
PolyPhen-2
Gene (No. of
predicted
nsSNPS)
SNP ID AA
change
PPH2_Prob PPH2 PROVEAN
score
PROVEAN
(cutoff = -2.5)
SIFT
score
SIFT
(cutoff = 0.05)
LY96 (5) rs141631661 12 S C 0.997 Probably damaging -3.04 Deleterious 0.003 Damaging
rs199978698 29 D N 1 Probably damaging -2.54 Deleterious 0 Damaging
rs142571384 70 D Y 1 Probably damaging -7.36 Deleterious 0.002 Damaging
rs367973351 99 D H 0.999 Probably damaging -4.5 Deleterious 0.004 Damaging
rs202107994 143 E A 0.962 Probably damaging -2.85 Deleterious 0 Damaging
HCK (16) rs199815006 125 F C 1 Probably damaging -6.34 Deleterious 0 Damaging
rs372761182 168 D N 0.999 Probably damaging -3.61 Deleterious 0.002 Damaging
rs200896933 186 G R 1 Probably damaging -7.13 Deleterious 0 Damaging
rs202001086 193 R Q 0.936 Possibly damaging -3.13 Deleterious 0.001 Damaging
rs199531584 212 G R 0.996 Probably damaging -7.01 Deleterious 0 Damaging
rs189947820 239 R W 1 Probably damaging -6.86 Deleterious 0 Damaging
rs200867413 257 W R 1 Probably damaging -12.37 Deleterious 0 Damaging
rs200178250 271 T M 0.973 Probably damaging -4.89 Deleterious 0 Damaging
rs151054795 339 I T 0.981 Probably damaging -3.86 Deleterious 0.029 Damaging
rs114505022 350 A T 0.983 Probably damaging -3.52 Deleterious 0.001 Damaging
rs145632103 384 R W 1 Probably damaging -7.46 Deleterious 0 Damaging
rs377468533 392 T M 0.554 Possibly damaging -3.83 Deleterious 0.022 Damaging
rs137871205 435 R W 1 Probably damaging -6.94 Deleterious 0 Damaging
rs374360915 435 R Q 0.993 Probably damaging -3.2 Deleterious 0.001 Damaging
rs142491977 470 M V 0.773 Possibly damaging -3.13 Deleterious 0.001 Damaging
rs17093828 482 P Q 1 Probably damaging -7.15 Deleterious 0 Damaging
HCLS1 (26) rs150386736 7 G A 1 Probably damaging -4.789 Deleterious -4.789 Damaging
rs372797954 19 D E 0.999 Probably damaging -3.437 Deleterious -3.437 Damaging
rs148254045 21 W C 1 Probably damaging -11.321 Deleterious -11.321 Damaging
rs146539124 54 I T 0.986 Probably damaging -3.533 Deleterious -3.533 Damaging
rs371758156 155 R C 0.999 Probably damaging -2.749 Deleterious -2.749 Damaging
rs372191150 163 V L 0.971 Probably damaging -2.684 Deleterious -2.684 Damaging
rs34767273 166 D H 1 Probably damaging -5.588 Deleterious -5.588 Damaging
rs144275149 181 T M 1 Probably damaging -2.589 Deleterious -2.589 Damaging
rs147129061 184 H Y 1 Probably damaging -5.782 Deleterious -5.782 Damaging
rs368893881 184 H Q 1 Probably damaging -7.69 Deleterious -7.69 Damaging
rs36104070 200 I T 0.908 Possibly damaging -3.216 Deleterious -3.216 Damaging
rs143408084 203 D E 0.891 Possibly damaging -3.417 Deleterious -3.417 Damaging
rs202020296 211 G S 0.977 Probably damaging -2.55 Deleterious -2.55 Damaging
rs61749596 218 P L 0.97 Probably damaging -2.595 Deleterious -2.595 Damaging
rs144589441 225 T M 1 Probably damaging -3.399 Deleterious -3.399 Damaging
rs150713543 240 A V 0.998 Probably damaging -3.167 Deleterious -3.167 Damaging
rs145562548 291 P L 0.984 Probably damaging -2.73 Deleterious -2.73 Damaging
rs368506277 430 G W 1 Probably damaging -5.551 Deleterious -5.551 Damaging
rs376245237 441 G V 1 Probably damaging -2.85 Deleterious -2.85 Damaging
rs201252242 451 P L 1 Probably damaging -9.08 Deleterious -9.08 Damaging
rs202175021 452 D N 0.997 Probably damaging -3.99 Deleterious -3.99 Damaging
rs142545513 453 D N 1 Probably damaging -4.19 Deleterious -4.19 Damaging
rs377000996 464 G A 1 Probably damaging -5.148 Deleterious -5.148 Damaging
rs78618042 467 R Q 0.999 Probably damaging -3.049 Deleterious -3.049 Damaging
rs374286282 476 L P 1 Probably damaging -6.108 Deleterious -6.108 Damaging
rs142693789 479 A E 0.998 Probably damaging -4.51 Deleterious -4.51 Damaging
688 Genes Genom (2015) 37:679–691
123
and CD69, a C-type lectin that plays a vital role in T cell
activation, proliferation and signaling (Martın and San-
chez-Madrid 2011). Moreover, it has been observed in
subjects with history of severe infection displayed higher
expression levels of anti-inflammatory proteins such as
PI3, a protease inhibitor that acts to reduce inflammation
(Verrier et al. 2012), and TNFAIP3 that restrains NF-kappa
B activation (Song et al. 1996).
One of the prime objectives of human genetics lies in
deciphering the functional consequences of inherited vari-
ations between individuals. Even though millions of SNPs
are said to be present in the human population, the small
percentage of this set will be posing potential disease risk.
Among the genomic variants, non-synonymous SNPs are
of prime interest as they lead to amino acid change in
translated protein products, and hence, these variants are
primarily studied for their relevance in inheritable diseases
and drug sensitivity (Yue and Moult 2006). It has been
further termed as one of the promising arena for research in
the field of genomics (Karchin et al. 2005). The effect of
majority of nsSNPs is non-detrimental as they are deleted
by natural selection. Assessment of lethality of detrimental
nsSNPS is traced primarily based on phylogenetic infor-
mation (i.e. correlation with residue conservation) and
structural approaches (PolyPhen). However, studies have
also revealed many human disease genes as a resultant of
Table 3 continued
Gene (No. of
predicted
nsSNPS)
SNP ID AA
change
PPH2_Prob PPH2 PROVEAN
score
PROVEAN
(cutoff = -2.5)
SIFT
score
SIFT
(cutoff = 0.05)
SLC15A3 (26) rs377584250 51 G R 0.991 Probably damaging -4.34 Deleterious 0.002 Damaging
rs199743083 93 L P 0.994 Probably damaging -4.74 Deleterious 0.001 Damaging
rs201623464 187 V G 1 Probably damaging -6.04 Deleterious 0 Damaging
rs140918009 191 G C 0.992 Probably damaging -7.49 Deleterious 0 Damaging
rs142952480 197 R H 1 Probably damaging -3.35 Deleterious 0.007 Damaging
rs377073448 213 S L 1 Probably damaging -4.21 Deleterious 0.002 Damaging
rs140942899 256 P L 0.999 Probably damaging -5.93 Deleterious 0.002 Damaging
rs147368970 258 G D 1 Probably damaging -5.28 Deleterious 0 Damaging
rs373404805 318 P L 1 Probably damaging -6.64 Deleterious 0.003 Damaging
rs372172597 327 W R 1 Probably damaging -11.75 Deleterious 0.004 Damaging
rs34738190 337 Y C 0.998 Probably damaging -7.53 Deleterious 0 Damaging
rs188010920 369 T M 0.983 Probably damaging -3.17 Deleterious 0.021 Damaging
rs201019779 371 P L 1 Probably damaging -8.64 Deleterious 0.001 Damaging
rs140596697 392 R C 0.99 Probably Damaging -4.74 Deleterious 0.005 Damaging
rs140945002 400 R W 0.518 Possibly damaging -4.65 Deleterious 0.027 Damaging
rs199509067 405 P S 1 Probably damaging -4.34 Deleterious 0.025 Damaging
rs142111240 464 Q R 1 Probably damaging -3.54 Deleterious 0 Damaging
rs377527454 471 S G 1 Probably damaging -3.54 Deleterious 0 Damaging
rs148648672 473 I T 0.998 Probably damaging -3.85 Deleterious 0.002 Damaging
rs377256065 481 E Q 0.999 Probably damaging -2.63 Deleterious 0 Damaging
rs143864458 488 P L 1 Probably damaging -9.1 Deleterious 0 Damaging
rs367924777 489 R C 1 Probably damaging -4.81 Deleterious 0.001 Damaging
rs369243035 531 G E 1 Probably damaging -5.26 Deleterious 0 Damaging
rs374216372 537 R W 0.996 Probably damaging -3.22 Deleterious 0.023 Damaging
rs199844407 552 T M 0.999 Probably damaging -3.56 Deleterious 0.002 Damaging
rs371994103 562 R C 1 Probably damaging -3.57 Deleterious 0.001 Damaging
AA Change amino acid substitution, PPH2_Prob a classifier probability of the variation being damaging as predicted by PolyPhen-2, PPH2
PolyPhen-2, an automatic tool for the prediction of potential effect of an amino acid substitution on the structure and function of human proteins
by using physical and evolutionary comparative considerations, PROVEAN Score score that correlates with biological activity level and might be
used as an indicator for the amount of functional effect of a protein variation. If the PROVEAN score is B to a given threshold, the variation is
predicted as deleterious, whereas the variant is predicted to have a neutral effect if the score is above the threshold, PROVEAN Protein Variation
Effect Analyzer (PROVEAN) is a tool that predicts whether an amino acid substitution or indel effects the biological function of a protein, SIFT
Score scaled probability of an amino acid substitution being tolerated. Substitutions with scores\0.05 are predicted to affect protein function,
SIFT Sorting Tolerant From Intolerant (SIFT) is a tool to predict whether an amino acid substitution affects protein function
Genes Genom (2015) 37:679–691 689
123
exonic or noncoding mutations affecting regulatory regions
(Hudson 2003; Yan et al. 2002).
The findings from this work propose that the application
of computational algorithms, such as PROVEAN, SIFT,
and PolyPhen-2 analysis may offer a substitute approach to
pick target SNPs by understanding the effect of SNPs on
the functional attributes or molecular phenotype of a pro-
tein. In future studies, structural models of the mutants will
be built that may be applicable for predicting the delete-
rious nsSNPs, which in turn will be useful for additional
genotype-phenotype research as well as pharmacogenetics
studies.
Conclusion
Identification of hub genes that possess potential to cause
disease-related symptoms will lead to a comprehensive
approach to devise a therapeutic method for curing com-
plex diseases. This study determined interactions between
hub genes HCLS1, SLC15A3, HCK and LY96 from DEGs
of primary human macrophages. These genes act in signal
transduction, cell differentiation, peptide transport and
related important cellular pathways. Moreover, we short-
listed three functional modules related to defense and
inflammatory responses in WNV infected macrophages.
Hence, these clusters can be targeted in further studies to
devise a therapeutic target that stabilizes their gene
expression and aids in curbing symptoms related to WNV-
linked infection. In addition to targeting multiple targets,
gene-based approach aids in devising more personalized
approach, which further enhances its efficacy.
Acknowledgments The Deanship of Scientific Research (DSR),
King Abdulaziz University, Jeddah, funded this work under Grant No.
(830-011-D1434). The authors, therefore, acknowledge with thanks
DSR for technical and financial support.
Conflict of interest Authors declare no conflict of interest.
References
Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A,
Bork P, Kondrashov AS, Sunyaev SR (2010) A method and
server for predicting damaging missense mutations. Nat Method
7:248–249
Albert R (2005) Scale-free networks in cell biology. J Cell Sci
118:4947–4957
Bamshad MJ, Ng SB, Bigham AW, Tabor HK, Emond MJ, Nickerson
DA, Shendure J (2011) Exome sequencing as a tool for
Mendelian disease gene discovery. Nat Rev Genet 12:745–755
Brinton MA (2002) The molecular biology of West Nile virus: a new
invader of the western hemisphere. Annu Rev Microbiol
56:371–402
Choi Y, Sims GE, Murphy S, Miller JR, Chan AP (2012) Predicting
the functional effect of amino acid substitutions and indels. PLoS
One 7:e46688
Chowdhury P, Khan SA, Dutta P, Topno R, Mahanta J (2014)
Characterization of West Nile virus (WNV) isolates from
Assam, India: insights into the circulating WNV in northeastern
India. Comp Immunol Microb 37:39–47
Christodoulou DC, Wakimoto H, Onoue K, Eminaga S, Gorham JM,
DePalma SR, Herman DS, Teekakirikul P, Conner DA, McKean
DM et al (2014) 50RNA-Seq identifies Fhl1 as a genetic modifier
in cardiomyopathy. J Clin Invest 124:1364–1370
Crowe S, Zhu T, Muller WA (2003) The contribution of monocyte
infection and trafficking to viral persistence, and maintenance
of the viral reservoir in HIV infection. J Leukoc Biol
74:635–641
Dinh P, Sotiriou C, Piccart MJ (2007) The evolution of treatment
strategies: aiming at the target. Breast 16:10–16
Dziarski R, Gupta D (2000) Role of MD-2 in TLR2- and TLR4-
mediated recognition of Gram-negative and Gram-positive
bacteria and activation of chemokine genes. J Endotoxin Res
6:401–405
Firoz A, Malik A, Singh SK, Jha V, Ali A (2014) Comparative
analysis of glycogene expression in different mouse tissues using
RNA-Seq data. Int J Genomics 2014:837365
Heinz FX, Collett MS, Purcell RH, Gould EA, Howard CR, Houghton
M, Moormann RJM, Rice CM, Thiel HJ (2000) Family
Flaviviridae. In: van Regenmortel MHV, Fauquet CM, Bishop
DHL, Carstens EB, Estes MK, Lemon SM et al (eds) Virus
taxonomy: classification and nomenclature of viruses. Academic
Press, San Diego, pp 859–878
Huang Y, Burkhardt JK (2007) T-cell-receptor–dependent actin
regulatory mechanisms. J Cell Sci Rev 120:723–730
Huang Y, Comiskey EO, Dupree RS, Li S, Koleske AJ, Burkhardt JK
(2008) The c-Abl tyrosine kinase regulates actin remodeling at
the immune synapse. Blood 112:111–119
Hudson TJ (2003) Wanted: regulatory SNPs. Nat Genet 33:439–440
Hunt R, Sauna ZE, Ambudkar SV, Gottesman MM, Kimchi-Sarfaty C
(2009) Silent (synonymous) SNPs: should we care about them?
Methods Mol Biol 578:23–39
Ishikawa H, Barber GN (2008) STING is an endoplasmic reticulum
adaptor that facilitates innate immune signaling. Nature
455:674–678
Jones M, Dry IR, Frampton D, Singh M, Kanda RK, Yee MB, Kellam
P, Hollinshead M, Kinchington PR, Otoole EA et al (2014)
RNA-seq analysis of host and viral gene expression highlights
interaction between varicella zoster virus and keratinocyte
differentiation. PLoS Pathog 10:e1003896
Karchin R, Kelly L, Sali A (2005) Improving functional annotation of
non-synonymous SNPs with information theory. Pac Symp
Biocomput 10:397–408
Kitamura D, Kaneko H, Miyagoe Y, Ariyasu T, Watanabe T (1989)
Isolation and characterization of a novel human gene expressed
specifically in the cells of hematopoietic lineage. Nucl Acids Res
17:9367–9379
Kumar P, Henikoff S, Ng PC (2009) Predicting the effects of coding
non-synonymous variants on protein function using the SIFT
algorithm. Nat Protoc 4:1073–1081
Lee J, Lee J (2013) Hidden information revealed by optimal
community structure from a protein-complex bipartite network
improves protein function prediction. PLoS One 8:e60372
Lee EJ, Malik A, Pokharel S, Ahmad S, Mir BA, Cho KH, Kim J,
Kong JC, Lee DM, Chung KY et al (2014) Identification of
genes differentially expressed in myogenin knock-down bovine
muscle satellite cells during differentiation through RNA
sequencing analysis. PLoS One 9:e92447
690 Genes Genom (2015) 37:679–691
123
Malik A, Lee J, Lee J (2014) Community-based network study of
protein-carbohydrate interactions in plant lectins using glycan
array data. PLoS One 9:e95480
Martın JC, Bandres JC (1999) Cells of the monocyte-macrophage
lineage and pathogenesis of HIV-1 infection. J Acquir Immune
Defic Syndr 22:413–429
Martın P, Sanchez-Madrid F (2011) CD69: an unexpected regulator
of TH17 cell-driven inflammatory responses. Sci Signal 4:pe14
Mercer TR, Clark MB, Crawford J, Brunck ME, Gerhardt DJ, Taft RJ,
Nielsen LK, Dinger ME, Mattick JS (2014) Targeted sequencing
for gene discovery and quantification using RNA Cap-
tureSeq. Nat Protoc 5:989–1009
Montojo J, Zuberi K, Rodriguez H, Kazi F, Wright G, Donaldson SL,
Morris Q, Bader GD (2010) GeneMANIA Cytoscape plugin: fast
gene function predictions on the desktop. Bioinformatics
26:2927–2928
Mullen GE, Kennedy MN, Visintin A, Mazzoni A, Leifer CA, Davies
DR, Segal DM (2003) The role of disulfide bonds in the
assembly and function of MD-2. Proc Natl Acad Sci USA
100:3919–3924
Nagalakshmi U, Waern K, Snyder M (2010) RNA-Seq: a method for
comprehensive transcriptome analysis. Curr Protoc Mol Biol
11:1–13
Ohto U, Fukase K, Miyake K, Satow Y (2007) Crystal structures of
human MD-2 and its complex with antiendotoxic lipid IVa.
Science 316:1632–1634
Orenstein JM (2001) The macrophage in HIV infection. Immunobi-
ology 204:598–602
Pena J, Plante JA, Carillo AC, Roberts KK, Smith JK, Juelich TL,
Beasley DWC, Freiberg AN, Labute MX, Arani PN et al (2014)
Multiplexed digital mRNA profiling of the inflammatory
response in the west nile swiss webster mouse model. PLoS
Negl Trop Dis 8:e3216
Qian F, Chung L, Zheng W, Bruno V, Alexander RP, Montgomery
RR (2013) Identification of genes critical for resistance to
infection by West Nile virus using RNA-Seq analysis. Viruses
5:1664–1681
Ramensky V, Bork P, Sunyaev S (2002) Human non-synonymous
SNPs: server and survey. Nucl Acids Res 17:3894–3900
Sakata K, Yamashita T, Maeda M, Moriyama Y, Shimada S,
Tohyama M (2001) Cloning of a lymphatic peptide/histidine
transporter. Biochem J 356:53–60
Sasawatari S, Okamura T, Kasumi E, Tanaka-Furuyama K, Yanobu-
Takanashi R, Shirasawa S, Kato N, Toyama-Sorimachi N (2011)
The solute carrier family 15A4 regulates TLR9 and NOD1
functions in the innate immune system and promotes colitis in
mice. Gastroenterology 140:1513–1525
Scielzo C, Ghia P, Conti A, Bachi A, Guida G, Geuna M, Alessio M,
Cappio FC (2005) HS1 protein is differentially expressed in
chronic lymphocytic leukemia patient subsets with good or poor
prognoses. J Clin Invest 115:1644–1650
Skokowa J, Klimiankou M, Klimenkova O, Lan D, Gupta K, Hussein
K, Carrizosa E, Kusnetsova I, Li Z, Sustmann C et al (2012)
Interactions among HCLS1, HAX1 and LEF-1 proteins are
essential for G-CSF-triggered granulopoiesis. Nat Med
18:1550–1559
Smoot ME, Ono K, Ruscheinski J, Wang PL, Ideker T (2011)
Cytoscape 2.8: new features for data integration and network
visualization. Bioinformatics 27:431–432
Song HY, Rothe M, Goeddel DV (1996) The tumor necrosis factor-
inducible zinc finger protein A20 interacts with TRAF1/TRAF2
and inhibits NF-kappa B activation. Proc Natl Acad Sci USA
93:6721–6725
Su G, Kuchinsky A, Morris JH, States DJ, Meng F (2010) GLay:
community structure analysis of biological networks. Bioinfor-
matics 26:3135–3137
Suthar MS, Diamond MS, Gale M Jr (2013) West Nile virus infection
and immunity. Nat Rev Microbiol 11:115–128
Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren
MJ, Salzberg SL, Wold BJ, Pachter L (2010) Transcript
assembly and quantification by RNA-Seq reveals unannotated
transcripts and isoform switching during cell differentiation. Nat
Biotechnol 28:511–515
Trible RP, Emert-Sedlak L, Smithgall TE (2006) HIV-1 Nef
selectively activates Src family kinases Hck, Lyn, and c-Src
through direct SH3 domain interaction. J Biol Chem
281:27029–27038
Van Rossum AG, Schuuring-Scholtes E, Seggelen V, Kluin PM,
Schuuring E (2005) Comparative genome analysis of cortactin
and HS1: the significance of the F-actin binding repeat domain.
BMC Genom 6:15
Verrier T, Solhonne B, Sallenave JM, Garcia-Verdugo I (2012) The
WAP protein Trappin-2/Elafin: a handyman in the regulation of
inflammatory and immune responses. Int J Biochem Cell Biol
44:1377–1380
Yan H, Yuan W, Velculescu VE, Vogelstein B, Kinzler KW (2002)
Allelic variation in human gene expression. Science 297:1143
Yang Z, Nielsen R (1998) Synonymous and nonsynonymous rate
variation in nuclear genes of mammals. J Mol Evol 46:409–418
Young A, Berry R, Holloway AF, Blackburn NB, Dickinson JL,
Sakala M, Philips JL, Moore KHB (2014) RNA-seq profiling of
a radiation resistant and radiation sensitive prostate cancer cell
line highlights opposing regulation of DNA repair and targets for
radiosensitization. BMC Cancer 14:808
Yue P, Moult J (2006) Identification and analysis of deleterious
human SNPs. J Mol Biol 356:1263–1274
Ziegler SF, Marth JD, Lewis DB, Perlmutter RM (1987) Novel
protein-tyrosine kinase gene (hck) preferentially expressed in
cells of hematopoietic origin. Mol Cell Biol 7:2276–2285
Genes Genom (2015) 37:679–691 691
123