SlideShare a Scribd company logo
1 of 5
Download to read offline
International Journal of Computer Applications Technology and Research
Volume 3– Issue 7, 446 - 450, 2014
www.ijcat.com 446
Particle Swarm Optimization for Gene cluster
Identification
Jahagirdar Manasi
Department of Computer Engineering,
KKWIEER, Nashik,
University of Pune, India
S.M. Kamlapur
Department of Computer Engineering,
KWIEER, Nashik,
University of Pune, India
Abstract: The understanding of gene regulation is the most basic need for the classification of genes within a DNA. These genes
within the DNA are grouped together into clusters also known as Transcription Units. The genes are grouped into transcription units
for the purpose of construction and regulation of gene expression and synthesis of proteins. This knowledge further contributes as
essential information for the process of drug design and to determine the protein functions of newly sequenced genomes. It is possible
to use the diverse biological information across multiple genomes as an input to the classification problem. The purpose of this work is
to show that Particle Swarm Optimization may provide for more efficient classification as compared to other algorithms. To validate
the approach E.Coli complete genome is taken as the benchmark genome.
Keywords: Classification, Drug Design, Protein Synthesis, Particle Swarm Optimization, Transcription Units.
1. INTRODUCTION
The advances in Bio-technological studies, have led to the
design and implementation of a large number of computer
algorithms for bio-synthesis. Amongst which Genetic analysis
and synthesis constitutes of a major portion of research. The
genomic era has opened up opportunities for analysis of
complete gene organization, especially in prokaryotic
organisms (bacteria). These have led to interesting
conclusions about tendencies of genes and related functions.
Data clustering is the process of grouping together similar
multi-dimensional data into number of clusters or bins.
Clustering algorithms have been applied to a wide range of
problems, of which Gene analysis or Computational Biology
form a considerable part.
The availability of complete genome sequences give rise to
the need for more computational methods for discovering the
regulation and synthesis of genomes. Classifying genes into
different clusters or groups can thus enhance the knowledge
of gene function. The approach takes into account several data
sources including gene co-ordinates, regulatory control signals
etc. Knowledge of gene organization is becoming increasingly
important in the search for novel antibacterial targets and for
understanding the processes involved in bacterial
pathogenesis. Altogether, these facts point to the critical need
for gene classification in targeted organisms.
Based on the sequence and annotations of the E. coli genome,
the common features shared among pairs of adjacent genes
within operons are analyzed against pairs of adjacent genes
positioned at the boundaries of transcription units, but
transcribed in the same direction. Their differences in terms of
distances between genes, measured in base pairs, and in terms
of functional relationships are evaluated. It is also shown that
such differences can be used to develop a method to cluster
genes in the whole genome sequence. This method might help
the identification of transcription unit boundaries in other
prokaryotic genomes.
In this paper, an approach based on Swarm Intelligence is
presented to classify genes from target genome. The next
section contains the survey of the similar work done before.
Section III comprises of the proposed system description,
followed by Datasets used, Results and Conclusions.
2. LITERATURE REVIEW
Several computational methods have been devised to cluster
genes and group into a few general categories [9]. The first
one being clustering by detecting Promoters and Transcription
Terminators. A transcription unit can be identified if the
promoter and the terminator genes of a gene sequence are
identified [9]. Several algorithms have been developed to
predict rho-independent transcription terminators [2, 3]
efficient prokaryotic promoter-searching algorithm is not
available as yet, even for the model organism E.coli [3].
The drawbacks of the method mentioned above can be
overcome by the next method Construction of Hidden Markov
Model (HMM). This method was reported to classify 60% of
known genes in E.coli [4]. However, this method is difficult
to apply in organisms where promoters and terminators are
not as well characterized. The third method Probabilistic
Machine Learning Approach using Variety of Data, estimates
the probability of any consecutive sequence of genes on the
same strand to be in a transcription unit and yielded 67%
accuracy in E.coli [5]. With the generation of a large amount
of gene expression data, co-expression pattern has been used
as a tool to improve gene classification [6].
Bockhorst et al. [7] developed a Bayesian network approach
to cluster and showed the method was able to predict 78% of
E.coli transcription units with 10% false positives. However,
these methods again are only applicable to organisms in
which vast amounts of experimental data are available. The
fourth category of methods proposed using artificial
intelligence and genetic algorithms. This method was reported
to have a maximum of 88% accuracy in identification of
adjacent gene pairs to be in a transcription unit and found
75% of known transcription units in E.coli. This method has
opened the possibility of transcription unit identification in
bacterial genomes other than E.Coli.
International Journal of Computer Applications Technology and Research
Volume 3– Issue 7, 446 - 450, 2014
www.ijcat.com 447
3. DOMAIN CONCEPTS
3.1 Transcription Units
Transcription units are genetic regulatory system found in the
organisms in which genes for functionally related proteins is
clustered along a DNA. This feature allows protein synthesis
to be controlled and coordinated in response to the needs of
the cell [10]. By generating proteins only as and when
required, operons allows the cells to conserve energy. The
part of the chromosome containing genes under consideration
can be categorized into two regions: one that includes
structural genes (i.e. genes that code for protein structure) and
other is the regulatory region. This overall unit is known as an
operon.
The gene pairs can be categorized as (i) WO (Within Operon)
pair and (ii) TUB(Transcription Unit Border) pair. Adjacent
genes that fall into the same transcription unit can be termed
as WO gene pair. Whereas, the gene pair that lies at the
borders of the transcription units are termed as TUB pairs.
TRANSCRIPTION UNIT
Figure 1. WO and TUB gene Pairs
3.2 Features for Gene Classification
Five properties were originally considered for the prediction
of operons: (i) the intergenic distance, (ii) the metabolic
pathway, (iii) the COG gene function, (iv) the operon length
ratio.
Thus the intergenic distance, the gene length ratio, and the
COG gene function are generally selected to identify the
clusters of related genes. The intergenic distance property not
only plays an important role in the initial step, but also yields
good prediction results [20]. This property can be used to
universally predict operons in bacterial genomes with a
completed chromosomal sequence.
3.2.1 Intergenic Distance:
This property is defined as the distance (in bp i.e. base pairs)
between two ORF’s (Open Reading Frames). A drawback
with intergenic distance is the fact that every species has
different spacing. Also, some highly expressed operons are
exceptions to this rule, which can also lead to correct
identification of transcription units.
Distance = Gene2_start-(Gene1_end+1) (1)
3.2.2 Functional Relationship:
Operon contains genes that are often functionally related. The
Clusters of Orthologous Groups (COG) and Metabolic
Pathway are the most representative of the functional
relationship category. The proteins that are produced are often
present in the same pathway, or are a part of the same
complex. Improved clustering is expected when incorporating
this knowledge into the process.
3.2.3 Transcription Unit Length:
The length of a transcription unit is given by the number of
genes within that unit. If it contains of just one single gene,
then it is known as a singleton unit.
4. IMPLEMENTATION DETAILS
4.1 Calculation of Pair Score
The properties used in this study are the intergenic distance,
the metabolic pathway, and the COG gene function. The
fitness values of the three properties are calculated based on
the log-likelihood method as shown below.
4.1.1 Intergenic distance:
As shown, the equation given below is used to calculate the
pair-score of intergenic distance [20].
(2)
Where, NWO(property) and NTUB(property) correspond to the
number of WO and TUB pairs in the interval distance (10, 20,
30…). TNWO and TNTUB are the total pair numbers within WO
and TUB, respectively.
4.1.2 Gene length ratio:
The TUB pairs have been observed to have smaller gene
length ratio. Thus the length ratio influences the probability of
the gene pair to be in the same operon [20].
(3)
4.1.3 COG gene function:
Equation mentioned above along with the following equation
are used to calculate the COG pair-score [1].
(4)
where LLCOGd (genei, genej) represents the pair-score of
adjacent genes with a different COG gene function.
4.2 Fitness Calculation
4.2.1 Calculation of operon fitness value
While the pair-scores of each particle are calculated based on
the metabolic pathway and the COG function, the fitness
value of the operon in BPSO is calculated by multiplying the
pair-score average with the gene number in the same operon.
4.2.2 Calculation of particle fitness value
Finally, the fitness value of a particle is calculated as the sum
of the fitness values from all putative operons in the particle.
4.3 Particle Updating
Each particle is updated through an individual best (pbesti), a
global best (gbest) value, as well as other parameters. The
pbesti value represents the position of the ith
particle with the
highest fitness value at a given iteration, and gbest represents
the best position of all pbest particles.
International Journal of Computer Applications Technology and Research
Volume 3– Issue 7, 446 - 450, 2014
www.ijcat.com 448
Figure 2: Block Diagram
5. PARTICLE SWARM OPTIMIZATION
Particle swarm optimization (PSO) is a population-based
stochastic optimization technique developed by Kennedy and
Eberhart in 1995 [4]. PSO has been developed through
simulation of the social behavior of organisms, such as the
social behavior observed of birds in a flock or fish in a school.
It describes an automatically evolving system. In PSO, each
single solution is known as particle in the search space. Each
particle uses their memory and knowledge gained by the
swarm as a whole to find the optimal solution. The fitness
value of each particle is evaluated by an optimized fitness
function, and the particle velocity directs the movement of the
particles.
Each particle adjusts its position according to its own
experience during movement. In addition, each particle also
searches for the optimal solution in a search space based on
the experience of a neighboring particle, thus making use of
the best position encountered by itself and its neighbor.
The entire process is reiterated a predefined number of times
or until a minimum error is achieved. PSO has been
successfully employed to many application areas; it obtains
better results quickly and has a lower cost compared to other
methods. However, PSO is not suitable for optimization
problems in a discrete feature space. Hence, Kenney and
Eberhart developed binary PSO (BPSO) to overcome this
problem [20].
The basic elements of PSO are briefly introduced below:
(i) Population: A swarm (population) consists of N particles.
(ii) Particle position, xi: Each candidate solution can be
represented by a D-dimensional vector; the ith
particle can be
described as xi = (xi1,xi2, . . . , xiD), where xiD is the position of
the ith
particle with respect to the Dth
dimension.
(iii) Particle velocity, vi: The velocity of the ith particle is
represented by vi = (vi1,vi2, . . . ,viD), where viD is the velocity
of the ith
particle with respect to the Dth
dimension. In
addition, the velocity of a particle is limited within
[Vmin,Vmax]D
.
(iv) Inertia weight, w: The inertia weight is used to control the
impact of the previous velocity of a particle on the current
velocity.
(v) Individual best, pbesti: pbesti is the position of the ith
particle with the highest fitness value at a given iteration.
(vi) Global best, gbest: The best position of all pbest particles
is called global best.
(vii) Stopping criteria: The process is stopped after the
maximum allowed number of iterations is reached.
In the PSO algorithm, each particle represents a candidate
solution to the problem, and a swarm consists of N particles
moving around a D-dimension search space until the
computational limitations are reached.
6. RESULTS AND DISCUSSION
6.1 Data set Preparation
The entire microbial genome data were downloaded from
the GenBank database (http://www.ncbi.nlm.nih.gov/). The
related genomic information contains the gene name, the gene
ID, the position, the strand, and the product. The experimental
operon data set of the E. coli genome was obtained from
RegulonDB (http://regulondb.ccg.unam.mx/) [1], which
contains highly reliable data of validated experimental
operons of the E. coli genome. The metabolic pathway and
COG data of the genomes were obtained from KEGG
(http://www.genome.ad.jp/kegg/pathway.html) and NCBI
(http://www.ncbi.nlm.nih.gov/COG/), respectively.
International Journal of Computer Applications Technology and Research
Volume 3– Issue 7, 446 - 450, 2014
www.ijcat.com 449
Figure 2: Dataset for E.Coli
6.2 Result Set
The result set shows the pair score calculation results for the
genome data available. Each table shows the values calculated
for every property.
Table 1. Pair Scores for Intergenic Distances between the range [-100,300]
[-100,-90) 0 [0,10) 0.9906 [100,110) -0.1167 [200,210) -0.9366
[-90,-80) 0 [10,20) 2.6333 [110,120) -0.0611 [210,220) -0.5982
[-80,-70) 0 [20,30) 1.4239 [120,130) -0.2405 [220,230) -0.7404
[-70,-60) 0 [30,40) -0.4859 [130,140) -0.5619 [230,240) -1.0121
[-60,-50) 0 [40,50) -0.9955 [140,150) -0.2805 [240,250) -1.0239
[-50,-40) 0.1312 [50,60) -0.2334 [150,160) -0.7442 [250,260) -0.8495
[-40,-30) -0.7160 [60,70) 0.2374 [160,170) -0.6777 [260,270) -0.7160
[-30,-20) -0.0510 [70,80) 0.3204 [170,180) -0.2832 [270,280) -0.9893
[-20,-10) 1.0963 [80,90) 0.1059 [180,190) -0.7000 [280,290) -0.6672
[-10,0) 1.5120 [90,100) -0.0286 [190,200) -0.6615 [290,300] 0.5366
Table 2. Pair Scores for COG functions
COG function Pair Score
Information Storage and
Processing (ISP)
0.021268710361031777
Cellular Processes and
Signaling (CPAS)
-0.01032989945363101
Metabolism (M) 0.13398627322082188
Poorly Characterized
(PC)
-0.45171651491780973
International Journal of Computer Applications Technology and Research
Volume 3– Issue 7, 446 - 450, 2014
www.ijcat.com 450
7. CONCLUSION
The gene encoding that is output from the algorithm is the
closest to the actual organization of operons in the genome.
The analysis of gene data is gaining increasing importance.
This study proposes a method to identify operons at complete
genome level. The gene features like Intergenic distance,
Gene length ratio, gene clusters of orthologous groups make
feasible input parameters for identification process. This
identification can be very valuable contribution for various
genetic applications.
8. ACKNOWLEDGMENTS
We wish to express our gratitude to Prof. Swati Bhavsar,
Dept. of Microbiology, R. Y. K. college of science Nashik for
her valuable help in the data interpretation. Also sincere
thanks to Dr. Neelima Kulkarni for domain concepts.
9. REFERENCES
[1] Bockhorst,J., Craven,M., Page,D., Shavlik,J. and
Glasner,J. "A Bayesian network approach to operon
prediction." Bioinformatics, 19, 1227-1135(2003).
[2] Brendel,V. and Trifonov,E.N. "A computer algorithm for
testing potential prokaryotic terminators." Nucleic Acids Res.,
12, 4411-4427.(1984)
[3] Brendel,V. and Trifonov,E.N. "Computer-aided mapping
of DNA-protein interaction sites." Proceedings of the Ninth
International CODATA Conference, Jerusalem, Israel, pp. 17-
20, 115-118.(1984)
[4] Craven,M., Page,D., Shavlik,J., Bockhorst,J. and
Glasner,J. "A probabilistic learning approach to whole-
genome operon prediction." Proceedings of the Eighth
International Conference on Intelligent Systems for Molecular
Biology, La Jolla, CA, pp.116-127(2000).
[5] Ermolaeva,M.D., Khalak,H.G., White,O., Smith,H.O. and
Salzberg,S.L. "Prediction of transcription terminators in
bacterial genomes." J. Mol. Biol., 301, 27-33(2000).
[6] Ermolaeva,M., White,O. and Salzberg,S.L. "Prediction of
operons in microbial genomes." Nucleic Acids Res., 29, 1216-
1221(2001).
[7] J. Kennedy and R. Eberhart, "Particle swarm
optimization," in IEEE International Joint Conference on
Neural Network. vol. 4 Perth, Australia, 1995, pp. 1942-1948.
[8] Li-Yeh Chuang, Cheng-Huei Yang, Jui-Hung Tsai, and
Cheng-Hong Yang," Operon Prediction using Chaos
Embedded Particle Swarm Optimization", IEEE-ACM
TRANS- ACTIONS ON COMPUTATIONAL BIOLOGY
AND BIOINFORMATICS(2013).
[9] L. Wang, J. D. Trawick, R. Yamamoto, and C. Zamudio,
"Genome-wide operon prediction in Staphylococcus aureus,"
Nucleic Acids Res., vol. 32, pp. 3689-702, 2004.
[10] L. Y. Chuang, J. H. Tsai, and C. H. Yang, "Binary
particle swarm optimization for operon prediction," Nucleic
acids research, vol. 38, p. e128(2010).
[11] L. Y. Chuang, J. H. Tsai, and C. H. Yang,
"Complementary Binary particle swarm optimization for
operon prediction," Nucleic acids research, (2010).
[12] Mironov,A.A., Koonin,E.V., Roytberg,M.A. and
Gelfand,M.S. "Computer analysis of transcription regulatory
patterns in completely sequenced bacterial genomes." Nucleic
Acids Res., 27, 2981-2989(1999).
[13] Overbeek,R., Fonstein,M., D'Souza,M., Pusch,G.D. and
Maltsev,N. "The use of gene clusters to infer functional
coupling." Proc. Natl Acad. Sci. USA, 96, 2896-2901(2002).
[14] Ozoline,O.N., Deev,A.A. and Arkhipova,M.V. "Non-
canonical sequence elements in the promoter structure.
Cluster analysis of promoters recognized by Escherichia coli
RNA polymerase." Nucleic Acids Res., 23, 4703 4709(1997).
[15] Sabatti,C., Rohlin,L., Oh,M. and Liao,J.C. "Co-
expression pattern from DNA microarray experiments as a
tool for operon prediction." Nucleic Acids Res., 30, 2886-
2893(2002).
[16] Salgado,H., Moreno-Hagelsieb,G., Smith,T.F. and
Collado-Vides,J. "Operons in Escherichia coli: genomic
analyses and predictions." Proc. Natl Acad. Sci. USA, 97,
6652-6657(2000).
[17] Unniraman,S., Prakash,R. and Nagaraja,V. "Conserved
economics of transcription termination in eubacteria." Nucleic
Acids Res., 30, 675-684(2002).
[18] Vitreschak,A.G, Rodionov,D.A., Mironov,A.A. and
Gelfand,M.S. "Regulation of riboavin biosynthesis and
transport genes in bacteria by transcriptional and translational
attenuation." Nucleic Acids Res., 30, 3141-3151(2002)
[19] Wolf,Y.I., Rogozin,I.B., Kondrashov,A.S. and
Koonin,E.V. "Genome alignment, evolution of prokaryotic
genome organization, and prediction of gene function using
genomic context." Genome Res.,11, 356-372(2002).
[20] Yada,T., Nakao,M., Totoki,Y. and Nakai,K. "Modeling
and predicting transcriptional units of Escherichia coli genes
using hidden Markov models". Bioinformatics, 15, 987-
993(1999).
[21] Zheng,Y., Szustakowski,J.D., Fortnow,L., Roberts,R.J.
and Kasif,S. "Computational identi_cation of operons in
microbial genomes". Genome Res., 12, 1221-1230(2002).

More Related Content

What's hot

Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.sangeeta jadav
 
Protein protein interaction
Protein protein interactionProtein protein interaction
Protein protein interactionAashish Patel
 
Bioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirBioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirKAUSHAL SAHU
 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Sijo A
 
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...A Review of Various Methods Used in the Analysis of Functional Gene Expressio...
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...ijitcs
 
Functional genomics, a conceptual approach
Functional genomics, a conceptual approachFunctional genomics, a conceptual approach
Functional genomics, a conceptual approachKAUSHAL SAHU
 
Bioinformatics and functional genomics
Bioinformatics and functional genomicsBioinformatics and functional genomics
Bioinformatics and functional genomicsAisha Kalsoom
 
Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)Sai Ram
 
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSIONCOMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSIONcsandit
 
BioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsBioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsAyeshaYousaf20
 
Report on System Biology Funding from BMBF
Report on System Biology Funding from BMBFReport on System Biology Funding from BMBF
Report on System Biology Funding from BMBFEuroBioForum
 
Construction of phylogenetic tree from multiple gene trees using principal co...
Construction of phylogenetic tree from multiple gene trees using principal co...Construction of phylogenetic tree from multiple gene trees using principal co...
Construction of phylogenetic tree from multiple gene trees using principal co...IAEME Publication
 
Experimental methods and the big data sets
Experimental methods and the big data sets Experimental methods and the big data sets
Experimental methods and the big data sets improvemed
 
A Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products ResearchA Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products ResearchHuda Nazeer
 

What's hot (20)

Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.
 
genomic comparison
genomic comparison genomic comparison
genomic comparison
 
Protein protein interaction
Protein protein interactionProtein protein interaction
Protein protein interaction
 
Bioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirBioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sir
 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)
 
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...A Review of Various Methods Used in the Analysis of Functional Gene Expressio...
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...
 
Functional genomics, a conceptual approach
Functional genomics, a conceptual approachFunctional genomics, a conceptual approach
Functional genomics, a conceptual approach
 
Bioinformatics and functional genomics
Bioinformatics and functional genomicsBioinformatics and functional genomics
Bioinformatics and functional genomics
 
Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)
 
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSIONCOMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
 
BioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsBioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomics
 
Report on System Biology Funding from BMBF
Report on System Biology Funding from BMBFReport on System Biology Funding from BMBF
Report on System Biology Funding from BMBF
 
Construction of phylogenetic tree from multiple gene trees using principal co...
Construction of phylogenetic tree from multiple gene trees using principal co...Construction of phylogenetic tree from multiple gene trees using principal co...
Construction of phylogenetic tree from multiple gene trees using principal co...
 
Techniques in proteomics
Techniques in proteomicsTechniques in proteomics
Techniques in proteomics
 
1207.2600
1207.26001207.2600
1207.2600
 
Salisha ppt (1) (1)
Salisha ppt (1) (1)Salisha ppt (1) (1)
Salisha ppt (1) (1)
 
protein microarray
protein microarray protein microarray
protein microarray
 
Experimental methods and the big data sets
Experimental methods and the big data sets Experimental methods and the big data sets
Experimental methods and the big data sets
 
A Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products ResearchA Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products Research
 

Viewers also liked

Enhanced Detection System for Trust Aware P2P Communication Networks
Enhanced Detection System for Trust Aware P2P Communication NetworksEnhanced Detection System for Trust Aware P2P Communication Networks
Enhanced Detection System for Trust Aware P2P Communication NetworksEditor IJCATR
 
Design and Implementation of Refresh and Timing Controller Unit for LPDDR2 Me...
Design and Implementation of Refresh and Timing Controller Unit for LPDDR2 Me...Design and Implementation of Refresh and Timing Controller Unit for LPDDR2 Me...
Design and Implementation of Refresh and Timing Controller Unit for LPDDR2 Me...Editor IJCATR
 
Knowledge Management in the Cloud: Benefits and Risks
Knowledge Management in the Cloud: Benefits and RisksKnowledge Management in the Cloud: Benefits and Risks
Knowledge Management in the Cloud: Benefits and RisksEditor IJCATR
 
Dynamic Chunks Distribution Scheme for Multiservice Load Balancing Using Fibo...
Dynamic Chunks Distribution Scheme for Multiservice Load Balancing Using Fibo...Dynamic Chunks Distribution Scheme for Multiservice Load Balancing Using Fibo...
Dynamic Chunks Distribution Scheme for Multiservice Load Balancing Using Fibo...Editor IJCATR
 
RED: A HIGH LINK UTILIZATION AND FAIR ALGORITHM
RED: A HIGH LINK UTILIZATION AND FAIR ALGORITHMRED: A HIGH LINK UTILIZATION AND FAIR ALGORITHM
RED: A HIGH LINK UTILIZATION AND FAIR ALGORITHMEditor IJCATR
 
A Review on Classification Based Approaches for STEGanalysis Detection
A Review on Classification Based Approaches for STEGanalysis DetectionA Review on Classification Based Approaches for STEGanalysis Detection
A Review on Classification Based Approaches for STEGanalysis DetectionEditor IJCATR
 
BRAIN MACHINE INTERFACE SYSTEM FOR PERSON WITH QUADRIPLEGIA DISEASE
BRAIN MACHINE INTERFACE SYSTEM FOR PERSON WITH QUADRIPLEGIA DISEASEBRAIN MACHINE INTERFACE SYSTEM FOR PERSON WITH QUADRIPLEGIA DISEASE
BRAIN MACHINE INTERFACE SYSTEM FOR PERSON WITH QUADRIPLEGIA DISEASEEditor IJCATR
 
Assistive System Using Eye Gaze Estimation for Amyotrophic Lateral Sclerosis ...
Assistive System Using Eye Gaze Estimation for Amyotrophic Lateral Sclerosis ...Assistive System Using Eye Gaze Estimation for Amyotrophic Lateral Sclerosis ...
Assistive System Using Eye Gaze Estimation for Amyotrophic Lateral Sclerosis ...Editor IJCATR
 

Viewers also liked (8)

Enhanced Detection System for Trust Aware P2P Communication Networks
Enhanced Detection System for Trust Aware P2P Communication NetworksEnhanced Detection System for Trust Aware P2P Communication Networks
Enhanced Detection System for Trust Aware P2P Communication Networks
 
Design and Implementation of Refresh and Timing Controller Unit for LPDDR2 Me...
Design and Implementation of Refresh and Timing Controller Unit for LPDDR2 Me...Design and Implementation of Refresh and Timing Controller Unit for LPDDR2 Me...
Design and Implementation of Refresh and Timing Controller Unit for LPDDR2 Me...
 
Knowledge Management in the Cloud: Benefits and Risks
Knowledge Management in the Cloud: Benefits and RisksKnowledge Management in the Cloud: Benefits and Risks
Knowledge Management in the Cloud: Benefits and Risks
 
Dynamic Chunks Distribution Scheme for Multiservice Load Balancing Using Fibo...
Dynamic Chunks Distribution Scheme for Multiservice Load Balancing Using Fibo...Dynamic Chunks Distribution Scheme for Multiservice Load Balancing Using Fibo...
Dynamic Chunks Distribution Scheme for Multiservice Load Balancing Using Fibo...
 
RED: A HIGH LINK UTILIZATION AND FAIR ALGORITHM
RED: A HIGH LINK UTILIZATION AND FAIR ALGORITHMRED: A HIGH LINK UTILIZATION AND FAIR ALGORITHM
RED: A HIGH LINK UTILIZATION AND FAIR ALGORITHM
 
A Review on Classification Based Approaches for STEGanalysis Detection
A Review on Classification Based Approaches for STEGanalysis DetectionA Review on Classification Based Approaches for STEGanalysis Detection
A Review on Classification Based Approaches for STEGanalysis Detection
 
BRAIN MACHINE INTERFACE SYSTEM FOR PERSON WITH QUADRIPLEGIA DISEASE
BRAIN MACHINE INTERFACE SYSTEM FOR PERSON WITH QUADRIPLEGIA DISEASEBRAIN MACHINE INTERFACE SYSTEM FOR PERSON WITH QUADRIPLEGIA DISEASE
BRAIN MACHINE INTERFACE SYSTEM FOR PERSON WITH QUADRIPLEGIA DISEASE
 
Assistive System Using Eye Gaze Estimation for Amyotrophic Lateral Sclerosis ...
Assistive System Using Eye Gaze Estimation for Amyotrophic Lateral Sclerosis ...Assistive System Using Eye Gaze Estimation for Amyotrophic Lateral Sclerosis ...
Assistive System Using Eye Gaze Estimation for Amyotrophic Lateral Sclerosis ...
 

Similar to Particle Swarm Optimization for Gene cluster Identification

International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentIJERD Editor
 
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSTRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSHEETHUMOLKS
 
IRJET- Disease Identification using Proteins Values and Regulatory Modules
IRJET-  	  Disease Identification using Proteins Values and Regulatory  ModulesIRJET-  	  Disease Identification using Proteins Values and Regulatory  Modules
IRJET- Disease Identification using Proteins Values and Regulatory ModulesIRJET Journal
 
Classification of Microarray Gene Expression Data by Gene Combinations using ...
Classification of Microarray Gene Expression Data by Gene Combinations using ...Classification of Microarray Gene Expression Data by Gene Combinations using ...
Classification of Microarray Gene Expression Data by Gene Combinations using ...IJCSEA Journal
 
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptxBTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptxChijiokeNsofor
 
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...rahulmonikasharma
 
Chapter 20 ppt
Chapter 20 pptChapter 20 ppt
Chapter 20 pptrehman2009
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicsAthira RG
 
Functional proteomics, and tools
Functional proteomics, and toolsFunctional proteomics, and tools
Functional proteomics, and toolsKAUSHAL SAHU
 
Biological Significance of Gene Expression Data Using Similarity Based Biclus...
Biological Significance of Gene Expression Data Using Similarity Based Biclus...Biological Significance of Gene Expression Data Using Similarity Based Biclus...
Biological Significance of Gene Expression Data Using Similarity Based Biclus...CSCJournals
 
Bio Scope
Bio ScopeBio Scope
Bio ScopeStartup
 
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...ijbbjournal
 
Current trends in pseduogene detection and characterization
Current trends in pseduogene detection and characterizationCurrent trends in pseduogene detection and characterization
Current trends in pseduogene detection and characterizationShreya Feliz
 
Improving the effectiveness of information retrieval system using adaptive ge...
Improving the effectiveness of information retrieval system using adaptive ge...Improving the effectiveness of information retrieval system using adaptive ge...
Improving the effectiveness of information retrieval system using adaptive ge...ijcsit
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformaticsmaulikchaudhary8
 
Gene Selection for Sample Classification in Microarray: Clustering Based Method
Gene Selection for Sample Classification in Microarray: Clustering Based MethodGene Selection for Sample Classification in Microarray: Clustering Based Method
Gene Selection for Sample Classification in Microarray: Clustering Based MethodIOSR Journals
 
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...IRJET Journal
 
Gene identification and discovery
Gene identification and discoveryGene identification and discovery
Gene identification and discoveryAmit Ruchi Yadav
 
Bioinformatics.Practical Notebook
Bioinformatics.Practical NotebookBioinformatics.Practical Notebook
Bioinformatics.Practical NotebookNaima Tahsin
 

Similar to Particle Swarm Optimization for Gene cluster Identification (20)

International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSTRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
 
IRJET- Disease Identification using Proteins Values and Regulatory Modules
IRJET-  	  Disease Identification using Proteins Values and Regulatory  ModulesIRJET-  	  Disease Identification using Proteins Values and Regulatory  Modules
IRJET- Disease Identification using Proteins Values and Regulatory Modules
 
Classification of Microarray Gene Expression Data by Gene Combinations using ...
Classification of Microarray Gene Expression Data by Gene Combinations using ...Classification of Microarray Gene Expression Data by Gene Combinations using ...
Classification of Microarray Gene Expression Data by Gene Combinations using ...
 
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptxBTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
 
Gene identification using bioinformatic tools.pptx
Gene identification using bioinformatic tools.pptxGene identification using bioinformatic tools.pptx
Gene identification using bioinformatic tools.pptx
 
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
 
Chapter 20 ppt
Chapter 20 pptChapter 20 ppt
Chapter 20 ppt
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Functional proteomics, and tools
Functional proteomics, and toolsFunctional proteomics, and tools
Functional proteomics, and tools
 
Biological Significance of Gene Expression Data Using Similarity Based Biclus...
Biological Significance of Gene Expression Data Using Similarity Based Biclus...Biological Significance of Gene Expression Data Using Similarity Based Biclus...
Biological Significance of Gene Expression Data Using Similarity Based Biclus...
 
Bio Scope
Bio ScopeBio Scope
Bio Scope
 
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
 
Current trends in pseduogene detection and characterization
Current trends in pseduogene detection and characterizationCurrent trends in pseduogene detection and characterization
Current trends in pseduogene detection and characterization
 
Improving the effectiveness of information retrieval system using adaptive ge...
Improving the effectiveness of information retrieval system using adaptive ge...Improving the effectiveness of information retrieval system using adaptive ge...
Improving the effectiveness of information retrieval system using adaptive ge...
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Gene Selection for Sample Classification in Microarray: Clustering Based Method
Gene Selection for Sample Classification in Microarray: Clustering Based MethodGene Selection for Sample Classification in Microarray: Clustering Based Method
Gene Selection for Sample Classification in Microarray: Clustering Based Method
 
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...
 
Gene identification and discovery
Gene identification and discoveryGene identification and discovery
Gene identification and discovery
 
Bioinformatics.Practical Notebook
Bioinformatics.Practical NotebookBioinformatics.Practical Notebook
Bioinformatics.Practical Notebook
 

More from Editor IJCATR

Text Mining in Digital Libraries using OKAPI BM25 Model
 Text Mining in Digital Libraries using OKAPI BM25 Model Text Mining in Digital Libraries using OKAPI BM25 Model
Text Mining in Digital Libraries using OKAPI BM25 ModelEditor IJCATR
 
Green Computing, eco trends, climate change, e-waste and eco-friendly
Green Computing, eco trends, climate change, e-waste and eco-friendlyGreen Computing, eco trends, climate change, e-waste and eco-friendly
Green Computing, eco trends, climate change, e-waste and eco-friendlyEditor IJCATR
 
Policies for Green Computing and E-Waste in Nigeria
 Policies for Green Computing and E-Waste in Nigeria Policies for Green Computing and E-Waste in Nigeria
Policies for Green Computing and E-Waste in NigeriaEditor IJCATR
 
Performance Evaluation of VANETs for Evaluating Node Stability in Dynamic Sce...
Performance Evaluation of VANETs for Evaluating Node Stability in Dynamic Sce...Performance Evaluation of VANETs for Evaluating Node Stability in Dynamic Sce...
Performance Evaluation of VANETs for Evaluating Node Stability in Dynamic Sce...Editor IJCATR
 
Optimum Location of DG Units Considering Operation Conditions
Optimum Location of DG Units Considering Operation ConditionsOptimum Location of DG Units Considering Operation Conditions
Optimum Location of DG Units Considering Operation ConditionsEditor IJCATR
 
Analysis of Comparison of Fuzzy Knn, C4.5 Algorithm, and Naïve Bayes Classifi...
Analysis of Comparison of Fuzzy Knn, C4.5 Algorithm, and Naïve Bayes Classifi...Analysis of Comparison of Fuzzy Knn, C4.5 Algorithm, and Naïve Bayes Classifi...
Analysis of Comparison of Fuzzy Knn, C4.5 Algorithm, and Naïve Bayes Classifi...Editor IJCATR
 
Web Scraping for Estimating new Record from Source Site
Web Scraping for Estimating new Record from Source SiteWeb Scraping for Estimating new Record from Source Site
Web Scraping for Estimating new Record from Source SiteEditor IJCATR
 
Evaluating Semantic Similarity between Biomedical Concepts/Classes through S...
 Evaluating Semantic Similarity between Biomedical Concepts/Classes through S... Evaluating Semantic Similarity between Biomedical Concepts/Classes through S...
Evaluating Semantic Similarity between Biomedical Concepts/Classes through S...Editor IJCATR
 
Semantic Similarity Measures between Terms in the Biomedical Domain within f...
 Semantic Similarity Measures between Terms in the Biomedical Domain within f... Semantic Similarity Measures between Terms in the Biomedical Domain within f...
Semantic Similarity Measures between Terms in the Biomedical Domain within f...Editor IJCATR
 
A Strategy for Improving the Performance of Small Files in Openstack Swift
 A Strategy for Improving the Performance of Small Files in Openstack Swift  A Strategy for Improving the Performance of Small Files in Openstack Swift
A Strategy for Improving the Performance of Small Files in Openstack Swift Editor IJCATR
 
Integrated System for Vehicle Clearance and Registration
Integrated System for Vehicle Clearance and RegistrationIntegrated System for Vehicle Clearance and Registration
Integrated System for Vehicle Clearance and RegistrationEditor IJCATR
 
Assessment of the Efficiency of Customer Order Management System: A Case Stu...
 Assessment of the Efficiency of Customer Order Management System: A Case Stu... Assessment of the Efficiency of Customer Order Management System: A Case Stu...
Assessment of the Efficiency of Customer Order Management System: A Case Stu...Editor IJCATR
 
Energy-Aware Routing in Wireless Sensor Network Using Modified Bi-Directional A*
Energy-Aware Routing in Wireless Sensor Network Using Modified Bi-Directional A*Energy-Aware Routing in Wireless Sensor Network Using Modified Bi-Directional A*
Energy-Aware Routing in Wireless Sensor Network Using Modified Bi-Directional A*Editor IJCATR
 
Security in Software Defined Networks (SDN): Challenges and Research Opportun...
Security in Software Defined Networks (SDN): Challenges and Research Opportun...Security in Software Defined Networks (SDN): Challenges and Research Opportun...
Security in Software Defined Networks (SDN): Challenges and Research Opportun...Editor IJCATR
 
Measure the Similarity of Complaint Document Using Cosine Similarity Based on...
Measure the Similarity of Complaint Document Using Cosine Similarity Based on...Measure the Similarity of Complaint Document Using Cosine Similarity Based on...
Measure the Similarity of Complaint Document Using Cosine Similarity Based on...Editor IJCATR
 
Hangul Recognition Using Support Vector Machine
Hangul Recognition Using Support Vector MachineHangul Recognition Using Support Vector Machine
Hangul Recognition Using Support Vector MachineEditor IJCATR
 
Application of 3D Printing in Education
Application of 3D Printing in EducationApplication of 3D Printing in Education
Application of 3D Printing in EducationEditor IJCATR
 
Survey on Energy-Efficient Routing Algorithms for Underwater Wireless Sensor ...
Survey on Energy-Efficient Routing Algorithms for Underwater Wireless Sensor ...Survey on Energy-Efficient Routing Algorithms for Underwater Wireless Sensor ...
Survey on Energy-Efficient Routing Algorithms for Underwater Wireless Sensor ...Editor IJCATR
 
Comparative analysis on Void Node Removal Routing algorithms for Underwater W...
Comparative analysis on Void Node Removal Routing algorithms for Underwater W...Comparative analysis on Void Node Removal Routing algorithms for Underwater W...
Comparative analysis on Void Node Removal Routing algorithms for Underwater W...Editor IJCATR
 
Decay Property for Solutions to Plate Type Equations with Variable Coefficients
Decay Property for Solutions to Plate Type Equations with Variable CoefficientsDecay Property for Solutions to Plate Type Equations with Variable Coefficients
Decay Property for Solutions to Plate Type Equations with Variable CoefficientsEditor IJCATR
 

More from Editor IJCATR (20)

Text Mining in Digital Libraries using OKAPI BM25 Model
 Text Mining in Digital Libraries using OKAPI BM25 Model Text Mining in Digital Libraries using OKAPI BM25 Model
Text Mining in Digital Libraries using OKAPI BM25 Model
 
Green Computing, eco trends, climate change, e-waste and eco-friendly
Green Computing, eco trends, climate change, e-waste and eco-friendlyGreen Computing, eco trends, climate change, e-waste and eco-friendly
Green Computing, eco trends, climate change, e-waste and eco-friendly
 
Policies for Green Computing and E-Waste in Nigeria
 Policies for Green Computing and E-Waste in Nigeria Policies for Green Computing and E-Waste in Nigeria
Policies for Green Computing and E-Waste in Nigeria
 
Performance Evaluation of VANETs for Evaluating Node Stability in Dynamic Sce...
Performance Evaluation of VANETs for Evaluating Node Stability in Dynamic Sce...Performance Evaluation of VANETs for Evaluating Node Stability in Dynamic Sce...
Performance Evaluation of VANETs for Evaluating Node Stability in Dynamic Sce...
 
Optimum Location of DG Units Considering Operation Conditions
Optimum Location of DG Units Considering Operation ConditionsOptimum Location of DG Units Considering Operation Conditions
Optimum Location of DG Units Considering Operation Conditions
 
Analysis of Comparison of Fuzzy Knn, C4.5 Algorithm, and Naïve Bayes Classifi...
Analysis of Comparison of Fuzzy Knn, C4.5 Algorithm, and Naïve Bayes Classifi...Analysis of Comparison of Fuzzy Knn, C4.5 Algorithm, and Naïve Bayes Classifi...
Analysis of Comparison of Fuzzy Knn, C4.5 Algorithm, and Naïve Bayes Classifi...
 
Web Scraping for Estimating new Record from Source Site
Web Scraping for Estimating new Record from Source SiteWeb Scraping for Estimating new Record from Source Site
Web Scraping for Estimating new Record from Source Site
 
Evaluating Semantic Similarity between Biomedical Concepts/Classes through S...
 Evaluating Semantic Similarity between Biomedical Concepts/Classes through S... Evaluating Semantic Similarity between Biomedical Concepts/Classes through S...
Evaluating Semantic Similarity between Biomedical Concepts/Classes through S...
 
Semantic Similarity Measures between Terms in the Biomedical Domain within f...
 Semantic Similarity Measures between Terms in the Biomedical Domain within f... Semantic Similarity Measures between Terms in the Biomedical Domain within f...
Semantic Similarity Measures between Terms in the Biomedical Domain within f...
 
A Strategy for Improving the Performance of Small Files in Openstack Swift
 A Strategy for Improving the Performance of Small Files in Openstack Swift  A Strategy for Improving the Performance of Small Files in Openstack Swift
A Strategy for Improving the Performance of Small Files in Openstack Swift
 
Integrated System for Vehicle Clearance and Registration
Integrated System for Vehicle Clearance and RegistrationIntegrated System for Vehicle Clearance and Registration
Integrated System for Vehicle Clearance and Registration
 
Assessment of the Efficiency of Customer Order Management System: A Case Stu...
 Assessment of the Efficiency of Customer Order Management System: A Case Stu... Assessment of the Efficiency of Customer Order Management System: A Case Stu...
Assessment of the Efficiency of Customer Order Management System: A Case Stu...
 
Energy-Aware Routing in Wireless Sensor Network Using Modified Bi-Directional A*
Energy-Aware Routing in Wireless Sensor Network Using Modified Bi-Directional A*Energy-Aware Routing in Wireless Sensor Network Using Modified Bi-Directional A*
Energy-Aware Routing in Wireless Sensor Network Using Modified Bi-Directional A*
 
Security in Software Defined Networks (SDN): Challenges and Research Opportun...
Security in Software Defined Networks (SDN): Challenges and Research Opportun...Security in Software Defined Networks (SDN): Challenges and Research Opportun...
Security in Software Defined Networks (SDN): Challenges and Research Opportun...
 
Measure the Similarity of Complaint Document Using Cosine Similarity Based on...
Measure the Similarity of Complaint Document Using Cosine Similarity Based on...Measure the Similarity of Complaint Document Using Cosine Similarity Based on...
Measure the Similarity of Complaint Document Using Cosine Similarity Based on...
 
Hangul Recognition Using Support Vector Machine
Hangul Recognition Using Support Vector MachineHangul Recognition Using Support Vector Machine
Hangul Recognition Using Support Vector Machine
 
Application of 3D Printing in Education
Application of 3D Printing in EducationApplication of 3D Printing in Education
Application of 3D Printing in Education
 
Survey on Energy-Efficient Routing Algorithms for Underwater Wireless Sensor ...
Survey on Energy-Efficient Routing Algorithms for Underwater Wireless Sensor ...Survey on Energy-Efficient Routing Algorithms for Underwater Wireless Sensor ...
Survey on Energy-Efficient Routing Algorithms for Underwater Wireless Sensor ...
 
Comparative analysis on Void Node Removal Routing algorithms for Underwater W...
Comparative analysis on Void Node Removal Routing algorithms for Underwater W...Comparative analysis on Void Node Removal Routing algorithms for Underwater W...
Comparative analysis on Void Node Removal Routing algorithms for Underwater W...
 
Decay Property for Solutions to Plate Type Equations with Variable Coefficients
Decay Property for Solutions to Plate Type Equations with Variable CoefficientsDecay Property for Solutions to Plate Type Equations with Variable Coefficients
Decay Property for Solutions to Plate Type Equations with Variable Coefficients
 

Recently uploaded

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 

Recently uploaded (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 

Particle Swarm Optimization for Gene cluster Identification

  • 1. International Journal of Computer Applications Technology and Research Volume 3– Issue 7, 446 - 450, 2014 www.ijcat.com 446 Particle Swarm Optimization for Gene cluster Identification Jahagirdar Manasi Department of Computer Engineering, KKWIEER, Nashik, University of Pune, India S.M. Kamlapur Department of Computer Engineering, KWIEER, Nashik, University of Pune, India Abstract: The understanding of gene regulation is the most basic need for the classification of genes within a DNA. These genes within the DNA are grouped together into clusters also known as Transcription Units. The genes are grouped into transcription units for the purpose of construction and regulation of gene expression and synthesis of proteins. This knowledge further contributes as essential information for the process of drug design and to determine the protein functions of newly sequenced genomes. It is possible to use the diverse biological information across multiple genomes as an input to the classification problem. The purpose of this work is to show that Particle Swarm Optimization may provide for more efficient classification as compared to other algorithms. To validate the approach E.Coli complete genome is taken as the benchmark genome. Keywords: Classification, Drug Design, Protein Synthesis, Particle Swarm Optimization, Transcription Units. 1. INTRODUCTION The advances in Bio-technological studies, have led to the design and implementation of a large number of computer algorithms for bio-synthesis. Amongst which Genetic analysis and synthesis constitutes of a major portion of research. The genomic era has opened up opportunities for analysis of complete gene organization, especially in prokaryotic organisms (bacteria). These have led to interesting conclusions about tendencies of genes and related functions. Data clustering is the process of grouping together similar multi-dimensional data into number of clusters or bins. Clustering algorithms have been applied to a wide range of problems, of which Gene analysis or Computational Biology form a considerable part. The availability of complete genome sequences give rise to the need for more computational methods for discovering the regulation and synthesis of genomes. Classifying genes into different clusters or groups can thus enhance the knowledge of gene function. The approach takes into account several data sources including gene co-ordinates, regulatory control signals etc. Knowledge of gene organization is becoming increasingly important in the search for novel antibacterial targets and for understanding the processes involved in bacterial pathogenesis. Altogether, these facts point to the critical need for gene classification in targeted organisms. Based on the sequence and annotations of the E. coli genome, the common features shared among pairs of adjacent genes within operons are analyzed against pairs of adjacent genes positioned at the boundaries of transcription units, but transcribed in the same direction. Their differences in terms of distances between genes, measured in base pairs, and in terms of functional relationships are evaluated. It is also shown that such differences can be used to develop a method to cluster genes in the whole genome sequence. This method might help the identification of transcription unit boundaries in other prokaryotic genomes. In this paper, an approach based on Swarm Intelligence is presented to classify genes from target genome. The next section contains the survey of the similar work done before. Section III comprises of the proposed system description, followed by Datasets used, Results and Conclusions. 2. LITERATURE REVIEW Several computational methods have been devised to cluster genes and group into a few general categories [9]. The first one being clustering by detecting Promoters and Transcription Terminators. A transcription unit can be identified if the promoter and the terminator genes of a gene sequence are identified [9]. Several algorithms have been developed to predict rho-independent transcription terminators [2, 3] efficient prokaryotic promoter-searching algorithm is not available as yet, even for the model organism E.coli [3]. The drawbacks of the method mentioned above can be overcome by the next method Construction of Hidden Markov Model (HMM). This method was reported to classify 60% of known genes in E.coli [4]. However, this method is difficult to apply in organisms where promoters and terminators are not as well characterized. The third method Probabilistic Machine Learning Approach using Variety of Data, estimates the probability of any consecutive sequence of genes on the same strand to be in a transcription unit and yielded 67% accuracy in E.coli [5]. With the generation of a large amount of gene expression data, co-expression pattern has been used as a tool to improve gene classification [6]. Bockhorst et al. [7] developed a Bayesian network approach to cluster and showed the method was able to predict 78% of E.coli transcription units with 10% false positives. However, these methods again are only applicable to organisms in which vast amounts of experimental data are available. The fourth category of methods proposed using artificial intelligence and genetic algorithms. This method was reported to have a maximum of 88% accuracy in identification of adjacent gene pairs to be in a transcription unit and found 75% of known transcription units in E.coli. This method has opened the possibility of transcription unit identification in bacterial genomes other than E.Coli.
  • 2. International Journal of Computer Applications Technology and Research Volume 3– Issue 7, 446 - 450, 2014 www.ijcat.com 447 3. DOMAIN CONCEPTS 3.1 Transcription Units Transcription units are genetic regulatory system found in the organisms in which genes for functionally related proteins is clustered along a DNA. This feature allows protein synthesis to be controlled and coordinated in response to the needs of the cell [10]. By generating proteins only as and when required, operons allows the cells to conserve energy. The part of the chromosome containing genes under consideration can be categorized into two regions: one that includes structural genes (i.e. genes that code for protein structure) and other is the regulatory region. This overall unit is known as an operon. The gene pairs can be categorized as (i) WO (Within Operon) pair and (ii) TUB(Transcription Unit Border) pair. Adjacent genes that fall into the same transcription unit can be termed as WO gene pair. Whereas, the gene pair that lies at the borders of the transcription units are termed as TUB pairs. TRANSCRIPTION UNIT Figure 1. WO and TUB gene Pairs 3.2 Features for Gene Classification Five properties were originally considered for the prediction of operons: (i) the intergenic distance, (ii) the metabolic pathway, (iii) the COG gene function, (iv) the operon length ratio. Thus the intergenic distance, the gene length ratio, and the COG gene function are generally selected to identify the clusters of related genes. The intergenic distance property not only plays an important role in the initial step, but also yields good prediction results [20]. This property can be used to universally predict operons in bacterial genomes with a completed chromosomal sequence. 3.2.1 Intergenic Distance: This property is defined as the distance (in bp i.e. base pairs) between two ORF’s (Open Reading Frames). A drawback with intergenic distance is the fact that every species has different spacing. Also, some highly expressed operons are exceptions to this rule, which can also lead to correct identification of transcription units. Distance = Gene2_start-(Gene1_end+1) (1) 3.2.2 Functional Relationship: Operon contains genes that are often functionally related. The Clusters of Orthologous Groups (COG) and Metabolic Pathway are the most representative of the functional relationship category. The proteins that are produced are often present in the same pathway, or are a part of the same complex. Improved clustering is expected when incorporating this knowledge into the process. 3.2.3 Transcription Unit Length: The length of a transcription unit is given by the number of genes within that unit. If it contains of just one single gene, then it is known as a singleton unit. 4. IMPLEMENTATION DETAILS 4.1 Calculation of Pair Score The properties used in this study are the intergenic distance, the metabolic pathway, and the COG gene function. The fitness values of the three properties are calculated based on the log-likelihood method as shown below. 4.1.1 Intergenic distance: As shown, the equation given below is used to calculate the pair-score of intergenic distance [20]. (2) Where, NWO(property) and NTUB(property) correspond to the number of WO and TUB pairs in the interval distance (10, 20, 30…). TNWO and TNTUB are the total pair numbers within WO and TUB, respectively. 4.1.2 Gene length ratio: The TUB pairs have been observed to have smaller gene length ratio. Thus the length ratio influences the probability of the gene pair to be in the same operon [20]. (3) 4.1.3 COG gene function: Equation mentioned above along with the following equation are used to calculate the COG pair-score [1]. (4) where LLCOGd (genei, genej) represents the pair-score of adjacent genes with a different COG gene function. 4.2 Fitness Calculation 4.2.1 Calculation of operon fitness value While the pair-scores of each particle are calculated based on the metabolic pathway and the COG function, the fitness value of the operon in BPSO is calculated by multiplying the pair-score average with the gene number in the same operon. 4.2.2 Calculation of particle fitness value Finally, the fitness value of a particle is calculated as the sum of the fitness values from all putative operons in the particle. 4.3 Particle Updating Each particle is updated through an individual best (pbesti), a global best (gbest) value, as well as other parameters. The pbesti value represents the position of the ith particle with the highest fitness value at a given iteration, and gbest represents the best position of all pbest particles.
  • 3. International Journal of Computer Applications Technology and Research Volume 3– Issue 7, 446 - 450, 2014 www.ijcat.com 448 Figure 2: Block Diagram 5. PARTICLE SWARM OPTIMIZATION Particle swarm optimization (PSO) is a population-based stochastic optimization technique developed by Kennedy and Eberhart in 1995 [4]. PSO has been developed through simulation of the social behavior of organisms, such as the social behavior observed of birds in a flock or fish in a school. It describes an automatically evolving system. In PSO, each single solution is known as particle in the search space. Each particle uses their memory and knowledge gained by the swarm as a whole to find the optimal solution. The fitness value of each particle is evaluated by an optimized fitness function, and the particle velocity directs the movement of the particles. Each particle adjusts its position according to its own experience during movement. In addition, each particle also searches for the optimal solution in a search space based on the experience of a neighboring particle, thus making use of the best position encountered by itself and its neighbor. The entire process is reiterated a predefined number of times or until a minimum error is achieved. PSO has been successfully employed to many application areas; it obtains better results quickly and has a lower cost compared to other methods. However, PSO is not suitable for optimization problems in a discrete feature space. Hence, Kenney and Eberhart developed binary PSO (BPSO) to overcome this problem [20]. The basic elements of PSO are briefly introduced below: (i) Population: A swarm (population) consists of N particles. (ii) Particle position, xi: Each candidate solution can be represented by a D-dimensional vector; the ith particle can be described as xi = (xi1,xi2, . . . , xiD), where xiD is the position of the ith particle with respect to the Dth dimension. (iii) Particle velocity, vi: The velocity of the ith particle is represented by vi = (vi1,vi2, . . . ,viD), where viD is the velocity of the ith particle with respect to the Dth dimension. In addition, the velocity of a particle is limited within [Vmin,Vmax]D . (iv) Inertia weight, w: The inertia weight is used to control the impact of the previous velocity of a particle on the current velocity. (v) Individual best, pbesti: pbesti is the position of the ith particle with the highest fitness value at a given iteration. (vi) Global best, gbest: The best position of all pbest particles is called global best. (vii) Stopping criteria: The process is stopped after the maximum allowed number of iterations is reached. In the PSO algorithm, each particle represents a candidate solution to the problem, and a swarm consists of N particles moving around a D-dimension search space until the computational limitations are reached. 6. RESULTS AND DISCUSSION 6.1 Data set Preparation The entire microbial genome data were downloaded from the GenBank database (http://www.ncbi.nlm.nih.gov/). The related genomic information contains the gene name, the gene ID, the position, the strand, and the product. The experimental operon data set of the E. coli genome was obtained from RegulonDB (http://regulondb.ccg.unam.mx/) [1], which contains highly reliable data of validated experimental operons of the E. coli genome. The metabolic pathway and COG data of the genomes were obtained from KEGG (http://www.genome.ad.jp/kegg/pathway.html) and NCBI (http://www.ncbi.nlm.nih.gov/COG/), respectively.
  • 4. International Journal of Computer Applications Technology and Research Volume 3– Issue 7, 446 - 450, 2014 www.ijcat.com 449 Figure 2: Dataset for E.Coli 6.2 Result Set The result set shows the pair score calculation results for the genome data available. Each table shows the values calculated for every property. Table 1. Pair Scores for Intergenic Distances between the range [-100,300] [-100,-90) 0 [0,10) 0.9906 [100,110) -0.1167 [200,210) -0.9366 [-90,-80) 0 [10,20) 2.6333 [110,120) -0.0611 [210,220) -0.5982 [-80,-70) 0 [20,30) 1.4239 [120,130) -0.2405 [220,230) -0.7404 [-70,-60) 0 [30,40) -0.4859 [130,140) -0.5619 [230,240) -1.0121 [-60,-50) 0 [40,50) -0.9955 [140,150) -0.2805 [240,250) -1.0239 [-50,-40) 0.1312 [50,60) -0.2334 [150,160) -0.7442 [250,260) -0.8495 [-40,-30) -0.7160 [60,70) 0.2374 [160,170) -0.6777 [260,270) -0.7160 [-30,-20) -0.0510 [70,80) 0.3204 [170,180) -0.2832 [270,280) -0.9893 [-20,-10) 1.0963 [80,90) 0.1059 [180,190) -0.7000 [280,290) -0.6672 [-10,0) 1.5120 [90,100) -0.0286 [190,200) -0.6615 [290,300] 0.5366 Table 2. Pair Scores for COG functions COG function Pair Score Information Storage and Processing (ISP) 0.021268710361031777 Cellular Processes and Signaling (CPAS) -0.01032989945363101 Metabolism (M) 0.13398627322082188 Poorly Characterized (PC) -0.45171651491780973
  • 5. International Journal of Computer Applications Technology and Research Volume 3– Issue 7, 446 - 450, 2014 www.ijcat.com 450 7. CONCLUSION The gene encoding that is output from the algorithm is the closest to the actual organization of operons in the genome. The analysis of gene data is gaining increasing importance. This study proposes a method to identify operons at complete genome level. The gene features like Intergenic distance, Gene length ratio, gene clusters of orthologous groups make feasible input parameters for identification process. This identification can be very valuable contribution for various genetic applications. 8. ACKNOWLEDGMENTS We wish to express our gratitude to Prof. Swati Bhavsar, Dept. of Microbiology, R. Y. K. college of science Nashik for her valuable help in the data interpretation. Also sincere thanks to Dr. Neelima Kulkarni for domain concepts. 9. REFERENCES [1] Bockhorst,J., Craven,M., Page,D., Shavlik,J. and Glasner,J. "A Bayesian network approach to operon prediction." Bioinformatics, 19, 1227-1135(2003). [2] Brendel,V. and Trifonov,E.N. "A computer algorithm for testing potential prokaryotic terminators." Nucleic Acids Res., 12, 4411-4427.(1984) [3] Brendel,V. and Trifonov,E.N. "Computer-aided mapping of DNA-protein interaction sites." Proceedings of the Ninth International CODATA Conference, Jerusalem, Israel, pp. 17- 20, 115-118.(1984) [4] Craven,M., Page,D., Shavlik,J., Bockhorst,J. and Glasner,J. "A probabilistic learning approach to whole- genome operon prediction." Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology, La Jolla, CA, pp.116-127(2000). [5] Ermolaeva,M.D., Khalak,H.G., White,O., Smith,H.O. and Salzberg,S.L. "Prediction of transcription terminators in bacterial genomes." J. Mol. Biol., 301, 27-33(2000). [6] Ermolaeva,M., White,O. and Salzberg,S.L. "Prediction of operons in microbial genomes." Nucleic Acids Res., 29, 1216- 1221(2001). [7] J. Kennedy and R. Eberhart, "Particle swarm optimization," in IEEE International Joint Conference on Neural Network. vol. 4 Perth, Australia, 1995, pp. 1942-1948. [8] Li-Yeh Chuang, Cheng-Huei Yang, Jui-Hung Tsai, and Cheng-Hong Yang," Operon Prediction using Chaos Embedded Particle Swarm Optimization", IEEE-ACM TRANS- ACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS(2013). [9] L. Wang, J. D. Trawick, R. Yamamoto, and C. Zamudio, "Genome-wide operon prediction in Staphylococcus aureus," Nucleic Acids Res., vol. 32, pp. 3689-702, 2004. [10] L. Y. Chuang, J. H. Tsai, and C. H. Yang, "Binary particle swarm optimization for operon prediction," Nucleic acids research, vol. 38, p. e128(2010). [11] L. Y. Chuang, J. H. Tsai, and C. H. Yang, "Complementary Binary particle swarm optimization for operon prediction," Nucleic acids research, (2010). [12] Mironov,A.A., Koonin,E.V., Roytberg,M.A. and Gelfand,M.S. "Computer analysis of transcription regulatory patterns in completely sequenced bacterial genomes." Nucleic Acids Res., 27, 2981-2989(1999). [13] Overbeek,R., Fonstein,M., D'Souza,M., Pusch,G.D. and Maltsev,N. "The use of gene clusters to infer functional coupling." Proc. Natl Acad. Sci. USA, 96, 2896-2901(2002). [14] Ozoline,O.N., Deev,A.A. and Arkhipova,M.V. "Non- canonical sequence elements in the promoter structure. Cluster analysis of promoters recognized by Escherichia coli RNA polymerase." Nucleic Acids Res., 23, 4703 4709(1997). [15] Sabatti,C., Rohlin,L., Oh,M. and Liao,J.C. "Co- expression pattern from DNA microarray experiments as a tool for operon prediction." Nucleic Acids Res., 30, 2886- 2893(2002). [16] Salgado,H., Moreno-Hagelsieb,G., Smith,T.F. and Collado-Vides,J. "Operons in Escherichia coli: genomic analyses and predictions." Proc. Natl Acad. Sci. USA, 97, 6652-6657(2000). [17] Unniraman,S., Prakash,R. and Nagaraja,V. "Conserved economics of transcription termination in eubacteria." Nucleic Acids Res., 30, 675-684(2002). [18] Vitreschak,A.G, Rodionov,D.A., Mironov,A.A. and Gelfand,M.S. "Regulation of riboavin biosynthesis and transport genes in bacteria by transcriptional and translational attenuation." Nucleic Acids Res., 30, 3141-3151(2002) [19] Wolf,Y.I., Rogozin,I.B., Kondrashov,A.S. and Koonin,E.V. "Genome alignment, evolution of prokaryotic genome organization, and prediction of gene function using genomic context." Genome Res.,11, 356-372(2002). [20] Yada,T., Nakao,M., Totoki,Y. and Nakai,K. "Modeling and predicting transcriptional units of Escherichia coli genes using hidden Markov models". Bioinformatics, 15, 987- 993(1999). [21] Zheng,Y., Szustakowski,J.D., Fortnow,L., Roberts,R.J. and Kasif,S. "Computational identi_cation of operons in microbial genomes". Genome Res., 12, 1221-1230(2002).