Comparative Genomics is an important aspect of bioinformatics. By comparing the genome sequence of a newly sequenced organism against the genome sequence of organisms with known functionality, the functionality of newly sequenced organism can be identified and studied. This computational technique complements wet lab techniques due to its efficiency and cost effecive nature.
In this research comparative genomics has been used to identify functionally equivalent and functionally similar genes, operons - groups of genes involved in a common functionality, metabolic pathways, genes with conserved functionality, and genes which are unique to a particular organism. The study has also been applied to analyze the regulation mechanism for translation and the evolution of microorganisms.
Automated identification of metabolic pathways, identification of genes unique to an organism, and the identification of conserved genes are all important for the development of specific anti-bacterial and specific bacteriostatic agents (drugs which arrest the growth of bacteria) and the identification of new pathways. There are many advantages of more specific antibacterial drugs such as reduced toxicity, reduced side effects, and smaller probability of resistant strains of pathogens (disease causing strains of bacteria).
Related Collaborators
Putative orthologs Database (sample version)
Gene-group Databases (sample version)
Number of genomes | Number of Genes | Genes ( E. coli gene name / B. subtilis gene names for different gene names otherwise E. coli gene names) |
---|---|---|
17 | 55 |
{asnS, aspS}, alaS, cysS, dnaX, {efp, yeiP}, {ffh, ftsy}, fusA/fus, gltX, glyA, hisS,
infB, ksgA, lon, metG, mopA/groEL, nusA, pheS, prlA/secY,
recA, rplA, rplB, rplD, rplE, rplF, rplK, rplM, rplN, rplX,
rplV, rpsB, rpsC, rpsD, rpsE, rpsG, rpsH, rpsI, rpsJ, rpsK, rpsL, rpsM,
rpsQ, rpsS, rpoB, rpoC, serS, {tufA, tufB}, topA, {trxA, yfiG}, ygjD/ydiE, yhbZ/obg
|
16 | 24 |
adk, argS, clpB/clpC, eno, ftsZ, glyA, hflb, leuS,
mrsA/ybbT, pepP/yqhT, pgk,
pheT, pyrH/smbA, rplD, rplO, rplW, rpsM, rpsO, ruvB, secF, tmk, truA, uvrB, yfjB/yjbN
|
15 | 16 |
apt, ispB/gerCC, ndk, nth, orf.174/yluA, pnp/pnpA, prsA/prs, pyrG/ctrA,
rplJ, rplR, rpsO, tpiA/tpi, mesJ/yacA, ycfF/hit, ycfH/yabD, ychF/yyaF
|
Specific Genes (incomplete - under construction) (with respect to E. coli compared to seventeen complete genomes mentioned above)
NOTE: The genes which are specific to more
restricted group (group containing more genomes) is also specific to any
subgroup. For example, hflc is also specific to the subgroup
Ec-Hi-Rp
References
Ec-Bb-Hi-Hp-Rp-Tp: hflC
Ec-Bb-Ct-Hi-Hp: yjjT
Ec-Bb-Ct-Hi-Rp: dacA
Ec-Bb-Hi-Hp-Rp: pal
Ec-Ct-Hi-Mp-Rp: kdtA
Ec-Ct-Hi-Tb-Rp: cydB, orf.2883, yceC
Ec-Ct-Hi-Tp-Rp: nrdB
Ec-Ct-Mp-Tp-Rp: phnL
Ec-Bb-Hi-Tp: hflk, hrpA
Ec-Bb-Ct-Hi: greB
Ec-Bb-Ct-Rp: orf.762
Ec-Ct-Mg-Mp: yjcU
Ec-Ct-Hi-Tp: udp
Ec-Ct-Tb-Rp: uhpC
Ec-Hi-Tb-Rp: hupA
Ec-Hi-Hp-Tb: orf.1839
Ec-Hi-Tb-Tp: secE
Pathogens Genes ( E. coli name )
Ec-Mp-Tb: yhfV
Ec-Hi-Tb: fic, recC
Ec-Hi-Rp: hscA
Ec-Hi-Hp: ydeA
Ec-HI-Tp: asnA
Ec-Tb-Tp: add
Ec-Ct-Hi: aspC, orf.1597, orf.1600, orf.1602, trpR, tyrP, yigN, ytfL
Ec-Ct-Hp: cls, yejE, yjbC
Ec-Ct-Rp: orf.1142
Ec-Ct-Tb: appC, hisJ, orf.1575, orf.1724, xerC
Ec-Ct-Tp: fhiA, orf.3357, yrfE
Ec-Bb-Hi: pepD, orf.3153, yibQ
Ec-Hi-Rp:
vacJ, secB, yfhE, bolA, cyaY, orf.2833
Ec-Hi-Hp:
fucP, orf.634, sdaC, ykgB, phnA, orf.2936
Ec-Hi-Tb:
plsB, glnD, tesB, orf.606, nadR, menC, tag, yijC, frdC
frdD
Automated Genome Comparison, Evolution, and Pathogenicity
Automated Reconstruction of Pathways