GENOME DATABASES FORMATS AND THEIR TOOLS -THE STATE OF THE ART SURVEY

Authors

  • More Shivaprasad Department of Computer Science and Engineering, Assistant Prof, Sou.Sushila Danchand Ghodawat Charitable Trust’s Sanjay Ghodawat Group of Institutions, Atigre, Kolhapur, (MS) India
  • Dr. Kulhalli Kshama Department of Information and Technology, Prof, Dr. D.Y. Patil College of Engineering & Technology, Kolhapur, (MS)

Keywords:

VCF, BAM, SAMtools, GFF, BEDtool, BigWig, BigBed, Bwtool, Tabix, SAM tools, NCList, Ensembl, Indexing

Abstract

 Biological sciences have large amount of genomic data and there is challenge to deal with this huge amount of
data for the researchers. Genomic data are commonly represented in tables stored as plain text files and requires parsing for
analysis, which is very time consuming and error prone method. The indexing facilities provide efficient access to data along
with providing useful methods of summarizing columns. Analysis of code can also be substantially simpler as well as being
uniform across different data formats. These benefits of reduced code complexity and greatly increased performance allow
users much greater freedom to explore their data.

Published

2015-01-25

How to Cite

More Shivaprasad, & Dr. Kulhalli Kshama. (2015). GENOME DATABASES FORMATS AND THEIR TOOLS -THE STATE OF THE ART SURVEY. International Journal of Advance Engineering and Research Development (IJAERD), 2(1), 12–18. Retrieved from https://ijaerd.org/index.php/IJAERD/article/view/422