• español
  • English
  • français
  • Deutsch
  • português (Brasil)
  • italiano
  • Contact Us
  • Send Feedback
    • español
    • English
    • français
    • Deutsch
    • português (Brasil)
    • italiano
    • español
    • English
    • français
    • Deutsch
    • português (Brasil)
    • italiano
    JavaScript is disabled for your browser. Some features of this site may not work without it.
    Gredos. Repositorio documental de la Universidad de SalamancaUniversidad de Salamanca
    Consorcio BUCLE Recolector

    Browse

    All of GredosCommunities and CollectionsBy Issue DateAuthorsSubjectsTitlesThis CollectionBy Issue DateAuthorsSubjectsTitles

    My Account

    LoginRegister

    Statistics

    View Usage Statistics
    Estadísticas totales de uso y lectura

    ENLACES Y ACCESOS

    Derechos de autorPolíticasGuías de autoarchivoFAQAdhesión USAL a la Declaración de BerlínProtocolo de depósito, modificación y retirada de documentos y datosSolicitud de depósito, modificación y retirada de documentos y datos

    COMPARTIR

    View Item 
    •   Gredos Home
    • Scientific Repository
    • Grupos de Investigación
    • BISITE. Bioinformática, Sistemas Informáticos Inteligentes y Tecnología Educativa
    • BISITE. Artículos
    • View Item
    •   Gredos Home
    • Scientific Repository
    • Grupos de Investigación
    • BISITE. Bioinformática, Sistemas Informáticos Inteligentes y Tecnología Educativa
    • BISITE. Artículos
    • View Item

    Compartir

    Exportar

    RISMendeleyRefworksZotero
    • edm
    • marc
    • xoai
    • qdc
    • ore
    • ese
    • dim
    • uketd_dc
    • oai_dc
    • etdms
    • rdf
    • mods
    • mets
    • didl
    • premis

    Citas

    Título
    File formats used in next generation sequencing: A literature review
    Autor(es)
    Canal-Alonso, Ángel
    Jiménez, Pedro
    Egido, Noelia
    Prieto Tejedor, JavierUSAL authority ORCID
    Corchado Rodríguez, Juan ManuelUSAL authority ORCID
    Palabras clave
    Next-Generation sequencing
    File format
    Data sharing
    Clasificación UNESCO
    1203.17 Informática
    2410.07 Genética Humana
    Fecha de publicación
    2022
    Resumen
    [EN]Next-generation sequencing (NGS) has revolutionized the field of genomics, allowing a detailed and precise look at DNA. As this technology advanced, the need arose for standardized file formats to represent, analyze and store the vast data sets produced. In this article, we review the key file formats used in NGS: FASTA, FASTQ, BED, GFF, and VCF. The FASTA format, one of the oldest, provides a basic representation of genomic and protein sequences, identifiable by unique headers. FASTQ is essential for NGS, as it stores both the sequence and the associated quality information. BED provides a tabular representation of genomic loci, while GFF details the localization and structure of genomic features in reference sequences. Finally, VCF has emerged as the predominant standard for documenting genetic variants, from simple SNPs to complex structural variants. The adoption and adaptation of these formats have been fundamental for progress in bioinformatics and genomics. They provide a foundation on which to build sophisticated analyses, from gene discovery and function prediction to the identification of disease-associated variants. With a clear understanding of these formats, researchers and practitioners are better equipped to harness the power and potential of next-generation sequencing.
    URI
    https://hdl.handle.net/10366/153123
    Collections
    • BISITE. Artículos [370]
    Show full item record
    Files in this item
    Nombre:
    Format_NGS_en.pdf
    Tamaño:
    253.7Kb
    Formato:
    Adobe PDF
    Thumbnail
    FilesOpen
     
    Universidad de Salamanca
    AVISO LEGAL Y POLÍTICA DE PRIVACIDAD
    2024 © UNIVERSIDAD DE SALAMANCA
     
    Universidad de Salamanca
    AVISO LEGAL Y POLÍTICA DE PRIVACIDAD
    2024 © UNIVERSIDAD DE SALAMANCA