Egyszerű nézet

dc.contributor.author Gézsi, András
dc.contributor.author Bolgár, Bence Márton
dc.contributor.author Marx, Péter
dc.contributor.author Sárközy, Péter
dc.contributor.author Szalai, Csaba
dc.contributor.author Antal, Péter
dc.date.accessioned 2018-09-18T06:46:50Z
dc.date.available 2018-09-18T06:46:50Z
dc.date.issued 2015
dc.identifier 84945534349
dc.identifier.citation pagination=875, pages: 15; journalVolume=16; journalIssueNumber=1; journalTitle=BMC GENOMICS;
dc.identifier.uri http://repo.lib.semmelweis.hu//handle/123456789/5953
dc.identifier.uri doi:10.1186/s12864-015-2050-y
dc.description.abstract BACKGROUND: The low concordance between different variant calling methods still poses a challenge for the wide-spread application of next-generation sequencing in research and clinical practice. A wide range of variant annotations can be used for filtering call sets in order to improve the precision of the variant calls, but the choice of the appropriate filtering thresholds is not straightforward. Variant quality score recalibration provides an alternative solution to hard filtering, but it requires large-scale, genomic data. RESULTS: We evaluated germline variant calling pipelines based on BWA and Bowtie 2 aligners in combination with GATK UnifiedGenotyper, GATK HaplotypeCaller, FreeBayes and SAMtools variant callers, using simulated and real benchmark sequencing data (NA12878 with Illumina Platinum Genomes). We argue that these pipelines are not merely discordant, but they extract complementary useful information. We introduce VariantMetaCaller to test the hypothesis that the automated fusion of measurement related information allows better performance than the recommended hard-filtering settings or recalibration and the fusion of the individual call sets without using annotations. VariantMetaCaller uses Support Vector Machines to combine multiple information sources generated by variant calling pipelines and estimates probabilities of variants. This novel method had significantly higher sensitivity and precision than the individual variant callers in all target region sizes, ranging from a few hundred kilobases to whole exomes. We also demonstrated that VariantMetaCaller supports a quantitative, precision based filtering of variants under wider conditions. Specifically, the computed probabilities of the variants can be used to order the variants, and for a given threshold, probabilities can be used to estimate precision. Precision then can be directly translated to the number of true called variants, or equivalently, to the number of false calls, which allows finding problem-specific balance between sensitivity and precision. CONCLUSIONS: VariantMetaCaller can be applied to small target regions and whole exomes as well, and it can be used in cases of organisms for which highly accurate variant call sets are not yet available, therefore it can be a viable alternative to hard filtering in cases where variant quality score recalibration cannot be used. VariantMetaCaller is freely available at http://bioinformatics.mit.bme.hu/VariantMetaCaller .
dc.relation.ispartof urn:issn:1471-2164
dc.title VariantMetaCaller: automated fusion of variant calling pipelines for quantitative, precision-based filtering
dc.type Journal Article
dc.date.updated 2018-07-19T11:38:31Z
dc.language.rfc3066 en
dc.identifier.mtmt 2966077
dc.identifier.wos 000363504500005
dc.identifier.pubmed 26510841
dc.contributor.department BME/VIK/Méréstechnika és Információs Rendszerek Tanszék
dc.contributor.department SE/AOK/I/Genetikai, Sejt- és Immunbiológiai Intézet
dc.contributor.institution Budapesti Műszaki és Gazdaságtudományi Egyetem
dc.contributor.institution Semmelweis Egyetem


Kapcsolódó fájlok:

A fájl jelenleg csak egyetemi IP címről érhető el.

Megtekintés/Megnyitás

Ez a rekord az alábbi gyűjteményekben szerepel:

Egyszerű nézet