Jul 20, 2023

Toolset for genome set arithmetic such as intersect, union

The bedtools utilities are a suite of tools for performing a wide range of genomics analysis tasks. The most widely-used of these tools enable genome arithmetic, i.e., set theory on the genome. For example, with bedtools one can intersect, merge, count, complement, and shuffle genomic intervals from multiple files in common genomic formats such as BAM, BED, GFF/GTF, and VCF.

Although each individual utility is designed to do a relatively simple task, e.g., intersect two interval files, more sophisticated analyses can be conducted by stringing together multiple bedtools operations on the command line or in shell scripts.

Checkout these related ports:
  • Wise - Intelligent algorithms for DNA searches
  • Wfa2-lib - Exact gap-affine algorithm using homology to accelerate alignment
  • Vt - Discovers short variants from Next Generation Sequencing data
  • Vsearch - Versatile open-source tool for metagenomics
  • Viennarna - Alignment tools for the structural analysis of RNA
  • Velvet - Sequence assembler for very short reads
  • Vcftools - Tools for working with VCF genomics files
  • Vcflib - C++ library and CLI tools for parsing and manipulating VCF files
  • Vcf2hap - Generate .hap file from VCF for haplohseq
  • Vcf-split - Split a multi-sample VCF into single-sample VCFs
  • Unikmer - Toolkit for nucleic acid k-mer analysis, set operations on k-mers
  • Unanimity - Pacific Biosciences consensus library and applications
  • Ugene - Integrated bioinformatics toolkit
  • Ucsc-userapps - Command line tools from the UCSC Genome Browser project
  • Trimmomatic - Flexible read trimming tool for Illumina NGS data