Note: Most software published by our group is also available on


Hobbes is a tool for very fast, very accurate short-read sequence alignment. Hobbes has its own homepage.


Parallel genome assembly using Hyracks, available at


An online EM implementation of the MEME model for fast motif discovery in large ChIP-Seq and DNase-Seq Footprinting data, available at


PyLOH is a tool for discovering copy number variations in cancer genomes. PyLOH is available at


A tool for transcripts abundances estimation from heterogeneous tissue sample of RNA-Seq data, available at


GBMCI is a tool for survival analysis by direct concordance index learning using gradient boosting, available at


AREM is a tool for ChIP-seq analysis, and is able to discover peaks in repeat regions of the genomes. AREM is available at Aligning Reads from ChIP-seq data by expectation-maximization


Tree-HMM is an extension of the popular ChromHMM for doing clustering of epigenetic markers. Given a complete set of epigenetic markers for several cell types, and the phylogeny describing their relationship, Tree-HMM will perform approximate inference on the graphical model induced by connecting genomic positions of a certain cell type in a chain while also connecting different species according to the tree. Thus, each internal node's state depends on its “vertical parent” (parent cell type) and its “horizontal parent” (adjacent genomic region).

Code is available at and some sample data is available at


SGD-RJ Stochastic gradient descent based on reversible jump MCMC.


SSEA SNP set enrichment analysis for genome-wide association studies.


SiPhy site-specific phylogenetic analysis for identifying conserved DNA sequences.


DNAzip DNA sequence compression


MDOS motif discovery using orthologous sequences (alignment independent)


iPubMed: instant PubMed, featuring interactive and fuzzy search


MotifMap: A comprehensive map of regulatory motif sites in human and model organisms.

An older version of MotifMap, as published in our first paper and specific to the human dataset is available at

You are here: startsoftware