Readme file for Dirichlet process software accompanying Estimating transcription start site sequencing coverage and transcriptional repertoires for different tissues Albin Sandelin, Eivind Valen, Piero Carninci, Anders Krogh and Ole Winther The matlab script DPrun.m will run the analysis. The data used in the paper is located in ./data/ The data files has the following format: 1st column number of counts 2nd column the multiplicity, i.e. the number of species with that count. Figures (eps-format) summarizing the predictions are saved in ./figures/ using the file name of the data.