identifying sampling bias in bacterial genome databases

When analyzing the diversity of bacterial populations it is important to consider a potential sampling bias that can distort your pangenome analysis.
Our tool PhyloThin is a coalescent-theory-based approach to identify prokaryotic genomes that can be considered as oversampled.

We are currently writing the manuscript for this tool. If you need preliminary access to PhyloThin, check out our github repository.