Daniel C. Jones - Staff Scientist | Fred Hutchinson Cancer Center

Projects

Proseg: probabilistic cell segmentation

Fast and accurate cell segmentation for spatial transcriptomics borrowing methods from cell simulation.

GitHub
Maxspin: Maximization of Spatial Information

An information theoretic approach to quantifying the degree of spatial organization in spatial transcriptomics (or other spatial omics) data. This method computes spatial information scores for genes to identify spatially varying expression patterns.

GitHub
COITrees: Cache Oblivious Interval Trees

A very fast interval tree data structure for overlap queries of static integer intervals, with genomic intervals in mind. Implemented in Rust, COITrees uses a van Emde Boas layout for improved cache locality and features SIMD optimizations through AVX2 and Neon instructions when available.

GitHub
Gadfly: Grammar of Graphics for Julia

A plotting and data visualization system for Julia based on the Grammar of Graphics. Gadfly creates publication quality graphics with an intuitive, consistent interface and tight integration with DataFrames.jl. Features include interactive zooming/panning, SVG/PNG/PDF/PS output formats, and support for a wide range of plot types.

GitHub Website

View all my projects on GitHub

Notable Publications

Cell Simulation as Cell Segmentation

Daniel C Jones, Anna E Elz, Azadeh Hadadianpour, Heeju Ryu, David R Glass, Evan W Newell

Nature Methods (2025)

Single-cell spatial transcriptomics promises a highly detailed view of a cell's transcriptional state and microenvironment, yet inaccurate cell segmentation can render this data murky. We adopt methods from ab initio cell simulation to rapidly infer morphologically plausible cell boundaries that preserve cell type heterogeneity.
Read Paper
An Information Theoretic Approach to Detecting Spatially Varying Genes

Daniel C. Jones, Patrick Danaher, Youngmi Kim, Joseph M. Beechem, Raphael Gottardo, Evan W. Newell

Cell Reports Methods (2023)

A key step in spatial transcriptomics is identifying genes with spatially varying expression patterns. We adopt an information theoretic perspective to this problem by equating the degree of spatial coherence with the Jensen-Shannon divergence between pairs of nearby cells and pairs of distant cells.
Read Paper
Polee: RNA-Seq analysis using approximate likelihood

Daniel C. Jones, Walter L. Ruzzo

NAR Genomics and Bioinformatics (2021)

We propose a new method of approximating the likelihood function of a sparse mixture model for RNA-Seq analysis, using a technique called the Pólya tree transformation. This approximation achieves most of the benefits of full probabilistic models with a fraction of the computational costs, leading to more accurate detection of differential transcript expression and transcript coexpression.
Read Paper
Compression of next-generation sequencing reads aided by highly efficient de novo assembly

Daniel C. Jones, Walter L. Ruzzo, Xinxia Peng, Michael G. Katze

Nucleic Acids Research (2012)

We present Quip, a lossless compression algorithm for next-generation sequencing data in the FASTQ and SAM/BAM formats. Using a novel de novo assembly algorithm with a probabilistic data structure to dramatically reduce memory requirements, we developed the first assembly-based compressor, effectively reducing dataset sizes to <15% of their original size.
Read Paper
A new approach to bias correction in RNA-Seq

Daniel C. Jones, Walter L. Ruzzo, Xinxia Peng, Michael G. Katze

Bioinformatics (2012)

We present a new method to measure and correct for sequence bias in RNA-Seq experiments using a simple graphical model. Our model does not rely on existing gene annotations, and model selection is performed automatically, making it applicable with few assumptions and effectively decreasing bias while increasing uniformity in read coverage.
Read Paper

View all publications on Google Scholar

Projects

Proseg: probabilistic cell segmentation

Maxspin: Maximization of Spatial Information

COITrees: Cache Oblivious Interval Trees

Gadfly: Grammar of Graphics for Julia

Notable Publications

Cell Simulation as Cell Segmentation

An Information Theoretic Approach to Detecting Spatially Varying Genes

Polee: RNA-Seq analysis using approximate likelihood

Compression of next-generation sequencing reads aided by highly efficient de novo assembly

A new approach to bias correction in RNA-Seq