We specialize in the often-overlooked step between raw data and usable data. We take raw, messy crime and policing data, figure out what it is, clean it, and deliver something the client can actually use.

Data Cleaning and Documentation

We take raw, poorly documented crime and policing data and deliver a clean, analysis-ready dataset. Every project includes an audit of data quality, documentation explaining the data and what the variables mean, and an assessment of limitations so you know what you have and what you can do with it. We communicate regularly with clients throughout the project and our deliverables are flexible to your needs.

Dataset Merging

We merge data across agencies, jurisdictions, time periods, and reporting systems. This includes linking records across databases, standardizing inconsistent coding schemes, and identifying where differences in reporting standards make datasets incomparable.

PDF and Web Scraping

Not all data are available for download. We extract data from PDFs and websites and deliver it in a format you can work with, such as an Excel file or a clean dataset ready for analysis. Whether it is a single table or thousands of pages of PDFs, we can handle it.

Litigation Support

We help legal teams make sense of data produced in discovery. Opposing parties often know the flaws in their own data and are unlikely to volunteer that information. We find those problems, document them, and make sure you understand what the data can and cannot support before you build your case on it.

R Training and Workshops

We offer R training and workshops for teams that want to build internal capacity for crime data work. Sessions are practical and built around your team’s data and workflows. Participants leave with reusable code they can apply immediately, drawing on our published textbook A Criminologist’s Guide to R (CRC Press).

For a formal summary of our capabilities and past performance, download our Capability Statement.

Ready to get started? Contact us to discuss your project.