Please Excuse the Dust!

This website is currently under construction. Please forgive a few broken links, weird formatting, or generic text until we get up and running!

Resource Round-Up: R in Industry Edition

One of the ways that practices of reproducible research can be brought into industry is through the development of custom R packages and data tools for one’s company / organization. Not only can these tools deliver large efficiency gains and standardization, they ideally infuse corporate culture with the shared passion and mission found in open source communities. Similar to [my favorite readings on reproducible research], the benefits of internal R packages are a common theme in my talks and on my Twitter feed1. [Read More]

Resource Round-Up: Reproducible Research Edition

I frequently talk, think, and tweet about how open science principles apply to data analysts and scientists in industry. However, despite my interest in reproducibility, my process for sharing my favorite articles on this topic has been anything but reproducible – repeatedly searching my browser history, past “Sent” mails, or simply typing out the same summaries over and over. Going forward, this post serves as a brief annotates bibliography for some of my favorite reads. [Read More]

Rtistic: A package-by-numbers repo

Last winter, I attended a holiday party at a “paint-and-sip” venue. For those unfamiliar, “paint-and-sip” is a semi-trendy cottage industry offering evenings of music, wine, and a guided painting activity. For example, my group painted sasquatch on a snowy winter’s eve: A very bad painting of a sasquatch in a forest As often happens, this completely unrelated thing set me thinking about R. What made painting fun when we lacked the talent and experience to excel, when it almost surely wouldn’t turn out very well, and when “failure” would be very visibly and undeniably on display? [Read More]

Notes on supporting conference speakers

When I joined the team of co-organizers for satRday Chicago, Chicago supeRlady Angela Li tasked me with managing all-things speaker. This set me reflecting on the best and worst of my own speaker experiences and I began to list out highs and lows I’d experienced. When I solicited feedback and more ideas on Twitter, the topic seemed to strike a chord: Inspired by @earino 's great GitHub on how to host a good panel, I started making some notes on how to create a good experience for speakers at satRdays Chicago [Read More]

RMarkdown Driven Development (RmdDD)

Introduction RMarkdown is an excellent platform for capturing narrative analysis and code to create reproducible reports, blogs, slides, books, and more. One benefit of RMarkdown is its abilities to keep an analyst in the “flow” of their work and to capture their thought process along the way. However, thought processes are rarely linear; as a result, first-draft RMarkdown scripts rarely are either. This is fine for some individual analysis and preliminary exploration but can significantly decrease how understandable and resilient an RMarkdown will be in the future. [Read More]

Notes on preparing a tech talk

As part of co-organizing satRday Chicago, we wanted to offer a mentorship program for first-time speakers. To avoid totally putting the onus on our volunteer mentors, it seemed like it might be useful to break down the steps of writing and preparing for a tech talk into more modular stages where one can seek help. To that end, I wrote up the following advice to share with our speakers. [Read More]