Summary

Experienced data scientist and software engineer. My research is on theory and methodology for sequential testing and estimation, particularly for causal inference in randomized experiments, best-arm identification in multi-armed bandit problems, quantile estimation, and matched observational studies. Prior to graduate school, I was a software engineer and technical lead at Google and then Thumbtack, where I also did quite a bit of data science and spent a year as a manager. My software background tilts towards backend engineering with a focus on software design and testability. My recent experience centers on Python, R and Linux, with a bit of C++.

Work Experience

2022-present
Data Scientist, Experimentation and Causal Inference
LinkedIn
2020-2022
Member of Research Staff
The Voleon Group
Summer 2019
Statistical Researcher (contract)
Optimizely
  • Investigated the potential for variance reduction within Stats Engine, the sequential A/B analysis engine behind Optimizely’s platform. This involved some theory, some simulations (Python), and some analysis of past experiments (Spark, Scala, Python).
  • Initiated research on general justification of the normal mixture sequential probability ratio test, the underlying methodology in Stats Engine.
  • Consulted on customer questions related to statistical methodology, particularly sequential testing, frequentist vs. Bayesian approaches to A/B testing, and false discovery rate control.
2018-2019
Graduate Student Instructor, Statistical Models (Stat 215B)
University of California, Berkeley

Assisted students, graded assignments, devised midterm questions, planned some lectures.

Summer 2018
Statistical Researcher (contract)
The Voleon Group

Researched methodology related to the design and analysis of sequential experiments with heavy-tailed data.

2016-2017
Graduate Student Researcher, Celeste Project
Lawrence Berkeley National Laboratory
  • Developed a suite of accuracy benchmarks for Celeste, a variational Bayesian inference engine for astronomical imagery written in Julia.
  • Tracked down and fixed bugs in the Celeste model and implementation. Wrote extensive documentation based on my learnings.
2015
Data Scientist (contract)
Thumbtack

Defined metrics to capture the long-term value of service providers and developed predictive models for these metrics based on information available at signup time.

2010-2014
Software Engineer / Director of Engineering
Thumbtack
  • Wrote best practices, developed tools, and educated coworkers on randomized experiments. We recorded ~500 product experiments under this framework during my tenure.
  • Over ~18 months, developed and ran a distributed pipeline to crawl a billion web pages, identifying and categorizing local services using SVM text classification
  • Designed and analyzed large, randomized, controlled SEO experiments.
  • Convinced early team to adopt mandatory code reviews and continually pushed for better automated testing and design review, preferably by example
  • Designed and implemented much of our Python application framework, early build system and source organization
  • Led engineering team for a year; launched initiatives to migrate from PHP towards Python and from dedicated hardware to AWS; composed job ladder and compensation bands; managed ~seven engineers
  • Conducted ~300 interviews; devised interview questions and at-home challenges for software engineers, data scientists and site reliability engineers; trained new interviewers
  • Built our systems monitoring and reporting infrastructure (Python, Tornado, Graphite)
  • Designed randomized experiment to measure effect of content creation on SEO landing page traffic, and implemented optimization algorithm to construct new landing pages, increasing overall revenue by about 20%
  • Administered our ~dozen production servers for a year, writing new Puppet configs, responding on-call to outages and publishing postmortems
2007-2010
Software Engineer
Google

Android Team, 2010

  • Wrote complete automated test suite for Android system download provider, refactoring existing code as necessary to achieve testability; received a peer bonus for my final design documentation
  • Designed and implemented public API for Android download manager and implemented system “Downloads” app, both released in Gingerbread
  • Revised accelerometer filtering logic for Android platform screen rotation

Platforms Team, 2007-2010

  • Technical lead for about a year of the Autotest project, a kernel and hardware qualification platform; mentored summer SWE intern who joined full-time the following summer
  • Ported Ruby on Rails prototype of Autotest scheduling web app to Python/Django, vastly extended over the following two years
  • Designed and implemented new Autotest reporting web app; frontend rendered complex, spreadsheet-style reports in browser (Google Web Toolkit frontend, Python/Django backend)
  • Planned and executed a refactoring of the Autotest job scheduler to replace all multithreading with an asynchronous, event-loop style implementation, ending months of quality issues
  • Designed and led implementation of distributed scheduling support to add redundancy for job scheduling and increase capacity for simultaneously executing test machines
Summer 2006
Software Development Engineer Intern, Direct3D Group
Microsoft
  • Productized a novel technique for resolution-independent font rendering using GPU pixel shaders
  • Reimplemented researchers’ C# prototype in C++, extending its capabilities and solving many issues not addressed in prior research
  • Participated in preliminary API design for this work
Summer 2005
Software Engineer Intern, Embedded Solutions Group
NVIDIA
  • Worked on the emerging OpenGL-ES 1.1 driver, adding support for various frame swapping modes and for 2X and 4X full-scene antialiasing
  • Began the implementation of a new OpenGL-ES 2.0 (programmable pipeline) driver.
2005-2007
Research Assistant
Horowitz Group, Department of Physics, Harvard University
  • Worked on kilopixel optical SETI project for a 72” telescope
  • Led design + implementation of central control software (Python)
  • Designed and wired daughterboard simulator and wrote 8051 microcontroller code
Spring 2005
Teaching Fellow, Physics 123, Laboratory Electronics
Department of Physics, Harvard University

Oversaw weekly labs and help sessions, graded assignments

Summer 2004
Software Engineer Intern
Ember

Developed system for testing throughput and latency of multihop wireless networks, including clock synchronization, data collection and PDF report generation (Java, C, Atmel AVR)

Summer 2002
Research Science Institute
  • Created object recognition software using edge detection and a database of templates
  • Wrote pipelined implementation using the group’s Real-Time Specification for Java library.
2001-2003
Software Engineer
eStara
  • Developed PHP account administration applications
  • Developed Javascript volume bar for browser-based VoIP
  • Developed extensions for large software IP phone application (C++)
2000-2001
Software Engineer
Roadfly.com

Spent most of my time developing a PHP message board application and being amazed that someone would pay me to code!

Publications

The uniform general signed rank test and its design sensitivity

S. R. Howard, S. D. Pimentel

Biometrika, 108(2), 381-396, 2021

Time-uniform, nonparametric, non-asymptotic confidence sequences

S. R. Howard, A. Ramdas, J. McAuliffe, J. Sekhon

Annals of Statistics, 49(2), 1055-1080, 2021

Time-uniform Chernoff bounds via nonnegative supermartingales

S. R. Howard, A. Ramdas, J. McAuliffe, J. Sekhon

Probability Surveys, Volume 17, 257-317, 2020

Cataloging the visible universe through Bayesian inference at petascale

J. Regier, K. Pamnany, K. Fischer, A. Noack, M. Lam, J. Revels, S. R. Howard, R. Giordano, D. Schlegel, J. McAuliffe, R. Thomas, and Prabhat

International Parallel and Distributed Processing Symposium (IPDPS), 2018

Education

2015 - 2019

Ph.D. in Statistics

University of California, Berkeley

Coursework in probability theory, theoretical statistics, high-dimensional statistics, applied statistical models, convex optimization, causal inference, and statistical consulting.

2003 - 2007

B.A. in Computer Science

Harvard University

Projects

ConfSeq (https://www.github.com/gostevehoward/confseq)
A library implementing many techniques from my research to computing confidence sequences for means, quantiles, and cumulative distribution functions. Written in C++ with Python and R wrappers.
Quantile best-arm identification (https://github.com/gostevehoward/quantilecs)
This repository contains all code to reproduce the simulations and plots from the paper "Sequential estimation of quantiles with applications to A/B-testing and best-arm identification". Of particular interest is an efficient C++ simulation harness for quantile best-arm identification.

Presentations

Asilomar Conference on Signals, Systems, and Computers, November 2019
CMU Stat/ML reading group, September 2018
Guest lecture, Principles and Techniques of Data Science (DS 100), UC Berkeley, Spring 2018
Thumbtack Tech Talk, December 2013
Thumbtack Tech Talk, March 2012

Peer review service

Journal of Machine Learning Research (2019)
Neural Information Processing Systems (2019, 2018)
Conference on Learning Theory (2019)
International Conference on Machine Learning (2018)