Sam Zeitlin PhD

Director at Radically Different Data Science, LLC

San Francisco, CA

2

Office Hours

How does this work?
About

I currently provide consulting services including: analytics and machine learning, market research, technical design and implementation, developer relations, leadership, hiring, technical mentorship, career coaching, and management. I'm interested in difficult problems. I like looking for better ways to see invisible things. I'm a former cancer research scientist (biochemistry, cell biology, and biophysics), and self-taught pythonista. I've worked in a variety of roles at both large and small companies. Favorites: Python, Pandas, Seaborn, Pachyderm Recent: Google Pubsub, BigQuery, Kubernetes, Pachyderm https://github.com/szeitlin Find my papers on google scholar: http://bit.ly/zeitlin_papers

Ask me about
Data Team Management
Career Coaching
MLOps
Competitive Analysis
Work experience
logo

May 2021 - Present

Elastic

Principal Machine Learning Engineer, Security Protections

At Elastic, I'm the technical lead for the Security Data Science team. Responsibilities include writing and reviewing code, architectural design and implementation for large machine learning projects, project management, and last but not least, mentoring and hiring machine learning engineers.

Feb 2021 - Present

Radically Different Data Science, LLC

Director

Radically Different Data Science provides consultation services including: analytics and machine learning, market research, technical design, implementation, developer relations, leadership, hiring, mentorship, career coaching, and management.

Dec 2020 - Present

U.S. Digital Response

Volunteer

Dec 2019 - Oct 2020

Sentry (sentry.io)

Lead Data Scientist and Data Team Manager

Built 3 end-to-end prototypes of machine learning systems for anomaly detection. Managed the data team (3 engineers). Tech Stack: GCP including GKE (kubernetes), GKB (docker builds), GCS (S3-equivalent storage), PubSub (streaming), BigQuery (data warehouse), Firestore (for model data). Python. Pachyderm (self-hosted on kubernetes with terraform). Slack webhook (for posting model results). Looker (dashboards). Weights and Biases (ML monitoring).

May 2018 - May 2019

Denali Publishing LLC

Lead Data Scientist

• Founded data science team and built data infrastructure for game data (Guns, Cars, Zombies!). • Sourced, recruited, and managed a team of four data scientist-engineers. Advised Product and Engineering on troubleshooting existing products. Supervised and built ETL pipelines for data from Apple, Google, Flurry. Identified cases of cheating using internal databases (MySQL). Stack: Python using Docker to deploy with Pachyderm in Kubernetes on AWS, set up and managed the Redshift data warehouse, and supervised building dashboards in Looker.

logo

May 2018 - May 2019

Triller

Lead Data Scientist and Data Team Manager

• Founded data science team and built data infrastructure for an AI-powered social media mobile app. • Sourced, recruited, and managed a team of four data scientist-engineers. Advised Product and Engineering on hiring, designed metrics for new data-driven features as well as troubleshooting existing products. Designed and prototyped ML features, A/B testing system. Supervised and built ETL pipelines for data from Apple, Google, Localytics, and internal databases (Postgres). >15M rows of data daily with automated pipelines. ~50 users of >24 dashboards for critical business decisions. Stack: Python and Pyspark using Docker to deploy with Pachyderm in Kubernetes on AWS, set up and managed the Redshift data warehouse, and supervised building dashboards in Looker.

logo

Nov 2016 - Apr 2018

Yahoo!

Product Hacker (Senior Software Engineer)

Data scientist working with the Yahoo BrightRoll team (video ad exchange) (2016-2017) and the AOL ONE Video team (2017-2018). The stack: python (pandas, matplotlib, seaborn, numpy, scikit-learn, Airflow) on AWS (s3, Redshift, Spark). We also used Looker extensively for data sharing across teams. Our team had direct revenue impact. We worked directly with the Product, Engineering, and business teams to optimize supply (inventory quality) and monetization.

logo

Aug 2015 - Apr 2016

Sighten

Software Engineer, Data Science

Data pipelines from postgres using django and pandas. File parsing, time series, pub/sub patterns, dynamic naming, data comparison and extrapolation. Increased test coverage and updated legacy code. Built a Slackbot.

logo

Aug 2014 - Aug 2015

self-employed

Data Science/Consultant

• Market Intelligence related to Healthcare. • Data Science and Visualization related to Energy. • pandas • python • matplotlib, seaborn • Google maps • django

logo

Oct 2012 - Jun 2013

Geron

Scientist II, Discovery Biology

High-content imaging and statistical analysis. Assay development and optimization for target and chemistry platform validation

May 2011 - Oct 2012

UCSF

Specialist II

High-throughput screening (HTS) and statistical analysis. High-content imaging screens conducted in collaboration with Peter Walter's lab, Kevan Shokat's lab, and Genentech, studying the unfolded protein response (UPR). Addtionally, helped develop a screen for compounds that promote expansion of hematopoetic stem cells, in collaboration with Andy Leavitt's lab, and GE. Screened a panel of human cancer cell lines for sensitivity to loss of a putative oncogene, using siRNA (collaboration with Genentech).

logo

Jan 2010 - Mar 2010

University of California, Irvine

Lecturer, Department of Biomedical Engineering

BME50A is an introductory-level course in cell and molecular biology for engineering students. For this class, I was in charge of 150 students. I supervised two teaching assistants, who ran recitation sections and helped with grading homework, quizzes, and exams. I gave two lectures a week, 1-2 hours each, met with students during weekly office hours, posted assignments and grading rubrics to the class websites, and answered numerous student emails.

May 2002 - Feb 2010

UCSD

Postdoctoral Fellow

Independently defined my own project, identified collaborators, developed methods, and collaborated to develop custom software for big data analysis (CellFinder). Collaborated to use a state-of-the-art robotic laser system, which helped identify a DNA damage-dependent epigenetic mechanism for chromatin assembly at centromeres. Multiple first-author and corresponding-author publications. Research techniques: Xenopus egg extracts, protein purification, antibody generation and purification, Western blotting, siRNA, transient transfection, clonal selection, real-time quantitative PCR, recombinant DNA methods, immunofluorescence, high-content imaging, and live-cell fluorescence microscopy in human and mouse normal, cancer, and stem cell lines. Managerial: Ensured laboratory was in compliance with Environmental Health and Safety regulations. Ordered supplies and chemicals for the laboratory. Mentored and trained students, technicians, visiting faculty, and other postdocs.

1997 - 2002

Scripps Research Institute

Graduate Student

Performed independent research and obtained reagents by contacting scientists all over the world. Skills used: Hypothesis testing, immunofluorescence and live-cell imaging, quantitative image analysis, Western blotting, antibody generation and purification, PCR, recombinant DNA methods, transient transfection, clonal selection, population statistics, immunoprecipitation, quantitative kinase assays, technical writing, public speaking. Research topic: CENP-A is phosphorylated by Aurora B (Zeitlin et al., Journal of Cell Biology, 2001) Detailed kinetics of H3 phosphorylation beginning in G2 (Zeitlin et al., Journal of Cell Science, 2001) Managerial: Ensured laboratory was in compliance with Environmental Health and Safety regulations. Ordered supplies and chemicals for the laboratory.

Education

1997 - 2002

The Scripps Research Institute

PhD, Molecular and Cellular Structure and Chemistry

Find my papers on google scholar: http://bit.ly/zeitlin_papers

Talk to Sam

@ Copyright 2020 OfficeHours Technologies Co.