Building a Data Pipeline from Scratch - Joe Crobak
Increase Performance with Automatic Keyword Recommendation - Eliot Brenner
Introduction to InfluxDB, an Open Source Distributed Time Series Database by Paul Dix
Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learning Group)
Netflix - Pig with Lipstick by Jeff Magnusson
Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele
Obtaining, Scrubbing, and Exploring Data at the Command Line by Jeroen Janssens
Outlier Selection and One Class Classification by Jeroen Janssens
Deployment Tools and Techniques at Spotify: Virtualenv in debian by Chris Angove
Python on Rails - Victory Levy
Square's Machine Learning Infrastructure and Applications - Rong Yan
A Tour of Cryptography Packages in Go - Kyle Isom
Spotify's Ad Targeting Infrastructure: Achieving Real-time Personalization for 24 million+ Users - Kinshuk Mishra
A Look at Vert.x’s Improved Clojure Language Support - John Chapin
Indexing and Searching Logs with Elasticsearch/Solr by Radu Gheorghe from Sematext
Scaling HBase (nosql store) to handle massive loads at Pinterest by Jeremy Carol
Scalr: Setting Up Automated Scaling
Stop Hiring DevOps Experts and Start Growing Them by Jez Humble
Digging into the Dirichlet Distribution by Max Sklar
Agents and Agency in the Internet by Greg Meredith