Published inTeads EngineeringRunning Spark Pipelines on EMR Using Spots InstancesA compilation of good practices and lessons learned in a production environmentFeb 22, 2022Feb 22, 2022
Published inTeads EngineeringInvestigating a network issue encountered with Kafka on AWSTL;DR we ended up tuning the ARP cache on our EC2 instancesSep 16, 2021Sep 16, 2021
Published inTeads EngineeringUpdating to Spark 3.0 in productionBreaking changes and expected improvements: a production point of viewDec 3, 2020Dec 3, 2020
Published inTowards Data ScienceCompact prediction treeA Lossless Model for Accurate Sequence Prediction over a finite alphabetOct 29, 20203Oct 29, 20203