We are very excited to be presenting and attending this year’s Data and AI Summit which will be hosted virtually and physically in San Francisco from June 27th-30th. Throughout the course of 2021 we completed a number of really interesting projects built around delta-rs and the Databricks platform which we are thrilled to share with a broader audience. In addition to the presentations listed below, a number of Scribd engineers who are responsible for data and ML platform, machine learning systems, and more, will be in attendance if you want to meet up and learn more about how Scribd uses data and ML to change the way the world reads!
- Christian Williams will be sharing some of the work he has done developing kafka-delta-ingest in his talk: Streaming Data into Delta Lake with Rust and Kafka
- QP Hou, Scribd Emeritus, will be presenting on his foundational work to ensure correctness within delta-rs during his session: Ensuring Correct Distributed Writes to Delta Lake in Rust with Formal Verification
- R Tyler Croy will be co-presenting with Gavin Edgley from Databricks on the cost analysis work Scribd has done to efficiently grow our data platform with Doubling the size of the data lake without doubling the cost
There are so many great sessions to watch in person or online during the event, particularly around Delta Lake, which is one of our favorite technologies and powers our entire data platform. We are also expecting some great ML related talks as data and ML begin to overlap more and more. We hope to see you there!