Welcome to Percona Live Online 2021
Online Open Source Database Conference
Back To Schedule
Wednesday, May 12 • 16:00 - 16:30
Multi-colo Async Replication at LinkedIn

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.

LinkedIn is a global site is served from multiple data centers (a.k.a colos). Member Data written at each data center is globally replicated to other data centers. To avoid write latency, we choose the replication to be async which has lead to a lot of problems related to conflicts. This talk is about why global replication is needed, how are we leveraging the multi-colo replication for site-up using [traffic shift](https://engineering.linkedin.com/blog/2017/05/trafficshift--load-testing-at-scale), how are we using Kafka to do multi-colo replication, how we architected our applications and the schema to minimize conflicts and finally how to handle conflicts in case if they arise.

So far LinkedIn has been using [Espresso](https://engineering.linkedin.com/espresso/introducing-espresso-linkedins-hot-new-distributed-document-store) and Oracle as primary data stores. There are already tools developed for handling the multi-colo replication which are covered in this talk. MySQL is growing very rapidly at LinkedIn and we are in search of an open source and reliable async multi-colo replication. I hope this talk may stress the need for it and let the open source community come up with good solutions.

avatar for Karthik Appigatla

Karthik Appigatla

Staff SRE, LinkedIn
Karthik Appigatla has been working on various large scale data stores for a decade primarily focused on MySQL. Currently, he has been working for LinkedIn for the last 5 years. Prior to LinkedIn, he worked for Yahoo, Pythian and Percona where he was responsible for helping clients... Read More →

Wednesday May 12, 2021 16:00 - 16:30 EDT
Room #1