Loading…
Welcome to Percona Live Online 2021
Online Open Source Database Conference
REGISTER HERE!
Back To Schedule
Wednesday, May 12 • 15:30 - 16:30
Trino on Ice: Using Iceberg To Replace the Hive Table Format

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
Trino (formerly PrestoSQL) is a ludicrously fast query engine that evolved from the need to replace the slow query turnaround speeds of the Hive engine. Trino grew in popularity under the label of Presto for years as an interactive query engine that lives over your data lake. While this operation was certainly a step away from the initial big data days of waiting hours to days for queries to complete, there were still many tedious rules engineers had to follow in order to correctly create, manage, and use their data in the datalake due to the Hive table format.

Apache Iceberg, a table format created at Netix, aims to address many of these issues. Iceberg simplifies the life of the engineer by decoupling the logical view of the data from the physical layout of the data using techniques like hidden partitioning and allows for in-place schema migration of your tables. Iceberg also increases the speed at which you can query your system by tracking files at the le level versus the partition level and so much more. 2 of 3 Speakers

With the marrying of Trino and Iceberg, companies can take advantage of a full replacement of the big data days of old and move into the next generation of datalakes that simplify the mental load of their data engineers and focus on building out the business logic and other tasks. In this talk I will cover some of the examples of the issues Iceberg solves from the lens of Trino.

Speakers
avatar for Brian Olsen

Brian Olsen

Developer Advocate, Starburst
Brian is a U.S. Marine turned software engineer and developer advocate working to foster the open-source Trino community. Brian spent four years as a data engineer at a cybersecurity company working on pipeline maintenance and query optimization. While in this role, Brian was responsible... Read More →


Wednesday May 12, 2021 15:30 - 16:30 EDT
Room #7