Loading…
Welcome to Percona Live Online 2021
Online Open Source Database Conference
REGISTER HERE!
Back To Schedule
Thursday, May 13 • 07:00 - 08:00
How Machine Learning Inside Databases Solves Significant Data-Science Challenges

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.


Machine Learning inside databases is becoming a hot trend. Last time at Percona Live 2020, our team presented AI Tables - an open-source solution that enables automated machine learning capabilities inside databases. The main idea of AI Tables is to allow anyone who works with databases to implement ML projects in a matter of hours without requiring data science skills.

It is as simple as using SQL queries!

In the journey of bringing AI Tables to the community, we have discovered and solved Machine Learning problems that are hard even for ML engineers but are common for data inside databases.

For example:
Forecasting inventory for all products in all stores (**GROUP BY store, product_id**), given a table that contains all inventory updates over time (**ORDER BY time**).

This problem is complex even for experienced ML engineering teams. In a traditional ML approach, you would need to train one model for each product at each store, which can mean thousands or hundreds of thousands of models, not even thinking of the logistic nightmare to bring such many models to production.

Another example of a challenge solved is creating views that do **joins between data tables and ML models**. It significantly streamlines using machine learning inside BI tools to forecast data trends. Also, it opens broader possibilities for anomaly detection and much more!

We have made significant progress in solving those problems automatically through AI-Tables, and we would like to share with you our approach and discuss some interesting insights that we have made in the process.

**Agenda:**
- 5 min | Advantages of ML inside a database over the traditional approach
- 15 min | Machine learning workflows inside databases
- 15 min | Automated multivariate time-series forecasting
- 15 min | Joining tables with ML models
- 10 min | Q&A

Speakers
avatar for Jorge Torres

Jorge Torres

CEO, MindsDB
Jorge Torres is the Co-founder & CEO of MindsDB. He is also a visiting scholar at UC Berkeley researching machine learning automation and explainability. Before founding MindsDB, he worked for a number of data-intensive start-ups, most recently working with Aneesh Chopra (the first... Read More →
avatar for Patricio Cerda-Mardini

Patricio Cerda-Mardini

Machine Learning Research Engineer, MindsDB
Patricio Cerda-Mardini is a Machine Learning Research Engineer. As a masters student at PUC Chile, he focused on machine learning methods for human-robot interaction and recommendation systems, areas in which he holds a couple of academic publications. Prior to joining MindsDB, he... Read More →


Thursday May 13, 2021 07:00 - 08:00 EDT
Room #1