All posts by
Michał Gozdera

May 19 2025

Unlock faster data processing for Machine Learning: reducing pivoting time from hours to minutes

Training Machine Learning models on big data isn’t just about fitting the model itself — it’s about efficiency at every stage of the process. While much attention is given to optimizing model training itself, the earlier phases can be just as, if not more, critical to the overall performance. In this article, we take a deep dive into what happens before we actually invoke model.fit(), focusing on the data pivoting stage. We are taking you on a journey through various pivoting solutions, exploring both pitfalls and interesting optimizations. The goal is simple: make this process highly efficient — in terms of processing time and memory usage. So, buckle up!


Michał Gozdera

Michał is a Machine Learning Engineer with a background in application development. At Allegro, he focuses on building a platform that streamlines the training and deployment of machine learning models.