Skip to main content
EuroPythonCode of ConductBuy tickets

PyArrow and the future of data analytics

Room:
Liffey Hall 1
Start:
09:30 on 14 July 2022
Duration:
45 minutes

Abstract

In this talk we will introduce PyArrow and talk bout the transformation that the Arrow format is allowing in the Data Analytics world.

PyArrow provides an in-memory format, a disk format, a network exchange protocol, a dataframe library and a query engine all integrated in a single library. But the Arrow ecosystem doesn't stop there and allows you to work integrating multiple different technologies. It can be a swiss army knife for data engineers and it integrates zero cost with NumPy and Pandas in many cases.

TalkPyData: Data Engineering


The speaker

Alessandro Molina

Relying on Python as his primary development language for more than 15 years, has always been interested in Python as a Development Platform.

He worked as CTO and team leader of Python teams for the past 10 years and is currently core developer of the TurboGears2 web framework and a contributor to the Apache Arrow project.

Alessandro is the author of Crafting Test-Driven Software with Python and Modern Python Standard Library Cookbook and has authored many OpenSource Python projects like the DEPOT file storage framework and the DukPy JavaScript interpreter for Python.

Alessandro has been an active speaker to tens of European conferences since 2012



← Back to schedule