PyArrow and the future of data analytics
- Room:
- Liffey Hall 1
- Start (Dublin time):
- Start (your time):
- Duration:
- 45 minutes
Abstract
In this talk we will introduce PyArrow and talk bout the transformation that the Arrow format is allowing in the Data Analytics world.
PyArrow provides an in-memory format, a disk format, a network exchange protocol, a dataframe library and a query engine all integrated in a single library. But the Arrow ecosystem doesn't stop there and allows you to work integrating multiple different technologies. It can be a swiss army knife for data engineers and it integrates zero cost with NumPy and Pandas in many cases.
TalkPyData: Data Engineering