Correlating messy data with "correlate"
- Room:
- Liffey B
- Start (Dublin time):
- Start (your time):
- Duration:
- 30 minutes
Abstract
An introduction to the correlate Python library. You tell correlate about two datasets that should map to each other, and it determines the best matches for you. The novel scoring algorithm at the heart of correlate means it copes exceedingly well with messy real-world data. correlate supports fuzzy matching, weighted matching, and ordering.
TalkPython Libraries
Description
Data correlation! What could be more computer science-y! Ever needed to find matching items between two sets of data? Maybe even messy real-world data, with inexact string matches? Come find out how the novel scoring algorithm and clever heuristics at the heart of correlate solve this problem with ease!