4. System Overview

The Febrl system is implemented in an object oriented design with a handful of modules, each containing routines for specific tasks. The overall system is configured and controlled by a project.py module, which will be explained in detail in Chapter 5.

Record linkage consists of two main steps. The first one deals with data cleaning and standardisation, while the second performs the actual linkage (or deduplication). The user thus needs to specify various settings in order to be able to perform a cleaning/standardisation and/or a linkage/deduplication/geocoding process.