DeltaPy: A Framework for Tabular Data Augmentation in Python

3 Pages Posted: 19 May 2020

See all articles by Derek Snow

Derek Snow

The Alan Turing Institute

Date Written: April 22, 2020


A range of data abstractions have come to the fore since the re-emergence of machine learning. This includes procedures like feature engineering, extraction, transformation, and selection, as well as data pre-processing, generation, synthesisation, and augmentation. This report attempts to unify some of this terminology with the development of a bare-bones Python package, DeltaPy.

Keywords: Tabular Data, Augmentation Methods, Machine Learning, Data Science, Feature Engineering, Synthetic Data, Colab Notebook

JEL Classification: C02, C13, C21, C38, C53, C87

Suggested Citation

Snow, Derek, DeltaPy: A Framework for Tabular Data Augmentation in Python (April 22, 2020). Available at SSRN: or

Derek Snow (Contact Author)

The Alan Turing Institute ( email )

British Library, 96 Euston Rd
London, NW1 2DB
United Kingdom


Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Abstract Views
PlumX Metrics