DeltaPy: A Framework for Tabular Data Augmentation in Python

3 Pages Posted: 19 May 2020

See all articles by Derek Snow

Derek Snow

New York University (NYU) - Finance and Risk Engineering Department; The Alan Turing Institute; University of Oxford - Oxford-Man Institute of Quantitative Finance

Date Written: April 22, 2020

Abstract

A range of data abstractions have come to the fore since the re-emergence of machine learning. This includes procedures like feature engineering, extraction, transformation, and selection, as well as data pre-processing, generation, synthesisation, and augmentation. This report attempts to unify some of this terminology with the development of a bare-bones Python package, DeltaPy.

Keywords: Tabular Data, Augmentation Methods, Machine Learning, Data Science, Feature Engineering, Synthetic Data, Colab Notebook

JEL Classification: C02, C13, C21, C38, C53, C87

Suggested Citation

Snow, Derek, DeltaPy: A Framework for Tabular Data Augmentation in Python (April 22, 2020). Available at SSRN: https://ssrn.com/abstract=3582219 or http://dx.doi.org/10.2139/ssrn.3582219

Derek Snow (Contact Author)

New York University (NYU) - Finance and Risk Engineering Department ( email )

6 Metrotech Center
New York, NY 11201
United States

The Alan Turing Institute ( email )

British Library, 96 Euston Rd
London, NW1 2DB
United Kingdom

HOME PAGE: http://www.turing.ac.uk/

University of Oxford - Oxford-Man Institute of Quantitative Finance ( email )

Eagle House
Walton Well Road
Oxford, Oxfordshire OX2 6ED
United Kingdom

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
1,915
Abstract Views
5,406
Rank
21,634
PlumX Metrics