DeltaPy: A Framework for Tabular Data Augmentation in Python
3 Pages Posted: 19 May 2020
Date Written: April 22, 2020
Abstract
A range of data abstractions have come to the fore since the re-emergence of machine learning. This includes procedures like feature engineering, extraction, transformation, and selection, as well as data pre-processing, generation, synthesisation, and augmentation. This report attempts to unify some of this terminology with the development of a bare-bones Python package, DeltaPy.
Keywords: Tabular Data, Augmentation Methods, Machine Learning, Data Science, Feature Engineering, Synthetic Data, Colab Notebook
JEL Classification: C02, C13, C21, C38, C53, C87
Suggested Citation: Suggested Citation
Snow, Derek, DeltaPy: A Framework for Tabular Data Augmentation in Python (April 22, 2020). Available at SSRN: https://ssrn.com/abstract=3582219 or http://dx.doi.org/10.2139/ssrn.3582219
Do you have a job opening that you would like to promote on SSRN?
Feedback
Feedback to SSRN
If you need immediate assistance, call 877-SSRNHelp (877 777 6435) in the United States, or +1 212 448 2500 outside of the United States, 8:30AM to 6:00PM U.S. Eastern, Monday - Friday.