Write the code once
Prototype, develop, test and deploy with the same tool
Parallelize the code automatically
Don't worry about pipelines
Don't reinvent the wheel
You can use the SOIL Library for typical modules such as filtering, clustering, basic statistics, ...
Run cloud and local modules seemingly
import soilfrom soil.modules.preprocessing.filters import row_filterfrom soil.modules.simple_module import my_modulepatients = soil.data('my_data')women, = row_filter.RowFilter(patients, sex={'eql': '1'})statistics, = my_module(women, aggregation_column='age')print(statistics.data)