Subject: [Dbworld] Introducing ReproZip!
Date: Thursday 9th June 2016 04:32:08 UTC (about 1 year ago)
This message introduces ReproZip (https://vida-nyu.github.io/reprozip/), a tool developed at NYU that makes reproducibility quite easy, even after an experiment or software has been developed without reproducibility in mind. ReproZip enables a researcher to create a compendium of his/her Linux experiment by automatically tracking and identifying all its required dependencies (data files, libraries, configuration files, etc.). After creating the package with at most two commands, the researcher can share it with others, who can then use ReproZip to unpack the experiment and reproduce the findings independent of their operating system, again with little to no effort. ReproZip can also be used to simply detect all the dependencies used by the experiment, which allows for better documentation and debugging. ReproZip can pack and unpack a variety of experiments, including client-server scenarios, databases, and interactive and visualization tools. Our examples repository (https://github.com/ViDA-NYU/reprozip-examples/) contains a few examples of experiments packed and unpacked with ReproZip, including: - A simulation in statistical physics (https://github.com/ViDA-NYU/reprozip-examples/tree/master/ising-model); - Data analysis examples in digital humanities (https://github.com/ViDA-NYU/reprozip-examples/tree/master/irish-schools) and data journalism (https://github.com/ViDA-NYU/reprozip-examples/tree/master/bechdel-test); - A real Web app called Stacked Up (https://github.com/ViDA-NYU/reprozip-examples/tree/master/stacked-up), which includes a PostgreSQL database; - A tool for visualizing the bus transportation system in the city of Konstanz (https://github.com/ViDA-NYU/reprozip-examples/tree/master/bus-vis). ReproZip will be demoed at SIGMOD this year, so please drop by, check our demo examples, and ask us any questions you might have! Also, if you have an experiment that can be executed in a reasonable time (i.e., that takes at most 2 minutes), feel free to bring it for us to pack and unpack! We will have USB drivers to move ReproZip packages between computers. For more information on ReproZip, please check the preprint of our demo paper (http://bigdata.poly.edu/~fchirigati/papers/reprozip-sigmod206.pdf), our official website (https://vida-nyu.github.io/reprozip/), and our documentation (http://reprozip.readthedocs.io/). See you at SIGMOD! Fernando Chirigati Remi Rampin Vicky Steeves Dennis Shasha Juliana Freire (New York University) _______________________________________________ Please do not post msgs that are not relevant to the database community at large. Go to www.cs.wisc.edu/dbworld for guidelines and posting forms. To unsubscribe, go to https://lists.cs.wisc.edu/mailman/listinfo/dbworld