Paper Reading #4 : Gestalt: Integrated Support for Implementation and Analysis in Machine Learning

Reference
Authors: Kayur Patel†, Naomi Bancroft†, Steven M. Drucker, Andrew J. Ko+, James A. Landay, James Fogarty*
Affiliations: DUB group, University Of Washington, Seattle, WA and Microsoft Research*, Redmond, WA respectively.
Presentation: UIST’10, October 3–6, 2010, New York, New York, USA.
Summary
Hypothesis
The paper presents Gestalt, a development environment designed to support the process of applying machine learning. It hypothesizes Gestalt to improve user's ability to find and fix errors in machine learning code by implementing a classification pipeline,analyzing data as it moves through that pipeline, and easily transitioning between implementation and analysis.
Methods and Results
In one of such pipeline implementation, a)data is transformed into discrete examples, (b) attributes† are computed over each example, (c) a learning algorithm is used to train a model, and (d) the accuracy of that model is evaluated. Gestalt can help developers locate execution errors within specific steps which is implemented using color schemes of red, yellow, green or grey.Gestalt uses a single unified table to store data at pipelines, hence developers are freed from managing data conversion or moving data between tools. The paper further presents different tool works for machine learning and how Gestalt can fit with those platforms.
User study was done among 8 computer science graduate students on gesture recognition and sentiment analysis problems with precoded bugs to compare the performance of Gestalt to baseline.Participants unanimously preferred Gestalt and were able to find and fix more bugs using Gestalt than using the baseline. The explicit structure provided by viewing and interacting with the classification pipeline was one of the most useful components of Gestalt to most of them.Their experiment shows Gestalt significantly improves the ability of developers to find and fix bugs in machine learning systems.
Discussion
Based on the quantitative results and as well the qualitative data presented by the paper, I was satisfied with the research and implementation of Gestalt. As I am not aware of machine learning algorithms, nor have I used one till now, I am not so sure why one would want a system like this. I would say the way it finds bugs and the way people were impressed about the progress in the pipeline is what made Gestalt a better tool. I am not sure if such techniques are desired in SDK's like Eclipse and Visual Studio, but may be if available, it can be beneficial.
Comments
Post a Comment