Retrieval of Experiments by Efficient Comparison of Marginal Likelihoods

Abstract

We study the task of retrieving relevant experiments given a query experiment. By experi- ment, we mean a collection of measurements from a set of ‘covariates’ and the associated ‘out- comes’. While similar experiments can be retrieved by comparing available ‘annotations’, this approach ignores the valuable information available in the measurements themselves. To incor- porate this information in the retrieval task, we suggest employing a retrieval metric that utilizes probabilistic models learned from the measurements. We argue that such a metric is a sensible measure of similarity between two experiments since it permits inclusion of experiment-specific prior knowledge. However, accurate models are often not analytical, and one must resort to storing posterior samples which demands considerable resources. Therefore, we study strate- gies to select informative posterior samples to reduce the computational load while maintaining the retrieval performance. We demonstrate the efficacy of our approach on simulated data with simple linear regression as the models, and real world datasets.

Date
Sep 11, 2014 5:00 PM — 5:30 PM
Event
Large Scale Online Learning and Decision Making Workshop
Sohan Seth
Sohan Seth
Lead Data Scientist

Lead Data Scientist (Senior Research Fellow equivalent) at the School of Informatics, University of Edinburgh.