Mathematical programming and statistical learning approaches for multiple instance learning

Lök, Emel Şeyma.

Archives and Documentation Center Digital Archives Home
→
Boğaziçi Üniversitesi Tezleri
→
Fen Bilimleri Enstitüsü
→
Endüstri Mühendisliği
→
Ph.D. Theses
→
View Item

Mathematical programming and statistical learning approaches for multiple instance learning

Lök, Emel Şeyma.

URI: http://digitalarchive.boun.edu.tr/handle/123456789/13574

Date: 2018.

Abstract:

Many real-world applications of classiﬁcation require ﬂexibility in representing complex objects to preserve the relevant information for class separation. Multiple instance learning (MIL) aims to solve classiﬁcation problem where each object is rep resented with a bag of instances, and class labels are provided for the bags rather than individual instances. The aim is to learn a function that correctly labels new bags. In this thesis, we propose statistical learning and mathematical optimization methods to solve MIL problems from diversiﬁed application domains. We ﬁrst present bag encoding strategies to obtain bag-level feature vectors for MIL. Simple instance space partition ing approaches are utilized to learn representative feature vectors for the bags. Our experiments on a large database of MIL problems show that random tree-based encod ing is scalable and its performance is competitive with the state-of-the-art methods. Mathematical programming-based approaches to MIL problem construct a bag-level decision function. In this context, we formulate MIL problem as a linear programming model to optimize bag orderings for correct classiﬁcation. Proposed formulation com bines instance-level scores to return an estimate on the bag label. All instances are solved to optimality on various data representations in a reasonable computation time. At last, we develop a quadratic programming formulation that is superior to previous MIL formulations on underlying assumptions and computational diﬃculties. Proposed MIL framework models contributions of instances to the bag class labels, and provide a bag class decision threshold. Experimental results verify that proposed formulation enables eﬀective classiﬁcation in various MIL applications.

Show full item record