Archives and Documentation Center
Digital Archives

Neural Sign Language Translation by learning tokenization

Show simple item record

dc.contributor Graduate Program in Computer Engineering.
dc.contributor.advisor Akarun, Lale.
dc.contributor.author Orbay, Alptekin.
dc.date.accessioned 2023-03-16T10:04:40Z
dc.date.available 2023-03-16T10:04:40Z
dc.date.issued 2020.
dc.identifier.other CMPE 2020 O73
dc.identifier.uri http://digitalarchive.boun.edu.tr/handle/123456789/12428
dc.description.abstract In this thesis, we propose a multitask learning based method to improve Neural Sign Language Translation (NSLT) consisting of two parts, a tokenization layer and Neural Machine Translation (NMT). The tokenization part focuses on how Sign Language (SL) videos should be represented to be fed into the other part. It has not been studied elaborately whereas NMT research has attracted several researchers contributing enormous advancements. Up to now, there are two main input tokenization levels, namely frame-level and gloss-level tokenization. Glosses are world-like intermediate presentation and unique to SLs. Therefore, we aim to develop a generic sign-level tokenization layer so that it is applicable to other domains without further e ort. We begin with investigating current tokenization approaches and explain their weaknesses with several experiments. To provide a solution, we adapt Transfer Learning, Multitask Learning and Unsupervised Domain Adaptation into this research to leverage additional supervision. We succeed in enabling knowledge transfer between SLs and improve translation quality by 5 points in BLEU-4 and 8 points in ROUGE scores. Secondly, we show the e ects of body parts by extensive experiments in all the tokenization approaches. Apart from these, we adopt 3D-CNNs to improve e ciency in terms of time and space. Lastly, we discuss the advantages of sign-level tokenization over gloss-level tokenization. To sum up, our proposed method eliminates the need for gloss level annotation to obtain higher scores by providing additional supervision by utilizing weak supervision sources.
dc.format.extent 30 cm.
dc.publisher Thesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2020.
dc.subject.lcsh Sign language -- Study and teaching.
dc.subject.lcsh Learning strategies.
dc.title Neural Sign Language Translation by learning tokenization
dc.format.pages xiii, 71 leaves ;


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Digital Archive


Browse

My Account