Arşiv ve Dokümantasyon Merkezi
Dijital Arşivi

A knowledge-graph based graph neural network model to identify topics in short texts

Basit öğe kaydını göster

dc.contributor Graduate Program in Computer Engineering.
dc.contributor.advisor Üsküdarlı, Suzan.
dc.contributor.author Güney, Abdullah Atakan.
dc.date.accessioned 2023-10-15T06:43:04Z
dc.date.available 2023-10-15T06:43:04Z
dc.date.issued 2022
dc.identifier.other CMPE 2022 G86
dc.identifier.uri http://digitalarchive.boun.edu.tr/handle/123456789/19695
dc.description.abstract Topic models are probabilistic generative models used to analyze a collection of documents. People have leveraged topic models for many years to extract hidden structures from documents. However, classical topic models such as Latent Dirichlet Allocation (LDA) have issues with short texts typical in user- generated social media content. Due to the limited context of short texts, they fail to learn interpretable topics from extensive vocabularies with the bag of word representations that do not repre sent them well. This thesis proposes a topic model based on Graph Neural Networks (GNN) where documents are represented as graphs with entity-specific relations using Wikidata as a knowledge graph. A graph attention network learns the embeddings of these documents whose outputs are passed to the probabilistic generative topic model Entity Embedded Topic Modeling (EETM) as probability distribution parameters to yield the topics. We evaluate our model with various short text collections fetched from Twitter related to politics, sports, pandemics, and trending news events. We provide a detailed discussion regarding our observations related to the learned embeddings and qualities of topics resulting from our model.
dc.publisher Thesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2022.
dc.subject.lcsh Latent structure analysis.
dc.subject.lcsh Latent Dirichlet Allocation.
dc.title A knowledge-graph based graph neural network model to identify topics in short texts
dc.format.pages xiv, 91 leaves


Bu öğenin dosyaları

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster

Dijital Arşivde Ara


Göz at

Hesabım