Evaluating Architectural Choices for Deep Learning Approaches for Question Answering over Knowledge Bases

Hakimov, Sherzod; Jebbara, Soufian; Cimiano, Philipp

Computer Science > Computation and Language

arXiv:1812.02536 (cs)

[Submitted on 6 Dec 2018 (v1), last revised 13 Dec 2018 (this version, v2)]

Title:Evaluating Architectural Choices for Deep Learning Approaches for Question Answering over Knowledge Bases

Authors:Sherzod Hakimov, Soufian Jebbara, Philipp Cimiano

View PDF

Abstract:The task of answering natural language questions over knowledge bases has received wide attention in recent years. Various deep learning architectures have been proposed for this task. However, architectural design choices are typically not systematically compared nor evaluated under the same conditions. In this paper, we contribute to a better understanding of the impact of architectural design choices by evaluating four different architectures under the same conditions. We address the task of answering simple questions, consisting in predicting the subject and predicate of a triple given a question. In order to provide a fair comparison of different architectures, we evaluate them under the same strategy for inferring the subject, and compare different architectures for inferring the predicate. The architecture for inferring the subject is based on a standard LSTM model trained to recognize the span of the subject in the question and on a linking component that links the subject span to an entity in the knowledge base. The architectures for predicate inference are based on i) a standard softmax classifier ranging over all predicates as output, iii) a model that predicts a low-dimensional encoding of the property given entity representation and question, iii) a model that learns to score a pair of subject and predicate given the question as well as iv) a model based on the well-known FastText model. The comparison of architectures shows that FastText provides better results than other architectures.

Comments:	the longer version than the original publication at ICSC 2019
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:1812.02536 [cs.CL]
	(or arXiv:1812.02536v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1812.02536

Submission history

From: Sherzod Hakimov [view email]
[v1] Thu, 6 Dec 2018 14:11:25 UTC (453 KB)
[v2] Thu, 13 Dec 2018 09:36:17 UTC (577 KB)

Computer Science > Computation and Language

Title:Evaluating Architectural Choices for Deep Learning Approaches for Question Answering over Knowledge Bases

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Evaluating Architectural Choices for Deep Learning Approaches for Question Answering over Knowledge Bases

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators