Learning Open Domain Multi-hop Search Using Reinforcement Learning

Noriega-Atala, Enrique; Surdeanu, Mihai; Morrison, Clayton T.

Computer Science > Computation and Language

arXiv:2205.15281 (cs)

[Submitted on 30 May 2022]

Title:Learning Open Domain Multi-hop Search Using Reinforcement Learning

Authors:Enrique Noriega-Atala, Mihai Surdeanu, Clayton T. Morrison

View PDF

Abstract:We propose a method to teach an automated agent to learn how to search for multi-hop paths of relations between entities in an open domain. The method learns a policy for directing existing information retrieval and machine reading resources to focus on relevant regions of a corpus. The approach formulates the learning problem as a Markov decision process with a state representation that encodes the dynamics of the search process and a reward structure that minimizes the number of documents that must be processed while still finding multi-hop paths. We implement the method in an actor-critic reinforcement learning algorithm and evaluate it on a dataset of search problems derived from a subset of English Wikipedia. The algorithm finds a family of policies that succeeds in extracting the desired information while processing fewer documents compared to several baseline heuristic algorithms.

Comments:	Accepted for publication at the Structured and Unstructured Knowledge Integration (SUKI) workshop, held at NAACL-HLT 2022
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2205.15281 [cs.CL]
	(or arXiv:2205.15281v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.15281

Submission history

From: Enrique Noriega-Atala [view email]
[v1] Mon, 30 May 2022 17:44:19 UTC (6,859 KB)

Computer Science > Computation and Language

Title:Learning Open Domain Multi-hop Search Using Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Learning Open Domain Multi-hop Search Using Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators