Reinforcement Learning Based Meta-Path Discovery in Large-Scale Heterogeneous Information Networks

Loading...
Thumbnail Image
File version

Accepted Manuscript (AM)

Author(s)
Wan, Guojia
Du, Bo
Pan, Shirui
Haffari, Gholamreza
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2020
Size
File type(s)
Location

New York, USA

License
Abstract

Meta-paths are important tools for a wide variety of data mining and network analysis tasks in Heterogeneous Information Networks (HINs), due to their flexibility and interpretability to capture the complex semantic relation among objects. To date, most HIN analysis still relies on handcrafting meta-paths, which requires rich domain knowledge that is extremely difficult to obtain in complex, large-scale, and schema-rich HINs. In this work, we present a novel framework, Meta-path Discovery with Reinforcement Learning (MPDRL), to identify informative meta-paths from complex and large-scale HINs. To capture different semantic information between objects, we propose a novel multi-hop reasoning strategy in a reinforcement learning framework which aims to infer the next promising relation that links a source entity to a target entity. To improve the efficiency, moreover, we develop a type context representation embedded approach to scale the RL framework to handle million-scale HINs. As multi-hop reasoning generates rich meta-paths with various length, we further perform a meta-path induction step to summarize the important meta-paths using Lowest Common Ancestor principle. Experimental results on two large-scale HINs, Yago and NELL, validate our approach and demonstrate that our algorithm not only achieves superior performance in the link prediction task, but also identifies useful meta-paths that would have been ignored by human experts.

Journal Title
Conference Title

Proceedings of The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20)

Book Title
Edition
Volume

34

Issue

4

Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

© 2020 AAAI Press. This is the author-manuscript version of this paper. Reproduced in accordance with the copyright policy of the publisher. Please refer to the conference's website for access to the definitive, published version.

Item Access Status
Note
Access the data
Related item(s)
Subject

Science & Technology

Social Sciences

Computer Science, Artificial Intelligence

Computer Science, Interdisciplinary Applications

Persistent link to this record
Citation

Wan, G; Du, B; Pan, S; Haffari, G, Reinforcement Learning Based Meta-Path Discovery in Large-Scale Heterogeneous Information Networks, Proceedings of The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20), 2020, 34, pp. 6094-6101