Relational Deep Learning: Graph Representation Learning on Relational Databases

Fey, Matthias; Hu, Weihua; Huang, Kexin; Lenssen, Jan Eric; Ranjan, Rishabh; Robinson, Joshua; Ying, Rex; You, Jiaxuan; Leskovec, Jure

Local TagsRelease HistoryDetailsSummary

Relational Deep Learning: Graph Representation Learning on Relational Databases

Fey, M., Hu, W., Huang, K., Lenssen, J. E., Ranjan, R., Robinson, J., et al. (2023). Relational Deep Learning: Graph Representation Learning on Relational Databases. Retrieved from https://arxiv.org/abs/2312.04615.

Item is Released

show all

Basic

hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0010-A3EB-D Version Permalink: https://hdl.handle.net/21.11116/0000-0010-A3EC-C

Genre: Paper

Files

hide Files

:

arXiv:2312.04615.pdf (Preprint), 2MB

View Save

File Permalink:
https://hdl.handle.net/21.11116/0000-0010-A3ED-B

Name:
arXiv:2312.04615.pdf

Description:
File downloaded from arXiv at 2025-02-12 15:05

OA-Status:
Not specified

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
http://creativecommons.org/licenses/by/4.0/

Locators

show

Creators

hide

Creators:
Fey, Matthias¹, Author
Hu, Weihua¹, Author
Huang, Kexin¹, Author
Lenssen, Jan Eric², Author
Ranjan, Rishabh¹, Author
Robinson, Joshua¹, Author
Ying, Rex¹, Author
You, Jiaxuan¹, Author
Leskovec, Jure¹, Author

Affiliations:
1External Organizations, ou_persistent22
2Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society, ou_1116547

Content

hide

Free keywords: Computer Science, Learning, cs.LG,Computer Science, Databases, cs.DB

Abstract: Much of the world's most valued data is stored in relational databases and
data warehouses, where the data is organized into many tables connected by
primary-foreign key relations. However, building machine learning models using
this data is both challenging and time consuming. The core problem is that no
machine learning method is capable of learning on multiple tables
interconnected by primary-foreign key relations. Current methods can only learn
from a single table, so the data must first be manually joined and aggregated
into a single training table, the process known as feature engineering. Feature
engineering is slow, error prone and leads to suboptimal models. Here we
introduce an end-to-end deep representation learning approach to directly learn
on data laid out across multiple tables. We name our approach Relational Deep
Learning (RDL). The core idea is to view relational databases as a temporal,
heterogeneous graph, with a node for each row in each table, and edges
specified by primary-foreign key links. Message Passing Graph Neural Networks
can then automatically learn across the graph to extract representations that
leverage all input data, without any manual feature engineering. Relational
Deep Learning leads to more accurate models that can be built much faster. To
facilitate research in this area, we develop RelBench, a set of benchmark
datasets and an implementation of Relational Deep Learning. The data covers a
wide spectrum, from discussions on Stack Exchange to book reviews on the Amazon
Product Catalog. Overall, we define a new research area that generalizes graph
machine learning and broadens its applicability to a wide set of AI use cases.

Details

hide

Language(s): eng - English

Dates: Created: 2023-12-07Published Online: 2023

Publication Status: Published online

Pages: 21 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 2312.04615
URI: https://arxiv.org/abs/2312.04615
BibTex Citekey: Fey2312.04615

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show