Scalable Distributed Machine Learning for Knowledge Graphs

Medientyp: Dissertation; Elektronische Hochschulschrift; E-Book

Titel: Scalable Distributed Machine Learning for Knowledge Graphs

Beteiligte: Draschner, Carsten Felix [Verfasser:in]

Erschienen: Universitäts- und Landesbibliothek Bonn, 2023-07-17

Sprache: Englisch

DOI: https://doi.org/20.500.11811/10945

Schlagwörter: Knowledge Graphs ; SANSA ; Artificial Intelligence ; AI Ethics ; Machine Learning ; Scalable Semantic Analytics ; Distributed Computing

Entstehung:

Anmerkungen: Diese Datenquelle enthält auch Bestandsnachweise, die nicht zu einem Volltext führen.

Beschreibung: Due to the increasing progress of digitization, immense amounts of data are accumulating, which can be summarized under the term Big Data and form an exciting basis for data analyses. Since the data are heterogeneous and come from many different sources, data integration techniques are beneficial to perform analytics. Knowledge Graphs (KG) link the heterogeneous data within a directed multi-graph by unique resource identifiers. These data can be used for data analytics and prediction methods. One subbranch of Artificial Intelligence (AI) is Machine Learning (ML). ML models are developed and trained, which, based on the available training data, should approximate the target data as closely as possible. The samples in the training data are usually represented by features. For most data analytics and ML approaches, these features are fixed-length numeric feature vectors. However, in the context of KGs, there is no native representation within fixed-length numeric feature vectors. Depending on the use case, these problems can also require the concrete use and inclusion of individual actual values from the KG. The sheer size of some large-scale KG data does not fit into the memory of today's computers. One solution is to use cluster computation through distributed execution, which distributes the data and processing tasks across multiple computers. Both the technologies and the algorithms for this distributed computation must be designated. Due to the possible impact of the results from these data analysis pipelines, special technical implementation of accessible, reproducible, reusable, and explainable approaches is beneficial. These ML and AI development meta-dimensions belong to Ethical AI and Sustainable AI concepts. Within this work, we developed novel approaches for ML on KGs while considering ethical and sustainability dimensions. In particular, we developed technologies that create fixed-length numeric feature vectors. These include methods that, like graph kernels, extract features from the graph in the ...

Zugangsstatus: Freier Zugang

Rechte-/Nutzungshinweise: Urheberrechtsschutz

Nur in Feld suchen:

Zuletzt gesuchte Begriffe: