• Medientyp: E-Artikel; Sonstige Veröffentlichung
  • Titel: API Comparison of CPU-To-GPU Command Offloading Latency on Embedded Platforms (Artifact)
  • Beteiligte: Cavicchioli, Roberto [VerfasserIn]; Capodieci, Nicola [VerfasserIn]; Solieri, Marco [VerfasserIn]; Bertogna, Marko [VerfasserIn]
  • Erschienen: Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019
  • Sprache: Englisch
  • DOI: https://doi.org/10.4230/DARTS.5.1.4
  • Schlagwörter: Heterogeneus systems ; Applications ; GPU
  • Entstehung:
  • Anmerkungen: Diese Datenquelle enthält auch Bestandsnachweise, die nicht zu einem Volltext führen.
  • Beschreibung: High-performance heterogeneous embedded platforms allow offloading of parallel workloads to an integrated accelerator, such as General Purpose-Graphic Processing Units (GP-GPUs). A time-predictable characterization of task submission is a must in real-time applications. We provide a profiler of the time spent by the CPU for submitting stereotypical GP-GPU workload shaped as a Deep Neural Network of parameterized complexity. The submission is performed using the latest API available: NVIDIA CUDA, including its various techniques, and Vulkan. Complete automation for the test on Jetson Xavier is also provided by scripts that install software dependencies, run the experiments, and collect results in a PDF report.
  • Zugangsstatus: Freier Zugang
  • Rechte-/Nutzungshinweise: Namensnennung (CC BY)