Posts by Tags

Academic Paper

Enabling Transparent Acceleration of Big Data Frameworks Using Heterogeneous Hardware

21 minute read

Published:

Exploiting heterogeneous hardware for Big Data workloads is usually done by introducing new APIs, resulting in more complex programs to develop, understand, and maintain. But, what if we do not change/extend the original programming model? Is it possible? This post discusses a new approach to do so.

CPUs

CUDA

Installing CUDA, OpenCL and Level Zero in OpenSUSE Leap 15

6 minute read

Published:

In this post, we show how to install the NVIDIA drivers to get access to CUDA and OpenCL parallel programming frameworks and utilities for NVIDIA GPUs. We also show how to install the Intel compute-runtime drivers for accessing, via OpenCL and Level Zero, Intel Integrated Graphics.

Co-design Approach

Enabling Transparent Acceleration of Big Data Frameworks Using Heterogeneous Hardware

21 minute read

Published:

Exploiting heterogeneous hardware for Big Data workloads is usually done by introducing new APIs, resulting in more complex programs to develop, understand, and maintain. But, what if we do not change/extend the original programming model? Is it possible? This post discusses a new approach to do so.

Code Generation

Deep Learning

Running TornadoVM on NVIDIA Jetson Nano

7 minute read

Published:

Did you know that TornadoVM can also run on ARM-based systems with NVIDIA GPUs? In this post, we will show how TornadoVM can be used on an NVIDIA Jetson Nano, a small, powerful computer designed for embedded artificial intelligence (AI) and machine learning (ML) applications.

Docker

Drivers

Installing CUDA, OpenCL and Level Zero in OpenSUSE Leap 15

6 minute read

Published:

In this post, we show how to install the NVIDIA drivers to get access to CUDA and OpenCL parallel programming frameworks and utilities for NVIDIA GPUs. We also show how to install the Intel compute-runtime drivers for accessing, via OpenCL and Level Zero, Intel Integrated Graphics.

FPGAs

GPGPU

GPU Drivers

GPU Profiling

GPUs

Hardware Acceleration

Enabling Transparent Acceleration of Big Data Frameworks Using Heterogeneous Hardware

21 minute read

Published:

Exploiting heterogeneous hardware for Big Data workloads is usually done by introducing new APIs, resulting in more complex programs to develop, understand, and maintain. But, what if we do not change/extend the original programming model? Is it possible? This post discusses a new approach to do so.

Heterogeneous Programming

Installation

Installing CUDA, OpenCL and Level Zero in OpenSUSE Leap 15

6 minute read

Published:

In this post, we show how to install the NVIDIA drivers to get access to CUDA and OpenCL parallel programming frameworks and utilities for NVIDIA GPUs. We also show how to install the Intel compute-runtime drivers for accessing, via OpenCL and Level Zero, Intel Integrated Graphics.

Intel HD Graphics

Intel Integrated GPUs

Intel Level Zero

Intel OpenCL

Intel VTune

JIT Compilation

Level Zero

Exploring Level Zero resources: repositories and purpose

2 minute read

Published:

Sometimes, it is not clear which Level Zero repository is the right one for our needs. In this post, we will explain each of the Level Zero public resources and what they are intended to be.

Installing CUDA, OpenCL and Level Zero in OpenSUSE Leap 15

6 minute read

Published:

In this post, we show how to install the NVIDIA drivers to get access to CUDA and OpenCL parallel programming frameworks and utilities for NVIDIA GPUs. We also show how to install the Intel compute-runtime drivers for accessing, via OpenCL and Level Zero, Intel Integrated Graphics.

Machine Learning

Running TornadoVM on NVIDIA Jetson Nano

7 minute read

Published:

Did you know that TornadoVM can also run on ARM-based systems with NVIDIA GPUs? In this post, we will show how TornadoVM can be used on an NVIDIA Jetson Nano, a small, powerful computer designed for embedded artificial intelligence (AI) and machine learning (ML) applications.

Managed Runtime Systems

Enabling Transparent Acceleration of Big Data Frameworks Using Heterogeneous Hardware

21 minute read

Published:

Exploiting heterogeneous hardware for Big Data workloads is usually done by introducing new APIs, resulting in more complex programs to develop, understand, and maintain. But, what if we do not change/extend the original programming model? Is it possible? This post discusses a new approach to do so.

MaxineVM

Memory Allocation

Memory Types

NVIDIA CUDA

NVIDIA Jetson Nano

Running TornadoVM on NVIDIA Jetson Nano

7 minute read

Published:

Did you know that TornadoVM can also run on ARM-based systems with NVIDIA GPUs? In this post, we will show how TornadoVM can be used on an NVIDIA Jetson Nano, a small, powerful computer designed for embedded artificial intelligence (AI) and machine learning (ML) applications.

OpenCL

Installing CUDA, OpenCL and Level Zero in OpenSUSE Leap 15

6 minute read

Published:

In this post, we show how to install the NVIDIA drivers to get access to CUDA and OpenCL parallel programming frameworks and utilities for NVIDIA GPUs. We also show how to install the Intel compute-runtime drivers for accessing, via OpenCL and Level Zero, Intel Integrated Graphics.

OpenSUSE Leap 15

Installing CUDA, OpenCL and Level Zero in OpenSUSE Leap 15

6 minute read

Published:

In this post, we show how to install the NVIDIA drivers to get access to CUDA and OpenCL parallel programming frameworks and utilities for NVIDIA GPUs. We also show how to install the Intel compute-runtime drivers for accessing, via OpenCL and Level Zero, Intel Integrated Graphics.

Profiling

RHEL

Research

Resources

Exploring Level Zero resources: repositories and purpose

2 minute read

Published:

Sometimes, it is not clear which Level Zero repository is the right one for our needs. In this post, we will explain each of the Level Zero public resources and what they are intended to be.

Runtime System

SPEC

Exploring Level Zero resources: repositories and purpose

2 minute read

Published:

Sometimes, it is not clear which Level Zero repository is the right one for our needs. In this post, we will explain each of the Level Zero public resources and what they are intended to be.

SPIR-V

Timers

Toolkit

TornadoVM

Running TornadoVM on NVIDIA Jetson Nano

7 minute read

Published:

Did you know that TornadoVM can also run on ARM-based systems with NVIDIA GPUs? In this post, we will show how TornadoVM can be used on an NVIDIA Jetson Nano, a small, powerful computer designed for embedded artificial intelligence (AI) and machine learning (ML) applications.

Enabling Transparent Acceleration of Big Data Frameworks Using Heterogeneous Hardware

21 minute read

Published:

Exploiting heterogeneous hardware for Big Data workloads is usually done by introducing new APIs, resulting in more complex programs to develop, understand, and maintain. But, what if we do not change/extend the original programming model? Is it possible? This post discusses a new approach to do so.

Unified Memory

Unified Shared Memory

WSL

Windows 11

Windows WSL

level-zero

oneAPI

Exploring Level Zero resources: repositories and purpose

2 minute read

Published:

Sometimes, it is not clear which Level Zero repository is the right one for our needs. In this post, we will explain each of the Level Zero public resources and what they are intended to be.