Thumbnail: gpu

Talk: RAPIDS - Open GPU Data Science

by on under talks
1 minute read

Last week I was fortunate to speak at the PyData Cardiff meetup.

I presented an overview of RAPIDS, a suite of open source software libraries which give you the freedom to execute end-to-end data science and analytics pipelines entirely on GPUs. Much can change between releases of RAPIDS so the latest overview of libraries, benchmarks, and updates is consolidated in a release deck. The deck I presented at PyData Cardiff was a variation of the 0.11 deck which I remixed to appeal to the audience of the meetup, which was mainly data scientists, individual researchers and students.

Abstract

The RAPIDS suite of open source software libraries (https://rapids.ai/) allow you to run data science and analytics pipelines entirely on GPUs, but following familiar Python APIs including Numpy, Pandas and SciKit Learn.

RAPIDS relies on NVIDIA® CUDA® primitives for low-level compute optimization, but exposes that GPU parallelism and high-bandwidth memory speed through user-friendly Python interfaces.

RAPIDS also focuses on common data preparation tasks for analytics and data science. This includes a familiar DataFrame API that integrates with a variety of machine learning algorithms for end-to-end pipeline accelerations without paying typical serialization costs. RAPIDS also includes support for multi-node, multi-GPU deployments, enabling vastly accelerated processing and training on much larger dataset sizes.

Video

Slides

Python, RAPIDS, GPUs, Talk, Public Speaking, Slides, PyData Cardiff
Spotted a mistake in this article? Why not suggest an edit!
comments powered by Disqus