Resources

We have collected presentations from IXPUG workshops, annual meetings, and BOF sessions, and made them accessible here to view or download. You may search by event, keyword, science domain or author’s name. The database will be updated as new talks are made available.

  • CategoriesClear All
    • Toggle ImageToggle Image
    • Toggle ImageToggle Image
    • Toggle ImageToggle Image
    • Toggle ImageToggle Image
    • Toggle ImageToggle Image
    • Toggle ImageToggle Image
    • Toggle ImageToggle Image
    • Toggle ImageToggle Image
    • Toggle ImageToggle Image

Search ResultShowing 1 - 10 of 497 Results

IXPUG Mid-Year Workshop 2021 Jul 15, 2021

Artificial Intelligence-Enabled Multi-Scale Simulations for COVID-19 Drug Discovery, IXPUG Mid-Year Workshop 2021 keynote presentation by Arvind Ramanathan, Argonne National Lab

Keyword(s): COVID-19,,AI/ML,RNA processing,accelerated simulations,statistical inference,DeepDrive MD,weighted ensemble simulations,heterogenous hardware,Stream-AI-MD,artificial intelligence

Author(s): Arvind Ramanathan
Video(s):
Read more | |
IXPUG Mid-Year Workshop 2021 Jun 21, 2021

SPMD / SIMD on GPUs

Keyword(s): SPMD,SIMD on GPUs

Author(s): Patrick Steinbrecher
Video(s):
Read more | |
IXPUG Mid-Year Workshop 2021 Jun 21, 2021

I have been collecting things which aren't taught in our SYCL (DPC++) book, and you won't get in a full day tutorial on SYCL (at least as taught thus far). In 30 minutes, I will share the collection as it stands today - and I will explain why these matter, and how they underscore the value of SYCL over CUDA because many of the nuances are new to us all because they help SYCL address XPUs (more than 'just GPUs') and they address the needs of being open for multivendor support.

Keyword(s): SYCL,DPC++,nuances

Author(s): James Reinders
Video(s):
Read more | |
IXPUG Mid-Year Workshop 2021 Jun 21, 2021

The high-performance computing (HPC) ecosystem increasingly supports heterogeneous architectures and customization. Field programmable gate arrays (FPGA) are among the options being considered due to their ability to both adapt to individual workloads and serve as prototype vehicles for application-specific accelerators. However, adoption has been limited due to the difficulty in programming these devices. To mitigate this, vendors are introducing frameworks based on embedded domain specific languages (eDSLs), such as SYCL. This work takes the first step in evaluating one of these new DSLs, DPC++, using DOE proxy applications to identify programmability gaps and performance on Intel FPGAs. Initial testing is being done with the MiniAMR application from the Mantevo suite, focusing on the 7-point stencil.

Keyword(s): FPGA Evaluation,oneAPI,Mantevo,SYCL,DPC++

Author(s): Nicholas Miller
Video(s):
Read more | |
IXPUG Mid-Year Workshop 2021 Jun 21, 2021

This lightning talk presents our work on parallelizing the epidemiological simulation model Episim. Episim combines a large-scale person-centric human mobility model with a mechanistic model of infection and a person-centric disease progression model. Movements of persons, including the intervening activities in which they may interact with others, can be directly taken from data. Episim can be used to evaluate different intervention strategies, such as closing educational facilities, reducing other out-of-home activities, wearing masks, or contact tracing. In general, the parallelization of agent-based models is challenging. Especially when the underlying social network is irregular and dynamic, they require frequent communication between the processing elements. In Episim, however, we were able to take advantage of the fact that people are not contagious on the same day they become infected, and therefore immediate health synchronization is not required.

Keyword(s): Epidemics,Agent-based modeling,Parallel programming

Author(s): Steffen Fuerst
Video(s):
Read more | |
IXPUG Mid-Year Workshop 2021 Jun 21, 2021

With discrete Intel GPUs entering the high performance computing landscape, there is an urgent need for production-ready software stacksfor these platforms. In this paper, we report how we prepare the Ginkgo math library for Intel GPUs by developing a kernel backed based on the DPC++ programming environment. We discuss conceptual differences to the CUDA and HIP programming models and describe workflows for simplified code conversion. We benchmark advanced sparse linear algebra routines utilizing the converted kernels to assess the efficiency of the DPC++ backend in the hardware-specific performance bounds, and compare the performance of basic building blocks against routines providing the same functionality that ship with Intel's oneMKL vendor library.

Keyword(s): oneAPI,Intel GPUs,Ginkgo,Math library,Ginkgo Math library,SpMV

Author(s): Terry Cojean
Video(s):
Read more | |
IXPUG Mid-Year Workshop 2021 Jun 21, 2021

This paper explores the process and results of converting an existing Fortran kernel to a Data Parallel C++ (DPC++) kernel with offload functionality while maintaining the main driver routine as Fortran. Obstacles and chosen solutions are discussed with intent to provide guidance to future developers. Initial performance comparisons are made between the original Fortran code, Fortran with OpenMP* offload, and DPC++ offload.

Keyword(s): DPC++,Mixed Language,Fortran,Heterogeneous Compute

Author(s): James Tullos, Christoph Bauinger
Video(s):
Read more | |
IXPUG Webinar Series Mar 15, 2021

The trifecta of high volumes of data, abundant compute availability on cloud and on-premise, and rapid algorithmic innovations enable data scientists and AI researchers to do fast experiments, prototyping, and model development at an accelerated pace that was never possible before. In this talk, we will touch upon a variety of software packages, libraries, and tools that can also help HPC practitioners push the envelope of applying AI in their application domains and simulations at-scale. We will cover examples and talk about how to create efficient end-to-end AI pipelines with large data sets in-memory, security, and other features through Intel-optimized software packages such as Intel® Distribution of Python, Intel® Optimized Modin, Intel® Optimized Sklearn, and XGBoost, as well as DL Frameworks such as Intel® Optimized Tensorflow and Intel® Optimized PyTorch tuned and enabled with new hardware features and instructions every new CPU generation.

Keyword(s): oneAPI,Modin,Intel® AI Analytics Toolkit,Intel® Distribution of Modin,Scikit-learn,XGBoost,Machine Learning,Census,PLAsTiCC,SigOpt

Author(s): Meena Arunachalam, Vrushabh Sanghavi
Video(s): Performance Optimizations for End-to-End AI Pipelines
Read more | |
IXPUG Webinar Series Mar 08, 2021

In this webinar we will demonstrate how an existing CUDA stencil application code can be migrated to DPC++ with the help of the Compatibility Tool. We will highlight and discuss the crucial differences between the two programming environments in the context of migrating the tsunami simulation easyWave. The discussion also includes steps for making the code to compliant with the SYCL standard. During the talk, we will also show that the migrated code can run on a wide range of platforms starting from CPUs, over GPUs, to FPGAs.

Keyword(s): oneAPI,SYCL,easyWave,heterogeneous architectures,Data Parallel C++,stencil kernels,Compatibility Tool,Unified Shared Memory,DPC++

Author(s): Marius Knaust, Steffen Christgau
Video(s): Migrating from CUDA-only to Multi-Platform DPC++
Read more | |
IXPUG Workshop at HPC Asia 2021 Feb 08, 2021

HPC industry is undergoing a seismic shift and growth due to global Exascale initiatives, emergence of AI and accelerated migration of workloads to the Cloud. At the same time, increasing demands for high-performance data analytics and computational workloads have resulted in expanding ecosystems of diverse general purpose processors and accelerator technologies. In this talk, we discuss how Intel is addressing the needs of the HPC community with a comprehensive portfolio of products and technologies that are built on top of an open, scalable and standards-based ecosystem in order for the community to advance HPC together.

Keyword(s): Exascale,Cloud,XPU,Heterogeneous Acceleration,oneAPI,DevCloud

Author(s): John K. Lee
Video(s): Keynote Address: Advancing HPC Together
Read more | |