CS Events Monthly View

Qualifying Exam

Observations and Opportunities in Architecting Shared Virtual Memory for Heterogeneous Systems

 

Download as iCal file

Wednesday, November 09, 2016, 10:15am

 

Computing is becoming increasingly heterogeneous with accelerators like GPUs being tightly integrated with CPUs on the same die. Extending the CPU’s virtual addressing mechanism to these accelerators is a key step in making accelerators easily programmable. In this work, we analyze, using real-system measurements, shared virtual memory across the CPU and an integrated GPU. We make several key observations and highlight consequent research
opportunities: (1) servicing a TLB miss from the GPU can be an order of magnitude slower than that from the CPU and consequently it is imperative to enable many concurrent TLB misses to hide this larger latency; (2) divergence in memory accesses impacts the GPU’s address translation more than the rest of the memory hierarchy, and research in designing address translation mechanisms tolerant to this effect is imperative; and (3) page faults from the GPU are considerably slower than that from the CPU and software-hardware co-design is essential for efficient implementation of page faults from throughput-oriented accelerators like GPUs. We present a detailed measurement study of a commercially available integrated APU that illustrates these effects and motivates future research opportunities.

Speaker: Jan Vesely

Bio

NULL

Location : CoRE B (305)

Committee

Prof. Abhishek Bhattacharjee (chair), Prof. Thu Nguyen, Prof. Ulrich Kremer, and Prof. Konstantinos Michmizos

Event Type: Qualifying Exam

Organization

Dept. of Computer Science