IISWC-2011

November 6-8, 2011

 Austin, TX, USA


  • Tutorial I:   (Half-day, morning of November 6)   

    • Heterogeneous Computing with GPU Ocelot (by Georgia Tech )

  •  Tutorial II:  (Full Day, November 6) 

    • Energy Efficient Datacenters and Systems (by IBM Research - Austin)

Tutorial Descriptions

Tutorial I: Heterogeneous Computing with GPU Ocelot

 

  GPU Ocelot is an open-source dynamic JIT compilation framework for GPU compute applications targeting a > range of GPU and non-GPU execution targets. Ocelot supports CUDA applications and provides an implementation of the CUDA Runtime API enabling seamless integration with existing CUDA applications. Its JIT compiler supports four backend execution targets - (1) an emulator that implements NVIDIAs Parallel Thread Execution (PTX) instruction set architecture, (2) NVIDIA, (3) AMD GPUs, and (4) a translator to LLVM for efficient execution of GPU kernels on multicore CPUs. A back-end targeting vector cores is in development. Existing CUDA applications are seamlessly supported. Ocelot facilitates research and development on several fronts. First, Ocelot improves developer productivity of GPU compute applications by providing an infrastructure for building event trace analyzers using the emulator. This tutorial will describe how analyzers can be built and cover several existing analyzers for (1) correctness checks, (2) debugging and (3) performance tuning. Second, as a JIT compiler infrastructure, Ocelot provides facilities for compiler research including interfaces to an internal representation of PTX programs in support of optimization passes for massively data parallel computer kernels. Third, with an open source re-implementation of the CUDA runtime, Ocelot enables research into scheduling, resource allocation, and operating systems. Finally, Ocelot enables research in heterogeneous architectures via trace generation interfaces, PTX emulation and support for detailed workload characterization on GPU and CPU devices.

 

 

Tutorial II: Energy Efficient Datacenters and Systems

 

This tutorial will discuss the chief characteristics of power and cooling problems of modern data centers and the servers that they support. The tutorial will present an in-depth survey of the problems associated with data center and system energy consumption and substantial material on existing solutions to these problems. It also will cover emerging industrial solutions, including how they address different aspects of the problem and what technical ideas they exploit. Finally, the tutorial will identify key remaining challenges, opportunities for future research in the area, and a survey of the methodologies available for this research. Students and researchers interested in learning more about the challenges of designing energy-efficient data centers and systems, existing solutions to these challenges, and remaining open research problems. No prior knowledge of power management will be required.

Outline of Tutorial II (Energy-Efficient Datacenters and Systems)

  • Fundamentals of Data Center and Server-Systems Power Management (~2 hr)
    • What is the problem? (peak power, average power, operating costs, etc.)
    • Relevant metrics (SPECpower, Green500, PUE, etc.)
    • Solution categories (energy-proportional computing, consolidation, workload-optimized systems, etc.)
    • Data center facilities (power and cooling components)
    • Server power management (poltage scaling, DRAM power modes, power supply trends, etc.)
    • Industry standards and government regulation
  • Special Topics (~4 hr)
    • Storage and Data Center Networking
      • Where is the power consumed in storage servers?
      • New storage technologies (flash, PCM)
      • Techniques for saving storage energy and power with existing hardware (tiering, caching, spin-down, etc.)
      • Emerging industry storage solutions
      • Data Center Networking
        • Where is the power consumed? (optical, copper, etc.)
        • Energy-Efficient Ethernet
        • Review research studies on shutting down links and switches
    • Cloud computing: virtualized resources and energy management
      • How does it differ from a traditional Enterprise or HPC/Scientific computing environment?
      • How does it enable energy-efficient computing?
      • Challenges to address (scaling, energy accounting, benchmarks, etc.)
      • Review research studies in virtualized/cloud environments
    • Energy-efficient software
      • Sources of inefficiencies in software - some results on impact
      • Energy-aware systems software
      • Energy-efficiency in the middle-ware (JVM, databases, etc.)
      • Workload management systems (HPC runtimes, etc.)
    • Modeling for energy-efficient data centers
      • State of the art in data center modeling
      • Recent practices (enclosed aisle, container DC, free cooling)
      • Predicted near-term DC trends (modularity, scalability, wimpy nodes, etc.)
      • Future research topics (model integration, off-line vs. real-time modeling)
    • Integrating IT and facilities management
      • Thermal balancing
      • Hierarchical power capping across DC
      • Optimizing energy cost across DC
      • Joint thermal, power and performance co-optimization
    • Reliability-aware power management
      • Data centers: redundancy, branch circuit oversubscription, branch circuit to machine mapping..
      • Systems: power supply oversubscription, over-current protection, fail-in-place regulator designs
      • Devices: power on/off cycles for disks, temperature sensing in disks
      • Microprocessors: removing circuit timing guardband, thermal cycling impacts on packaging, etc.
  • Research in emerging technologies and solutions (~1 hr)
    • Storage-class memory (alternatives, energy-efficiency implications and challenges)
    • Workload-optimized computing (accelerators, low-power processor clusters, etc.)
    • New cooling/packaging/power delivery technologies (3D, micro-channel cooling, etc.)
    • Facilities innovations (energy-cycle management, immersion cooling, dynamic power distribution)