GVProf
stable

GVProf Basics

  • Preface
  • Install
  • Manual
  • FAQ

GVProf Development

  • Workflow
  • Roadmap

GVProf Samples

  • Unit Tests
  • Rodinia GPU Benchmark
  • QMCPACK
  • Castro
  • Deepwave
  • Darknet
  • PyTorch
  • NAMD
  • BarraCUDA
GVProf
  • »
  • GVProf: A Value Profiler for GPUs
  • Edit on GitHub

GVProf: A Value Profiler for GPUs

GVProf is an advanced value profiler that locates value redundancy problems in GPU-accelerated applications. GVProf’s code is available on Github.

GVProf Basics

  • Preface
    • HPCToolkit (Profiling Runtime)
    • Redshow
    • GPU Patch
    • Program Analyzer and Aggregator
  • Install
    • GPU Patch
    • Dependencies
    • Redshow
    • HPCToolkit
    • Setup and Test
  • Manual
    • Compile with Line Information
    • Profile Using GVProf
    • Profile Using HPCToolkit
      • First pass
      • Second pass
      • HPCToolkit separate pass
    • Control Knobs
    • Interpret Profile Data
      • Calling context view
      • Data flow view
      • Fine grain pattern views
    • Example
  • FAQ
    • Profile Python applications
    • Accelerate data flow profiling
    • Accelerate value pattern profiling

GVProf Development

  • Workflow
  • Roadmap

GVProf Samples

  • Unit Tests
    • interval_merge
    • op_graph_simple
    • op_pattern_simple
    • stress
    • vectorAdd
  • Rodinia GPU Benchmark
    • backprop
    • bfs
    • cfd
    • hotspot
    • hotspot3D
    • huffman
    • lavaMD
    • pathfinder
    • srad
    • streamcluster
  • QMCPACK
    • Introduction
    • Profiling
    • Optimization
  • Castro
    • Introduction
    • Profiling
    • Optimization
  • Deepwave
    • Introduction
    • Profiling
    • Optimization
  • Darknet
    • Introduction
    • Profiling
    • Optimization
  • PyTorch
    • Introduction
    • Profiling
    • Optimization
  • NAMD
    • Introduction
    • Profiling
    • Optimization
  • BarraCUDA
    • Introduction
    • Profiling
    • Optimizations

Indices and Tables

  • Index

  • Search Page

Next

© Copyright 2021, Rice University and Scalable Machines Research. Revision 28f0707f.

Built with Sphinx using a theme provided by Read the Docs.