Awamer awamer.ai

NVIDIA/TensorRT-LLM/perf-torch-cuda-graphs

TensorRT-LLM NVIDIA

>- Apply CUDA Graphs to PyTorch workloads — API selection (torch.compile, PyTorch make_graphed_callables, TE make_graphed_callables, MCore CudaGraphManager, FullCudaGraphWrapper, m...

How to get this skill

Agent Skill by NVIDIA. Download or clone it, then install it in your agent.

Setup & Installation

  1. Clone the repository: git clone https://github.com/NVIDIA/skills.git
  2. Copy the skill folder (which contains SKILL.md) into your agent skills folder, e.g. .claude/skills/.
  3. Restart or reload the agent to auto-discover the skill.
  4. Check SKILL.md for any special instructions or requirements.