NVIDIA/Megatron-Bridge/perf-cuda-graphs
Validate and use CUDA graph capture in Megatron Bridge, including local full-iteration graphs and Transformer Engine scoped graphs for attention, MLP, and MoE modules.
How to get this skill
Agent Skill by NVIDIA. Download or clone it, then install it in your agent.
Setup & Installation
- Clone the repository:
git clone https://github.com/NVIDIA/skills.git - Copy the skill folder (which contains
SKILL.md) into your agent skills folder, e.g..claude/skills/. - Restart or reload the agent to auto-discover the skill.
- Check
SKILL.mdfor any special instructions or requirements.
Related skills
NVIDIA/Megatron-Bridge/adding-model-support
Guide for adding support for new LLM or VLM models in Megatron-Bridge.
NVIDIA/Megatron-Bridge/build-and-dependency
Dev environment setup for Megatron Bridge — container-based development, uv package management, lockfile regeneration, adding dependencies, Slurm container usage, and common build...
NVIDIA/Megatron-Bridge/bump-dependency
Bump a pinned dependency (TransformerEngine, Megatron-LM, NRX, etc.), regenerate the lockfile, open a PR, and drive it to green by attaching a watchdog to the "CICD NeMo" workflow...
NVIDIA/Megatron-Bridge/cicd
CI/CD reference for Megatron Bridge — pipeline structure, commit and PR workflow, CI failure investigation, and common failure patterns.