NVIDIA/TensorRT-LLM/ad-model-onboard

TensorRT-LLM NVIDIA

> Translates a HuggingFace model into a prefill-only AutoDeploy custom model using reference custom ops, validates with hierarchical equivalence tests.

How to get this skill

Agent Skill by NVIDIA. Download or clone it, then install it in your agent.

View on GitHub ⬇ Download ZIP Publisher: NVIDIA

Setup & Installation

Clone the repository: git clone https://github.com/NVIDIA/skills.git
Copy the skill folder (which contains SKILL.md) into your agent skills folder, e.g. .claude/skills/.
Restart or reload the agent to auto-discover the skill.
Check SKILL.md for any special instructions or requirements.

Ad Meqyas AI

Meqyas AI helps individuals and organizations measure AI readiness, then move from diagnosis to a practical adoption roadmap.

Measure your AI readiness

## NVIDIA/TensorRT-LLM/ad-model-onboard

Use this skill when you need an AI agent to follow a repeatable workflow for this task.

1. Review the source or repository details.
2. Copy the skill folder that contains `SKILL.md` into your agent skills directory.
3. Restart or reload the agent so it can discover the skill.
4. Test the skill on a small task before using it in production.

Related skills

TensorRT-LLM

NVIDIA/TensorRT-LLM/ad-accuracy-debug

> Debug AutoDeploy accuracy regressions vs a reference score (PyTorch backend or published baseline).

NVIDIA Details →

TensorRT-LLM

NVIDIA/TensorRT-LLM/ad-add-fusion-transformation

> Claude Code skill (trtllm-agent-toolkit): implement or extend TensorRT-LLM AutoDeploy fusion transforms under transform/library/ in a TensorRT-LLM checkout.

NVIDIA Details →

TensorRT-LLM

NVIDIA/TensorRT-LLM/ad-conf-check

> Check whether AutoDeploy YAML configs were actually applied by analyzing server logs and optionally graph dumps (AD_DUMP_GRAPHS_DIR).

NVIDIA Details →

TensorRT-LLM

NVIDIA/TensorRT-LLM/ad-graph-dump

> Enable and interpret TensorRT-LLM AutoDeploy FX graph text dumps via AD_DUMP_GRAPHS_DIR.

NVIDIA Details →