NVIDIA DLFW Inspect
Free19 GitHub stars
Agent ToolAgnosticFile System
Overview
NVIDIA DLFW Inspect is a tool designed to facilitate debugging convergence issues and testing new algorithms for training large language models. It is ideal for developers and researchers working with Nvidia libraries such as Transformer Engine, Megatron-LM, and NeMo.