Systems-level contributor focused on PyTorch, AMD ROCm on Windows, and ML infrastructure.
Auto-updated via GitHub Actions
Systems-level contributor focused on PyTorch, AMD ROCm on Windows, and ML infrastructure.
Auto-updated via GitHub Actions
Forked from facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction. - Windows ROCm Support Fork for RDNA (CK-less)
Python 3
Forked from pytorch/ao
PyTorch native quantization and sparsity for training and inference - Windows ROCm build support via hipBLASLt
Python
Forked from guinmoon/bitsandbytes_win_rocm
Accessible large language models via k-bit quantization for PyTorch - Windows ROCm Support Fork
Forked from Comfy-Org/comfy-kitchen
Fast kernel library for Diffusion inference with multiple compute backends. - Windows ROCm Backend Fork (primarily for RDNA4)
Python 1
ComfyUI custom node to patch default attention with Flash Attention 2
ComfyUI custom node to patch the default attention in VAE to a specific implementation
Python 2