Reddit r/StableDiffusion via Reddit May 18th 2026

AMD RX 9060 XT Confirmed Viable for LoRA Training

AMD AMD RDNA4 LoRA fine-tuning Linux AI consumer GPU ROCm open-source AI

Key insights

AMD RX 9060 XT (RDNA4 / gfx1200) has been confirmed working for LoRA training under native Linux via ROCm.
This is among the first publicly documented AI training runs on RDNA4 architecture, filling a practical gap in community knowledge.
ROCm compatibility with gfx1200 was the key unresolved variable; this post provides a reproducible workflow for other practitioners.

Why this matters

RDNA4 cards have been on the market without clear community validation for AI training, leaving AMD-hardware users unable to confidently invest in the architecture for fine-tuning workloads. This result gives ML practitioners a concrete, reproducible reference point before purchasing or deploying RX 9060 XT cards for local model training. It also signals that AMD's ROCm stack is keeping pace with new GPU families fast enough to be practically useful, which matters for anyone evaluating whether AMD can serve as a credible alternative to Nvidia in the local and hobbyist AI training market.

Summary

AMD's new RX 9060 XT (RDNA4 / gfx1200) has been confirmed working for Portrait LoRA fine-tuning on native Linux, with a community member posting one of the first documented training runs on the architecture. The significance is in what was unresolved until now: RDNA4 cards have been available since launch but largely untested for AI training workloads, with ROCm compatibility for gfx1200 remaining a practical unknown for most ML practitioners. This post walks through the ROCm setup and workflow steps that make it work, closing the gap between hardware availability and usable documentation. Essentially: (AMD, ROCm) now have community-validated signal that RDNA4 is a trainable GPU family for local fine-tuning, not just inference. - The RX 9060 XT runs on the gfx1200 architecture, which required specific ROCm compatibility verification before training workflows could be trusted. - Portrait LoRA training is a meaningful benchmark because it is memory and compute-intensive, giving practitioners a realistic proxy for other fine-tuning workloads. - Native Linux support, not a container workaround, is what makes this result portable and reproducible for the broader AMD user base. For the significant slice of the open-source ML community that builds on AMD hardware to avoid Nvidia pricing, this is the confirmation they have been waiting for before committing to RDNA4 cards for production fine-tuning.

Potential risks and opportunities

Risks

If ROCm stability on gfx1200 is limited to narrow workloads like portrait LoRA, practitioners who invest in RX 9060 XT hardware for broader fine-tuning pipelines may encounter blocking incompatibilities with no near-term fix from AMD.
Community-validated workflows built on a single undocumented ROCm configuration could fragment quickly as AMD releases new ROCm versions, forcing repeated re-validation for each gfx1200 training setup.
AMD's developer relations and ROCm support teams face heightened community expectations now that RDNA4 training is confirmed possible, with any regression in future ROCm releases likely to generate significant negative signal among open-source ML practitioners.

Opportunities

AMD has a concrete community success story to amplify through its ROCm developer program, potentially accelerating RDNA4 adoption among hobbyist and indie ML practitioners who have been on the fence.
Training framework maintainers (kohya-ss, OneTrainer, Axolotl) could attract AMD users by officially documenting and testing gfx1200 support, given that demand signal is now established.
Local AI hardware resellers and system integrators targeting the open-source fine-tuning market can now position RX 9060 XT as a validated training card, opening a market segment previously locked to Nvidia by lack of documentation.

What we don't know yet

Training throughput and iteration time on the RX 9060 XT versus comparable Nvidia cards (RTX 4070, RTX 5070) were not benchmarked in the post.
Whether larger fine-tuning workloads (full SDXL or Flux training, not just portrait LoRA) are stable on gfx1200 under ROCm remains untested as of this report.
The specific ROCm version and any required patches or workarounds used in this run were not fully detailed, leaving reproducibility partially dependent on follow-up community documentation.

Originally reported by Reddit r/StableDiffusion

Read the original article →

Original headline: Training a Portrait LoRA on AMD RX 9060 XT (RDNA4 / gfx1200) on Native Linux