Need help, can't run Wan2.1 on 12gb vram?

I'm having issues running anything heavier than image generation in ComyUI. I'm a bit new to this stuff and was hoping for some guidance. Specifically I'm having issues running things like Hunyuan and now Wan2.1 video generators.

I followed a couple of guides and posts for setting up Wan2.1 and can get text to video working but I have to crank the resolution down all the way to something tiny like 320x240. Otherwise I get Out of Memory errors during the Ksampler step. The error pop ups are saying Pytorch is using 10+gb of Vram on my 12gb Vram card so Ksampler can't allocate its workload.

For reference my machine has an i7 14700k, 64gb of system ram, and an rx 7700xt with 12gb of vram.

I've tried using --lowvram and --disable-smart-memory to no effect.

I've tried a workflow or two from civitai that claimed to have Hunyuan and Flow working on 12gb or less of Vram but no luck.

I've followed a few of the guides from the ComfyUI Examples github pages, one blogpost suggested setting Vae Decode to tiled but I don't even make it to that step before getting OOM :(

I'll admit I'm a bit lost and not sure where to start looking. I've seen quite a few posts about Wan2.1 and Hunyuan (and Flow) running on 12gb of vram or less but can't seem to figure it out. Any suggestions or maybe directions to reading material that might help me?