Fork of city96/ComfyUI-GGUF — MPS-focused

This repository is a fork of the original https://github.com/city96/ComfyUI-GGUF with changes primarily targeting Apple Silicon (MPS) environments.

Key change in this fork:

Added a load_device option to Advanced loaders. Use it if you explicitly want to CHANGE the device. E.g. you know model is loaded to cpu but you want it to use mps. It still may increase memory usage.
Added a stream_from_disk option to Advanced loaders to keep GGUF weights memory-mapped and dequantize/page-in on demand to reduce peak RAM usage. In order to work changes to ComfyUI need to be applied: https://github.com/Comfy-Org/ComfyUI/pull/14622/changes. Alternatively you may download fork https://github.com/FromFirstByte/ComfyUI or add changes manually to your installation of ComfyUI (it's only one line).

After changes applied start ComfyUI in low VRAM mode:

python3 main.py --lowvram

Basic workflow could be found in workflows folder.

ComfyUI-GGUF

GGUF Quantization support for native ComfyUI models

This is currently very much WIP. These custom nodes provide support for model files stored in the GGUF format popularized by llama.cpp.

While quantization wasn't feasible for regular UNET models (conv2d), transformer/DiT models such as flux seem less affected by quantization. This allows running it in much lower bits per weight variable bitrate quants on low-end GPUs. For further VRAM savings, a node to load a quantized version of the T5 text encoder is also included.

Note: The "Force/Set CLIP Device" is NOT part of this node pack. Do not install it if you only have one GPU. Do not set it to cuda:0 then complain about OOM errors if you do not undestand what it is for. There is not need to copy the workflow above, just use your own workflow and replace the stock "Load Diffusion Model" with the "Unet Loader (GGUF)" node.

Installation

Important

Make sure your ComfyUI is on a recent-enough version to support custom ops when loading the UNET-only.

To install the custom node normally, git clone this repository into your custom nodes folder (ComfyUI/custom_nodes) and install the only dependency for inference (pip install --upgrade gguf)

git clone https://github.com/city96/ComfyUI-GGUF

To install the custom node on a standalone ComfyUI release, open a CMD inside the "ComfyUI_windows_portable" folder (where your run_nvidia_gpu.bat file is) and use the following commands:

git clone https://github.com/city96/ComfyUI-GGUF ComfyUI/custom_nodes/ComfyUI-GGUF
.\python_embeded\python.exe -s -m pip install -r .\ComfyUI\custom_nodes\ComfyUI-GGUF\requirements.txt

On MacOS sequoia, torch 2.4.1 seems to be required, as 2.6.X nightly versions cause a "M1 buffer is not large enough" error. See this issue for more information/workarounds.

Usage

Simply use the GGUF Unet loader found under the bootleg category. Place the .gguf model files in your ComfyUI/models/unet folder.

LoRA loading is experimental but it should work with just the built-in LoRA loader node(s).

Pre-quantized models:

Initial support for quantizing T5 has also been added recently, these can be used using the various *CLIPLoader (gguf) nodes which can be used inplace of the regular ones. For the CLIP model, use whatever model you were using before for CLIP. The loader can handle both types of files - gguf and regular safetensors/bin.

t5_v1.1-xxl GGUF

See the instructions in the tools folder for how to create your own quants.

Name		Name	Last commit message	Last commit date
Latest commit History 197 Commits
tools		tools
workflows		workflows
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
dequant.py		dequant.py
loader.py		loader.py
nodes.py		nodes.py
ops.py		ops.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fork of city96/ComfyUI-GGUF — MPS-focused

ComfyUI-GGUF

Installation

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Fork of city96/ComfyUI-GGUF — MPS-focused

ComfyUI-GGUF

Installation

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages