ComfyUI Node for moondream1, a 1.6B parameter visual language model
Thin wrapper custom node for moondream.
moondream1 is a tiny (1.6B parameter) vision language model that performs on par with models twice its size. It is trained on the LLaVa training dataset, and initialized with SigLIP as the vision tower and Phi-1.5 as the text encoder.
Comments
None