db0, 10 months ago Yes, it’s uisng the clip image2text model
Yes, it’s uisng the clip image2text model