🐿️
13

Spent three days trying to get a simple image caption model to work right

I was building a small tool to auto-describe product photos for my online shop, using a basic open-source model. The captions kept coming out weird, like calling a blue shirt 'a sad sky' or a coffee mug 'a ceramic cylinder of despair'. I finally got it working by feeding it 200 of my own labeled images, but man, that took way longer than the 'quick setup' guide said. Has anyone else had to train a model on their own stuff just to make it talk normal?
2 comments

Log in to join the discussion

Log In
2 Comments
spencerw72
spencerw721mo ago
Oh man, totally. I tried using one for my blog photos and it kept calling my dog "a brown floor rug with eyes." I had to make a tiny set of like 50 pictures just labeled "dog" to get it to stop. The guides make it sound like five minutes and you're done, lol.
4
brooke533
brooke5331mo ago
Right, the guides are always way too optimistic.
1