Cradawx@alien.topB to LocalLLaMAEnglish · 2 years agoShareGPT4V - New multi-modal model, improves on LLaVAsharegpt4v.github.ioexternal-linkmessage-square17linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkShareGPT4V - New multi-modal model, improves on LLaVAsharegpt4v.github.ioCradawx@alien.topB to LocalLLaMAEnglish · 2 years agomessage-square17linkfedilink
minus-squareGeraltOfRiga@alien.topBlinkfedilinkEnglisharrow-up1·2 years agoThis is kinda nuts (first time I try a LLM + vision) Tried with a first person shooter screenshot, enemy on screen. Asked to give me the 2D coordinates of the enemy and it did, precisely.
This is kinda nuts (first time I try a LLM + vision)
Tried with a first person shooter screenshot, enemy on screen. Asked to give me the 2D coordinates of the enemy and it did, precisely.