TIPSv2: Advancing Vision-Language Pretraining with Enhanced Patch-Text Alignment

(gdm-tipsv2.github.io)

21 points | by gmays 13 hours ago ago

1 comments