Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation

(github.com)

13 points | by neehao 2 days ago ago

No comments yet.