21 points | by franze 13 hours ago ago
3 comments
This is neat. I wonder… Is there a more comprehensive analysis of how well the Apple Vision framework compares to other multi-modal AI? Would there be any benefit to pre-processing images via Auge before handing them to Claude, GPT?
The ocr example says it recognizes Chinese, but output ignores it - maybe just AI bug in generated examples
Very cool
This is neat. I wonder… Is there a more comprehensive analysis of how well the Apple Vision framework compares to other multi-modal AI? Would there be any benefit to pre-processing images via Auge before handing them to Claude, GPT?
The ocr example says it recognizes Chinese, but output ignores it - maybe just AI bug in generated examples
Very cool