EdgeSync-LLM – KV cache fragment engine for on-device LLM inference (Go/Android)

(github.com)

2 points | by bossandboss 4 hours ago ago

1 comments