local-inference — AI Digest

11 июн Google Releases DiffusionGemma: 26B Open Model with 4× Faster Text Generation Google DeepMind models-llm
22 июн llama.cpp b9754: Real-Time Model Load Progress via SSE and PEG Grammar Parser tools