#local-inference 2 items 11 июн Google Releases DiffusionGemma: 26B Open Model with 4× Faster Text Generation Google DeepMind models-llm 22 июн llama.cpp b9754: Real-Time Model Load Progress via SSE and PEG Grammar Parser tools