Z80-μLM is a 'conversational AI' that generates short character-by-character sequences, with quantization-aware training (QAT) to run on a Z80 processor with 64kb of ram. The root behind this project ...
Qwen3.5 comes in an open-weight and hosted API version, with the company advertising improvements in performance and costs from previous versions. Qwen3.5 supports new agentic capabilities and is ...
Restart your editor — done. Your AI assistant can now use local Ollama models.