Given how quickly things evolve, it’s easy to get lost in the numerous offerings and hard to get the best deal. So, what do you use? Both clients/harnesses and LLM providers or local setups would be interesting.
Personally, I’ve been using opencode with Github copilot for work. I’m currently looking for cost-effective provider for personal work. Maybe openrouter with one of the cheap models?


How do you get your LLM credits? Or do you run Gemma and Qwen locally? With which hardware?
For Claude I have the lowest tier subscription through work. I also have openrouter to use occasionally when I need it. Gemma and Qwen I run locally on a strix halo framework desktop I bought just before ram prices went to the moon.
So, CPU only?
No, Strix halo is AMD’s integrated CPU GPU using unified ram. Its the non apple tax equivalent of the mac minis people have been using to run local models. On one hand its a bit slower as it has lower memory bandwidth and that’s the limiting factor, but on the other its less than half the price for more memory and runs linux rather than osx.