For the longest time, I’ve been trying to figure out a way to “survive” in this new AI age without having to fork over a ton of money just to keep up. I’ve tried using local models via Ollama, and while they definitely work to a degree, they’re (unsurprisingly) not as good as the big model providers.

The local models tend to

  • Forget what they’re doing
  • Struggle to break larger tasks into smaller ones
  • Lose focus easily
  • Have weaker coding performance
  • Drift over longer sessions

So to improve the reliability of fully local, smaller models (and to keep all my data local and in my own network), I created Coyote.

It’s a local-first, batteries-included command line tool and runtime for building and running LLM workflows locally. It’s model agnostic and supports things like

  • Agents and agent delegation
  • Roles/personas
  • MCP Servers
  • RAG
  • Custom tools
  • Macros
  • Workflow Scripting

A lot of the features it supports are specifically designed to compensate for weaknesses in smaller local models. For example:

  • Auto continuation to keep pushing models to completion instead of stopping halfway through problems
  • Parallel agent delegation so tasks can be split into smaller, focused scopes
  • Workflow-based execution (“If this, do that”) for building more reliable and repeatable automations

It also supports the major cloud providers if you want them (which definitely helped while testing 😄), but my long-term goal is simple:

Get as close as possible to Claude Code-style reliability using fully local models.

I’m always open to feedback, questions, or ideas.

Repo: https://github.com/Dark-Alex-17/coyote

  • Ricky Rigatoni@piefed.zip
    link
    fedilink
    English
    arrow-up
    0
    ·
    23 days ago

    Does it have built-in protections so it doesn’t randomly decide to delete every file it has permissions to?

    • Dark-Alex-17@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      0
      ·
      23 days ago

      Yes it does. By default, any of the execute_command or fs_write/fs_patch/etc. tools all have guards around them that prompt for user confirmation before doing things. They can be disabled via the AUTO_APPROVE environment variable if necessary (like they are when using the sisyphus agent). For bash tools, I’ve included functions that can help do this when you write your own tools. For Python tools, you can use the usual input methods.

      • Ricky Rigatoni@piefed.zip
        link
        fedilink
        English
        arrow-up
        0
        ·
        23 days ago

        As usual, leave it to the random developers on the internet to put more care and thought into something than the multibillion dollar companies.

  • Dark-Alex-17@lemmy.worldOP
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    22 days ago

    By popular demand, Loki is now Coyote!

    I’ve just released a new version with the full rename.


    Thanks to @minticecream for the suggestion!


    All the docs are updated except for the screenshots and gifs. I’ll update those tomorrow (since they don’t do any harm right now and don’t require another release to be updated).

    For almost a year I’ve been using Loki as the name but it never really satisfied me. Not to mention it’s already the name of an extremely popular logging system by Grafana, which makes Loki hard to discover.

    Got some great feedback and the rename was born! Thanks again everyone who gave suggestions and for all the positive feedback!

    • Meron35@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      23 days ago

      Ditto. I don’t see how this is different/better from existing harnesses such as Opencode, Pi, and even “commercial” open source offerings such as the CLIs for Codex, Copilot, and Gemini, especially once tricked out with plugins and extensions.

    • Dark-Alex-17@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      0
      ·
      23 days ago

      OpenCode is specific to coding workflows. Loki is built to be a general LLM runtine/workflow engine for any problem domain, not just code. An example use I have for it is a cron job that runs at boot to

      • See if the cause of the reboot was power loss (LLM)
      • If it was, check all services to ensure they’re up and running (tool)
      • If a service isn’t up, then use an LLM to see what happened (LLM)
      • Try out the usual methods for getting that service started (tool + RAG)
      • If none of those work, try figuring out what’s ultimately wrong (LLM)
      • Send me a ntfy notification on my phone to let me know what service isn’t running, and the suspected cause with some context (tool)
    • naught@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      0
      ·
      23 days ago

      Opencode isn’t very fun to set up with local LLMs and I’ve had issues with tool calling, but it’s very doable! That said, OpenCode is my go-to, absolutely love it compared to all alternatives I’ve tried

      • Dark-Alex-17@lemmy.worldOP
        link
        fedilink
        English
        arrow-up
        0
        ·
        23 days ago

        When it comes to writing code, OpenCode is my go-to as well. It’s my ultimate benchmark for how well optimized and reliable I can make local models function in Loki.

        • naught@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          0
          ·
          23 days ago

          Is that included in the token & cost counter? I haven’t really noticed that yet. It’s just the most reliable and best harness i’ve yet used. For context i’ve only otherwise tried claude, gemini, and aider. More if you count non cli apps

          • Evotech@lemmy.world
            link
            fedilink
            English
            arrow-up
            0
            ·
            23 days ago

            Yes of course. It’s all tools and skills and system stuff

            But 10k isn’t much in the grand scope. But it can be a big hurdle if you want to use opencode with local small models

      • alehc@slrpnk.net
        link
        fedilink
        English
        arrow-up
        0
        ·
        23 days ago

        Haven’t configured much beyond this but what’s wrong with ollama launch opencode <model>? Haven’t had an issue yet.

        • naught@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          0
          ·
          23 days ago

          yo… what? i was configuring json files n shit with custom sources. granted i used lmstudio as ollama doesn’t support mlx models or something for mac. This is definitely the easy way if you’re using ollama.

  • Decronym@lemmy.decronym.xyzB
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    23 days ago

    Acronyms, initialisms, abbreviations, contractions, and other phrases which expand to something larger, that I’ve seen in this thread:

    Fewer Letters More Letters
    DNS Domain Name Service/System
    IP Internet Protocol
    SSL Secure Sockets Layer, for transparent encryption
    TLS Transport Layer Security, supersedes SSL

    3 acronyms in this thread; the most compressed thread commented on today has 16 acronyms.

    [Thread #318 for this comm, first seen 26th May 2026, 19:30] [FAQ] [Full list] [Contact] [Source code]

  • merc@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    0
    ·
    23 days ago

    The demo would be a lot more impressive if the questions you were asking weren’t longer than the extremely simple SQL queries it generates.

  • CIA_chatbot@lemmy.world
    cake
    link
    fedilink
    English
    arrow-up
    0
    ·
    23 days ago

    Just an fyi, Loki is also an extremely popular logging system by Grafana, might want a rename if you don’t want to deal with people not finding your project due to having a larger project named the same thing

    • Dark-Alex-17@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      0
      ·
      23 days ago

      So actually, this was the original purpose of it. But all the help I tried to get on it didn’t really have much interest in doing anything outside of the usual big model providers, so I tried advertising a more general use case to attract more input. I can’t deny that agnostic support for even the big providers is helpful when you’re trying to stay current with the rapid advances in LLMs.

      After that, I kind of gave up on getting feedback on local-first models. So, instead, I just dove in head-first the way I wanted;Trying new things, building new agents to try and rival Claude Code, adding features as I found them useful and necessary to improve that reliability, etc., and iterating. Then, with the most recent release on Friday, I had done so many changes and improvements specifically for local models that I thought I finally had a strong enough tool to maybe pique enough people’s interest to get some feedback and input. 🙂

      Oh, and the config example shows how to add Ollama models here

        • JollyForeheadRidges@lemmy.zip
          link
          fedilink
          English
          arrow-up
          0
          ·
          23 days ago

          Crap. I was just starting to play with Ollama and thought it might be a good balance between running local models and using one of the proprietary services.

          Could you elaborate on what’s happening with them / what to watch out for?

          • MalReynolds@slrpnk.net
            link
            fedilink
            English
            arrow-up
            0
            ·
            23 days ago

            If it gets you started with local models, by all means go ahead, their onboarding is the easiest and it works. Also a lot of 3rd party stuff uses it as a first class citizen allowing you to try out other things (e.g. Open WebUI) easily as you explore what’s possible. Currently try the Qwen 3.6 and Gemma4 models as best bang for buck, somewhere there’s a does it fit in my machine website that can help (search for it).

            That said, basically all roads in local LLM lead to llama.cpp, which gets the innovations first and then others copy their homework. Ollama (looks like they’re angling to go commercial) for a long time used it internally without attribution, now they use a bodged up engine of their own that is less performant and almost certainly a copy (possibly vibe coded) of llama.cpp. They heavily encourage using their own models / quantizations and don’t let you play with a lot of parameters without a lot of friction (possibly because they’re not implemented yet, but who knows, low transparency). You get the picture, wannabe techbros. That’s off the top of my head, search for more authoritative sources.

            After you’ve gotten the hang of things, have a look at llama-swap which just wraps llama.cpp, lemonade if you’re on AMD, vLLM for nvidia, LM Studio for mac.

        • Dark-Alex-17@lemmy.worldOP
          link
          fedilink
          English
          arrow-up
          0
          ·
          23 days ago

          Looking at Llama-swap, since it says it supports OpenAI-compatible API, it should just work natively already. Just set up the client to be type: openai-compatible and fill in the URL and provide the models. Should work out of the box!

          • MalReynolds@slrpnk.net
            link
            fedilink
            English
            arrow-up
            0
            ·
            23 days ago

            Hope so, bet it doesn’t without some tweaking though, OpenAI-compatible seldom is, and ollama is bad for that. Still, worth checking out, I’ll have a go at it sometime soonish and perhaps you’ll see a PR (or some doco in the best case scenario).

            • Dark-Alex-17@lemmy.worldOP
              link
              fedilink
              English
              arrow-up
              0
              ·
              23 days ago

              Looking forward to it! Heads up in case you missed it: I had settled on renaming it to Coyote, so sometime this week will be a breaking change and release to get that done.

              Biggest pains are just going to be updating the repo tokens for Crates.io and renaming the homebrew repo.

    • Dark-Alex-17@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      0
      ·
      23 days ago

      Yeah… 😅 I originally named it Loki because, well…if you leave LLMs unsupervised they just create mischief. Any ideas of a good rename? I’ve gotten this comment before and I just couldn’t think of anything good.

      • [object Object]@lemmy.ca
        link
        fedilink
        English
        arrow-up
        0
        ·
        23 days ago

        I feel like that well describes a border collie.

        Wants to do stuff, but if you don’t attend they’ll find stuff to do.

        • Dark-Alex-17@lemmy.worldOP
          link
          fedilink
          English
          arrow-up
          0
          ·
          23 days ago

          Ooh I like Coyote! That’s definitely in the running now. Not to mention that’s really a really cool allusion to Native American mythology!

          • MalReynolds@slrpnk.net
            link
            fedilink
            English
            arrow-up
            0
            ·
            edit-2
            23 days ago

            Not to mention road-runners (humans) and ACME (OpenAi, Anthropic etc.) extending the metaphor in a different direction… Wile.E. (coyote, suupergenius) might be another name option.

            Ahh, ninja’d, I’ll leave it as another vote.

          • Helix 🧬@feddit.org
            link
            fedilink
            English
            arrow-up
            0
            ·
            23 days ago

            Or Coyode (mixture between code and coyote, could be written co[yo]de for extra yo). Only has 4 duckduckgo results so easily searchable and distinguishable.

            • Dark-Alex-17@lemmy.worldOP
              link
              fedilink
              English
              arrow-up
              0
              ·
              23 days ago

              After sitting with Coyote for a while, I’m really liking the name. Before I get too attached, any other ideas? (Just to make sure I stay objective 😛)

              • jackal@infosec.pub
                link
                fedilink
                English
                arrow-up
                0
                ·
                23 days ago

                Stop overthinking it and use it. It seems like the consensus is in the approval of your choice.

                • Dark-Alex-17@lemmy.worldOP
                  link
                  fedilink
                  English
                  arrow-up
                  0
                  ·
                  23 days ago

                  Works for me. I’ll refactor that and rename it tomorrow and hopefully have a new minor release sometime this week. It’ll be another baking change release so I’ll need to attach a couple commands to the release notes to make it easy to migrate.

        • Dark-Alex-17@lemmy.worldOP
          link
          fedilink
          English
          arrow-up
          0
          ·
          23 days ago

          Do you have a GitHub and would you be willing to share it with me so I could credit you with the name? No worries if not, I can least link to your Lemmy profile instead of you prefer. I just don’t want to change it without giving credit.