• Kuinox@lemmy.world
    link
    fedilink
    arrow-up
    24
    arrow-down
    1
    ·
    edit-2
    13 days ago

    I’m someone who use LLM, including Claude Code a lot.
    But come on, this could be a tiny script that do the same, write it with claude code if you dont want to write it, instead of having claude code run each time, you’ll be deterministic this way !

    • kautau@lemmy.world
      link
      fedilink
      arrow-up
      14
      ·
      13 days ago

      Yeah this should just be a standard GitHub action. It’s a waste of energy to have an LLM do this over and over.

      I see this trend happening a lot lately, where instead of getting the LLM to write code that does a thing, the LLM is repeatedly asked to do the thing, which leads to it just writing the same or very nearly the same code over and over.

    • dormedas@lemmy.dormedas.com
      link
      fedilink
      arrow-up
      12
      ·
      13 days ago

      Their company is an AI assistant for shopping, so trying to put AI everywhere including places it shouldn’t be is gonna happen.

      I like my build scripts dependable, debuggable, and deterministic. This is wild. When the bot makes a pull request, and the user (who may be someone else at some point) doesn’t respond with exactly what the prompt wants, what happens? What happens when Claude Code updates, or has an outage? Don’t change that GitHub action at the end’s name without remembering to update the prompt as well.

      • kautau@lemmy.world
        link
        fedilink
        arrow-up
        2
        ·
        12 days ago

        Or worse. A single bad actor (according to the company) poisoned grok to be white supremacist. How many unsupervised, privileged LLM commands could run in a short time if an angry employee at Anthropic poisons the LLM to cause malicious damage to servers, environments, or pipelines it has access to?

    • wise_pancake@lemmy.ca
      link
      fedilink
      arrow-up
      2
      ·
      13 days ago

      They only actually need Claude to skip QA and hope that reading the code is a good enough substitute.

      This really could be a script to create a PR for the merge, request a review from Claude, then automate the rest.

  • TehPers@beehaw.org
    link
    fedilink
    English
    arrow-up
    18
    ·
    13 days ago

    The only part of this process I’d consider automating with a LLM is summarizing the changes, and even then I’d only be interested looking at a suggested changelog, not something fully automated.

    It’s amazing to me how far people will go to avoid writing a simple script. Thankfully determinism isn’t a requirement for a release pipeline. You wouldn’t want all of your releases to go smoothly. That would be no fun.

    • FizzyOrange@programming.dev
      link
      fedilink
      arrow-up
      1
      ·
      13 days ago

      I agree. The right way to integrate AI into this process is to pre-fill the “release notes” box with an AI suggestion, that you then edit.

  • Nate Cox@programming.dev
    link
    fedilink
    English
    arrow-up
    16
    arrow-down
    1
    ·
    13 days ago

    So this is Jenkins except it guzzles water and ruins the lives of people near data centers?

  • cronenthal@discuss.tchncs.de
    link
    fedilink
    arrow-up
    10
    ·
    13 days ago

    Uhm, we have release pipelines in Azure DevOps that do all of this and much more with precision and reliability. Relying on an LLM for actual releases to production of all things seems like a pretty big and wholly unnecessary gamble.

  • jjjalljs@ttrpg.network
    link
    fedilink
    arrow-up
    5
    ·
    13 days ago

    This doesn’t seem like a good idea.

    One, releasing should be easy. At my last job, you clicked “new release” or whatever on GitHub. It then listed all the commits for you. If you “need” an Ai to summarize the commits, you fucked up earlier. Write better commit messages. Review the changes. Use your brain (something the AI can’t do) to make sure you actually want all of this to go out. Click the button. GitHub runs checks and you’re done.

    Most of the time it took a couple minutes at most to do this process.

  • Colonel Panic@programming.dev
    link
    fedilink
    arrow-up
    1
    ·
    edit-2
    13 days ago

    this seems overengineered to me - it’s easy enough to have fully automated builds, deployments and releases already. you can even have ai sketch your github actions or similar config. anything beyond that is simply a downgrade or reinventing the wheel. a git commit hook might be just enough