notes.jays.net

"AI" for computer programming

Experience: 30 years of computer programming. In 2023 my career veered off of large scale back-end development (DBs, APIs, etc.) and into "DevOps" (gluing dataflow tools together w/o building large bespoke ones in house). My title is "Data Engineer" but I wouldn't say I'm doing much of that.

2026-05-07

VSCode -> GitHub Copilot -> Claude Sonnet 4.6 is extremely good at AWS monkeying + Terraform. The majority of the time. I am more valuable to my employer (faster at finishing typical tasks) with that tool than without it.

Context: The AWS stack I'm responsible for: GitHub Actions (CI/CD) -> Terraform -> AWS: MWAA (Apache Airflow) (Step Functions, dbt), Docker -> ECR -> ECS, Glue (Jobs, Connectors, Crawlers) (Apache Spark (Java)), Athena, S3 general purpose buckets, Redshift (PostgreSQL). IAM, VPCs, Secrets, CloudWatch (w/ Dashboards), EventBridge, SNS, Lambda, CloudTrail. Experimental: S3 table buckets (Apache Iceberg), CloudFormation.

How I use Claude: My git repo contains zero secrets. New branch in git. Claude has direct access to read and write all the files in my LOCAL git clone (via VSCode). Every suggested change to every file pops up a pretty red/green diff display with "Approve" and "Reject" buttons. I tell Claude to run read-only AWS CLI commands itself so I don't have to copy/paste responses back to Claude. Every CLI command it wants to run pops up an "Allow" button for me to confirm manually.

This makes troubleshooting my cascading AWS environments much faster. Claude chases ~20 layers of AWS junk and suggests Terraform / source code changes. AWS is byzantine and constantly changing, and you frequently need 3-6 CLI commands chained together perfectly to find a root cause in whatever layer went sideways.

I frequently "argue" with Claude until I'm satisfied with a change set.

Sometimes Claude "death spirals" -- it gets stuck looping bad ideas, or circular hypotheses. I Abort Claude and re-guide it's approach, and/or chase the problem myself for a while. This is infrequent recently.

I perform all git operations myself, manually. Claude has zero permission to run any CLI autonomously. So theoretically all it could blow up is my local git clone. Which is trivial to git reset --hard.

When I think a task is finished, I push my branch as a Pull Request for additional not-me human review and QA. I take personal responsibility for every line of code in my PR. Whether I wrote it, or Claude did.

OOPS: On Jun 1 GitHub Copilot -> Claude Sonnet 4.6 is getting 9X more expensive? I have no idea how much USD my employer is currently paying for my recent use of Claude Sonnet 4.6. I've been told Claude Opus is much better, but I've never tried Opus because it's 3X more expensive than Sonnet and Sonnet works fine for what I do?

2026-01-19

It now appears that Claude Sonnet 4.5 (in VSCode) is stunningly stronger (faster) at many of the processes I've been doing manually for 30 years. With a low mistake / disastrous suggestion rate. I've gotten pretty good at harnessing Claude for my tasks.

Overall, in the domain I'm working in (AWS cloud monkey), I may or may not be more productive than other humans.

But it's becoming inarguable, it seems to me, that a human with my level of experience doing my specific job w/ Claude Sonnet 4.5 can be (note: is not guaranteed to be) more productive than a human w/o Claude or equivalent.

My professional experience now includes many months of harnessing "AI" tools.

Presumably in the next 1-5 years, the licensing costs of the tools my job is using will skyrocket as those companies eventually have to stop losing tens / hundreds of billions of dollars every year?


Side-note: For 25 years I'd use Perl to do little one-off things. They'd take me 20-90 minutes. Now Claude writes that thing for me from scratch in Python faster than I can write good prompts. This makes my old programmer brain sad. But my get shit done brain is happy the shit got done.


On Mastodon, someone asked:

Oh FFS, why is it so hard for people to understand that LLMs/genAI don't THINK?! 💀

Doing my specific job, at this point, I don't know how a blind Turing test would / could prove I "think" about problems more than Claude Sonnet 4.5 does.

Could you detect it was Claude? Absolutely.

Claude isn't just a one-time word salad generator anymore. It generates deep trees of hypothesis and conclusions only partially based on LLM predictions. It's "logical" "grounding" in technical specs / docs is... I didn't think it was going to be possible for at least 10 more years.

Or I'm a sucker / stupid / tricked. I dropped out of my Philosophy major decades ago, so maybe my thinking about thinking is weak. 🙂

2025-07-09

I cancelled all my personal "AI" subscriptions because my new job is flooded with those tools. By policy I'm not supposed to use those tools for personal use. I'm not making enough time for my hobbies to "need" "AI" for them anyway. My hobbies are shifting from software to physical.

2025-05-11

I'm often very impressed with ChatGPT 4o in specific situations. When I have no idea how to do X, ChatGPT is often excellent at quickly putting me in a functional ballpark of one way to start. Saving me 10-90 minutes of trying to find good docs / a single working example.

So I always have a ChatGPT 4o browser tab open. I go back and forth between command-line docs, web reference docs, Kagi searches, and asking ChatGPT 4o.

I've also been playing with Cursor for small hobby projects. It seems excellent at "writing things for me" and "explaining" what it's doing, how it works, and why to do those things. I've paid $0 for Cursor so far. The free tier is 125 whatevers/month, which I haven't hit yet.

e.g. It's been years since I've attempted any NLP (Natural Language Processing), so I had ChatGPT 4o sketch out some code for me in whatever language / libraries it recommend. I've had Cursor refining it for me. I'm still doing a lot of it manually myself, but from a tool-assisted initial scaffolding. I have to understand the new code myself before I accept anything the tools spit out. I commit each step to git manually after I understand what it did.

I've been very surprised that I can ask it questions about non-obvious, mildly janky code I wrote and it explains what I wrote in 3 seconds in very well written docs. If asked to document what I wrote, I probably would have written similar (or slightly worse?) docs of that thing, and it would have taken me 30 minutes.

I haven't attempted any AI integrations into large scale real-world contexts. I assume that if, a few months ago, I had attempted to have Cursor help me with the ~30K lines of Terraform configuration I was working on, that maybe that would have been useful? At the time I wasn't using Cursor yet, so I didn't try. Cursor is apparently super great at feeding your code context to it. It manipulates your code in-line inside a VSCode fork. Very fancy. I have no idea how well it scales.

I'm a baby "AI" tool user, not a power user (so far). Everything I've done is the intro stuff. I've been very impressed recently. It also kicks out total nightmare code occassionally. Or completely non-functional nonsense. But not very often in my recent experience.

2024 - early 2025

All these "AIs" (LLMs) are pretty much crap as far as I can tell. But I feel professional pressure to have one available to me, so apparently I've been personally paying OpenAI $20/month. Just to have a browser tab available in case it happens to be useful occasionally. Sometimes to be able to say "ya, I tried that, but it sucked at that."

As a consultant, it appears to me you always have to be open to at least trying new tech. Blanket refusal to even try something is frowned upon. If my $20/month destroys the entire planet / humanity, that's my bad. I don't feel massively complicit (yet?), I believe they're losing money on my usage? (Burn that venture capitol, burn. 🔥) If the price increases much (even enough to cover their costs?) I predict I'll drop my subscription.