Fragments: December 16

16 December 2025

16

Gitanjali Venkatraman does great illustrations of advanced topics (which is why I used to be so completely happy to work along with her on our Knowledgeable Generalists article). She has now printed the most recent in her collection of illustrated guides: tackling the advanced subject of Mainframe Modernization

In it she illustrates the historical past and worth of mainframes, why modernization is so difficult, and find out how to deal with the issue by breaking it down into tractable items. I really like the readability of her explanations, and smile regularly at her means of enhancing her phrases along with her quirky footage.

❄ ❄ ❄ ❄ ❄

Gergely Orosz on social media

Unpopular opinion:
Present code evaluate instruments simply don’t make a lot sense for AI-generated code
When reviewing code I actually wish to know:
The immediate made by the dev
What corrections the opposite dev made to the code
Clear marking of code AI-generated not modified by a human

Some individuals pushed again saying they don’t (and shouldn’t care) whether or not it was written by a human, generated by an LLM, or copy-pasted from Stack Overflow.

In my opinion it issues lots – due to the second important objective of code evaluate.

When requested why do code evaluations, most individuals will reply the primary important objective – high quality management. We wish to guarantee dangerous code will get blocked earlier than it hits mainline. We do that to keep away from bugs and to keep away from different high quality points, particularly comprehensibility and ease of change.

However I hear the second important objective much less typically: code evaluate is a mechanism to speak and educate. If I’m submitting some sub-standard code, and it will get rejected, I wish to know why in order that I can enhance my programming. Perhaps I’m unaware of some library options, or perhaps there’s some project-specific requirements I haven’t run into but, or perhaps my naming isn’t as clear as I believed it was. Regardless of the causes, I must know with a view to study. And my employer wants me to study, so I could be more practical.

We have to know the author of the code we evaluate each so we are able to talk our higher apply to them, but additionally to know find out how to enhance issues. With a human, its a dialog, and maybe some documentation if we notice we’ve wanted to clarify issues repeatedly. However with an LLM it’s about find out how to modify its context, in addition to people studying find out how to higher drive the LLM.

❄ ❄ ❄ ❄ ❄

Questioning why I’ve been making a variety of posts like this not too long ago? I clarify why I’ve been reviving the hyperlink weblog.

❄ ❄ ❄ ❄ ❄

Simon Willison describes how he makes use of LLMs to construct disposable however helpful net apps

These are the traits I’ve discovered to be best in constructing instruments of this nature:
A single file: inline JavaScript and CSS in a single HTML file means the least problem in internet hosting or distributing them, and crucially means you’ll be able to copy and paste them out of an LLM response.
Keep away from React, or something with a construct step. The issue with React is that JSX requires a construct step, which makes every thing massively much less handy. I immediate “no react” and skip that complete rabbit gap fully.
Load dependencies from a CDN. The less dependencies the higher, but when there’s a well-known library that helps clear up an issue I’m completely happy to load it from CDNjs or jsdelivr or comparable.
Preserve them small. A couple of hundred traces means the maintainability of the code doesn’t matter an excessive amount of: any good LLM can learn them and perceive what they’re doing, and rewriting them from scratch with assist from an LLM takes just some minutes.

His repository consists of all these instruments, along with transcripts of the chats that acquired the LLMs to construct them.

❄ ❄ ❄ ❄ ❄

Obie Fernandez: whereas many engineers are underwhelmed by AI instruments, some senior engineers are discovering them actually priceless. He feels that senior engineers have an oft-unspoken mindset, which along side an LLM, permits the LLM to be way more priceless.

Ranges of abstraction and generalization issues get talked about lots as a result of they’re simple to call. However they’re removed from the entire story.
Different instruments present up simply as typically in actual work:
A way for blast radius. Understanding which modifications are secure to make loudly and which needs to be quiet and contained.
A really feel for sequencing. Understanding when a technically right change remains to be incorrect as a result of the system or the crew isn’t prepared for it but.
An intuition for reversibility. Preferring strikes that hold choices open, even when they appear much less elegant within the second.
An consciousness of social value. Recognizing when a intelligent answer will confuse extra individuals than it helps.
An allergy to false confidence. Recognizing locations the place exams are inexperienced however the mannequin is incorrect.

❄ ❄ ❄ ❄ ❄

Emil Stenström constructed an HTML5 parser in python utilizing coding brokers, utilizing Github Copilot in Agent mode with Claude Sonnet 3.7. He robotically accredited most instructions. It took him “a few months on off-hours”, together with no less than one restart from scratch. The parser now passes all of the exams in html5lib check suite.

After writing the parser, I nonetheless don’t know HTML5 correctly. The agent wrote it for me. I guided it when it got here to API design and corrected dangerous choices on the excessive stage, however it did ALL of the gruntwork and wrote all the code.
I dealt with all git commits myself, reviewing code because it went in. I didn’t perceive all of the algorithmic selections, however I understood when it didn’t do the appropriate factor.

Though he offers an summary of what occurs, there’s not very a lot data on his workflow and the way he interacted with the LLM. There’s definitely not sufficient element right here to attempt to replicate his strategy. That is distinction to Simon Willison (above) who has detailed hyperlinks to his chat transcripts – though they’re much smaller instruments and I haven’t checked out them correctly to see how helpful they’re.

One factor that’s clear, nevertheless, is the important want for a complete check suite. A lot of his work is pushed by having that suite as a transparent information for him and the LLM brokers.

JustHTML is about 3,000 traces of Python with 8,500+ exams passing. I couldn’t have written it this shortly with out the agent.
However “shortly” doesn’t imply “with out considering.” I spent a variety of time reviewing code, making design choices, and steering the agent in the appropriate course. The agent did the typing; I did the considering.

❄ ❄

Then Simon Willison ported the library to JavaScript:

Time elapsed from mission concept to completed library: about 4 hours, throughout which I additionally purchased and adorned a Christmas tree with household and watched the most recent Knives Out film.

One in all his classes:

In case you can cut back an issue to a strong check suite you’ll be able to set a coding agent loop unfastened on it with a excessive diploma of confidence that it’ll finally succeed. I referred to as this designing the agentic loop a number of months in the past. I believe it’s the important thing ability to unlocking the potential of LLMs for advanced duties.

Our expertise at Thoughtworks backs this up. We’ve been doing a good bit of labor not too long ago in legacy modernization (mainframe and in any other case) utilizing AI emigrate substantial software program methods. Having a strong check suite is important (however not enough) to creating this work. I hope to share my colleagues’ experiences on this within the coming months.

However earlier than I go away Willison’s submit, I ought to spotlight his last open questions on the legalities, ethics, and effectiveness of all this – they’re well-worth considering.

Fragments: December 16

Related Articles

Planview releases Linked Work Graph to offer insights into relationships throughout groups and initiatives

Safety professionals specific concern over OpenClaw

When Does Expertise Migration Make Sense? Enterprise ROI Technique & Actual Examples

LEAVE A REPLY Cancel reply

Latest Articles

Planview releases Linked Work Graph to offer insights into relationships throughout groups and initiatives

Safety professionals specific concern over OpenClaw

When Does Expertise Migration Make Sense? Enterprise ROI Technique & Actual Examples

This week in AI updates: GPT-5.3-Codex-Spark, GitHub Agentic Workflows, and extra (February 13, 2026)

How A lot Does it Value to Create an App?