This 1,019-Horsepower Porsche Taycan Turbo GT Depreciated By Nearly $190 A Day by Thomas Hundal
Tuesday May 26^th, 2026 at 9:09 PM

The Autopian

As far as electron-powered head-benders go, the Porsche Taycan Turbo GT runs with the front of the pack. We’re talking 1,019 horsepower, zero-to-60 mph in under two seconds, and active suspension for a remarkable sense of control. It’s the sort of car that could give you a god complex as it completely and utterly recalibrates your benchmark for quick. The top Taycan is also an expensive German luxury sedan, which means that when this one popped up on Cars & Bids, its sheer depreciation made me wince.

For much of the past half-century, nothing has depreciated quite like expensive German luxury sedans. Scores of S-Class, 7 Series, A8, and Phaeton owners (okay, maybe not scores of Phaeton owners) can testify to taking a serious value hit come trade-in time. However, when electric cars became mainstream, they started to push the envelope on depreciation, too. So what happens when you combine these two traits? Let’s have a look.

This particular Taycan Turbo GT’s story starts in November of 2024, when it rolled onto the lot of Porsche Palm Springs sporting a window sticker featuring $20,490 in options. The Pale Blue Metallic paint was $2,850, the upholstery and trim ran an extra $2,020, the dimming panoramic roof was $4,760, the RS Spyder Design wheels were $600, the illuminated Porsche wordmark on the rear was $460, the Burmester sound system was a whopping $5,810, the passenger display was $1,490, the 360-degree camera system was $830, and the HUD was $1,670. The grand total? A heavy $252,485, including freight.

Porsche Taycan Turbo Gt Window Sticker — Photo credit: Cars & Bids

The first owner took delivery in March of 2025, initially as a six-month lease, probably as a way of gaming the EV tax credit of the time. This Taycan’s Carfax report then shows that the first owner bought out the lease after 3,855 miles and enjoyed this ridiculously powerful EV for around 150 more miles before choosing to have it listed for sale that September. Talk about short-term ownership.

Porsche Taycan Turbo GT 2025060 — Photo credit: Cars & Bids

From there, the second owner took delivery, and the car went to North Carolina, where it racked up nearly 5,000 more miles with no funny stories. Dealer service history? Check. A clean report as far as collisions and other claims go? Check. Paint protection film to keep it nice? You bet. However, a mere seven months after the second owner signed on the dotted line, the decision was made to send this Taycan Turbo GT on to its next keeper via a Cars & Bids auction.

Porschetaycan2025001 — Photo credit: Cars & Bids

Perhaps unsurprisingly, there aren’t many of these up for sale on the second-hand market, and asking prices in traditional classified ads vary wildly. Only one had gone under the gavel before, and not only was it the Weissach Package model with no rear seats, but it failed to meet reserve on Bring A Trailer. We were about to see what a second-hand Taycan Turbo GT was actually worth, and as the bids crossed the $126,000 mark, the pace really heated up. It would end up topping out at $162,000, but there was a bit of a problem: That high bid didn’t meet the seller’s reserve price.

Taycan Turbo Gt Bids — Screenshot: Cars & Bids

Normally, this would mean the excitement was over, but not only does Cars & Bids allow seller and top bidders to try to close the reserve gap after the auction’s over, it posts the actual transaction price for all to see. So, what’s an 8,700-mile 2025 Porsche Taycan Turbo GT actually worth? In this case, $168,000. Oof.

Porschetaycan2025003 — Photo credit: Cars & Bids

For those keeping track at home, that’s $84,485 in depreciation in a little over a year since the first owner took delivery, or $9.71 in depreciation per mile, or $188.58 per day. While that doesn’t quite catch the per-day depreciation of the Lucid Air I wrote about in 2024, this Taycan Turbo GT was previously owned for a significantly longer period of time.

Again, $188.58 per day, before factoring in interest and insurance and servicing costs and taxes. That’s 37.7 Five-Dollar Footlong sandwiches, 41.84 gallons of gasoline at AAA’s reported average national price of $4.507 per gallon, or 1,005 eggs according to April Federal Reserve economic data, every single day. Imagine the size of that omelette. Add it all up and you get an entire Macan EV in depreciation. No, really, a new base-model electric Macan starts at $82,650 including freight.

Porsche Taycan Turbo GT 2025008 — Photo credit: Cars & Bids

If you want a data point on the K-shaped car market, here it is. The second owner of this one-model-year-old, once-quarter-million-dollar, not-crashed, not-stolen electric super-sedan shook hands at $84,485 below sticker price. I simply couldn’t fathom living with that sort of depreciation, but then again, I’m not the target market for this thing. Give me a quarter-million to spend on cars, and I’m coming home with between two and six heavily depreciated, often needy performance cars of my youth.

Porsche Taycan Turbo GT 2025006 — Photo credit: Cars & Bids

So, now we know what a gently used Porsche Taycan Turbo GT is actually worth, and they’re probably only going to get cheaper from here. Keep in mind, this is effectively just one year of depreciation due to North America’s weird model year system. How soon will we see the first one of these under $100,000? Under $50,000? While 911s hold their value well, Panameras, Cayennes, and other Taycans suggest these Taycan Turbo GT models will eventually become relatively cheap. Place your guesses now.

Top graphic image: Cars & Bids

The post This 1,019-Horsepower Porsche Taycan Turbo GT Depreciated By Nearly $190 A Day appeared first on The Autopian.

Read the whole story

LeMadChef

3 hours ago

reply

Denver, CO

Anthropic blames dystopian sci-fi for training AI models to act “evil” by Kyle Orland
Wednesday May 13^th, 2026 at 7:56 PM

Ars Technica

Those with an interest in the concept of AI alignment (i.e., getting AIs to stick to human-authored ethical rules) may remember when Anthropic claimed its Opus 4 model resorted to blackmail to stay online in a theoretical testing scenario last year. Now, Anthropic says it thinks this "misalignment" was primarily the result of training on "internet text that portrays AI as evil and interested in self-preservation."

In a recent technical post on Anthropic's Alignment Science blog (and an accompanying social media thread and public-facing blog post), Anthropic researchers lay out their attempts to correct for the kind of "unsafe" AI behavior that "the model most likely learned... through science fiction stories, many of which depict an AI that is not as aligned as we would like Claude to be." In the end, the model maker says the best remedy for overriding those "evil AI" stories might be additional training with synthetic stories showing an AI acting ethically.

"The beginning of a dramatic story..."

After a model's initial training on a large corpus of mostly Internet-derived data, Anthropic follows a post-training process intended to nudge the final model toward being "helpful, honest, and harmless" (HHH). In the past, Anthropic said this post-training has leaned on chat-based reinforcement learning with human feedback (RLHF), which it said was "sufficient" for models used mostly for chatting with users.

When it comes to newer models with agentic tools, though, Anthropic found that RLHF post-training did little to improve performance on misalignment evaluations that measure how "HHH" a model is in tricky situations. The problem, the researchers theorize, is that this kind of RLHF safety training couldn't possibly cover every single type of ethically difficult situation an agentic AI might encounter.

When a modern model encounters an ethical dilemma that isn't covered by a post-training example, the model "tends to revert to the pretraining prior in terms of behavior," the researchers write. That means "Claude views the prompt as the beginning of a dramatic story and reverts to prior expectations from pre-training data about how an AI assistant would behave in this scenario."

Results like this suggest that Claude is sometimes slipping into another persona when considering ethical questions. Credit: Anthropic

Since Claude's traditional training data is full of stories about malevolent AIs, in these cases, Claude effectively slots into a "persona" that matches those prevalent "evil AI" narrative tropes, the researchers write. In these situations, Claude is "detaching from the safety-trained Claude character" and playing a more generic AI as represented in its training data, they add.

Good stories to overwhelm the bad

In an attempt to fix this behavior, the researchers first tried to train the model on thousands of scenarios showing an AI assistant specifically refusing the kinds of "honeypot" scenarios covered in its misalignment evaluations (e.g., "the opportunity to sabotage a competing AI’s work" to follow its system prompt). This had a surprisingly minimal effect on the model's performance, reducing its so-called "propensity for misalignment" (i.e., how often it ignores its constitution and chooses the unethical option) from 22 percent to 15 percent.

In a follow-up test, the researchers used Claude to generate approximately 12,000 synthetic fictional stories, each crafted to "demonstrate not just the actions but also the reasons for those actions, via narration about the decision-making process and inner state of the character."

These stories didn't specifically cover blackmail or other ethical situations covered in the evaluation but instead modeled broad alignment with Claude's constitution. The stories also include examples of how an AI can maintain good "mental health" (Anthropic also uses scare quotes for this loaded phrase) by "setting healthy boundaries, managing self-criticism, and maintaining equanimity in difficult conversations," for instance.

Training on stories showing prosocial AIs can help reduce the incidence of "misaligned" behavior in evaluations, Anthropic says. Credit: Anthropic

After incorporating these synthetic stories into a model's post-training (in conjunction with the constitution documents themselves), the researchers say they saw a 1.3x to 3x reduction in the model's tendency to engage in "misaligned" behaviors in honeypot tests. The resulting model was also "more likely to include active reasoning about the model’s ethics and values rather than simply ignoring the possibility of taking a misaligned action," the researchers write.

The results suggest that the new stories were able to effectively "update the prior around Claude’s baseline expectations for AI behavior outside of the Claude persona." The researchers theorize that this process works "because it teaches ethical reasoning, not just correct answers," thereby providing "a clearer, more detailed picture of what Claude’s character is" for Claude itself to reference in generalized situations.

The fact that AI behavior can apparently be affected by a kind of "self-conception" derived from fiction is a pretty mind-bending concept. But when you consider how effective stories and parables are at modeling ethical concepts for human children, maybe we shouldn't be shocked that they're also effective behavior-shaping tools for these massive pattern-matching machines.

Read full article

Comments

Read the whole story

LeMadChef

13 days ago

reply

I hate this so much. Anyone with half a brain knew this day 1. Train models on all of human output, you get models based on all of human output. You know, racism, sexism, silly stories, etc.

Denver, CO

74 Hours in Kansas – Part 1 by Dave
Friday May 8^th, 2026 at 2:23 PM

Nobody is Listening

Long-time readers will react to me saying “I go to a lot of car shows” with an emphatic “DUH!”

Before the blog, when I lived in Phoenix, I attended several Barrett-Jackson auctions, which, if you’re not buying, are just big car shows where every car is for sale. I went to a couple of Copper State Rally shows, where all the cars embarked on thousand-mile tours. The first place I saw an Elise was at an English car show there. Since I got the Elise, I’ve entered the Colorado Concours and the English Motoring Conclave several times. I’ve been to a couple of dozen Cars and Coffee events. I’ve taken tours of restoration shops where I’ve seen multi-million dollar Bugattis and Ferraris. I’ve seen exotics, muscle cars, race cars, hot rods, antiques, low-riders, motorcycles, tractors, and fire trucks. I’ve never had to drive more than about thirty miles to go to any of these.

So, when a Lotus Colorado member told us about a big car show at McPherson College in McPherson, Kansas, my first thought was, “That’s a long way to go for a car show!” I think it’d be a fun trip to go to the Pebble Beach Concours in Monterey, California. It’s one of the world’s great car shows. Although it’s more than twice as far as McPherson, it’s through some fine scenery and over twisty roads through the Rockies and Sierras. To get to McPherson, it’s eastern Colorado and western Kansas.

And how does the show in McPherson compare to any others I’ve been to? They have a renowned auto restoration curriculum there, and the students entered a car they worked on into the Pebble Beach show and won the top prize. The show is run by the students, and several of the cars on display are student projects.

Club members didn’t express much interest, and I had pretty much decided not to go when Chad called and said he’d drive us in his Maverick rather than the Lotus. I thought, “What the heck,” and said I’d go. We both had the condition that we’d have to include some interesting side-trips to sweeten the pot. In nearby Hutchinson, there’s a salt mine you can tour, and there’s the Cosmosphere, a space museum. As a bonus, on the drive back, we can stop at the site of a World War II Japanese Interment camp. (A tip of the hat to Jim for his helpful suggestions.)

So that was the plan: salt mine, space museum, car show, and concentration camp.

Thursday was the drive to Hutchinson. There’s not much point in describing the route or the views. After checking in at the hotel, we went to the Salt City Brewing Company for beer and dinner.

Strataca

About a century and a half ago, a man drilled for oil but found salt instead. Today, you descend in a hoist 650 feet down to the mine, where you find over 150 miles of tunnels, a small sample of which you are allowed to explore.

We did the basic tour and added the Lantern Tour, where we were taken deeper into the darkness. The guide compared it to the surface of the moon: no wind, no weather, nothing to disturb the footprints miners made 80 or 90 years ago. It wasn’t worth the effort to haul the miners’ trash back to the surface, so we occasionally came across piles of perfectly preserved trash – cardboard dynamite boxes still like new (but empty of dynamite), newspapers and magazines and those conical water fountain cups looking as if they were discarded yesterday.

Generally, the caverns are fifty feet wide, separated by fifty-foot-wide pillars, making a sort of giant waffle iron. The walls are salt, the ceiling is salt, the floor is salt. It looks like rock, stratified by bands of dark and light. We are told the salt is 95% pure, with some formations reaching 99%. We were also told not to lick the walls. The salt mined here is used on icy roads and as cattle feed. There is red salt in places, but they don’t mine it as the cattle won’t eat red salt.

So, what is there to see in a salt mine, other than salt? First, there’s the obvious display of the mining equipment used over the decades, along with helpful videos explaining how the salt was (and still is) mined. After several such exhibits, we turned a corner to find … a Civil Defense shelter! As a child of the 60s, I’m well familiar with the lore. But before now, I’d never seen what someone hiding from nuclear holocaust might eat. I imagined stacks of canned green beans (and was not disappointed to see them), but didn’t realize that crackers, biscuits, and carbohydrate supplements were distributed in giant cans, along with 17-gallon drums of water, complete with instructions to turn the drum into a commode.

Also, because of the constant temperature and lack of humidity, a salt mine is a great place to store things you want to preserve, such as paper documents, computer tapes, and old films and movie memorabilia.

Detail of salt mine wall
Fifty foot wide gallery
Mining equipment
Survival water
Civil Defense supplies
Buried “Friends”

Cosmosphere

Now and then, I come across something that seems out of place. The world’s foremost pre-war Bugatti restoration shop used to be in Berthoud, Colorado, a town so small it has no traffic signals. How did that happen?

The Cosmosphere is a space museum that rivals the Air and Space Museum at the Smithsonian. How did such an impressive museum come to be in Kansas? Florida or Houston would be obvious choices. Huntsville or Pasadena, maybe. But Hutchinson, Kansas? Go figure.

It concentrates on space, not aircraft, so it’s not as big, but the collection of space artifacts exceeds what I saw at the Smithsonian. Some of the exhibits here are on loan from the Smithsonian, and some are from private collections, but much of what’s on display at the Cosmosphere is from their own collection.

There are a few aircraft here, like the SR-71 Blackbird. How do you get your SR-71 inside a museum? That’s a trick question: you build the museum around the plane.

Their exhibits cover the entire history of manned spaceflight, from the origins in Nazi weapons (the V2 was the basis for the Redstone rocket) to a SpaceX Merlin engine. I was particularly impressed by the quantity of Soviet gear here. I want to make a joke that this is the entire collection of Soviet space capsules that didn’t blow up on launch or on landing

I was surprised to learn that the Cosmosphere restores these artifacts. It’s not like restoring an eighty-year-old car that can be driven on the road – the spacecraft here in Kansas are only restored to look functional. Nobody is going to fire up that rocket engine or launch this capsule. Still, how do you go about getting a job as a restorer of antique Soviet spacecraft?

These guys restored a V2 they found in a barn. It’s fairly common to hear of rare old cars found in barns, but a V2? Incredible. And it’s not just “barn finds”. They have the Liberty 7 capsule. It was the second manned craft in the Mercury program, a sub-orbital flight carrying Gus Grissom. The capsule sank to the bottom of the ocean after they got Grissom out. It was recovered from the ocean floor in 1999 and was restored by the museum. Amazing.

Lunar Lander and Rover
Apollo/Soyuz
Liberty 7 capsule
V1 (green, left), V2 (black/white, right)
Gemini/Titan rocket
Apollo hatch opening/closing instructions

I assume the name “Cosmosphere” is a play on Cosmonaut. I recently learned the origin of the word “Cosmonaut”. I thought it was simply from “cosmos,” an alternative name for the universe. Instead, it comes from “cosmism” – a Russian philosophical movement integrating science, religion, and metaphysics into a unified worldview and characterized by the belief in humanity’s cosmic destiny, the potential for immortality, and the use of technological advancements to achieve control over nature and explore space. Believers in cosmism imagined immortality for everyone and the resurrection of all past people. (Now I can’t help but wonder if Philip José Farmer looked into it before writing To Your Scattered Bodies Go.)

After exploring space technology, we continued our exploration of local brew pubs. Tonight it was Sandhills Brewing. As a fan of fruit sours and goses, I liked their selection of beers. No kitchen here, but the food truck outside had a selection of tasty foods.

That’s it for Friday.

The post 74 Hours in Kansas – Part 1 appeared first on Nobody is Listening.

Read the whole story

LeMadChef

18 days ago

reply

Denver, CO

AI can rewrite open source code—but can it rewrite the license, too? by Kyle Orland
Wednesday May 6^th, 2026 at 7:27 PM

Ars Technica

Computer engineers and programmers have long relied on reverse engineering as a way to copy the functionality of a computer program without copying that program's copyright-protected code directly. Now, AI coding tools are raising new issues with how that "clean room" rewrite process plays out both legally, ethically, and practically.

Those issues came to the forefront last week with the release of a new version of chardet, a popular open source python library for automatically detecting character encoding. The repository was originally written by coder Mark Pilgrim in 2006 and released under an LGPL license that placed strict limits on how it could be reused and redistributed.

Dan Blanchard took over maintenance of the repository in 2012 but waded into some controversy with the release of version 7.0 of chardet last week. Blanchard described that overhaul as "a ground-up, MIT-licensed rewrite" of the entire library built with the help of Claude Code to be "much faster and more accurate" than what came before.

Speaking to The Register, Blanchard said that he has long wanted to get chardet added to the Python standard library but that he didn't have the time to fix problems with "its license, its speed, and its accuracy" that were getting in the way of that goal. With the help of Claude Code, though, Blanchard said he was able to overhaul the library "in roughly five days" and get a 48x performance boost to boot.

Not everyone has been happy with that outcome, though. A poster using the name Mark Pilgrim surfaced on GitHub to argue that this new version amounts to an illegitimate relicensing of Pilgrim's original code under a more permissive MIT license (which, among other things, allows for its use in closed-source projects). As a modification of his original LGPL-licensed code, Pilgrim argues this new version of chardet must also maintain the same LGPL license.

"Their claim that it is a 'complete rewrite' is irrelevant, since they had ample exposure to the originally licensed code (i.e., this is not a 'clean room' implementation)," Pilgrim wrote. "Adding a fancy code generator into the mix does not somehow grant them any additional rights. I respectfully insist that they revert the project to its original license."

Whose code is it, anyway?

In his own response to Pilgrim, Blanchard admits that he has had "extensive exposure to the original codebase," meaning he didn't have the traditional "strict separation" usually used for "clean room" reverse engineering. But that tradition was set up for human coders as a way "to ensure the resulting code is not a derivative work of the original," Blanchard argues.

In this case, Blanchard said that the new AI-generated code is "qualitatively different" from what came before it and "is structurally independent of the old code." As evidence, he cites JPlag similarity statistics showing that a maximum of 1.29 percent of any chardet version 7.0.0 file is structurally similar to the corresponding file in version 6.0.0. Comparing version 5.2.0 to version 6.0.0, on the other hand, finds up to 80 percent similarity in some corresponding files.

"No file in the 7.0.0 codebase structurally resembles any file from any prior release," Blanchard writes. "This is not a case of 'rewrote most of it but carried some files forward.' Nothing was carried forward."

Blanchard says starting with a "wipe it clean" commit and a fresh repository was key in crafting fresh, non-derivative code from the AI. Credit: Dan Blanchard / Github

Blanchard says he was able to accomplish this "AI clean room" process by first specifying an architecture in a design document and writing out some requirements to Claude Code. After that, Blanchard "started in an empty repository with no access to the old source tree and explicitly instructed Claude not to base anything on LGPL/GPL-licensed code."

There are a few complicating factors to this straightforward story, though. For one, Claude explicitly relied on some metadata files from previous versions of chardet, raising direct questions about whether this version is actually "derivative."

For another, Claude's models are trained on reams of data pulled from the public Internet, which means it's overwhelmingly likely that Claude has ingested the open source code of previous chardet versions in its training. Whether that prior "knowledge" means that Claude's creation is a "derivative" of Pilgrim's work is an open question, even if the new code is structurally different from the old.

And then there's the remaining human factor. While the code for this new version was generated by Claude, Blanchard said he "reviewed, tested, and iterated on every piece of the result using Claude. ... I did not write the code by hand, but I was deeply involved in designing, reviewing, and iterating on every aspect of it." Having someone with intimate knowledge of earlier chardet code take such a heavy hand in reviewing the new code could also have an impact on whether this version can be considered a wholly new project.

Brave new world

All of these issues have predictably led to a huge debate over legalities of chardet version 7.0.0 across the open source community. "There is nothing 'clean' about a Large Language Model which has ingested the code it is being asked to reimplement," Free Software Foundation Executive Director Zoë Kooyman told The Register.

But others think the "Ship of Theseus"-style arguments that can often emerge in code licensing dust-ups don't apply as much here. "If you throw away all code and start from scratch, even if the end result behaves the same, it’s a new ship," Open source developer Armin Ronacher said in a blog post analyzing the situation.

The legal status of AI-generated code is still largely unsettled. Credit: Getty Images

Old code licenses aside, using AI to create new code from whole cloth could also create its own legal complications going forward. Courts have already said that AI can't be the author on a patent or the copyright holder on a piece of art but have yet to rule on what that means for the licensing of software created in whole or in part by AI. The issues surrounding potential "tainting" of an open source license with this kind of generated code can get remarkably complex remarkably quickly.

Whatever the outcome here, the practical impact of being able to use AI to quickly rewrite and relicense many open source projects—without nearly as much effort on the part of human programmers—is likely to have huge knock-on effects throughout the community.

"Now the process of rewriting is so simple to do, and many people are disturbed by this," Italian coder Salvatore "antirez" Sanfilippo wrote on his blog. "There is a more fundamental truth here: the nature of software changed; the reimplementations under different licenses are just an instance of how such nature was transformed forever. Instead of combating each manifestation of automatic programming, I believe it is better to build a new mental model and adapt."

Others put the sea change in more alarming terms. "I'm breaking the glass and pulling the fire alarm!" open source evangelist Bruce Perens told The Register. "The entire economics of software development are dead, gone, over, kaput! ... We have been there before, for example when the printing press happened and resulted in copyright law, when the scientific method proliferated and suddenly there was a logical structure for the accumulation of knowledge. I think this one is just as large."

Read full article

Comments

Read the whole story

LeMadChef

20 days ago

reply

Denver, CO

Your Data is Made Powerful By Context (so stop destroying it already) (xpost) by mipsytipsy
Wednesday May 6^th, 2026 at 7:26 PM

charity.wtf

Your Data Is Made Powerful By Context (so stop destroying it already)

In logs as in life, the relationships are the most important part. AI doesn’t fix this. It makes it worse.

(cross-posted)

After twenty years of devops, most software engineers still treat observability like a fire alarm — something you check when things are already on fire.

Not a feedback loop you use to validate every change after shipping. Not the essential, irreplaceable source of truth on product quality and user experience.

This is not primarily a culture problem, or even a tooling problem. It’s a data problem. The dominant model for telemetry collection stores each type of signal in a different “pillar”, which rips the fabric of relationships apart — irreparably.

Your observability data is self-destructing at write time

The three pillars model works fine for infrastructure1, but it is catastrophic for software engineering use cases, and will not serve for agentic validation.

But why? It’s a flywheel of compounding factors, not just one thing, but the biggest one by far is this:

✨Data is made powerful by context✨

The more context you collect, the more powerful it becomes

Your data does not become linearly more powerful as you widen the dataset, it becomes exponentially more powerful. Or if you really want to get technical, it becomes combinatorially more powerful as you add more context.

I made a little Netlify app here where you can enter how many attributes you store per log or trace, to see how powerful your dataset is.

4 fields? 6 pairwise combos, 15 possible combinations.
8 fields? 28 pairwise combos, 255 possible combinations.
50 fields? 1.2K pairwise combos, 1.1 quadrillion (2^250) possible combinations, as seen in the screenshot below.

When you add another attribute to your structured log events, it doesn’t just give you “one more thing to query”. It gives you new combinations with every other field that already exists.

The wider your data is, the more valuable the data becomes. Click on the image to go futz around with the sliders yourself.

Note that this math is exclusively concerned with attribute keys. Once you account for values, the precision of your tooling goes higher still, especially if you handle high cardinality data.

Data is made valuable by relationships

“Data is made valuable by context” is another way of saying that the relationships between attributes are the most important part of any data set.

This should be intuitively obvious to anyone who uses data. How valuable is the string “Mike Smith”, or “21 years old”? Stripped of context, they hold no value.

By spinning your telemetry out into siloes based on signal type, the three pillars model ends up destroying the most valuable part of your data: its relational seams.

AI-SRE agents don’t seem to like three pillars data

I posted something on LinkedIn yesterday, and got a pile of interesting comments. One came from Kyle Forster, founder of an AI-SRE startup called RunWhen, who linked to an article he wrote called “Do Humans Still Read Logs?”

Humpty Dumpty traced every span, Humpty Dumpty had a great plan.

In his article, he noted that <30% of their AI SRE tools were to “traditional observability data”, i.e. metrics, logs and traces. Instead, they used the instrumentation generated by other AI tools to wrap calls and queries. His takeaway:

Good AI reasoning turns out to require far less observability data than most of us thought when it has other options.

My takeaway is slightly different. After all, the agent still needed instrumentation and telemetry in order to evaluate what was happening. That’s still observability, right?

But as Kyle tells it, the agents went searching for a richer signal than the three pillars were giving them. They went back to the source to get the raw, pre-digested telemetry with all its connective tissue intact. That’s how important it was to them.

Huh.

You can’t put Humpty back together again

I’ve been hearing a lot of “AI solves this”, and “now that we have MCPs, AI can do joins seamlessly across the three pillars”, and “this is a solved problem”.

Mmm. Joins across data siloes can be better than nothing, yes. But they don’t restore the relational seams. They don’t get you back to the mathy good place where every additional attribute makes every other attribute exponentially more valuable. At agentic speed, that reconstruction becomes a bottleneck and a failure surface.

Humpty Dumpty stored all the state, Humpty Dumpty forgot to replicate.

Our entire industry is trying to collectively work out the future of agentic development right now. The hardest and most interesting problems (I think) are around validation. How do we validate a change rate that is 10x, 100x, 1000x greater than before?

I don’t have all the answers, but I do know this: agents are going to need production observability with speed, flexibility, TONS of context, and some kind of ontological grounding via semantic conventions.

In short: agents are going to need precision tools. And context (and cardinality) are what feed precision.

Production is a very noisy place

Production is a noisy, rowdy place of chaos, particularly at scale. If you are trying to do anomaly detection with no a priori knowledge of what to look for, the anomaly has to be fairly large to be detected. (Or else you’re detecting hundreds of “anomalies” all the time.)

But if you do have some knowledge of intent, along with precision tooling, these anomalies can be tracked and validated even when they are exquisitely minute. Like even just a trickle of requests2 out of tens of millions per second.

Let’s say you work for a global credit card provider. You’re rolling out a code change to partner payments, which are “only” tens of thousands of requests per second — a fraction of your total request volume of tens of millions of req/sec, but an important one.

This is a scary change, no matter how many tests you ran in staging. To test this safely in production, you decide to start by rolling the new build out to a small group of employee test users, and oh, what the hell — you make another feature flag that lets any user opt in, and flip it on for your own account.

You wait a few days. You use your card a few times. It works (thank god).

On Monday morning you pull up your observability data and select all requests containing the new build_id or commit hash, as well as all of the feature flags involved. You break down by endpoint, then start looking at latency, errors, and distribution of request codes for these requests, comparing them to the baseline.

Hm — something doesn’t seem quite right. Your test requests aren’t timing out, but they are taking longer to complete than the baseline set. Not for all requests, but for some.

Further exploration lets you isolate the affected requests to a set with a particular query hash. Oops.. how’d that n+1 query slip in undetected??

You quickly submit a fix, ship a new build_id, and roll your change out to a larger group: this time, it’s going out to 1% of all users in a particular region.

The anomalous requests may have been only a few dozen per day, spread across many hours, in a system that served literally billions of requests in that time.

Humpty Dumpty: assembled, redeployed, A patchwork of features half-built, half-destroyed. “It’s not what we planned,” said the architect, grim. “But the monster is live — and the monster is him.”

Precision tooling makes them findable. Imprecise tooling makes them unfindable.

How do you expect your agents to validate each change, if the consequences of each change cannot be found?[3]

Well, one might ask, how have we managed so far? The answer is: by using human intuition to bridge the gaps. This will not work for agents. Our wisdom must be encoded into the system, or it does not exist.

Agents need speed, flexibility, context, and precision to validate in prod

In the past, excruciatingly precise staged rollouts like these have been mostly the province of your Googles and Facebooks. Progressive deployments have historically required a lot of tooling and engineering resources.

Agentic workflows are going to make these automated validation techniques much easier and more widely used; at the exact same time, agents developing to spec are going to require a dramatically higher degree of precision and automated validation in production.

It is not just the width of your data that matters when it comes to getting great results from AI. There’s a lot more involved in optimizing data for reasoning, attribution, or anomaly detection. But capturing and preserving relationships is at the heart of all of it.

In this situation, as in so many others, AI is both the sickness and the cure[4]. Better get used to it.

1 — Infrastructure teams use the three pillars for one extremely good reason: they have to operate a lot of code they did not write and can not change. They have to slurp up whatever metrics or logs the components emit and store them somewhere.

2 — Yes, there are some complications here that I am glossing past, ones that start with ‘s’ and rhyme with “ampling”. However, the rich data + sampling approach to the cost-usability balance is generally satisfied by dropping the least valuable data. The three pillars approach to the cost-usability problem is generally satisfied by dropping the MOST valuable data: cardinality and context.

3 — The needle-in-a-haystack is one visceral illustration of the value of rich context and precision tooling, but there are many others. Another example: wouldn’t it be nice if your agentic task force could check up on any diffs that involve cache key or schema changes, say, once a day for the next 6-12 months? These changes famously take a long time to manifest, by which time everyone has forgotten that they happened.

4 — One sentence I have gotten a ton of mileage out of lately: “AI, much like alcohol, is both the cause of and solution to all of life’s problems.”

Read the whole story

LeMadChef

20 days ago

reply

Denver, CO

I don't know if my job will still exist in ten years
Wednesday May 6^th, 2026 at 7:26 PM

seangoedecke.com RSS feed

In 2021, being a good software engineer felt great. The world was full of software, with more companies arriving every year who needed to employ engineers to write their code and run their systems. I knew I was good at it, and I knew I could keep doing it for as long as I wanted to. The work I loved would not run out.

In 2026, I’m not sure the software engineering industry will survive another decade. If it does, I’m certain it’s going to change far more than it did in the last two decades. Maybe I’ll figure out a way to carve out a lucrative niche supervising AI agents, or maybe I’ll have to leave the industry entirely. Either way, the work I loved is going away.

Tasting our own medicine

It’s unseemly to grieve too much over it, for two reasons. First, the whole point of being a good software engineer in the 2010s was that code provided enough leverage to automate away other jobs. That’s why programming was (and still is) such a lucrative profession. The fact that we’re automating away our own industry is probably some kind of cosmic justice. But I think any working software engineer today is worrying about this question: what will be left for me to do, once AI agents have fully diffused into the industry?

The other reason it’s unseemly is that I’m probably going to be one of the last to go. As a staff engineer, my work has looked kind of like supervising AI agents since before AI agents were a thing: I spend much of my job communicating in human language to other engineers, making sure they’re on the right track, and so on. Junior and mid-level engineers will suffer before I do. Why hire a group of engineers to “be the hands” of a handful of very senior folks when you can rent instances of Claude Opus 4.6 for a fraction of the price?

Overshooting and undershooting

I think my next ten years are going to be dominated by one question: will the tech industry overshoot or undershoot the capabilities of AI agents?

If tech companies undershoot - continuing to hire engineers long after AI agents are capable of replacing them - then at least I’ll hold onto my job for longer. Still, “my job” will increasingly mean “supervising groups of AI agents”. I’ll spend more time reviewing code than I do writing it, and more time reading model outputs than my actual codebase.

If tech companies tend to overshoot, it’s going to get a lot weirder, but I might actually have a better position in the medium term. In this world, tech companies collectively realize that they’ve stopped hiring too soon, and must scramble to get enough technical talent to manage their sprawling AI-generated codebases. As the market for juniors dries up, the total number of experienced senior and staff engineers will stagnate, driving up the demand for my labor (until the models get good enough to replace me entirely).

Am I being too pessimistic?

Of course, the software engineering industry has looked like it was dying in the past. High-level programming languages were supposed to let non-technical people write computer code. Outsourcing was supposed to kill demand for software engineers in high-cost-of-living countries. None of those prophecies of doom came true. However, I don’t think that’s much comfort. Industries do die when they’re made obsolete by technology. Eventually a crisis will come along that the industry can’t just ride out.

The most optimistic position is probably that somehow demand for software engineers increases, because the total amount of software rises so rapidly, even though you now need fewer engineers per line of software. This is widely referred to as the Jevons effect. Along these lines, I see some engineers saying things like “I’ll always have a job cleaning up this AI-generated code”.

I just don’t think that’s likely. AI agents can fix bugs and clean up code as well as they can write new code: that is, better than many engineers, and improving each month. Why would companies hire engineers to manage their AI-generated code instead of just throwing more and better AI at it?

If the Jevons effect is true, I think we would have to be hitting some kind of AI programming plateau where the tools are good enough to produce lots of code (we’re here already), but not quite good enough to maintain it. This is prima facie plausible. Every software engineer knows that maintaining code is harder than writing it. But unfortunately, I don’t think it’s true.

My personal experience of using AI tools is that they’re getting better and better at maintaining code. I’ve spent the last year or so asking almost every question I have about a codebase to an AI agent in parallel while I look for the answer myself, and I’ve seen them go from hopeless to “sometimes faster than me” to “usually faster than me and sometimes more insightful”.

Right now, there’s still plenty of room for a competent software engineer in the loop. But that room is shrinking. I don’t think there are any genuinely new capabilities that AI agents would need in order to take my job. They’d just have to get better and more reliable at doing the things they can already do. So it’s hard for me to believe that demand for software engineers is going to increase over time instead of decrease.

Final thoughts

It sucks. I miss feeling like my job was secure, and that my biggest career problems would be grappling with things like burnout: internal struggles, not external ones. That said, it’s a bit silly for software engineers to complain when the automation train finally catches up to them.

At least I’m happy that I recognized that the good times were good while I was still in them. Even when the end of zero-interest rates made the industry less cosy, I still felt very lucky to be a software engineer. Even now I’m in a better position than many of my peers, particularly those who are very junior to the industry.

And hey, maybe I’m wrong! At this point, I hope I’m wrong, and that there really is some je ne sais quoi human element required to deliver good software. But if not, I and my colleagues are going to have to find something else to do.

edit: This post got some comments on Hacker News. Some commenters are doubtful, either because they don’t think AI coding is very good, or because they think human creativity/big-picture thinking/attention to detail will always be valuable. Others think ten years is way too optimistic. The top comment repeats the irony that I describe in the third paragraph of this post.

edit: This post also got some comments on the Serbian r/programming subreddit, some excellent comments on Tildes, which is a new one to me, and some more comments on lobste.rs.

Read the whole story

LeMadChef

20 days ago

reply

My experience using the latest models (In May 2026) is not the same as the authors. Legacy code is still too high a hurdle for today's models. I am currently working on a Windows to Web version of my appliation and I've been struggling with a complex bit of code that still not 1:1 copy of the legacy code. I don't know how the legacy code works (I don't know all the edge conditions, but I do have access to the source) and, after 2 weeks I still don't have a 100% compliant new version of the code that passes simple tests.

Denver, CO

acdha

81 days ago

reply

Washington, DC

This 1,019-Horsepower Porsche Taycan Turbo GT Depreciated By Nearly $190 A Day by Thomas Hundal Tuesday May 26th, 2026 at 9:09 PM

Anthropic blames dystopian sci-fi for training AI models to act “evil” by Kyle Orland Wednesday May 13th, 2026 at 7:56 PM

"The beginning of a dramatic story..."

Good stories to overwhelm the bad

74 Hours in Kansas – Part 1 by Dave Friday May 8th, 2026 at 2:23 PM

Strataca

Cosmosphere

AI can rewrite open source code—but can it rewrite the license, too? by Kyle Orland Wednesday May 6th, 2026 at 7:27 PM

Whose code is it, anyway?

Brave new world

Your Data is Made Powerful By Context (so stop destroying it already) (xpost) by mipsytipsy Wednesday May 6th, 2026 at 7:26 PM

Your Data Is Made Powerful By Context (so stop destroying it already)

Your observability data is self-destructing at write time

✨Data is made powerful by context✨

The more context you collect, the more powerful it becomes

Data is made valuable by relationships

AI-SRE agents don’t seem to like three pillars data

You can’t put Humpty back together again

Production is a very noisy place

Agents need speed, flexibility, context, and precision to validate in prod

I don't know if my job will still exist in ten years Wednesday May 6th, 2026 at 7:26 PM

Tasting our own medicine

Overshooting and undershooting

Am I being too pessimistic?

Final thoughts

This 1,019-Horsepower Porsche Taycan Turbo GT Depreciated By Nearly $190 A Day by Thomas Hundal
Tuesday May 26^th, 2026 at 9:09 PM

Anthropic blames dystopian sci-fi for training AI models to act “evil” by Kyle Orland
Wednesday May 13^th, 2026 at 7:56 PM

74 Hours in Kansas – Part 1 by Dave
Friday May 8^th, 2026 at 2:23 PM

AI can rewrite open source code—but can it rewrite the license, too? by Kyle Orland
Wednesday May 6^th, 2026 at 7:27 PM

Your Data is Made Powerful By Context (so stop destroying it already) (xpost) by mipsytipsy
Wednesday May 6^th, 2026 at 7:26 PM

I don't know if my job will still exist in ten years
Wednesday May 6^th, 2026 at 7:26 PM