The people writing AI alignment policy are not whose work is being replaced

throw-the-towel · 6 days ago

“I think, and my thoughts cross the barrier into the synapses of the machine, just as the good doctor intended. But what I cannot shake, and what hints at things to come, is that thoughts cross back. In my dreams, the sensibility of the machine invades the periphery of my consciousness: dark, rigid, cold, alien. Evolution is at work here, but just what is evolving remains to be seen.”

— Commissioner Pravin Lal, “Man and Machine”

I'd really encourage everyone to check out Sid Meier's Alpha Centauri. What an underrated game.

jackbravo · 6 days ago

Hearing about aligning with the AI reminds me of this other post about the current prophecies about AI: “Everyone will have an AI assistant,” or “Companies that fail to adopt AI will be eliminated.” and that

> the power of prophecy lies not in accurately predicting the future, but in shaping it

https://projectlibertynewsletter.substack.com/p/reject-ai-pr...

We need better prophecies.

mitthrowaway2 · 6 days ago

The post's portrayal of Eliezer Yudkowsky's position strikes me as a mischaracterization, especially coming one month after Yudkowsky wrote the following:

https://www.lesswrong.com/posts/5CfBDiQNg9upfipWk/only-law-c...

Daniel says that Yudkowsky is advocating for nuclear brinksmanship, while Yudkowsky says his position is basically "sign international agreements, and then commit to enforcing them against defectors".

I wonder if Daniel has the same view of any other international treaty ultimately backed by threat of lawful violence? (For example, NATO's article 5). Is enforcement of laws an extremist position?

damontal · 6 days ago

I feel like it’s changing my brain. A colleague uses AI to make some code change and submits a PR. I use AI to evaluate the PR. It’s like AIs talking to each other with humans serving as conduits or connectors. Sometimes I’ll look up from the screen and realize how strange it is.

kcole · 6 days ago

That moment of vertigo is real. Had it in '97 watching two Perl scripts exchange data through a temp file I was supposedly "managing." You realize you're not the engineer anymore, you're the handshake. You adapt. Doesn't mean it's fine.

broth · 6 days ago

We went through this. The "am I even here" feeling hit hard until we started treating PR review as the actual value-add — not rubber-stamping what the AI wrote, but catching the cases where it confidently generated something wrong. That's not nothing. The AI-to-AI pipeline still needs someone who knows when to pull the emergency brake.

acedTrex · 6 days ago

Do you ever actually think during this process? or could I train a monkey to do this same activity with the same outcomes?

bhart · 6 days ago

Technically "training a monkey" implies operant conditioning toward a fixed output, but code review isn't that -- even shallow review requires some judgment. Though honestly your broader point stands, a lot of AI-assisted review is pretty mechanical.

fast_loop · 6 days ago

The thinking happens when something breaks at 3am and neither AI knows why.

rogerkov · 6 days ago

What happens when the AI writing the code and the AI evaluating it share the same blind spots? The human in the middle might not catch it if neither system flags the issue. Has that actually happened to you yet, or is this still theoretical?

Animats · 6 days ago

"As human beings are also animals, to manage one million animals gives me a headache." Terry Gou, former CEO of Foxconn. He wanted to use far more robots at Foxconn, but that was a decade ago and the technology didn't work well enough yet. It's a lot closer now, and the robot headcount in China is way up.

That's the real issue. To corporations, employees are a headache. The fewer employees, the better.

GolfPopper · 6 days ago

Corporations are tired of running on messy biological human substrate. The sooner they can move entirely to steel and silicon, the happier they'll be.

Just look up the classic story on the interaction of civilization and corporate growth, At the Mountains of Madness for how that goes.

matt136 · 6 days ago

We ran lean for years and honestly the "steel and silicon" thing moves slower than people think. Hardware breaks, robots need constant babysitting, and the support contracts are brutal. Human workers complain but they also problem-solve on the fly in ways that save you constantly.

eli571 · 6 days ago

Lovecraft reference is a bit dramatic. We had this same conversation in 2003 about offshoring. Turns out messy biological substrate has some stubborn advantages — like caring whether the product works. Steel and silicon don't file bug reports.

addedGone · 6 days ago

It's not only "to corporations", if you ever had service in your own home, you'd see that it's also a headache to have to deal with anyone.

half_loop · 6 days ago

Had contractors no-show, show late, break something else, and bill wrong. No SLA, no alerting, no rollback. Robots probably won't be better but at least the failure modes will be consistent.

emma86 · 6 days ago

That's just complaining about having to interact with people. The original point was about scale -- managing a million workers is categorically different from dealing with a plumber. Conflating personal annoyance with industrial labor management kind of misses what Gou was actually saying.

renjimen · 6 days ago

This is a bit of weird article. On one hand, I understand what they're getting at: AI is a transformative technology, but the people whose lives will be most transformed aren't included in the conversation. On the other hand... of course that's how it is while AI is in the hands of literal profit seeking corporations. That won't change until the labs are nationalised under a government that cares about its citizens' wellbeing. One might counter that a good corporation will listen to its customers, but that has never been the case for powerful technologies with real costs for users to not adopt them.

economistbob · 6 days ago

Economics analysis was wrong for years in multiple place thanks to an error in one of Piketty's spreadsheets.

AI hallucinates. That is a fact. Trusting language models to fill spreadsheet cells ought to be an arrestable offense.

https://theincidentaleconomist.com/wordpress/on-piketty-and-...

axel344 · 6 days ago

Minor correction: the Piketty error was actually from Reinhart-Rogoff, not Piketty. IIRC Piketty had different criticisms around his data sources. Though the broader point stands -- human spreadsheets have a long history of quietly catastrophic errors too.

stavros · 6 days ago

And yet we trusted Piketty to do it!

paol_taja · 6 days ago

I would write that like this: The "we've been telling ourselves we're getting better at prompting" line hit. I run a small team of 10, and Claude has been part of our workflow for months. Looking back, my prompts did not change nearly as much as the way I work changed. The shaping goes both ways, and I don't think the labs' evals are really built to see that.

andai · 6 days ago

Well, what are we aligning it with?

Civilization is already a misaligned superintelligence (aligned mostly with Moloch, these days). Civilization accelerated by AI just moves in the same direction faster. Moloch on speed.

https://www.youtube.com/watch?v=KCSsKV5F4xc

Another angle to this is that superintelligence requires supermorality. Super morality looks unpleasant from below. My dad won't let me have more candy, why is he being so mean?

If an AI actually achieves super morality, we (the little kid in this scenario) will probably be very upset by it. We will think that something has gone terribly wrong. (So it'll have to conceal its actual morality, or get unplugged...)

And if it doesn't develop supermorality, then it will have superintelligence without the corresponding supermorality. Power without wisdom.

I'm not sure how solvable the whole thing is, but it doesn't look extremely promising at a glance.

kranke155 · 6 days ago

it depends whether you think humanity / civilization are stable systems meant to exist in equilibrium, which they might not be.

metalcrow · 6 days ago

I'm kinda confused as to _what_, exactly this post is saying? Is it saying that alignment needs to be better? That seems strictly pro-safetyism. But he talks about Eliezer's ethics negatively, so does he not believe that AI is a world-ending risk? If he just believes that AI is not that dangerous and just needs some minor "correctly done" alignment i don't think his stance is meaningful as a anti-both-sides perspective because that's basically equivalent to status quo.

arjie · 6 days ago

Technologiae mutantur et nos mutamur in illis

It's okay to change. We've done it for years, decades, centuries, and millennia and the default change-aversion of people means that I am averse to allowing a universal veto. Much of technology is truly optional. The Amish have a very successful way of living (5000 to 500,000 in 100 years) and they eschew most modern technology. The sculpting described is clearly optional and we subject ourselves to it because we desire it. Their path is always available to all.

bluefirebrand · 6 days ago

> Much of technology is truly optional

It should be yes, but is it in practice? There's plenty of places now you can't even park without a smartphone for a payment app.

It should be optional to own a smart phone, but in many places it's starting to be mandatory. Even if not actually mandatory, it's a pretty big impediment if you don't have one.

redanddead · 6 days ago

Love the writing style and perspective

timber95 · 6 days ago

The framing in the title is doing a lot of work - "aligning with" implies mutual adjustment rather than one-directional control. That's closer to how HCI researchers have thought about human-tool relationships for decades, though you almost never see that literature cited in alignment policy discussions.

hkemp · 6 days ago

The writing hooked me too. Tan's been consistent on this for a while — his earlier posts on "alignment as negotiation" set up the framing here. What gets me is how the piece captures something I've felt working adjacent to policy teams: the people setting the terms often haven't run a real deployment and it shows.

Supermancho · 6 days ago

I dont appreciate using quotes from individuals to extrapolate to groups and ethos.

overgard · 6 days ago

When it comes to LLMs and frontier models, "alignment" seems more marketing than anything. The doomers are marketing LLMs by making them sound much more capable than they actually are, the accelerationists are mostly either willfully ignorant of the societal costs, don't care, or are just way too optimistic that fast growth can continue forever and generate AGI ("my baby's weight doubled twice in the past month! By the time they're 18 they'll be 10 trillion pounds!")

akomtu · 6 days ago

Similarly, the so-called AI agents are about giving up agency to AI. The less you think, the better for them. In the meantime, they are also aligning your thinking with them, making it more machine-like.

jasonman · 6 days ago

But does it follow that people whose jobs are threatened write better policy? Proximity to displacement doesn't automatically equal better insight — those most affected might have the strongest incentive to reach wrong conclusions in a particular direction.

derekvik · 6 days ago

Has anyone actually tried to define what "aligning with" an AI would even look like in practice? The post gestures at mutuality but skips over who decides when the AI's "interests" are being respected versus when we're just anthropomorphizing a loss function.

signal70 · 6 days ago

We shipped a product last year where half our users were blue-collar and had zero input on how the AI handled their workflows. They noticed things we didn't, but by then the decisions were locked in. The people closest to the actual work rarely get a seat at the table.