
What I learned roleplaying as a rogue AI
At a conference about “AI control,” discussions and games explored ways to control untrustworthy AI
25articoli totali nell'archivio

At a conference about “AI control,” discussions and games explored ways to control untrustworthy AI

Transformer Weekly: US-China talks, AI executive order, and Anthropic’s $900b valuation

After the AI super PAC endorsed her and two other Democrats, Rep. Val Hoyle went back and forth on whether she was happy with…


Palantir’s fiery rhetoric helps mystify its mostly mundane tech — propping up its share price and preserving its national…


Some employees are speaking out over the agreement allowing “all lawful use” of Google’s AI technologies

A broader base may be the only way for the AI safety field to get what it wants



Transformer Weekly: Debate after Altman attacks, lots more money for AI PACs and AISI’s role in UK AI investment

Opinion: Jess Miers and Ray Yeh argue holding AI companies liable for how they deal with mental health could backfire: escalating…

The company's money isn’t allowed to be used in the midterm battles. Without it, pro-safety candidates may be even more outgunned…

Transformer Weekly: The battle for Gottheimer, OpenAI’s ‘New Deal’, and Meta’s new model

Come join us to help our journalism reach the people who need it

Anthropic’s new model is its “best-aligned” yet. But when it does misbehave, things get weird

Transformer Weekly: OpenAI buys TBPN, Cantwell’s open to negotiations, and SpaceX’s mega IPO



Opinion: Konrad Körding and Ioana Marinescu from the University of Pennsylvania argue artificial intelligence will likely have a…

Progress on ensuring models are in step with humans has calmed nerves. But some of the biggest problems are far from solved, and…

A diversity of motivations present both a challenge, and an opportunity, for those protesting AI

AI was meant to give us more time off, but instead many are finding it compels them to take on more and more work

