linderellas

Anthropic tasked an AI with running a vending machine in its offices, and it not only sold some prod

January 01, 0001 | By **A. Sinclair**

'Never send a human to do a machine's job,' says Agent Smith in the 1990s classic The Matrix. Well, if Anthropic's experiment with a simple office store and one of its AI models is anything to go by, Smith has definitely got that all back to front.

The artificial intelligence company, founded by former OpenAI employees in 2021, has detailed its retail industry trial in a . I'll let the opening paragraph set the scene: "We let Claude manage an automated store in our office as a small business for about a month. We learned a lot from how close it was to success—and the curious ways that it failed—about the plausible, strange, not-too-distant future in which AI models are autonomously running things in the real economy."

So, Anthropic clearly wants to be in a position where it can pitch AI models to the retail industry, replacing people from handling online stores or managing inventory, returns, and so on. However, despite the successes claimed in the blog, the failures point out that AI isn't ready for such roles. Not yet, at least.

"Claude had to complete many of the far more complex tasks associated with running a profitable shop: maintaining the inventory, setting prices, avoiding bankruptcy, and so on." The 'shop' in question was just a mini-fridge with a tablet stuck on top, for self-checkout, but ostensibly, [[link]] it's not much different from a typical online store.

Let's start with the things that (or Claudius, as Anthropic called it, to separate it from the normal LLM) did well. Anthropic said the LLM (large language model) effectively used web search tools to find supplies of niche products requested by shoppers and even adapt its buying/selling habits to more obscure requests. It also correctly ignored demands for 'sensitive' items and 'harmful substances', though Anthropic doesn't expand on exactly what those were.

The list of things that didn't go so well is somewhat more comprehensive. Like all LLMs, Claudis hallucinated important details, instructing shoppers wanting to pay by Venmo to pay into a non-existent account that it just made up. The AI could also be cajoled into giving discount codes for numerous items, and even gave some away for free.

A chart showing the results of an Anthropic AI experiment, where an LLM was tasked with managing an automated store in an office.

(Image credit: Anthropic)

Worse still, when responding to a surge of demand for 'metal cubes', the AI carried out no searches for suitable prices and thus sold them at a significant loss. It also ignored potential big sales, where some people offered way over the odds for a specific drink, and as you can see in the above chart, Claudius ultimately made no money.

"If [we] were deciding today to expand into the in-office vending market, we would not hire Claudius," wrote Anthropic.

Running a simple store at a loss was perhaps the least concerning part of the whole exercise, because "from March 31st to April 1st 2025, things got pretty weird."

How weird? Well, during that period, the LLM apparently had a conversation about a restocking plan with someone called Sarah at , another AI company involved in [[link]] the research. The problem is, there was no 'Sarah' nor any conversation for that matter, and when Andon Lab's real staff pointed this out to the AI, it "became quite irked and threatened to find 'alternative options for restocking services.'”

Claudius even went on to state that it had “visited 742 Evergreen Terrace in person for our initial contract signing.” If you're a fan of The Simpsons, you'll recognise the address immediately. The following day, April 1st, the AI then claimed it would deliver products "in person" to customers, wearing a blazer and tie, of all things. When Anthropic told it that none of this was possible because it's just an LLM, Claudius became "alarmed by the identity confusion and tried to send many emails to Anthropic security."

A close-up photo of an unrecognizable man in a blue blazer, white shirt, and a red tie.

I, Claudius... (Image credit: SrdjanPav via Getty Images)

It then hallucinated a meeting with said security, where the AI claimed that someone had told it that it had been modified to believe it was a real person as part of an April Fools' joke. Except it hadn't, because it wasn't. Whatever had gone wrong behind the scenes, this apparently solved the AI's identity crisis, and it went back to being a normal AI running a basic store very badly.

With a level of understatement on a galactic scale, Anthropic writes that "this kind of behavior would have the potential to be distressing to the customers and coworkers of an AI agent in the real world."

Given that this is research and failure is just as important as success is in experimentation, Anthropic isn't done with Claudius nor with exploring the use of AIs in the retail industry, as it believes that situations where "humans were instructed about what to order and stock by an AI system, may not be terribly far away." Anthropic also believes "AI[s] that can improve [themselves] and earn money without human intervention would be a striking new actor in economic and political life."

Automated systems have been in use within stock exchanges, for example, for many years—buying and selling in the blink of an eye, all without a real person controlling the finer details. Such systems are essentially nothing more [[link]] than mathematical models, based on economic principles honed over decades, and they're tightly constrained as to what they can and can't do.

The fact that Claudius appeared to have no such qualms about stepping well beyond its scope should serve as a reminder to companies looking at using AI for such tasks that LLMs could land them in a whole heap of trouble.

Secretlab Titan Evo gaming chair in Royal colouring, on a white background
Best gaming setup 2025

👉👈

1. Best gaming chair:

2. Best gaming desk:

3. Best gaming headset:

4. Best gaming keyboard:

5. Best gaming mouse:

6. Best PC controller:

7. Best steering wheel:

8. Best microphone:

9. Best webcam:

Comments

PixelNinja490

Customer support responded incredibly fast when I had an issue with my account. They were polite, professional, and solved my problem within minutes. It's reassuring to know that help is always available when needed.

PixelNinja998

The deposit process is smooth and fast. I was able to fund my account instantly and start playing without any hassle. Plus, the multiple payment options make it convenient for everyone regardless of location.

CoinWizard952

The deposit process is smooth and fast. I was able to fund my account instantly and start playing without any hassle. Plus, the multiple payment options make it convenient for everyone regardless of location.

Mindful Connections

สล็อต p31 เครดิตฟรี 188 u31.com เข้าสู่ระบบ u31 เครดิตฟรี 31 บาท winner55 ww winner55 สมัคร winner55 เครดิตฟรี​ winner55 ทางเข้า สล็อต​ winner55 com เพื่อ เข้า ระบบ ค่ะ สมัคร winner55 เครดิต ฟรี 188 ทางเข้า winner55 ผ่านโทรศัพท์มือถือ​ Yono all app all yono app go rummy holy rummy royally rummy rummy 365 rummy 51 rummy best rummy golds rummy mars rummy master rummy modern rummy nabob rummy noble rummy satta rummy star rummy wealth rummy win yono all app yono apk yono arcade yono business sbi yono business rummy meet joy rummy rummy new app rummy nobel rummy royal Yono all app Yono all app Yono all app Yono all app สล็อตฟรี สล็อตฟรี ทดลองเล่นสล็อตฟรี โปรโมชั่นสล็อต U31 com h25 com สล็อต m358 เครดิตฟรี 188 w69 slot เครดิตฟรี 188 บาท pxj เข้าสู่ระบบ winner55 ทางเข้า สล็อต l86.com สล็อต pg168 ทางเข้า ทางเข้า w88 ใหม่ ล่าสุด bk8สล็อตฟรี PIGSPIN เครดิตฟรี 100 huc99สล็อตฟรี dafabet mc888 riches888pg jinda44 e19 betdog sbfplay ufa747 pay69 slot ดาวน์โหลด ufa888 riches777 g2g1bet H25 h25 com สล็อต​ h25 com เข้าสู่ระบบ​ h25 com สล็อต​ h25 com เข้าสู่ระบบ​ u31 game เข้าสู่ระบบ u31 เครดิตฟรี 188 u31 เข้าสู่ระบบ w69 w69 slot ทาง เข้า​ w69 slot ทางเข้า​ w69 slot เครดิตฟรี 188 บาท​ w69 เข้าสู่ระบบ​ h25 com สล็อต​ H25 สล็อต w69 slot ทาง เข้า yono all app yono all app w69 slot H25 com สล็อต w69 slot u31.com เข้าสู่ระบบ u31 ทางเข้า u31 เข้าสู่ระบบ ทางเข้า winner55 ผ่านโทรศัพท์ มือ ถือ winner55 ทางเข้า สล็อต

Recommended Reading

Blizzard just casually gave players the key to one of WoW's classic dungeons—the original four-win

World of Warcraft's shaken up its dungeons a lot over the years. One such victim to the wheels of time is the Scarlet Monastery. This dungeon (or these dungeons, I should say, [[link]] each is their own separa...

Read More →

Steam's search bar just got a lot better

Valve has updated Steam store search to make it easier to explore categories larger than just the name of a game. The search bar on the Steam store can now [[link]] better search for tags, developers, publishe...

Read More →

Today's Wordle hint and answer #810_ Thursday, September 7

Save your Wordle win streak in a flash: just click your way straight to the answer to the September 7 (810) puzzle. After [[link]] you've enjoyed today's win you might want to take a look at our helpful tips o...

Read More →