The Japan Times - Anthropic's Claude AI gets smarter -- and mischievious

Tokyo 25°C

EUR -

AED 4.30878

AFN 75.088139

ALL 95.561304

AMD 435.019119

ANG 2.099991

AOA 1077.048119

ARS 1633.743618

AUD 1.628028

AWG 2.111859

AZN 1.992549

BAM 1.958981

BBD 2.363569

BDT 143.987894

BGN 1.957109

BHD 0.443079

BIF 3491.606608

BMD 1.173255

BND 1.496952

BOB 8.108753

BRL 5.813124

BSD 1.17352

BTN 111.32055

BWP 15.948049

BYN 3.311545

BYR 22995.796207

BZD 2.360153

CAD 1.594747

CDF 2721.951785

CHF 0.916036

CLF 0.026822

CLP 1055.636074

CNY 8.011278

CNH 7.99944

COP 4290.886514

CRC 533.520798

CUC 1.173255

CUP 31.091255

CVE 110.814062

CZK 24.36217

DJF 208.511097

DKK 7.472484

DOP 69.807476

DZD 155.414871

EGP 62.775014

ERN 17.598824

ETB 184.201363

FJD 2.570129

FKP 0.864241

GBP 0.863158

GEL 3.144316

GGP 0.864241

GHS 13.136436

GIP 0.864241

GMD 85.647414

GNF 10295.311947

GTQ 8.965435

GYD 245.506393

HKD 9.191291

HNL 31.231437

HRK 7.535932

HTG 153.725313

HUF 362.003077

IDR 20384.717408

ILS 3.45811

IMP 0.864241

INR 111.373802

IQD 1536.96393

IRR 1541656.949892

ISK 143.805466

JEP 0.864241

JMD 183.878547

JOD 0.831868

JPY 183.999313

KES 151.525537

KGS 102.56653

KHR 4707.687454

KMF 492.766707

KPW 1055.929389

KRW 1723.388282

KWD 0.361246

KYD 0.977959

KZT 543.555065

LAK 25788.142975

LBP 105064.976893

LKR 375.055706

LRD 215.732235

LSL 19.546108

LTL 3.464316

LVL 0.70969

LYD 7.450082

MAD 10.854074

MDL 20.219293

MGA 4869.007439

MKD 61.642351

MMK 2463.237101

MNT 4197.730703

MOP 9.46916

MRU 46.895281

MUR 54.861245

MVR 18.132674

MWK 2043.224376

MXN 20.452648

MYR 4.637894

MZN 74.955906

NAD 19.546663

NGN 1614.37562

NIO 43.070165

NOK 10.884579

NPR 178.104316

NZD 1.982771

OMR 0.451104

PAB 1.17349

PEN 4.11519

PGK 5.09046

PHP 72.119932

PKR 327.074167

PLN 4.246878

PYG 7217.425722

QAR 4.274757

RON 5.197052

RSD 117.321989

RUB 87.993368

RWF 1714.712049

SAR 4.399682

SBD 9.435445

SCR 17.459933

SDG 704.550818

SEK 10.811603

SGD 1.493199

SHP 0.875953

SLE 28.864339

SLL 24602.564306

SOS 669.928799

SRD 43.947762

STD 24284.007814

STN 24.884737

SVC 10.268679

SYP 129.673977

SZL 19.545913

THB 38.048375

TJS 11.007269

TMT 4.112258

TND 3.381027

TOP 2.824916

TRY 53.025844

TTD 7.96568

TWD 37.070747

TZS 3062.195542

UAH 51.563774

UGX 4412.59685

USD 1.173255

UYU 46.800573

UZS 14020.396174

VES 573.654487

VND 30901.774408

VUV 138.035069

WST 3.185609

XAF 657.071431

XAG 0.015654

XAU 0.000256

XCD 3.17078

XCG 2.114968

XDR 0.816151

XOF 657.022504

XPF 119.331742

YER 279.952314

ZAR 19.463185

ZMK 10560.703776

ZMW 21.915169

ZWL 377.787602

RBGPF

0.5000

63.1

+0.79%
CMSC

0.0600

22.88

+0.26%
RIO

0.1000

100.58

+0.1%
CMSD

0.1500

23.28

+0.64%
GSK

-0.7000

51.61

-1.36%
RYCEF

0.5500

16.35

+3.36%
BCE

0.1800

23.96

+0.75%
NGG

-1.0600

88.48

-1.2%
RELX

-0.2400

36.35

-0.66%
BCC

-1.1400

78.13

-1.46%
VOD

0.3500

16.15

+2.17%
AZN

-2.6300

184.74

-1.42%
BTI

-0.0900

58.71

-0.15%
BP

-0.9700

46.41

-2.09%
JRI

-0.0100

12.98

-0.08%

Anthropic's Claude AI gets smarter -- and mischievious / Photo: Julie JAMMOT - AFP

Anthropic's Claude AI gets smarter -- and mischievious

TECHNOLOGY 23.05.2025

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

K.Hashimoto--JT

The Japan Times - Anthropic's Claude AI gets smarter -- and mischievious

Anthropic's Claude AI gets smarter -- and mischievious

Featured

Apple earnings beat forecasts on iPhone 17 demand

Musk grilled on AI profits at OpenAI trial

Drivers help study road-trip mystery: what became of bug splats?

European rocket blasts off with Amazon internet satellites