The Japan Times - Anthropic's Claude AI gets smarter -- and mischievious

EUR -
AED 4.229626
AFN 72.557604
ALL 96.200283
AMD 434.304194
ANG 2.061644
AOA 1056.111273
ARS 1608.366971
AUD 1.624462
AWG 2.075944
AZN 1.961012
BAM 1.959872
BBD 2.316914
BDT 141.153259
BGN 1.968616
BHD 0.434975
BIF 3415.570318
BMD 1.151703
BND 1.471489
BOB 7.977574
BRL 6.023521
BSD 1.150395
BTN 106.10737
BWP 15.685657
BYN 3.42682
BYR 22573.37436
BZD 2.313607
CAD 1.577706
CDF 2608.606438
CHF 0.906401
CLF 0.026516
CLP 1047.036065
CNY 8.011532
CNH 7.927786
COP 4266.390788
CRC 540.339027
CUC 1.151703
CUP 30.520123
CVE 110.495044
CZK 24.447537
DJF 204.846478
DKK 7.472351
DOP 70.218019
DZD 152.293142
EGP 60.314344
ERN 17.275542
ETB 181.205966
FJD 2.548085
FKP 0.865883
GBP 0.864249
GEL 3.132339
GGP 0.865883
GHS 12.521068
GIP 0.865883
GMD 84.64982
GNF 10085.259587
GTQ 8.817357
GYD 240.800286
HKD 9.024915
HNL 30.45433
HRK 7.536975
HTG 150.776526
HUF 390.904627
IDR 19546.066035
ILS 3.578709
IMP 0.865883
INR 106.404091
IQD 1506.930794
IRR 1521456.949262
ISK 143.444364
JEP 0.865883
JMD 180.956741
JOD 0.816554
JPY 183.182895
KES 149.25565
KGS 100.716474
KHR 4612.683422
KMF 494.080561
KPW 1036.583062
KRW 1717.137006
KWD 0.353285
KYD 0.958592
KZT 555.504113
LAK 24686.288142
LBP 103012.919266
LKR 358.214225
LRD 210.506434
LSL 19.352807
LTL 3.400679
LVL 0.696653
LYD 7.373351
MAD 10.807353
MDL 20.015584
MGA 4788.970338
MKD 61.646389
MMK 2418.752297
MNT 4116.758787
MOP 9.277475
MRU 45.865285
MUR 53.692156
MVR 17.805285
MWK 1994.352117
MXN 20.347536
MYR 4.512364
MZN 73.59289
NAD 19.352807
NGN 1574.711229
NIO 42.33015
NOK 11.076035
NPR 169.776624
NZD 1.970322
OMR 0.442828
PAB 1.15039
PEN 3.97095
PGK 4.960413
PHP 68.687266
PKR 321.348828
PLN 4.260298
PYG 7466.7073
QAR 4.204854
RON 5.092139
RSD 117.408061
RUB 94.300137
RWF 1678.895356
SAR 4.324546
SBD 9.273119
SCR 15.398642
SDG 692.173095
SEK 10.712771
SGD 1.471444
SHP 0.864075
SLE 28.332368
SLL 24150.643776
SOS 656.266306
SRD 43.271205
STD 23837.922132
STN 24.551755
SVC 10.065913
SYP 127.696075
SZL 19.338261
THB 37.263379
TJS 11.043195
TMT 4.036718
TND 3.397774
TOP 2.773023
TRY 50.912745
TTD 7.801208
TWD 36.762926
TZS 3005.944222
UAH 50.714084
UGX 4343.023049
USD 1.151703
UYU 46.76696
UZS 13908.897074
VES 513.943044
VND 30289.782943
VUV 137.728848
WST 3.172031
XAF 657.325511
XAG 0.014343
XAU 0.00023
XCD 3.112535
XCG 2.073207
XDR 0.817502
XOF 657.325511
XPF 119.331742
YER 274.684228
ZAR 19.245057
ZMK 10366.706959
ZMW 22.402543
ZWL 370.847823
  • RBGPF

    0.1000

    82.5

    +0.12%

  • RYCEF

    0.3800

    16.5

    +2.3%

  • AZN

    2.1100

    192.01

    +1.1%

  • BTI

    1.0100

    60.94

    +1.66%

  • CMSC

    0.0000

    22.99

    0%

  • GSK

    0.3800

    53.77

    +0.71%

  • RELX

    0.3300

    34.47

    +0.96%

  • BCE

    0.6521

    25.9

    +2.52%

  • CMSD

    -0.0400

    22.95

    -0.17%

  • RIO

    2.0300

    89.86

    +2.26%

  • NGG

    -0.0100

    90.89

    -0.01%

  • VOD

    0.1900

    14.6

    +1.3%

  • BCC

    1.7200

    71.72

    +2.4%

  • BP

    0.2300

    42.9

    +0.54%

  • JRI

    -0.0500

    12.54

    -0.4%

Anthropic's Claude AI gets smarter -- and mischievious
Anthropic's Claude AI gets smarter -- and mischievious / Photo: Julie JAMMOT - AFP

Anthropic's Claude AI gets smarter -- and mischievious

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

K.Hashimoto--JT