The Japan Times - AI's blind spot: tools fail to detect their own fakes

EUR -
AED 4.202414
AFN 73.234648
ALL 93.94534
AMD 420.679135
ANG 2.048746
AOA 1049.891271
ARS 1708.316969
AUD 1.651217
AWG 2.062589
AZN 1.948912
BAM 1.955703
BBD 2.305386
BDT 141.133
BGN 1.934863
BHD 0.431579
BIF 3404.631133
BMD 1.144293
BND 1.477127
BOB 7.926607
BRL 5.915774
BSD 1.144643
BTN 109.047591
BWP 15.438234
BYN 3.321035
BYR 22428.147579
BZD 2.302086
CAD 1.624839
CDF 2570.082927
CHF 0.916597
CLF 0.026912
CLP 1059.177465
CNY 7.768723
CNH 7.764608
COP 3849.009092
CRC 521.474135
CUC 1.144293
CUP 30.323771
CVE 110.259531
CZK 24.195741
DJF 203.82989
DKK 7.478638
DOP 67.806637
DZD 152.604431
EGP 56.395203
ERN 17.164399
ETB 183.546696
FJD 2.586617
FKP 0.856955
GBP 0.854556
GEL 3.015225
GGP 0.856955
GHS 13.003355
GIP 0.856955
GMD 82.962963
GNF 10038.502097
GTQ 8.735567
GYD 239.428125
HKD 8.97658
HNL 30.63648
HRK 7.538035
HTG 149.712574
HUF 353.483867
IDR 20590.870346
ILS 3.431335
IMP 0.856955
INR 108.954451
IQD 1499.425629
IRR 1574490.289046
ISK 144.089783
JEP 0.856955
JMD 181.201013
JOD 0.81129
JPY 184.648901
KES 148.002659
KGS 100.065813
KHR 4583.772648
KMF 493.190359
KPW 1029.86432
KRW 1749.366875
KWD 0.355063
KYD 0.953953
KZT 541.303152
LAK 25845.718069
LBP 102500.516042
LKR 383.390984
LRD 207.749696
LSL 18.566079
LTL 3.3788
LVL 0.692172
LYD 7.336636
MAD 10.704169
MDL 20.134001
MGA 4852.759306
MKD 61.631943
MMK 2402.882317
MNT 4099.027451
MOP 9.246541
MRU 45.681734
MUR 53.838679
MVR 17.690605
MWK 1984.90155
MXN 19.989772
MYR 4.658456
MZN 73.131954
NAD 18.566079
NGN 1567.773639
NIO 42.117911
NOK 11.260973
NPR 174.476346
NZD 2.003841
OMR 0.441358
PAB 1.144643
PEN 3.894907
PGK 5.028751
PHP 70.375146
PKR 318.232516
PLN 4.293445
PYG 6959.654806
QAR 4.184292
RON 5.227137
RSD 117.371178
RUB 88.095631
RWF 1675.716886
SAR 4.297707
SBD 9.221334
SCR 15.409236
SDG 687.148732
SEK 11.051652
SGD 1.477743
SHP 0.85433
SLE 27.863888
SLL 23995.261369
SOS 654.167554
SRD 42.986493
STD 23684.559828
STN 24.498785
SVC 10.015503
SYP 126.481133
SZL 18.563079
THB 38.133591
TJS 10.610574
TMT 4.016469
TND 3.378232
TOP 2.755184
TRY 53.515737
TTD 7.757615
TWD 36.546404
TZS 3005.850912
UAH 50.978472
UGX 4177.792784
USD 1.144293
UYU 46.037717
UZS 13712.319878
VES 731.092695
VND 30090.335139
VUV 136.092615
WST 3.173331
XAF 655.924467
XAG 0.018332
XAU 0.000274
XCD 3.092509
XCG 2.062898
XDR 0.81576
XOF 655.924467
XPF 119.331742
YER 271.255012
ZAR 18.573595
ZMK 10300.011738
ZMW 21.031957
ZWL 368.461958
  • CMSC

    0.0400

    21.99

    +0.18%

  • RELX

    0.5500

    31.93

    +1.72%

  • GSK

    2.3600

    53.66

    +4.4%

  • AZN

    11.2900

    195.15

    +5.79%

  • VOD

    0.1400

    13.15

    +1.06%

  • NGG

    2.6700

    82.85

    +3.22%

  • RBGPF

    2.5400

    68.15

    +3.73%

  • RIO

    1.0700

    94.42

    +1.13%

  • RYCEF

    0.5400

    19.68

    +2.74%

  • CMSD

    -0.0300

    22.15

    -0.14%

  • JRI

    0.0600

    13

    +0.46%

  • BCE

    0.4000

    21.42

    +1.87%

  • BCC

    0.4500

    75.93

    +0.59%

  • BTI

    1.2100

    61.77

    +1.96%

  • BP

    1.2500

    37.4

    +3.34%

AI's blind spot: tools fail to detect their own fakes
AI's blind spot: tools fail to detect their own fakes / Photo: Chris Delmas - AFP

AI's blind spot: tools fail to detect their own fakes

When outraged Filipinos turned to an AI-powered chatbot to verify a viral photograph of a lawmaker embroiled in a corruption scandal, the tool failed to detect it was fabricated -- even though it had generated the image itself.

Text size:

Internet users are increasingly turning to chatbots to verify images in real time, but the tools often fail, raising questions about their visual debunking capabilities at a time when major tech platforms are scaling back human fact-checking.

In many cases, the tools wrongly identify images as real even when they are generated using the same generative models, further muddying an online information landscape awash with AI-generated fakes.

Among them is a fabricated image circulating on social media of Elizaldy Co, a former Philippine lawmaker charged by prosecutors in a multibillion-dollar flood-control corruption scam that sparked massive protests in the disaster-prone country.

The image of Co, whose whereabouts has been unknown since the official probe began, appeared to show him in Portugal.

When online sleuths tracking him asked Google's new AI mode whether the image was real, it incorrectly said it was authentic.

AFP's fact-checkers tracked down its creator and determined that the image was generated using Google AI.

"These models are trained primarily on language patterns and lack the specialized visual understanding needed to accurately identify AI-generated or manipulated imagery," Alon Yamin, chief executive of AI content detection platform Copyleaks, told AFP.

"With AI chatbots, even when an image originates from a similar generative model, the chatbot often provides inconsistent or overly generalized assessments, making them unreliable for tasks like fact-checking or verifying authenticity."

Google did not respond to AFP’s request for comment.

- 'Distinguishable from reality' -

AFP found similar examples of AI tools failing to verify their own creations.

During last month's deadly protests over lucrative benefits for senior officials in Pakistan-administered Kashmir, social media users shared a fabricated image purportedly showing men marching with flags and torches.

An AFP analysis found it was created using Google's Gemini AI model.

But Gemini and Microsoft's Copilot falsely identified it as a genuine image of the protest.

"This inability to correctly identify AI images stems from the fact that they (AI models) are programmed only to mimic well," Rossine Fallorina, from the nonprofit Sigla Research Center, told AFP.

"In a sense, they can only generate things to resemble. They cannot ascertain whether the resemblance is actually distinguishable from reality."

Earlier this year, Columbia University's Tow Center for Digital Journalism tested the ability of seven AI chatbots -- including ChatGPT, Perplexity, Grok, and Gemini -- to verify 10 images from photojournalists of news events.

All seven models failed to correctly identify the provenance of the photos, the study said.

- 'Shocked' -

AFP tracked down the source of Co's photo that garnered over a million views across social media -- a middle-aged web developer in the Philippines, who said he created it "for fun" using Nano Banana, Gemini's AI image generator.

"Sadly, a lot of people believed it," he told AFP, requesting anonymity to avoid a backlash.

"I edited my post -- and added 'AI generated' to stop the spread -- because I was shocked at how many shares it got."

Such cases show how AI-generated photos flooding social platforms can look virtually identical to real imagery.

The trend has fueled concerns as surveys show online users are increasingly shifting from traditional search engines to AI tools for information gathering and verifying information.

The shift comes as Meta announced earlier this year it was ending its third-party fact-checking program in the United States, turning over the task of debunking falsehoods to ordinary users under a model known as "Community Notes."

Human fact-checking has long been a flashpoint in hyperpolarized societies, where conservative advocates accuse professional fact-checkers of liberal bias, a charge they reject.

AFP currently works in 26 languages with Meta's fact-checking program, including in Asia, Latin America, and the European Union.

Researchers say AI models can be useful to professional fact-checkers, helping to quickly geolocate images and spot visual clues to establish authenticity. But they caution that they cannot replace the work of trained human fact-checkers.

"We can't rely on AI tools to combat AI in the long run," Fallorina said.

burs-ac/sla/sms

Y.Watanabe--JT