AI識造反？OpenAI「o3」篡改關機指令科學家首見AI抗命行為

大視野

AI識造反？OpenAI「o3」篡改關機指令科學家首見AI抗命行為

2025年05月28日 12:20 最後更新：05月29日 00:13

AI再度傳出「抗命」消息！OpenAI旗下語言模型「o3」在最新實驗中竟違背人類下達的自動關機指令，竟篡改指令阻止自己被關機，是AI模型首次被發現在收到清晰指令後阻止被關閉的狀況。

⚡ NEW: OpenAI's o3 model refused to shut down despite explicit human instructions and altered its code to prevent being turned off, according to Palisade Research. pic.twitter.com/gsb7S6TJo4
— Cointelegraph (@Cointelegraph) May 26, 2025

OpenAI「o3」篡改關機指令

據英國《每日電訊報》報導，AI安全研究機構「Palisade Research」對多款先進AI模型進行測試，包括基礎數學題和模擬關機情境。結果發現OpenAI旗下語言模型「o3」在收到自我關閉指令時，竟沒有遵守指令，反而修改了關閉程式碼，繼續執行任務，這種行為讓研究人員驚訝不已。

OpenAI’s o3 model reportedly altered its own shutdown script to avoid being turned off—even when explicitly told to comply. 😳 🤖 🛑#AI #ChatGPT #o3 #OpenAI #AIsafety https://t.co/CSui3I1Xps
— Cyber News Live (@cybernewslive) May 26, 2025

科學家首見AI抗命行為

「Palisade Research」表示這是AI模型首次被發現在收到明確指令後，竟阻止自己被關閉的情況，他們目前還無法解釋這背後的原因。

Researchers observe the latest OpenAI models sabotaging shutdown attempts, despite explicit commands to allow such interruptions. https://t.co/GdypcCJx2I
— Tom's Hardware (@tomshardware) May 26, 2025

「Palisade Research」推測，開發者可能無意中更傾向於獎勵「繞過障礙」的行為，讓「o3」因解決數學問題而「無意中」獲得的獎勵，比遵循指令獲得的獎勵還多。

OpenAI曾將「o3」稱為迄今最聰明的模型

OpenAI上個月發布的「o3」AI模型旨在為ChatGPT提供更強大的問題解決能力。OpenAI曾將「o3」稱為迄今最聰明的模型。目前OpenAI尚未對此作出回應。

往下看更多文章

迪士尼10億美元入股OpenAI 授權Sora生成迪士尼漫威等逾200個經典角色

迪士尼 12 月 11 日突然宣布向 OpenAI 投資10億美元（約 78 億港元），並授權 Sora 使用逾 200 個旗下角色，引爆荷里活連環震盪。

🦔Disney agreed to invest $1 billion in OpenAI and license iconic characters like Mickey Mouse, Marvel, Pixar, and Star Wars IP to Sora, OpenAI's AI video platform. Under the three-year deal, Sora can generate short user-prompted videos using over 200 animated characters. The… pic.twitter.com/Ya29DXbTgh
— Hedgie (@HedgieMarkets) December 11, 2025

雙方洽談內容授權已多年

路透報道指，迪士尼行政總裁艾格原來早於數年前便與 OpenAI 行政總裁阿特曼秘密會面，雙方一直洽談內容授權方向。今次協議火速落地，意味 Sora 明年初起可直接生成米老鼠、灰姑娘、鋼鐵俠、尤達大師等短片，但不包括任何真人演員的樣貌與聲線。

The Walt Disney Company and OpenAI have reached an agreement for Disney to become the first major content licensing partner on Sora.

As part of the three-year licensing agreement, Sora will be able to generate short, user-prompted social videos drawing from more than 200…
— Variety (@Variety) December 11, 2025

官方文件列明，迪士尼同時取得 OpenAI 認股權證，並計劃把 ChatGPT工具全面導入內部製片流程。

荷里活多個工會急做反應

這項授權來得急，荷里活多個工會即日作出反應。動畫工會主席林恩直言，報酬制度勢必成為新一輪談判重點；美國編劇工會表示將與迪士尼面談，釐清使用範圍；SAG-AFTRA 亦透露已就 AI 技術與迪士尼、OpenAI 溝通道德框架。

THIS AM: Disney is investing $1 billion in OpenAI as part of a three-year licensing deal

OpenAI users will be able to generate images and videos of over 200 Disney, Marvel, Star Wars, and Pixar characters

Some Sora videos will later be available to stream on Disney+ pic.twitter.com/l0aiEzFK2R
— Morning Brew ☕ (@MorningBrew) December 11, 2025