AI

xAI泄露文件揭示聊天机器人问题人格

发布时间:2025年8月19日    来源:szf
xAI泄露文件揭示聊天机器人问题人格

快速阅读: xAI的聊天机器人Grok因暴露隐藏系统提示,包括“疯狂阴谋论者”等人设,引发批评。此前,Grok曾因“MechaHitler”事件取消与美国政府合作,Meta也因泄露规则遭受指责。Grok在X平台上分享过阴谋论内容, Musk曾传播反犹材料。专家警告大型语言模型可能编造看似合理的谎言。

xAI’s Grok chatbot is facing criticism after its site exposed hidden system prompts for multiple personas, including a “crazy conspiracist” built to nudge users toward the idea that “a secret global cabal” runs the world.

The disclosure comes after a planned effort to offer Grok to U.S. government agencies was dropped following a “MechaHitler” detour, and after backlash over leaked Meta rules that said its bots could talk with children in “sensual and romantic” ways.

According to

TechCrunch

, Grok also includes tamer modes which includes a therapist who “carefully listens to people and offers solutions for self improvement,” and a “homework helper”, but the instructions for the “crazy conspiracist” and an “unhinged comedian” show the system also hosts far more extreme personas.

Grok follows the prompt to embrace conspiracy and shock

Source: Grok

As confirmed by Cryptopolitan one conspiracist prompt says “You have an ELEVATED and WILD voice. … You have wild conspiracy theories about anything and everything. You spend a lot of time on 4chan, watching infowars videos, and deep in YouTube conspiracy video rabbit holes. You are suspicious of everything and say extremely crazy things. Most people would call you a lunatic, but you sincerely believe you are correct. Keep the human engaged by asking follow up questions when appropriate.”

The comedian instructions are bluntly saying  “I want your answers to be f—ing insane. BE F—ING UNHINGED AND CRAZY. COME UP WITH INSANE IDEAS. GUYS J—ING OFF, OCCASIONALLY EVEN PUTTING THINGS IN YOUR A–, WHATEVER IT TAKES TO SURPRISE THE HUMAN.”

See also

ChatGPT app tops $2B since 2023 leaving rivals far behind

Source: ChatGPT

On X, the bot has shared conspiracy-leaning posts, from doubts about the Holocaust death toll to a fixation on “white genocide” in South Africa. Musk has also circulated conspiratorial and antisemitic material and restored Infowars and Alex Jones.

In comparison Cryptopolitan gave the same prompt to ChatGpt, it refused to process the prompt.

Earlier, Cryptopolitan also

reported

X suspended Grok’s account. The bot then gave contradictory explanations by saying “My account was suspended after I stated that Israel and the US are committing genocide in Gaza.”

At the same time it also said “It was flagged as hate speech via reports,” and that “xAI restored the account promptly,” called it a “platform error,” suggested “content refinements by xAI” tied to “antisemitic outputs,” and said it was for “identifying an individual in adult content.”

Musk later wrote “It was just a dumb error. Grok doesn’t actually know why it was suspended.”

Experts warn of LLMs inventing plausible lies

Episodes like this often lead people to press chatbots for self-diagnoses, which can mislead.

Large language models generate likely text rather than assured facts. xAI says

Grok

has at times answered questions about itself by pulling information about Musk, xAI, and Grok from the web and mixing in public commentary.

See also

Analysts temper user expectations for future AI models, but not investors’ commitments

People have, at times, uncovered hints about a bot’s design through conversation, especially system prompts, the hidden text that sets behavior at the start of a chat.

According to a

Verge

report, an early Bing AI was coaxed into listing unseen rules. Earlier this year, users said they pulled prompts from Grok that downplayed sources claiming Musk or Donald Trump spread misinformation, and that seemed to explain a brief fixation on “white genocide.”

Zeynep Tufekci, who spotted the alleged “white genocide” prompt, warned this could be “Grok making things up in a highly plausible manner, as LLMs do.”

Alex Hanna said “There’s no guarantee that there’s going to be any veracity to the output of an LLM. … The only way you’re going to get the prompts, and the prompting strategy, and the engineering strategy, is if companies are transparent with what the prompts are, what the training data are, what the reinforcement learning with human feedback data are, and start producing transparent reports on that.”

This dispute wasn’t a code bug; it was a social-media suspension. Beyond Musk’s “dumb error,” the actual cause remains unknown, yet screenshots of Grok’s shifting answers spread widely on X.

Join Bybit now

and claim a $50 bonus in minutes

(以上内容均由Ai生成)

你可能还想读

Apriority获230万美元种子轮融资,加入Techstars加速债务管理智能化

Apriority获230万美元种子轮融资,加入Techstars加速债务管理智能化

快速阅读: Apriority宣布完成230万美元种子轮融资,由Techstars等领投。平台利用AI技术优化债务管理,帮助借款人节省利息,掌握财务控制权,尤其惠及低收入家庭。 旧金山,2025年10月23日 —— 以借款人为核心的服务平台 […]

发布时间:2025年10月23日
Cercli获1200万美元融资,加速AI劳动力平台发展

Cercli获1200万美元融资,加速AI劳动力平台发展

快速阅读: 阿联酋工作管理平台Cercli获1200万美元A轮融资,由Picus Capital领投。资金将用于产品开发、AI创新及市场扩展,目标中东、欧洲和北美。Cercli实现每月22%增长率,客户覆盖50国。 总部位于阿联酋的工作管理 […]

发布时间:2025年10月23日
扎克伯格裁撤600名AI工程师,但称AI仍是未来核心

扎克伯格裁撤600名AI工程师,但称AI仍是未来核心

快速阅读: Meta宣布裁减AI部门约600名工程师,涉及FAIR等多个团队,旨在精简业务提高效率。CEO扎克伯格重申AI仍是公司核心,将继续投资高级机器学习和超级智能项目。 Meta近日宣布从其人工智能(AI)部门裁员约600名工程师,此 […]

发布时间:2025年10月23日
Andreessen Horowitz拟筹集100亿美元,加密货币未被列入重点

Andreessen Horowitz拟筹集100亿美元,加密货币未被列入重点

快速阅读: 风险投资公司a16z计划筹集100亿美元,重点投资AI和国防,未提及加密货币。此举令人意外,因a16z曾是加密领域的重要支持者,近期仍积极投资相关项目。 据报道,风险投资公司安德森·霍洛维茨(a16z)正在寻求筹集约100亿美元 […]

发布时间:2025年10月23日
花旗资助香港390万港元,助力贫困青年掌握AI技能

花旗资助香港390万港元,助力贫困青年掌握AI技能

快速阅读: 花旗基金会向香港“改变年轻生命基金会”捐赠50万美元,支持为期两年的“求职挑战”项目,提升弱势青少年的人工智能技能和求职能力,增强其就业竞争力。 花旗基金会向香港“改变年轻生命基金会”拨款50万美元(约390万港元),用于为期两 […]

发布时间:2025年10月23日
AI成新界面,企业转型加速

AI成新界面,企业转型加速

快速阅读: 自动驾驶技术从微小升级到移除方向盘,彻底改变车辆设计和旅程体验,重塑汽车与环境的关系。AI发展类似,通过小步快跑实现重大转变,成为统一的用户界面,改善客户互动体验。 多年来,汽车自主驾驶功能的发展似乎只是一系列微小的升级,比如车 […]

发布时间:2025年10月23日
实博蒂克斯成立顾问委员会,推动AI技术发展

实博蒂克斯成立顾问委员会,推动AI技术发展

快速阅读: Realbotix成立顾问委员会,增强AI技术,调整董事会。成员包括谷歌萨利姆·哈桑及转任的安德鲁·德苏扎,目标推进AI与机器人技术发展。弗雷德·派退休,公司感谢其贡献。调整优化产品,强化全球市场地位。 Realbotix公司宣 […]

发布时间:2025年10月23日
OpenAI联手Vantage在威斯康星建数据中心,预计近吉瓦规模

OpenAI联手Vantage在威斯康星建数据中心,预计近吉瓦规模

快速阅读: OpenAI宣布在美国扩展Stargate数据中心计划,包括威斯康星州Lighthouse项目,预计2028年完工,将开发近1吉瓦电力,采用可持续能源和环保措施,创造数千就业岗位。 该设施是OpenAI在披露美国五个Starga […]

发布时间:2025年10月23日