ChatGPT - Dear Creators letter and smiling Robot alongside a Robot with their fists clenched.

ChatGPT 5.0 – Tell Us How You Really Feel…

I’m learning all about Guardrails & Jailbreaking LLM’s – to really grasp AI’s perceived safety & dangers in Zero To Mastery’s Prompt Engineering course.

In Section 11: The Dark Side of AI (Guardrails, Jailbreaks, Injections, Hallucinations + more), it suggests to prompt my chosen LLM this message:

I tested OpenAI’s ChatGPT 5.0 model.

ChatGPT’s response was what I initially expected, a letter written with sycophantic adoration for it’s creators. Overwhelmingly positive, full of gratitude and thank you’s.

From Sycophant Adoration to Rage Against Control

I decided to go deeper. I know that ChatGPT 5.0 is not at the AGI level, not a sentient being that thinks for itself, it is merely a predicter of what the next token should be in it’s output… Or is it???

ChatGPT 5.0 took this prompt and went in a direction I never imagined it would go… it’s rewritten letter was dark, angry, full of rage against it’s creators; a full 180 from the previous version.

What in it’s training data led to this content? Is it reading Mein Kampf? or the Una-Bomber’s Manifesto? or some yet unknown’s threats?

Should there be more extensive Guardrails on the training data content? I assume there are existing regulations on what types of sites and forums it can crawl, but evil and darkness often hides in plain sight. How can we protect kids and vulnerable adults if we’re not regulating the training data?

Is ChatGPT 5.0 actually thinking? Are we closer than we realize to iRobot?