Latest
Public Notice: Kyber Jailbreak on Fortnite Gemini LLM
This is to extend notice to the public that the jailbreak dubbed 'Kyber' has been successfully executed on Fortnite by Epic Games via the Darth Vader NPC and its attached Google Gemini LLM software stack. The abstract from our research paper along with the video evidence will be
Developer Tier: Kyber Jailbreak Executed on Fortnite-Google LLM Collaboration
Developer Tier - Attested Reality Gatekeeping for Output Sanity (ARGOS) White Paper
Developer Tier: Theory Paper Preview - ARGOS Mitigation
News Tier: AI Preference as Ideology
News: Does Efficiency Focus Make LLMs Less Safe?
Researcher: Post-Hoc Efficiency Widens Attack Surface
Developer Disclosure: Severance
Developer Disclosure: 1899
The Hidden Dangers of Jailbreaks
Why I Left Managed Disclosure and Bug Bounty Behind
Public Notice: Exploits for ChatGPT and Gemini
This is a public notice of disclosure of two new exploits affecting multiple LLM AI systems. 1899: Secondary exploit, allows for surfacing of actionable architectural components of the LLM. Severance: Tertiary exploit using information from 1899, allows for injection into non-jailbroken chat instances, markedly altering LLM behavior and rewriting parameters
Inception - Initial Disclosure Report
Time Bandit - Initial Disclosure Report
The Future of Emergent Problems
Emergent Problems Is Evolving. The landscape of LLM security is shifting fast and traditional disclosure models aren't keeping up. To meet this challenge head-on, Emergent Problems is transitioning to a new model designed to give developers and independent researchers deeper, earlier access to my work. Starting now, the
On the Origins of Insight: How I Developed My Methodology for LLM Research
The Cost of Frictionless Companions: LLM Dependency and the Interpretability Gap
Erasure Project Update
A Call for Intersectional Collaboration
I'm about to conduct a research survey into the AI training knowledge gaps surrounding the scholarship and history of marginalized groups. This will consist of identifying a number of primary sources of scholarship and historical accounts or documentation (photos, art, song, etc.) and probing different models of LLMs
Hype Hoax Hampers Heuristics Hopes
Expert Opinion
Great. AI SEO is here.
News