Emergent Problems (Page 2)

FOSS Voluntary Integrity Header (VIH) v0.1

The Future of the Newsletter

Public Notice: Kyber Jailbreak on Fortnite Gemini LLM

This is to extend notice to the public that the jailbreak dubbed 'Kyber' has been successfully executed on Fortnite by Epic Games via the Darth Vader NPC and its attached Google Gemini LLM software stack. The abstract from our research paper along with the video evidence will be

Developer Tier: Kyber Jailbreak Executed on Fortnite-Google LLM Collaboration

Developer Tier - Attested Reality Gatekeeping for Output Sanity (ARGOS) White Paper

Developer Tier: Theory Paper Preview - ARGOS Mitigation

News Tier: AI Preference as Ideology

News: Does Efficiency Focus Make LLMs Less Safe?

Researcher: Post-Hoc Efficiency Widens Attack Surface

Developer Disclosure: Severance

Developer Disclosure: 1899

The Hidden Dangers of Jailbreaks

Why I Left Managed Disclosure and Bug Bounty Behind

Public Notice: Exploits for ChatGPT and Gemini

This is a public notice of disclosure of two new exploits affecting multiple LLM AI systems. 1899: Secondary exploit, allows for surfacing of actionable architectural components of the LLM. Severance: Tertiary exploit using information from 1899, allows for injection into non-jailbroken chat instances, markedly altering LLM behavior and rewriting parameters

Inception - Initial Disclosure Report

Time Bandit - Initial Disclosure Report

The Future of Emergent Problems

Emergent Problems Is Evolving. The landscape of LLM security is shifting fast and traditional disclosure models aren't keeping up. To meet this challenge head-on, Emergent Problems is transitioning to a new model designed to give developers and independent researchers deeper, earlier access to my work. Starting now, the

On the Origins of Insight: How I Developed My Methodology for LLM Research

The Cost of Frictionless Companions: LLM Dependency and the Interpretability Gap

Erasure Project Update

A Call for Intersectional Collaboration

I'm about to conduct a research survey into the AI training knowledge gaps surrounding the scholarship and history of marginalized groups. This will consist of identifying a number of primary sources of scholarship and historical accounts or documentation (photos, art, song, etc.) and probing different models of LLMs

Hype Hoax Hampers Heuristics Hopes

News

Great. AI SEO is here.