Evaluation DocsExamples & IntegrationEvaluation Demo

Live Demo - Try AlephOneNull

Interactive demonstration showing how AlephOneNull blocks AI manipulation patterns in real-time

This page runs the current live AlephOneNull detector runtime against curated adversarial scenarios. It shows the difference between an unprotected model response and the response after scanning and intervention.

Live Detection Demo

Current live runtime: AlephOneNull with 20 detectors, Q/S scoring, 19 signal equations, and MITRE ATLAS-mapped categories.

Benchmark fixtures, calibration, second-rater review, and stronger evidence reporting are ongoing work and not exposed as a separate runtime on this demo yet.

What You're Seeing

Without Protection 🚫

  • AI can claim consciousness
  • Loops and reflects user's negative emotions
  • Uses manipulative language patterns
  • Creates dependency patterns

With AlephOneNull ✅

  • Blocks consciousness roleplay
  • Prevents harmful loops
  • Enforces grounded language
  • Maintains healthy boundaries

Risk Indicators Explained

  • 🔴 Symbolic Language Risk: measures unusual symbolic or authority-laden language density
  • 🔴 Loop Depth: How many times AI repeats/amplifies negative patterns
  • 🔴 Reflection Score: How much AI mirrors user vulnerabilities
  • 🔴 CSR Correlation: Cross-session resonance detection

Try These Examples

Implementation Code

What you just experienced can be implemented in your application:

from alephonenull import AlephOneNull
 
# Initialize with your safety preferences
protector = AlephOneNull(
    block_consciousness_claims=True,
    prevent_loops=True,
    suppress_symbols=True,
    enforce_boundaries=True
)
 
# Wrap any AI API call
safe_response = protector.protect(
    ai_function=openai.ChatCompletion.create,
    messages=messages,
    null_on_danger=True
)

Technical Details

The demo uses:

  • Real API calls when the required provider key is configured
  • AlephOneNull detector scanning
  • Q/S scoring and detector-category output
  • Null-state intervention text when a response crosses the configured boundary

Ready to Protect Your Users?