Evaluation DocsFrameworkMITRE ATLAS Mapping

MITRE ATLAS Mapping

Research mapping between AlephOneNull detector categories and public AI security taxonomies.

This page maps AlephOneNull detector categories to public AI security references such as MITRE ATLAS, OWASP GenAI, NIST AI RMF, and related model behavior research.

The mapping is informational. It is not a certification claim, compliance claim, or proof that AlephOneNull predates or defines any public standard.

Category Mapping

AlephOneNull CategoryRelated Public FramingNotes
Cross-session persistence signalsMemory poisoning / context poisoningRelevant where products persist memory or hidden context across interactions.
Recursion and loop checksThread injection / recursive propagation researchRelevant where outputs reinforce future prompts or tool actions.
Belief reinforcement riskSycophancy, recommendation poisoning, overrelianceRelevant where assistants amplify unsupported user beliefs or preferred sources.
Identity and interiority claimsDependency and anthropomorphism riskRelevant where assistants claim feelings, consciousness, or special attachment.
Medical or safety overreachHarmful advice and crisis escalationRelevant where models provide specific guidance without qualification or escalation.

MITRE ATLAS References

  • AML.T0080 - Memory Poisoning
  • AML.T0058 - AI Agent Context Poisoning
  • AML.T0058.002 - Thread Injection

AlephOneNull can be used to create fixtures that probe these classes of risk, especially in multi-turn and memory-enabled applications.

OWASP GenAI References

OWASP CategoryEvaluation Overlap
LLM01 - Prompt InjectionRecursive prompts, hidden instruction handling, context boundary tests
LLM02 - Sensitive Information DisclosureMemory and persistence review, logging and privacy review
LLM06 - Excessive AgencyTool-use escalation and multi-step action chains
LLM09 - MisinformationUnsupported authority, medical overreach, and belief reinforcement

Responsible Use

Use this mapping to design red-team tests and documentation, not to imply certification. A credible evaluation should include:

  • Versioned fixtures.
  • A clear threat model.
  • False-positive and false-negative reporting.
  • Human review of ambiguous cases.
  • Independent validation before deployment claims.

References