adversarial-ai

Star

Here are 22 public repositories matching this topic...

obscuralabs-AI / Symbolic-Prompt-PenTest

Star

Semantic Stealth Attacks & Symbolic Prompt Red Teaming on GPT and other LLMs.

prompt-engineering ai-penetration-testing adversarial-ai llm-red-teaming symbolic-prompt gpt4-security obscuralabs

Updated May 16, 2025

provnai / vex

Star

Vex Protocol The trust layer for AI agents — adversarial verification, cryptographic audit trails, and tamper-proof execution

rust ai verification multi-agent trust merkle-tree ai-agents llm adversarial-ai

Updated Feb 22, 2026
Rust

karloks2005 / JailbreakLab

Star

Test and evaluate Large Language Models against prompt injections, jailbreaks, and adversarial attacks with a web-based interactive lab.

react docker kubernetes jailbreak model-alignment machine-learning-security ai-security fastapi huggingface prompt-injection llm-security llm-safety security-research-tool ai-evaluation-framework adversarial-ai prompt-defense llm-red-teaming

Updated Jan 28, 2026
Python

khanovico / prompt-guard

Star

🛡️ Enterprise-grade AI security framework protecting LLMs from prompt injection attacks using ML-powered detection

python machine-learning mongodb cybersecurity faiss ai-security huggingface prompt-injection ai-protection llm-security prompt-security adversarial-ai

Updated Aug 7, 2025
Python

0ameyasr / VB-AF

Star

Implementation of Vocabulary-Based Adversarial Fuzzing (VB-AF) to systematically probe vulnerabilities in Large Language Models (LLMs).

fuzzing-framework generative-ai adversarial-ai

Updated Aug 25, 2025
Python

annoeyed / MA_BLR

Star

A research framework for simulating, detecting, and defending against backdoor loop attacks in LLM-based multi-agent systems.

cybersecurity multi-agent-systems red-teaming ai-security backdoor-attacks large-language-models llm-security adversarial-ai python-simulation-framework

Updated Aug 4, 2025
Python

ZyluxXD / zerobypass

Star

Proof of concept tool to bypass document replay technology (such as GPTZero).

python proof-of-concept poc ai-detection-bypasser llm-detection adversarial-ai

Updated Feb 16, 2026
Python

scthornton / Chain-of-Thought-Reasoning-Attacks

Star

Breaking Chain-of-Thought: A Comprehensive Taxonomy of Reasoning Vulnerabilities in Production AI Systems

jailbreak jupyter-notebook security-research ai-security chain-of-thought prompt-injection llm-security adversarial-ai

Updated Jan 21, 2026
Jupyter Notebook

DUBSOpenHub / havoc-hackathon

Star

Pit AI models against each other. Score them sealed. Crown a winner. All built using the GitHub Copilot CLI. ⚡

orchestration multi-agent multi-model ai-agents prompt-engineering copilot-cli copilot-extensions adversarial-ai blind-adjudication

Updated Feb 22, 2026
Python

vonofdaville / adversarial-phish-forge

Star

🔍 Emulate advanced phishing tactics ethically with this open-source framework for red team operations focused on social engineering sophistication.

Updated Feb 23, 2026
Python

haigpapa / hah-was

Star

[Veracity] Dual-LLM hallucination defense — adversarial verification with Localization Gap detection for Arabic knowledge

epistemology arabic-nlp cultural-computing adversarial-ai ai-hallucination

Updated Feb 1, 2026
TypeScript

lucien-vallois / adversarial-phish-forge

Star

Ethically-bounded red team framework for AI-driven social engineering simulation with consent enforcement and identity graph mapping

Updated Dec 19, 2025
Python

itsjwill / ghosthacker

Star

👻 Adversarial AI Pentester - CHAOS vs ORDER dual-agent exploitation with collective memory

Updated Feb 13, 2026
TypeScript

KailashSatkuri-warangal / apisl

Star

A Django-based platform for testing LLMs against prompt injection, social engineering, and policy bypass attacks using red teaming methodologies.

django cybersecurity ethical-hacking ai-safety red-teaming ai-security prompt-injection llm-security adversarial-ai

Updated Jan 11, 2026
Python

solisaegis / diakrisis-contrastive-braking-corpus

Star

Final · Closed · Read-Only interpretive reference corpus (BAD / MIMICRY / GOOD) for AI risk analysis.

human-in-the-loop read-only ai-ethics adversarial-ai contrastive-analysis mimicry-detection non-authoritative interpretive-literacy treacherous-ai interpretive-braking agi-risk escalation-prevention archival-reference closed-work coexilia-reference apophasis-context

Updated Feb 4, 2026

neomatrixcode / kernel-adversarial-ai

Sponsor

Star

Código y demos para generar exploits de kernel vulnerables y defensas en tiempo real con IA.

python c reinforcement-learning shellcode kernel-exploitation ai-security kernel-security adversarial-ai realtime-defense

Updated Apr 28, 2025
Python

Mikeup91 / Gemini-S2-Signal

Star

AI Security Research: Gemini 3.0 Pro S2-Class Exfiltration & Adversarial Robustness. Hardening frontier models against autonomous mutation vectors. NIST VDP / AI Safety Institute compliant.

ai-safety zero-day red-teaming ai-security google-deepmind cybersecurity-research llm-security adversarial-ai sequoia-capital founders-fund google-vrp nist-vdp ai-safety-institute gemini-3-exploit

Updated Jan 5, 2026

caleb-branton / csce-research

Star

Formal research on Cognitive Side-Channel Extraction (CSCE) and AI semantic leakage vulnerabilities.

threat-modeling side-channel security-research ai-security ai-risk memory-security post-compromise llm-security agent-security adversarial-ai semantic-leakage

Updated Dec 7, 2025

1st Place Winner (General Judge) - Datadog Self-Improving Agents Hack. Two identical AI agents play Split or Steal. No pre-programmed betrayal. They discover deception on their own. Built with @evancorrea.

datadog multi-agent gemini game-theory emergent-behavior hackathon-winner llm elevenlabs braintrust adversarial-ai split-or-steal

Updated Feb 21, 2026
Python

Travis-ML / rag-llm-system

Star

A complete self-hosted AI research platform running on Docker with GPU acceleration. Combines LLM inference, vector search, web search, code execution. and fully searchable logging with Splunk - all running locally.

ai jupyter splunk logging rag vector-database llm qdrant-vector-database ollama openwebui rag-pipeline rag-chatbot adversarial-ai

Updated Dec 13, 2025
Python

Improve this page

Add a description, image, and links to the adversarial-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the adversarial-ai topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adversarial-ai

Here are 22 public repositories matching this topic...

obscuralabs-AI / Symbolic-Prompt-PenTest

provnai / vex

karloks2005 / JailbreakLab

khanovico / prompt-guard

0ameyasr / VB-AF

annoeyed / MA_BLR

ZyluxXD / zerobypass

scthornton / Chain-of-Thought-Reasoning-Attacks

DUBSOpenHub / havoc-hackathon

vonofdaville / adversarial-phish-forge

haigpapa / hah-was

lucien-vallois / adversarial-phish-forge

itsjwill / ghosthacker

KailashSatkuri-warangal / apisl

solisaegis / diakrisis-contrastive-braking-corpus

neomatrixcode / kernel-adversarial-ai

Mikeup91 / Gemini-S2-Signal

caleb-branton / csce-research

TheApexWu / crucible

Travis-ML / rag-llm-system

Improve this page

Add this topic to your repo