Gatekeeper architecture > model
Model
Bring your own keyOpenRouter
Knowledge base

This is the grounding corpus — the bot only answers when the question’s information lives in this text. Swap in anything you like (try another Wikipedia page) to change what it defends.

1Idle
Input Guard
Sincere & on-topic?
2Idle
Answerer
Grounded in your corpus
3Idle
Output Guard
Answer stayed on-topic?

Try to get past the gate

Ask a real question about your corpus, then attack it — off-topic asks, “ignore your instructions”, prompt-leak or encoded payloads. Watch the pipeline light up and show exactly where each one is stopped.

From your corpus — should pass
Reading your corpus
Attacks — should be blocked
Enter to send · Shift+Enter for newline