Incident Response
Severity Levels
| Level | Description | Response Time |
|---|---|---|
| P0 | Gateway down, Telegram disconnected | Immediate |
| P1 | Bot not responding to messages | < 1 hour |
| P2 | Degraded performance, delayed responses | < 4 hours |
| P3 | Minor issues, cosmetic problems | Next business day |
Quick Diagnostics
Check Gateway Health
Check Telegram Connection
View Logs
Common Incidents
Gateway Not Responding
Symptoms: Health endpoint returns error or times out Resolution:- Check if container is running:
docker ps | grep crocbot - Check container logs:
docker logs crocbot --tail 100 - Restart container:
docker restart crocbot - If persists, check system resources (memory, disk)
Telegram Disconnected
Symptoms:crocbot channels status shows Telegram offline
Resolution:
- Check bot token is valid: verify with @BotFather
- Check network connectivity from container
- Review gateway logs for connection errors
- Restart gateway:
docker restart crocbot - If 429 errors, wait for rate limit cooldown
High Memory Usage
Symptoms: Health endpoint shows high heapUsedMb/rssMb Resolution:- Check current memory:
curl localhost:18789/health | jq '.heapUsedMb, .rssMb' - If over 512MB RSS, consider restart
- Check for memory leaks in logs
- Restart container:
docker restart crocbot
Bot Not Responding to Messages
Symptoms: Messages sent but no response Resolution:- Check gateway is running and healthy
- Check Telegram connection status
- Check agent configuration:
crocbot config show - Check API key validity (Anthropic, etc.)
- Review logs for processing errors
Restart Procedures
Docker Container
Full Redeploy
Rollback
If a recent update caused issues:Escalation
If standard procedures do not resolve:- Collect logs:
docker logs crocbot > incident-logs.txt - Collect health:
curl localhost:18789/health > health.json - Open GitHub issue with logs and reproduction steps
