Runtime safety
- ensure
api_keyvalues are valid UUID strings - keep payload size bounded before logging
inputandoutput - avoid logging raw secrets, tokens, or PII
- apply timeout and retry strategy to any external trigger endpoint
Data layer
- provision
agent_logswith indexes foruser_id,trace_id, andparent_run_id - configure DB connection limits for your request volume
- monitor write failures from psycopg connection attempts
- set retention policies for trace data
LangChain behavior
- keep
dev_mode=Trueoutside production if trigger calls are not desired - document callback attachment points in your app architecture
- verify root chain events are consistently produced in your chains
Observability
- alert on sustained insert failures
- sample event volumes per trace
- track average and p95 trace depth
- define dashboards for error-heavy event categories