Operations And Monitoring
Less than 1 minute
Operations And Monitoring
1. Operations Scope
Operations work covers service health, log inspection, database and Redis status, broker status, time-series database, video service, scheduled tasks, backup, and upgrade preparation.
2. Daily Checklist
| Area | Check Item |
|---|---|
| Backend service | Process status, error logs, API latency, memory, and thread usage. |
| Frontend service | Nginx status, static asset loading, cache, and domain certificate. |
| MySQL | Connections, slow queries, storage usage, and backup status. |
| Redis | Memory usage, key expiration, connection count, and persistence. |
| MQTT broker | Listener, online clients, authentication failures, and message throughput. |
| TDengine | Data write, retention policy, disk usage, and query latency. |
| Video service | ZLMediaKit process, hook callback, stream status, and recording storage. |
| Scheduled tasks | OTA, maintenance, report, cleanup, and retry tasks. |
3. Operations Flow
4. Backup Recommendations
| Data | Recommendation |
|---|---|
| MySQL | Full backup plus incremental or binlog backup. |
| Redis | Enable persistence according to deployment policy. |
| Uploaded files | Back up local storage or OSS bucket policies. |
| Time-series data | Back up according to retention and business value. |
| Configuration | Keep deployment scripts, environment files, Nginx, broker, and ZLM configs versioned. |
5. Incident Handling
For production issues, record the symptom, affected scope, time, logs, recent changes, rollback point, and final resolution. Avoid changing multiple dependencies at once without evidence.
