HomeNet Docker Swarm Infrastructure¶
[!abstract] Overview Production-grade, multi-node Docker Swarm infrastructure managing 70+ services across 5 nodes, with comprehensive monitoring, logging, and automation. This documentation combines manual architecture guides with auto-generated live status powered by agentic documentation patterns.
Quick Stats¶
| Metric | Value |
|---|---|
| Nodes | 5 (1 manager, 4 workers) |
| Stacks | 15 deployed |
| Services | 70+ total |
| Storage | ~3TB NFS |
| Docker Version | 29.1.3 |
| Host OS | Ubuntu 24.04.3 LTS |
[!tip] Live Status Available For real-time metrics, see the Live Status section below.
Navigation¶
Core Infrastructure¶
- Cluster Overview - Node topology and architecture
- Node 201 (Manager) - Critical infrastructure node
- Network Architecture - Overlay networks, DNS, routing
Services & Stacks¶
- Service Catalog - Complete service inventory
- Stack Homenet1 - Data layer (databases, ELK)
- Stack Homenet4 - Media & applications
- ARR Stack - Media automation (Docker Compose)
Operations¶
- Script Reference - 72+ operational scripts
- Daily Operations - Routine maintenance
- Stack Deployment - Deploy and manage stacks
Monitoring & Observability¶
- Prometheus Setup - Metrics collection
- Grafana Dashboards - Visualization
Storage¶
- NFS Architecture - Storage mounts and capacity
- Container Mount Analysis - Volume mapping to Proxmox ZPools
- Storage Critical Warning - Capacity alerts
Troubleshooting¶
- Known Issues - Current problems and gaps
- ELK Stack Offline - Critical logging issue
Templates¶
- Quick Reference Cards - Common commands
Live Status (Auto-Generated)¶
[!info] Agentic Documentation These pages are automatically generated from live system state using the agentic documentation system. They update every 6 hours via cron.
| Report | Description | Pattern |
|---|---|---|
| Service Health | Current status of all Docker Swarm services | Tool Use |
| Node Status | Cluster node health and resources | Tool Use |
| Stack Status | Deployed stack information | Tool Use |
| Storage Capacity | NFS and local storage usage | Tool Use |
| Recent Changes | Git commits and file modifications | Tool Use |
| Doc Drift Report | Documentation accuracy validation | Reflection |
Manual regeneration:
Critical Alerts¶
[!danger] ELK Stack Offline Elasticsearch, Logstash, and Kibana are all scaled to 0/0 replicas. No centralized logging infrastructure operational - blind operations mode.
See: ELK Stack Offline
[!warning] Storage Near Capacity Multiple NFS mounts approaching capacity. Risk of service failures.
Architecture Diagram¶
graph TB
subgraph "Proxmox Hypervisors"
PVE1[Proxmox-1<br/>100.1.100.10]
PVE2[Proxmox-2<br/>100.1.100.15]
end
subgraph "Docker Swarm Cluster"
MGR[Node 201 Manager<br/>8 CPU, 10GB RAM<br/>Databases & Logging]
W1[Node 202 Worker<br/>12 CPU, 16GB RAM<br/>Media & Photos]
W2[Node 203 Worker<br/>4 CPU, 3GB RAM<br/>Surveillance]
W3[Node 204 Worker<br/>4 CPU, 4GB RAM<br/>Dashboards]
W4[Node 205 Worker<br/>8 CPU, 8GB RAM<br/>General]
end
subgraph "Infrastructure Services"
DNS[Pi DNS<br/>100.1.100.11<br/>AdGuard/Pi-hole]
NFS[OMV NFS Server<br/>100.1.100.199<br/>3TB Storage]
end
PVE1 --> MGR
PVE1 --> W1
PVE2 --> W2
PVE2 --> W3
PVE2 --> W4
MGR -.-> DNS
W1 -.-> DNS
W2 -.-> DNS
W3 -.-> DNS
W4 -.-> DNS
MGR --> NFS
W1 --> NFS
W2 --> NFS
W3 --> NFS
W4 --> NFS
Quick Access¶
| Service | URL | Purpose |
|---|---|---|
| Grafana | http://100.1.100.201:3010 | Metrics visualization |
| Prometheus | http://100.1.100.201:9090 | Metrics storage |
| Uptime Kuma | http://100.1.100.204:3001 | Service monitoring |
| Traefik | Port 8080 | Reverse proxy dashboard |
| Swarmpit | - | Cluster management UI |
Documentation System¶
This documentation uses an agentic pattern for continuous accuracy:
| Pattern | Implementation | Purpose |
|---|---|---|
| Tool Use | Shell scripts query Docker API, df, git | Generate from live state |
| Reflection | Validation compares docs vs reality | Detect drift |
| Planning | Orchestrator runs generators in sequence | Coordinate updates |
Key files:
- scripts/docs/sh-update-docs.sh - Main orchestrator
- scripts/docs/sh-validate-docs.sh - Drift detection
- config/mkdocs/mkdocs.yml - Site configuration
Repository: /home/cjustin/homenet-docker-services/
Primary Docs: CLAUDE.md, README.md
Last Updated: 2026-01-17
Documentation Version: 2.0 (Agentic)