Cheat Sheet for the hybrid infrastructure
🏗️ The Hybrid Infrastructure Stack
1. Core Hardware (The "Garage" Servers)
Primary Node:
– High-end workstation with a 2200W power supply.Dell Precision 7960 Redundancy Node:
– Dual Intel Xeon Platinum processors (48 cores) and 256GB RAM.Dell PowerEdge T640
2. Virtualization & Management
: The open-source hypervisor used to manage Virtual Machines (VMs) and create an active-active cluster.Proxmox Ubuntu & Windows VMs: Standardized templates used for rapid environment provisioning.
3. Data Engineering & Pipelines
: Replaced Databricks for heavy data computation and processing real estate transaction data.Apache Spark : Used for workflow orchestration (scheduling and monitoring data pipelines).Apache Airflow : The tool for SQL-based analysis and browsing Parquet files.DBeaver Delta Lake / Parquet: The storage formats used for the data lake architecture.
4. Connectivity & Security
: A zero-config VPN that allowed the remote team to access garage-hosted VMs securely without public IPs or complex firewall rules.Tailscale
5. Development AI (The "Architects")
&Claude Code : Used as technical collaborators to design the network architecture, write Proxmox configurations, and migrate Spark code.Gemini
💡 Quick Comparison: Why the Switch?
| Feature | Cloud (Azure/Databricks) | Hybrid (On-Prem) |
| Cost | Metered (Pay-per-run) | Fixed (Upfront hardware) |
| Innovation | Cautious (Fear of high bills) | Fearless (Unlimited iterations) |
| Access | Paid Public IPs | |
| Performance | Scalable but expensive | High-density local compute |
Comments
Post a Comment