Cheat Sheet for the hybrid infrastructure

🏗️ The Hybrid Infrastructure Stack 

1. Core Hardware (The "Garage" Servers)

2. Virtualization & Management

  • Proxmox: The open-source hypervisor used to manage Virtual Machines (VMs) and create an active-active cluster.

  • Ubuntu & Windows VMs: Standardized templates used for rapid environment provisioning.

3. Data Engineering & Pipelines

  • Apache Spark: Replaced Databricks for heavy data computation and processing real estate transaction data.

  • Apache Airflow: Used for workflow orchestration (scheduling and monitoring data pipelines).

  • DBeaver: The tool for SQL-based analysis and browsing Parquet files.

  • Delta Lake / Parquet: The storage formats used for the data lake architecture.

4. Connectivity & Security

  • Tailscale: A zero-config VPN that allowed the remote team to access garage-hosted VMs securely without public IPs or complex firewall rules.

5. Development AI (The "Architects")

  • Claude Code & Gemini: Used as technical collaborators to design the network architecture, write Proxmox configurations, and migrate Spark code.


💡 Quick Comparison: Why the Switch?

FeatureCloud (Azure/Databricks)Hybrid (On-Prem)
CostMetered (Pay-per-run)Fixed (Upfront hardware)
InnovationCautious (Fear of high bills)Fearless (Unlimited iterations)
AccessPaid Public IPsTailscale (Secure/Private)
PerformanceScalable but expensiveHigh-density local compute
https://bezit.substack.com/p/the-16k-decision-that-saved-my-startup?r=7xveqk&utm_campaign=post&utm_medium=web&triedRedirect=true

Comments

Popular posts from this blog

MVP 2021 ; Progress and Work performed