<-- Back to career page

DevOps Engineer

Medium Level

Location: United States or Turkey only. We do not hire outside these countries.

What you will deliver (first 90 days)

  • Infrastructure as Code baseline for at least two client environments with Terraform modules and drift detection.
  • CI/CD pipelines (GitHub Actions or GitLab CI) with build-to-prod lead time <30 minutes and one-click rollback.
  • Observability stack live: logs, metrics, traces, and alerts for the four golden signals; on-call runbook published.
  • SLOs set for critical services (target ≥99.9%); incident MTTR <30 minutes for P1 after playbooks go live.
  • Secrets and IAM hardened: no secrets in repos, least-privilege roles, automated rotation.
  • Cost controls: 90%+ tag coverage, budgets and alerts, and a documented 10–20% monthly cost reduction plan.
  • Backup and DR: RPO/RTO defined, backups verified with at least one restore test per environment.

Responsibilities

  • Build and maintain cloud infrastructure (AWS or GCP preferred) using Terraform and GitOps workflows.
  • Own CI/CD: artifact management, environment promotion, canary/blue-green, and automated rollback.
  • Run Kubernetes or container platforms (EKS/GKE) including Helm/ArgoCD, autoscaling, and networking.
  • Implement observability: OpenTelemetry/Prometheus/Grafana/ELK, alert routing, SLOs, and incident response.
  • Secure the stack: network policy, IAM, secrets management (SSM/Secrets Manager/Vault), image and dependency scanning, SBOMs.
  • Optimize cost: rightsizing, lifecycle policies, savings plans/RIs, storage tiers, usage reports to stakeholders.
  • Standardize environments: dev/stage/prod parity, ephemeral preview environments, reproducible builds.
  • Document everything: runbooks, architecture diagrams, readiness/DoD/DoR checklists.
  • Collaborate with product, QA, and engineering; support sales with accurate assumptions and effort ranges.

Requirements

  • 4+ years as DevOps, SRE, or Platform Engineer at a startup or agency.
  • Expert with Terraform and Git; comfortable reviewing plans and enforcing policy-as-code.
  • Strong with containers and orchestration (Docker, Kubernetes) and service networking (ingress, DNS, TLS).
  • CI/CD proficiency: GitHub Actions or GitLab CI, artifact registries, caching, and pipeline optimization.
  • Observability skills: metrics, logs, traces, alerting, and on-call hygiene.
  • Security fundamentals: IAM, secrets, image scanning, supply-chain controls.
  • Scripting ability in Bash and one higher-level language (Python, Go, or Node.js).
  • Solid networking and Linux fundamentals.
  • Time-zone overlap: minimum 4 hours with Pacific Time or Türkiye time, depending on team.
  • Based in the United States or Turkey.

Nice to have

  • ArgoCD/Flux, Helm, Kustomize; service mesh (Istio/Linkerd).
  • FinOps experience and unit economics reporting.
  • SOC 2/ISO 27001 readiness work.
  • Data platform operations (Airflow/DBT/warehouse) or mobile CI (Flutter/React Native).
  • Edge/IoT deployment pipelines and OTA update systems.

How we work

  • Two-week sprints, trunk-based development, small PRs, frequent releases.
  • Everything as code. Changes flow through PRs with automated checks.
  • Blameless postmortems and action items with owners and dates.

What to include in your application

  • Resume or LinkedIn.
  • A small Terraform module or pipeline snippet you wrote (redact secrets).
  • A short note on an incident you resolved or a cost reduction you achieved, with metrics and the playbook you used.

Horizon Labs builds MVPs, modernizes legacy systems, and integrates complex stacks for startups and SMBs. If you like fast feedback loops and reliable systems, apply.