Site Reliability Engineer | Platform Engineer | Cloud Architect
Professional with 10+ years of experience in managing distributed systems, infrastructure automation, and production reliability at scale in Cloud.
- Infrastructure: AWS (+10yrs), Container orchestration (Kubernetes, ECS), Terraform, Ansible.
- Observability: Prometheus, Thanos, Tempo, Grafana, OTEL, Datadog.
- CI/CD: GitOps (ArgoCD), GitHub Actions, GitLab CI, TeamCity.
- Datastore: PostgreSQL, Redshift, Redis, Kafka, DynamoDB.
- Rust: Learning phase. If you are a Rust enthusiast, here are the mandatory starting links:
- AI Agent ecosystem and LLM-powered tools: Automated remediation and developer productivity.
- You can check my kagent Lab
- Kubernetes As A Service: Understanding how hyperscalers manage Kubernetes infrastructure.
- You can check my KAAS with CAPI and Sveltos Lab
